Visual programming language Benefits

Language-Guided Audio-Visual Learning for Long-Term Sports Assessment

Abstract: Long-term sports assessment is a challenging task in video understanding since it requires judging complex movement variations and action-music coordination. However, there is no direct ...

IEEE

RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation

Abstract: We leverage Large Language Models (LLM) for zero-shot Semantic Audio Visual Navigation (SAVN). Existing methods utilize extensive training demonstrations for rein-forcement learning, yet ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Language-Guided Audio-Visual Learning for Long-Term Sports Assessment

RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation

Trending now