Abstract: With the rising interest in research on Large Multi-modal Models (LMMs) for video understanding, many studies have emphasized general video comprehension capabilities, neglecting the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results