Computerized Visual Detection Task

Detection of Videos with Audio-Visual Inconsistency for Video Representation Learning

Abstract: Audio-visual alignment using video data is a conventional approach for the self-supervision of multi-modal representation learning. Nevertheless, the presence of background music, external ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Detection of Videos with Audio-Visual Inconsistency for Video Representation Learning

Trending now