Learning Semantic Consistency for Audio-Visual Zero-Shot Learning
Academic Background In the field of artificial intelligence, Zero-Shot Learning (ZSL) is an extremely challenging task that aims to recognize unseen classes by leveraging knowledge from seen classes. Audio-Visual Zero-Shot Learning (AVZSL), a branch of ZSL, seeks to classify unseen classes by combining audio and visual information. However, many ex...