We released a lightweight multimodal baseline for Emotion Recognition in Conversations (SemEval-2024 Task 3, Friends-based dataset): transformer text classifier + self-supervised speech reps + late fusion arxiv.org/abs/2602.00914
#Multimodal #NLP #Speech #EmotionRecognition #MachineLearning #SemEval