Building custom data loader, experiment logging, tips for improving metrics, and GitHub repo if you’d like to follow along — Why Audio Data? NLP for audio data is not getting enough recognition, compared to NLP for text and computer vision tasks. Time to change that! Task Emotion recognition — recognize whether spoken audio exhibits anger, happiness, sadness, disgust, surprise, or neutral emotions.