News

What happens when you move AI from the cloud to your PC? In this episode of DEMO, host Keith Shaw visits Qualcomm HQ in San Diego to explore the power of AI at the edge. Jeff Monday, VP of Global ...
Abstract: Automatic emotion recognition from speech plays a fundamental role towards advanced emotional intelligence in human-machine interaction systems. The discriminative knowledge from speech for ...
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon. Speech Emotion Recognition using .
Abstract: In this article, we target speech translation (ST). We propose lightweight approaches that generally improve either ASR or end-to-end ST models. We leverage continuous representations of ...
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and ...