Abstract: We present a novel Automatic Speech Recognition (ASR) dataset for the Oromo language, a widely spoken language in Ethiopia and neighboring regions. The dataset was collected through a ...
AssemblyAI builds advanced speech language models that power next-generation voice AI applications. AssemblyAI builds advanced speech language models that power next-generation voice AI applications.
EduScan is a high-performance biometric solution designed to automate student attendance using real-time facial recognition. The system features a multi-role dashboard architecture and deep ...
Abstract: Multimodal speech emotion recognition (SER) has emerged as pivotal for improving human–machine interaction. Researchers are increasingly leveraging both speech and textual information ...