Audio signal processing. Corpus processing & management. Text normalization.
Language modeling: Training, evaluating and optimizing language models for voice-enabled Conversational AI applications.
Data augmentation for Language Models using various approaches including Generative AI.
Phonetic lexicon curation.
Developing and optimizing acoustic models for various languages, environmental conditions, and channels.
Analyzing performance and optimizing existing Voice Activity Detection (VAD) models.
Working with individuals from diverse backgrounds to analyze requirements and customize the ASR solution for enhancing the applicationβs performance.
Skills Required
Machine Learning
Python
Find more jobs at Omilia
Find out why Omilia was named a Leader in the 2023 Gartner Magic Quadrantβ’ for Enterprise Conversational AI Platforms for the second time in a row!