top of page

Senior Machine Learning Engineer – Speech

Apply Now  Please email resume to careers@ellipsishealth.com 

Ellipsis Health is creating cutting-edge AI/ML products that solve healthcare staffing issues and administrative burdens using conversational AI and our patented voice biomarker technology in the delivery of better healthcare for everyone. We are headquartered in Silicon Valley and are funded and supported by some of the most preeminent venture capital teams.


We are currently looking for a Senior Machine Learning Engineer specializing in speech. In this role, you will be responsible for designing, developing, and optimizing state-of-the-art speech recognition, synthesis and processing systems. Additionally, you will play a key role in developing and implementing evaluation metrics to ensure the accuracy, reliability, and overall quality of speech models and systems.


Ellipsis Health is located in the San Francisco Bay Area, but we are open to remote candidates for this role.

RESPONSIBILITIES

  • Design and implement advanced machine learning models for speech recognition (ASR), speech-to-speech, and text-to-speech (TTS) systems.

  • Research and evaluate new algorithms, frameworks, and techniques in speech and audio processing.

  • Develop efficient workflows for curating, cleaning, and annotating large-scale speech datasets, ensuring data quality and relevance for training and evaluation.

  • Preprocess speech data by handling noise reduction, segmentation, feature extraction (e.g., MFCCs, spectrograms), and augmentation techniques to enhance model robustness.

  • Develop robust evaluation frameworks and metrics for assessing the performance of speech models.

  • Fine-tune and deploy pre-trained open-source models for speech-related tasks, such as ASR, speech diarization, time-stamps alignment, role detection, PII removal, and speech language detection.

  • Collaborate with cross-functional teams, including data engineers, software developers, and product managers, to integrate speech technologies into end-user applications.

  • Stay up to date with advancements in speech technology and contribute to the company’s technical strategy in this domain.

REQUIREMENTS

  • Bachelor’s, Master’s, or Ph.D. in Computer Science, Electrical Engineering, or a related field.

  • 5+ years of experience in machine learning, with at least 2 years focused on speech or audio processing.

  • Strong understanding of speech processing concepts, including ASR, TTS, and acoustic modeling.

  • Proficiency in Python and experience with PyTorch.

  • Knowledge of deep learning architectures like CNNs, and transformers.

  • Solid knowledge of signal processing techniques and feature extraction methods (e.g., MFCCs, spectrograms).

  • Strong understanding and quality of speech evaluation techniques and metrics.

  • Familiarity with cloud platforms (e.g., AWS, GCP, Azure) and ML operations (MLOps) best practices.

  • Strong problem-solving skills and the ability to work independently on complex projects.

  • Hands-on experience with open-source speech models and datasets. (a plus)

  • Speech applications in healthcare (a plus).

Ellipsis Health is an inclusive company where diversity is celebrated. We are an equal opportunity workplace and affirmative action employer. We are committed to equal opportunity regardless of race, color, religion, sex, gender identity, ancestry, citizenship, age, physical or mental disability, military or veteran status, marital status, domestic partner status or any other basis protected by local, state or federal laws.

BENEFITS

  • Competitive salary

  • Meaningful stock options

  • Generous PTO policy

  • Health insurance (medical, dental, vision)

  • 401(k)

Apply Now  Please email resume to careers@ellipsishealth.com 

bottom of page