Added
24 hours ago
Type
Full time
Salary
Upgrade to Premium to se...

Related skills

python pytorch elevenlabs nvidia nemo espnet

๐Ÿ“‹ Description

  • TTS backend: integrate & optimize vendor APIs; lead R&D for TTS.
  • Linguistic optimization: ensure natural TTS with SSML and pronunciation.
  • Turn design: craft context-specific utterances to optimize turns and trust.
  • Prompt and persona management: manage LLM and TTS prompts to define agent personalities.
  • UI exposure: expose voice attributes (speed, pitch, tone) to the product UI for customization.
  • Cross-functional R&D: partner with ASR and Audio AI to ensure end-to-end voice quality and low latency in the pipeline.

๐ŸŽฏ Requirements

  • Python strong; deep learning (PyTorch).
  • 3+ yrs in TTS/Voice; NeMo/ESPnet/Coqui; APIs ElevenLabs, Rime, Cartesia.
  • Degree in Computational Linguistics/CS/AI-ML; phonetics & prosody.
  • Prompt engineering: craft/evaluate LLM prompts and templates.
  • Backend APIs; multi-vendor integration; GCP preferred.
  • Speech quality metrics (MOS, latency); design A/B tests.

๐ŸŽ Benefits

  • Work at the center of AI transformation in business communications.
  • Build and ship agentic AI products redefining operations.
  • Collaborative team with strong training and growth.
  • Competitive benefits and growth opportunities.
  • Inclusive offices and Great Place to Work culture.
Share job

Meet JobCopilot: Your Personal AI Job Hunter

Automatically Apply to Engineering Jobs. Just set your preferences and Job Copilot will do the rest โ€” finding, filtering, and applying while you focus on what matters.

Related Engineering Jobs

See more Engineering jobs โ†’