Summary
Hume AI is an empathic AI research lab that provides AI models and APIs for real-time voice interactions, text-to-speech, and expression measurement. Their offerings include the Empathic Voice Interface (EVI) for emotionally intelligent conversational AI and Octave TTS for expressive voice generation. They also offer tools for measuring emotional expression across various modalities and provide enterprise solutions.
Features7/13
See allMust Have
5 of 5
Conversational AI
API Access
Safety & Alignment Framework
Fine-Tuning & Custom Models
Enterprise Solutions
Other
2 of 8
Multimodal AI
Research & Publications
Image Generation
Code Generation
Security & Red Teaming
Synthetic Media Provenance
Threat Intelligence Reporting
Global Affairs & Policy
PricingTiered
See allFree
- 10,000 characters of text to speech per month (~10 minutes)
- Unlimited custom voices
Starter
- 30,000 characters of text to speech per month (~30 minutes)
- Unlimited custom voices
- 20 projects
- Commercial license
Creator
- 100,000 characters of text to speech per month (~100 minutes)
- Usage based pricing for additional characters ($0.20/1,000)
- Unlimited custom voices
- 1,000 projects
- Commercial license
Pro
- 500,000 characters of text to speech per month (~500 minutes)
- Usage based pricing for additional characters ($0.15/1,000)
- Unlimited custom voices
- 3,000 projects
- Commercial license
Scale
- 2,000,000 characters of text to speech per month (~2,000 minutes)
- Usage based pricing for additional characters ($0.13/1,000)
- Unlimited custom voices
- 10,000 projects
- Commercial license
Business
- 10,000,000 characters of text to speech per month (~10,000 minutes)
- Usage based pricing for additional characters ($0.10/1,000)
- Unlimited custom voices
- 20,000 projects
- Commercial license
Enterprise
- As much usage as you need
- Custom terms & assurance around DPA/SLAs
- Security questionnaires
- Unlimited custom voices
- Significantly discounted pricing at scale
- Priority support
- Commercial license
Rationale
Hume AI offers an 'Empathic Voice Interface (EVI)' which is a real-time emotionally intelligent voice AI, directly aligning with conversational AI. They provide extensive API access for their TTS, EVI, and Expression Measurement models. The 'Octave Voice Design' and 'Custom Models API' indicate capabilities for fine-tuning and custom models. Their 'Enterprise' pricing tier and solutions cater to enterprise needs. Hume AI's core focus on 'empathic AI' and 'aligning them with human well-being' strongly suggests a safety and alignment framework. Furthermore, their 'Expression Measurement' models analyze across multiple modalities (voice, face, language), indicating multimodal AI capabilities. They also have a 'Research' and 'Publications' section, aligning with research and publications.