Go Back
Summary

Tavus provides an AI platform focused on creating realistic human-AI interactions through conversational video interfaces. Their core offering includes APIs for generating digital twins and custom replicas, enabling real-time video conversations with AI agents that can see, hear, and respond. They cater to developers and product teams, offering various plans including enterprise solutions.

Features
6/13
See all

Must Have

4 of 5

Conversational AI

API Access

Fine-Tuning & Custom Models

Enterprise Solutions

Safety & Alignment Framework

Other

2 of 8

Image Generation

Multimodal AI

Code Generation

Research & Publications

Security & Red Teaming

Synthetic Media Provenance

Threat Intelligence Reporting

Global Affairs & Policy

Pricing
Tiered
See all

Free

$0.00 monthly
  • White-labeled APIs
  • 25 free minutes of AI conversational video
  • 5 free minutes of AI video generation
  • 5 free minutes of lip sync video
  • Access to 25 stock replicas
  • Support for 30+ languages
  • Watermark free
  • Limited minutes for AI conversational video, AI video generation, and lip sync video
  • Limited access to stock replicas

Starter

$59.00 monthly
Popular
  • Everything in Free, plus:
  • 3 custom replica trainings
  • 100 minutes of AI conversational video
  • 10 minutes of AI video generation
  • 10 minutes of lip sync video
  • Pay-as-you-go overage with no limits
  • Up to 3 concurrent streams

Growth

$397.00 monthly
  • Everything in Starter, plus:
  • 7 custom replica trainings
  • 500 minutes of AI conversational video
  • 100 minutes of AI video generation
  • 100 minutes of lip sync video
  • Access to 100+ stock replicas
  • Conversation recordings
  • Up to 15 concurrent streams

Enterprise

Custom
  • Everything in Growth, plus:
  • 100% white-labeled experience
  • Scaling discounts
  • Custom concurrency limits
  • Top-tier support
  • Enterprise-grade security and compliance
  • Guaranteed SLAs for speed and compute
Rationale

Tavus offers an 'OS for Human-AI Interaction' with building blocks for AI agents to see, hear, respond, and look human in real-time. This directly aligns with the 'conversational-ai' feature. They explicitly mention 'easy-to-use APIs' and 'white-labeled APIs' for integration, fulfilling 'api-access'. The ability to create 'digital twins' and 'custom replicas' from user videos, along with 'fine-tuning' capabilities, matches 'fine-tuning-and-custom-models'. Tavus also provides 'Enterprise' plans with 'Enterprise-grade security and compliance' and 'scaling discounts', indicating 'enterprise-solutions'. While not explicitly 'image generation' in the DALL-E sense, their 'Phoenix-3' model focuses on 'full-face rendering' and 'lifelike digital replicas', which is a form of AI-generated visual content. The combination of face-rendering, vision, speech, and emotional intelligence into one system, and the ability to swap in LLM, RAG, or TTS, points to 'multimodal-ai'. The website does not explicitly mention a 'safety-alignment-framework' or 'code-generation' as core offerings.