Go Back

Oracle Cloud Infrastructure Speech

oracle.com
Summary

Oracle Cloud Infrastructure (OCI) Speech is an AI service that provides speech-to-text and text-to-speech functionalities. It offers real-time transcription, supports multiple languages, and can be integrated into various applications via APIs and SDKs. The service is designed for enterprise use cases such as customer service call analysis, medical dictation, and digital media content searching.

Features
5/13
See all

Must Have

4 of 5

Conversational AI

API Access

Fine-Tuning & Custom Models

Enterprise Solutions

Safety & Alignment Framework

Other

1 of 8

Multimodal AI

Image Generation

Code Generation

Research & Publications

Security & Red Teaming

Synthetic Media Provenance

Threat Intelligence Reporting

Global Affairs & Policy

Pricing
Usage-based
See all

Transcription

$0.00 per use
  • Speech to Text
  • Text to Speech

Transcription

$0.35 per use
  • Speech to Text
  • Text to Speech
Rationale

Oracle Cloud Infrastructure (OCI) Speech offers both speech-to-text and text-to-speech capabilities, which directly aligns with conversational AI. It explicitly mentions REST APIs, SDKs, and CLIs for integration, fulfilling the API access feature. The service also supports customization, which can be interpreted as fine-tuning. While not explicitly called "enterprise solutions," OCI is an enterprise-grade cloud platform, and the use cases like customer service calls and medical dictation suggest enterprise applicability. The mention of OCI Language for sentiment analysis alongside OCI Speech indicates a multimodal approach, even if not directly generating multimodal content.