With the power of AI, language barriers should no longer prevent enterprises from being able to reach or serve global market segments anywhere, anytime. Oracle Cloud Infrastructure (OCI) Speech Service now offers advancements across its speech recognition and text-to-speech offerings. Enterprises can seamlessly transcribe audio content in diverse languages without delay, empowering rapid responses and insights in multinational workflows, customer service operations, and content localization efforts.

These enhancements make it easier than ever to deploy robust, scalable voice solutions that meet the needs of diverse users in real time, whether for customer engagement, accessibility, or operational insight.

OCI Speech Service now supports Multilingual Real-Time ASR Powered by Whisper

Oracle Cloud Infrastructure (OCI) Speech Service introduces a significant upgrade with the integration of a near real-time Automatic Speech Recognition (ASR) capability within a fine-tuned Whisper model from OpenAI. This enhancement enables near real time multilingual ASR, now supporting 57 global languages.

Figure 1: Over 57 languages are offered via OCI fine-tuned Whisper model from OpenAI.

Expanded Multilingual Text-to-Speech (TTS) Support

Alongside ASR, OCI Speech Service has rolled out expanded Multilingual Text-to-Speech (TTS) capabilities. The service now supports a wider array of languages, with new additions including Spanish (ES), Brazilian Portuguese (PT-BR), Japanese (JP), Chinese (ZH), Hindi (HI), Italian (IT), French (FR), and Arabic (AR). This update allows organizations to deliver natural, high-quality spoken output in key markets, facilitating more engaging customer interactions, automated voice assistants, learning platforms, and accessibility tools.

Major Async ASR Service Optimization Delivers Over 50% Reduction in Job Latency

The OCI Speech async ASR platform has also undergone major infrastructure optimization, resulting in a more than 50% reduction in job. This dramatic improvement means customers can either process significantly higher volumes of audio data concurrently scaling faster to meet demand spikes or reducing latency to make application more responsive with the same workload. With these performance gains, OCI is positioned to offer even more flexible pricing options making the file based async service both efficient and cost-effective in high-volume or competitive market situations.

Unlock New Global Business Opportunities with AI-Powered Multilingual Features

With Oracle Speech Service’s scalable and highly performant Automatic Speech Recognition (ASR) and expanded multilingual Text-to-Speech (TTS) capabilities, we grow a step closer to the elegant vision of the “Universal Translator” from Star Trek. This streamlined, elegant translation experience helps enterprises serve customers with fewer language barriers impeding key collaboration. With these enhanced speech automation features, enterprises can not only expand their global market segments but also focus on building greater, more meaningful relationships with customers.  

Get started today with OCI Speech.

For more information, see the following resources: