Azure Speech in Foundry Tools

Build multilingual voice AI apps with speech-to-text, text-to-speech, and translation APIs

Voice & Speech

DEVELOPER

Microsoft

WEBSITE

SOCIAL
NETWORKS

SUPPORTED
PLATFORMS

STARTING PRICE

Pay as you go

FREE TRIAL

PRICING TYPE

Freemium, Pay as you go

CARD REQUIRED

BEST FOR

Business

SUPPORTED
LANGUAGES

+ N more

See all

AI TEHNOLOGIES

Use cases

Transcribing call center conversations in real time to assist agents and automate post-call analytics
Generating captions and subtitles for audio and video content in more than 100 languages
Building conversational voice bots and assistants with natural-sounding prebuilt or custom neural voices
Enabling real-time speech-to-speech translation for multilingual communication applications
Creating custom neural voices that reflect a brand's identity for differentiated user experiences
Powering voice-enabled AI agents with end-to-end speech including customized transcription and avatars
Transcribing and summarizing meeting conversations for productivity and documentation workflows
Providing pronunciation assessment feedback to language learners in real-time
Building text-to-speech avatar videos for customer service, education, and marketing content
Deploying on-device speech recognition and synthesis in environments with intermittent connectivity
Analyzing audio and video call recordings to extract business insights using foundation models
Integrating fast batch transcription for voicemail processing, media captioning, and archiving workflows

Features

Speech-to-text transcription, Text-to-speech synthesis, Custom neural voice, Speech translation, Voice Live API, Speaker recognition, Text-to-speech avatar, Pronunciation assessment, Embedded speech, Fast transcription

Azure Speech in Foundry Tools

Description

Use cases

Features