getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Online Speech Recognition Software - Page 3

Last updated: March 2026

1 filter applied

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


89 software options

Castel Detect Live logo

Cloud-based speech recognition solution

learn more
Castel Detect LIVE is a voice recognition solution which helps firms of all sizes manage contact center speech analytics with alerts, reminders, scripting and call scoring. The platform allows users to regulate quality assurance via live calls analysis, post-call audits, and data-driven feedback.

Read more about Castel Detect Live

Users also considered
Translation Worldwide Software logo

Translation management tool for healthcare & medical sector

learn more
Translation Worldwide Software by JBI SOFTWARE is designed to help businesses across healthcare, legal, medical, insurance, banking, and other industries manage language translation projects. The artificial intelligence (AI)-enabled solution allows employees to handle text interpretation and translation processes and reduce lawsuits.

Read more about Translation Worldwide Software

Users also considered
SoapBox logo

Speech recognition software designed to detect kid speech

learn more
SoapBox Labs builds speech recognition technology for kids. Its proprietary technology has been built to deliver private and accurate results for kids ages 2 to 12 of all accents and dialects. Soapbox Labs also takes into account kids’ unpredictable speech patterns and behaviors.

Read more about SoapBox

Users also considered
DeepTranscript logo

Transcribe large volumes of conversation

learn more
DeepTranscript is an automatic speech recognition provider for professionnals designed for large volumes and high accuracy. Let's collect all data available in conversations, talks, interview with our plug and play API.

Read more about DeepTranscript

Users also considered
Kili logo

Training data platform for enterprise AI

learn more
Kili is a training data platform designed to help businesses in banking, manufacturing, and healthcare industries streamline the entire training process for artificial intelligence (AI) and machine learning models, from connecting with raw data sources to preparing models, processing information, and training. The platform enables organizations to handle multiple machine learning projects and process videos, images, text, and other types of data.

Read more about Kili

Users also considered
Maestra logo

Speech to text, closed captioning & transcription software

learn more
Maestra is a speech to text software designed to help educators, researchers, marketers, journalists, and media houses automatically add transcriptions, captions, subtitles, and voiceovers to audio and video files in real-time. The platform enables professionals to translate text into various languages including English, French, Spanish, and German.

Read more about Maestra

Users also considered
inspeech logo

Extract the wealth hidden in your client’s voice

learn more
Transform the valuable information contained in the calls you already have into competitive intelligence, customer satisfaction and new business.

Read more about inspeech

Users also considered
ELSA Speech Recognition API logo

Leading Speech Recognition API for Language Learning

learn more
ELSA uses proprietary speech recognition technology and artificial intelligence to help language learners improve their English pronunciation

Read more about ELSA Speech Recognition API

Users also considered
Sesame logo

Voice biometric identification system

learn more
Sesame by Utopia.AI is a cloud-based voice biometric identification solution which uses natural speech to identify callers in real time, by creating voice prints from previous calls without requiring caller enrollment. The software can also analyze caller vocabulary, sentiment, and emotional state.

Read more about Sesame

Users also considered
Akkadu logo

AI subtitles & interpretation for global chat.

learn more
Akkadu offers a range of innovative solutions for making meetings and events multilingual, whether they are on-site, hybrid, or online. With Akkadu, users can add remote simultaneous interpretation (RSI), AI subtitles, or human live captioning to enhance language accessibility for participants.

Read more about Akkadu

Users also considered
Ebby logo

Cloud-based transcriptions software

learn more
Ebby helps lawyers, podcasters, journalists, researchers, and academic professionals convert audio recordings into text documents using AI technology. The built-in editor automatically synchronizes and plays audio or video files with text data, letting users review and edit transcripts in real-time.

Read more about Ebby

Users also considered
Voci logo

Speech analytics software with event-level metadata

learn more
Voci is a cloud-based and on-premise speech analytics software designed to help businesses gain insights into voice data using AI technology and deep learning algorithms. It lets teams transcribe large volumes of audio into analyzable text via high-speed DDR4 SDRAMs.

Read more about Voci

Users also considered
AI-Powered Voice Assistants logo

Customer experience software for eCommerce businesses

learn more
AI-Powered Voice Assistants is a conversational marketing software that helps businesses recognize speech, interpret human language and optimize communications. Administrators can automate various repetitive tasks including insurance premium payment reminders and debt collection processes.

Read more about AI-Powered Voice Assistants

Users also considered
Voximal logo

A phone platform based on Asterisk propulsed by Voximal

learn more
Voximal is Asterisk's VoiceXML engine with state-of-the-art of latest text to speech and speech to text on-premise or online offer.

Read more about Voximal

Users also considered
Picovoice logo

Developer-first platform for adding voice to anything

learn more
The first and only ubiquitous on-device voice AI platform. Picovoice offers speech-to-text, voice search, wake word, intent and voice activity detection engines. Its stack can run on anything from embedded devices to web browsers, providing an immersive experience not achievable by any Big Tech.

Read more about Picovoice

Users also considered
Dictalogic logo

Cloud speech recognition solution

learn more
With the use of digital transformation, we allow a voice to text conversion on the fly, where you just record audio and send it to transcribe as you normally would and the audio converts to text before it reaches the transcriber. We have multiple options on assignment for you to explore.

Read more about Dictalogic

Users also considered
Yactraq logo

Speech Analytics for Contact Centers

learn more
Yactraq is interactive analytics that integrates emotion detection, speech analytics, and predictive intelligence to understand customer communications across multiple channels (phone, email, social media).

Read more about Yactraq

Users also considered
CommPeak Speech-to-Text logo

AI call insights with transcripts, summaries, tone analysis

learn more
Turn call recordings into actionable insights with AI analysis. Access searchable transcripts, AI-generated summaries, and sentiment analysis to identify which conversations need follow-up attention. Use real conversations to coach agents and strengthen training programs.

Read more about CommPeak Speech-to-Text

Users also considered
GoVivace logo

Conversational AI and speech analytics solution

learn more
GoVivace is a conversational AI and speech analytics solution. It provides intelligent omnichannel chatbots and voice bots for businesses of all sizes.

Read more about GoVivace

Users also considered
TENIOS Voice API logo

Integration platform for integrating telephony applications

learn more
TENIOS Voice API facilitates the seamless integration of speech services into your cloud telephony using standard web technologies. This API includes a variety of functions that enable software applications to initiate and receive calls, eliminating the need for developers to handle TK technologies.

Read more about TENIOS Voice API

Users also considered
Listener logo

Speech to Text - fast and reliable

learn more
Listener is a product that transcribes speech to text in real-time. It supports multiple languages and domains and provides high accuracy, speech adaptation, timestamps, speaker diarization, and flexible model deployment.

Read more about Listener

Users also considered
Authenti logo

Voice biometrics solution for identification & verification

learn more
Authenti is a voice biometric solution that verifies identity using vocal characteristics. It can be used for various applications, such as transactions, IVR, remote access, digital signatures, multi-factor authentication, and workforce management, healthcare, travel and education.

Read more about Authenti

Users also considered
Uniphore  logo

So every person, on every call, can finally be heard.

learn more
Uniphore is the global leader in Conversational Service Automation (CSA), which combines the power of artificial intelligence, automation technology and machine learning.

Read more about Uniphore

Users also considered
Wavel logo

Full Stack Voice AI Solutions for Videos And Localization

learn more
Wavel is an AI-powered video assistant that uses advanced text-to-speech and speech-to-text technology to create captions, subtitles, and dubbing in over 40+ languages. It offers voiceover customization with 250+ emotions and integrates with popular platforms like YouTube and Vimeo

Read more about Wavel

Users also considered
Philips SpeechExec logo

Use the power of your voice with professional dictation

learn more
Philips SpeechExec Pro Dictation and Transcription Software is designed for authors to focus on recording with their preferred voice recorder, download dictations quickly, and automatically route to assistants or speech recognition to transcribe files.

Read more about Philips SpeechExec

Users also considered