getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Speech Recognition Software with Speech-to-Text Analysis (2026) - Page 2

Last updated: April 2026

1 filter applied

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


63 software options

CogniAIX logo

AI-based tool turning conversations into tracked tasks

learn more
CogniAIX is an AI-based productivity tool that transcribes conversations and extracts actionable items from audio recordings. The software allows users to upload audio files or record directly through their microphone, then automatically identifies decisions, commitments, and action items from the transcribed content. CogniAIX converts these extracted elements into assigned tasks with designated owners and provides automated task tracking and follow-up capabilities.

Read more about CogniAIX

Users also considered
OneVoice logo

Enterprise voicemail transcription and translation tool.

learn more
OneVoice is part of a unified messaging platform for Office 365 and Gmail. It is an audio transcription, voicemail, and translation tool developed by Donoma. It aims to help sales and customer service agents perform their daily tasks by providing a range of accessible and inclusive features.

Read more about OneVoice

Users also considered
Rythmex logo

Speech to text, transcription, medical transcription

learn more
Rythmex is an AI transcription solution offering real-time, multilingual transcription services in a user-friendly interface. With its intuitive features, API integrations, and robust data security, Rythmex is the go-to solution for individuals and businesses seeking accurate transcription solution.

Read more about Rythmex

Users also considered
Verbit logo

Verbit makes video and audio accessible and more engaging.

learn more
Verbit provides accurate captions & transcription of live and recorded video to make them accessible & engaging to all audiences. These tools help educators, government entities, legal agencies, business leaders and media producers meet the needs of those with disabilities & all engage viewers.

Read more about Verbit

Users also considered
Speechmatics logo

Global experts in deep learning and speech recognition

learn more
Global experts in deep learning and speech recognition

Read more about Speechmatics

Users also considered
GoSpeech logo

Saas solution for transcription and subtitling

learn more
Saas solution to convert speech to text based on artificial intelligence

Read more about GoSpeech

Users also considered
3Play Media logo

Closed captioning and transcription solution

learn more
3Play Media is designed to help businesses across media, entertainment, eCommerce, fitness, education, and government sectors handle closed captioning, transcription, audio description, live captioning, and subtitling operations. It enables users to manage podcasts, enhance search engine optimization (SEO) activities, and track audience engagement via a unified platform.

Read more about 3Play Media

Users also considered
Rev.ai logo

Speech recognition software using asynchronous API

learn more
Rev.ai’s suite of speech-to-text APIs allows businesses to build downstream applications. Speech recognition software built from speech engine trained to transcribe content on various topics with various accents for various industries.

Read more about Rev.ai

Users also considered
Castel Detect Live logo

Cloud-based speech recognition solution

learn more
Castel Detect LIVE is a voice recognition solution which helps firms of all sizes manage contact center speech analytics with alerts, reminders, scripting and call scoring. The platform allows users to regulate quality assurance via live calls analysis, post-call audits, and data-driven feedback.

Read more about Castel Detect Live

Users also considered
Translation Worldwide Software logo

Translation management tool for healthcare & medical sector

learn more
Translation Worldwide Software by JBI SOFTWARE is designed to help businesses across healthcare, legal, medical, insurance, banking, and other industries manage language translation projects. The artificial intelligence (AI)-enabled solution allows employees to handle text interpretation and translation processes and reduce lawsuits.

Read more about Translation Worldwide Software

Users also considered
SoapBox logo

Speech recognition software designed to detect kid speech

learn more
SoapBox Labs builds speech recognition technology for kids. Its proprietary technology has been built to deliver private and accurate results for kids ages 2 to 12 of all accents and dialects. Soapbox Labs also takes into account kids’ unpredictable speech patterns and behaviors.

Read more about SoapBox

Users also considered
Maestra logo

Speech to text, closed captioning & transcription software

learn more
Maestra is a speech to text software designed to help educators, researchers, marketers, journalists, and media houses automatically add transcriptions, captions, subtitles, and voiceovers to audio and video files in real-time. The platform enables professionals to translate text into various languages including English, French, Spanish, and German.

Read more about Maestra

Users also considered
inspeech logo

Extract the wealth hidden in your client’s voice

learn more
Transform the valuable information contained in the calls you already have into competitive intelligence, customer satisfaction and new business.

Read more about inspeech

Users also considered
Sesame logo

Voice biometric identification system

learn more
Sesame by Utopia.AI is a cloud-based voice biometric identification solution which uses natural speech to identify callers in real time, by creating voice prints from previous calls without requiring caller enrollment. The software can also analyze caller vocabulary, sentiment, and emotional state.

Read more about Sesame

Users also considered
Ebby logo

Cloud-based transcriptions software

learn more
Ebby helps lawyers, podcasters, journalists, researchers, and academic professionals convert audio recordings into text documents using AI technology. The built-in editor automatically synchronizes and plays audio or video files with text data, letting users review and edit transcripts in real-time.

Read more about Ebby

Users also considered
Dictalogic logo

Cloud speech recognition solution

learn more
With the use of digital transformation, we allow a voice to text conversion on the fly, where you just record audio and send it to transcribe as you normally would and the audio converts to text before it reaches the transcriber. We have multiple options on assignment for you to explore.

Read more about Dictalogic

Users also considered
Picovoice logo

Developer-first platform for adding voice to anything

learn more
The first and only ubiquitous on-device voice AI platform. Picovoice offers speech-to-text, voice search, wake word, intent and voice activity detection engines. Its stack can run on anything from embedded devices to web browsers, providing an immersive experience not achievable by any Big Tech.

Read more about Picovoice

Users also considered
Yactraq logo

Speech Analytics for Contact Centers

learn more
Yactraq is interactive analytics that integrates emotion detection, speech analytics, and predictive intelligence to understand customer communications across multiple channels (phone, email, social media).

Read more about Yactraq

Users also considered
CommPeak Speech-to-Text logo

AI call insights with transcripts, summaries, tone analysis

learn more
Turn call recordings into actionable insights with AI analysis. Access searchable transcripts, AI-generated summaries, and sentiment analysis to identify which conversations need follow-up attention. Use real conversations to coach agents and strengthen training programs.

Read more about CommPeak Speech-to-Text

Users also considered
GoVivace logo

Conversational AI and speech analytics solution

learn more
GoVivace is a conversational AI and speech analytics solution. It provides intelligent omnichannel chatbots and voice bots for businesses of all sizes.

Read more about GoVivace

Users also considered
Uniphore  logo

So every person, on every call, can finally be heard.

learn more
Uniphore is the global leader in Conversational Service Automation (CSA), which combines the power of artificial intelligence, automation technology and machine learning.

Read more about Uniphore

Users also considered
Listener logo

Speech to Text - fast and reliable

learn more
Listener is a product that transcribes speech to text in real-time. It supports multiple languages and domains and provides high accuracy, speech adaptation, timestamps, speaker diarization, and flexible model deployment.

Read more about Listener

Users also considered
Authenti logo

Voice biometrics solution for identification & verification

learn more
Authenti is a voice biometric solution that verifies identity using vocal characteristics. It can be used for various applications, such as transactions, IVR, remote access, digital signatures, multi-factor authentication, and workforce management, healthcare, travel and education.

Read more about Authenti

Users also considered
Voci logo

Speech analytics software with event-level metadata

learn more
Voci is a cloud-based and on-premise speech analytics software designed to help businesses gain insights into voice data using AI technology and deep learning algorithms. It lets teams transcribe large volumes of audio into analyzable text via high-speed DDR4 SDRAMs.

Read more about Voci

Users also considered
SpeechPulse logo

Speed up your typing using Whisper voice recognition

learn more
SpeechPulse is a dictation utility for Windows 10 and 11 and Apple Silicon Macs. It operates totally offline and can type into any text input field, including text editors, web browsers, and office applications. SpeechPulse can also use NVIDIA GPUs to speed up the transcription.

Read more about SpeechPulse

Users also considered