getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Speech Recognition Software - Page 4

Last updated: April 2026

Filter results

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


101 software options

Philips SpeechExec logo

Use the power of your voice with professional dictation

learn more
Philips SpeechExec Pro Dictation and Transcription Software is designed for authors to focus on recording with their preferred voice recorder, download dictations quickly, and automatically route to assistants or speech recognition to transcribe files.

Read more about Philips SpeechExec

Users also considered
Voximal logo

A phone platform based on Asterisk propulsed by Voximal

learn more
Voximal is Asterisk's VoiceXML engine with state-of-the-art of latest text to speech and speech to text on-premise or online offer.

Read more about Voximal

Users also considered
GoVivace logo

Conversational AI and speech analytics solution

learn more
GoVivace is a conversational AI and speech analytics solution. It provides intelligent omnichannel chatbots and voice bots for businesses of all sizes.

Read more about GoVivace

Users also considered
Voice To Text Online logo

Speech recognition in 55 languages via browser

learn more
Voice to Text Online converts spoken words into written text in real-time across more than fifty-five languages. The browser-based tool processes audio locally using the Web Speech API, requiring no account creation or software installation. Users can transcribe live speech, save projects for later editing, and export transcripts in multiple formats including TXT, SRT, VTT, and JSON with automatic speaker identification and confidence-level indicators.

Read more about Voice To Text Online

Users also considered
ValueFlow logo

AI voice agents for conducting interviews

learn more
Interview agents that conduct voice-based interviews automatically. Simply share pre-configured interview links with your audience.

Read more about ValueFlow

Users also considered
Infercall logo

AI phone answering service for small businesses

learn more
Infercall is an AI-powered phone answering service that handles incoming calls for businesses around the clock. The platform trains on business information by extracting data from websites and uploaded documents, enabling it to answer questions about services, pricing, and availability. Infercall supports simultaneous call handling, appointment scheduling, lead qualification, call transfers, and provides full call transcripts and recordings with analytics.

Read more about Infercall

Users also considered
AICHE logo

AI-enabled software that transforms voice into text

learn more
AICHE transforms voice into polished text with one hotkey. Speak naturally - the AI delivers clean, structured output instantly copied to your clipboard. Available on Windows, Mac, Linux with privacy-first zero audio retention.

Read more about AICHE

Users also considered
AI-Powered Voice Assistants logo

Customer experience software for eCommerce businesses

learn more
AI-Powered Voice Assistants is a conversational marketing software that helps businesses recognize speech, interpret human language and optimize communications. Administrators can automate various repetitive tasks including insurance premium payment reminders and debt collection processes.

Read more about AI-Powered Voice Assistants

Users also considered
Gladia logo

Multilingual speech to text transcription API

learn more
Gladia provides an audio transcription API that converts speech to text through both asynchronous and real-time processing capabilities. The platform supports over one hundred languages and offers features including speaker diarization, sentiment analysis, named entity recognition, and word-level timestamps with sub-three-hundred-millisecond latency for real-time transcription.

Read more about Gladia

Users also considered
Sesame logo

Voice biometric identification system

learn more
Sesame by Utopia.AI is a cloud-based voice biometric identification solution which uses natural speech to identify callers in real time, by creating voice prints from previous calls without requiring caller enrollment. The software can also analyze caller vocabulary, sentiment, and emotional state.

Read more about Sesame

Users also considered
Amical logo

AI-based open-source speech-to-text application

learn more
Amical is an open-source speech-to-text application powered by generative AI technology that enables users to convert spoken words into text without using a keyboard. The application automatically understands context across different platforms, formatting dictation appropriately whether for professional emails or casual social media posts, while maintaining user privacy and delivering accurate transcriptions.

Read more about Amical

Users also considered
Call automation bot logo

Automate customer phone calls

learn more
Call Automation Bot is a cloud-based conversational AI platform that provides features such as call center coordination, call forwarding, speech-to-text processing, intent recognition, and solution flows.

Read more about Call automation bot

Users also considered
Picovoice logo

Developer-first platform for adding voice to anything

learn more
The first and only ubiquitous on-device voice AI platform. Picovoice offers speech-to-text, voice search, wake word, intent and voice activity detection engines. Its stack can run on anything from embedded devices to web browsers, providing an immersive experience not achievable by any Big Tech.

Read more about Picovoice

Users also considered
TalkMark logo

AI‑enabled transcription and summarization tool

learn more
TalkMark is an advanced AI‑powered transcription and summarization tool designed to convert speech to highly accurate text (95%+), with speaker identification, fast processing, and secure EU‑hosted infrastructure - ideal for professionals, students, creators, and enterprises.

Read more about TalkMark

Users also considered
VALT logo

Speech recognition solution

learn more
VALT is browser-based audio/video capture software for healthcare, education, government, and corporate use. It lets users record, manage, stream, and search content with features, including live observation of nine sessions, customizable data templates, and secure sharing.

Read more about VALT

Users also considered
Akkadu logo

AI subtitles & interpretation for global chat.

learn more
Akkadu offers a range of innovative solutions for making meetings and events multilingual, whether they are on-site, hybrid, or online. With Akkadu, users can add remote simultaneous interpretation (RSI), AI subtitles, or human live captioning to enhance language accessibility for participants.

Read more about Akkadu

Users also considered
CommPeak Speech-to-Text logo

AI call insights with transcripts, summaries, tone analysis

learn more
Turn call recordings into actionable insights with AI analysis. Access searchable transcripts, AI-generated summaries, and sentiment analysis to identify which conversations need follow-up attention. Use real conversations to coach agents and strengthen training programs.

Read more about CommPeak Speech-to-Text

Users also considered
Help Genie logo

Support That Sells

learn more
Help Genie is a fully branded AI voice and chat support platform for small and medium businesses. Every Genie is trained on your documentation, speaks in your brand voice, and handles calls and chats 24/7, no technical setup required.

Read more about Help Genie

Users also considered
Heynds logo

AI-enabled writing and speech assistant

learn more
Heynds is an AI Writing and Speech Assistant desktop app for Mac and Windows, coming soon to Linux. It's designed to make your writing workflow much faster and easier. You can say goodbye to slow typing, writer's block, and endless editing.

Read more about Heynds

Users also considered
Authenti logo

Voice biometrics solution for identification & verification

learn more
Authenti is a voice biometric solution that verifies identity using vocal characteristics. It can be used for various applications, such as transactions, IVR, remote access, digital signatures, multi-factor authentication, and workforce management, healthcare, travel and education.

Read more about Authenti

Users also considered
Uniphore  logo

So every person, on every call, can finally be heard.

learn more
Uniphore is the global leader in Conversational Service Automation (CSA), which combines the power of artificial intelligence, automation technology and machine learning.

Read more about Uniphore

Users also considered
Speak2mail logo

Voice to text email generator for Gmail

learn more
Speak2mail is a Chrome extension that converts voice dictation into email text directly within Gmail. The tool uses speech recognition technology to transcribe spoken words in real time and applies artificial intelligence to generate context-aware email responses. Speak2mail integrates with Gmail, supports writing in fourteen languages, and includes translation capabilities for composing messages in recipients' preferred languages.

Read more about Speak2mail

Users also considered
SpeechPulse logo

Speed up your typing using Whisper voice recognition

learn more
SpeechPulse is a dictation utility for Windows 10 and 11 and Apple Silicon Macs. It operates totally offline and can type into any text input field, including text editors, web browsers, and office applications. SpeechPulse can also use NVIDIA GPUs to speed up the transcription.

Read more about SpeechPulse

Users also considered
Voci logo

Speech analytics software with event-level metadata

learn more
Voci is a cloud-based and on-premise speech analytics software designed to help businesses gain insights into voice data using AI technology and deep learning algorithms. It lets teams transcribe large volumes of audio into analyzable text via high-speed DDR4 SDRAMs.

Read more about Voci

Users also considered
Yactraq logo

Speech Analytics for Contact Centers

learn more
Yactraq is interactive analytics that integrates emotion detection, speech analytics, and predictive intelligence to understand customer communications across multiple channels (phone, email, social media).

Read more about Yactraq

Users also considered