getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Speech Recognition Software with Audio Capture (2026) - Page 2

Last updated: March 2026

1 filter applied

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


60 software options

BigHand Workflow Management logo

Speech recognition and transcription software

learn more
Automatically delegate legal tasks to the right support staff, at the right cost to the firm, with BigHand Workflow Management. Assign support tasks & receive work seamlessly, whilst using the output reports to make data-driven decisions.

Read more about BigHand Workflow Management

Users also considered
Reportex logo

Audio transcription & editing solution

learn more
Reportex from Sony is a cloud-based audio transcription and editing solution which allows users to automatically transcribe audio from multiple file formats, edit and correct transcriptions, create and share video clips of transcribed audio, download edited files, and more

Read more about Reportex

Users also considered
Speech Recognition Engine logo

AI-enabled speech recognition engine

learn more
LumenVox’s speech and voice software leverages artificial intelligence, natural language understanding, and deep machine learning technologies to deliver speech recognition technology. It includes neural networks to improve the ability to add new languages and dialects and serve a more diverse base of users.

Read more about Speech Recognition Engine

Users also considered
Twixor logo

Customer communications management software

learn more
Twixor EnCaps is a low-code customer engagement platform that helps businesses deliver personalized interactions. The software utilizes generative AI and natural language processing functionalities on messaging platforms to create customer journeys. The CX platform combines digital assistant and intelligent process automation to deliver personalized interactions.

Read more about Twixor

Users also considered
Express Dictate logo

Record and send dictation directly from your computer

learn more
Record and send dictation directly from your computer with Express Dictate Digital Dictation Software.

Read more about Express Dictate

Users also considered
OneVoice logo

Enterprise voicemail transcription and translation tool.

learn more
OneVoice is part of a unified messaging platform for Office 365 and Gmail. It is an audio transcription, voicemail, and translation tool developed by Donoma. It aims to help sales and customer service agents perform their daily tasks by providing a range of accessible and inclusive features.

Read more about OneVoice

Users also considered
Rythmex logo

Speech to text, transcription, medical transcription

learn more
Rythmex is an AI transcription solution offering real-time, multilingual transcription services in a user-friendly interface. With its intuitive features, API integrations, and robust data security, Rythmex is the go-to solution for individuals and businesses seeking accurate transcription solution.

Read more about Rythmex

Users also considered
GoSpeech logo

Saas solution for transcription and subtitling

learn more
Saas solution to convert speech to text based on artificial intelligence

Read more about GoSpeech

Users also considered
Castel Detect Live logo

Cloud-based speech recognition solution

learn more
Castel Detect LIVE is a voice recognition solution which helps firms of all sizes manage contact center speech analytics with alerts, reminders, scripting and call scoring. The platform allows users to regulate quality assurance via live calls analysis, post-call audits, and data-driven feedback.

Read more about Castel Detect Live

Users also considered
Translation Worldwide Software logo

Translation management tool for healthcare & medical sector

learn more
Translation Worldwide Software by JBI SOFTWARE is designed to help businesses across healthcare, legal, medical, insurance, banking, and other industries manage language translation projects. The artificial intelligence (AI)-enabled solution allows employees to handle text interpretation and translation processes and reduce lawsuits.

Read more about Translation Worldwide Software

Users also considered
SoapBox logo

Speech recognition software designed to detect kid speech

learn more
SoapBox Labs builds speech recognition technology for kids. Its proprietary technology has been built to deliver private and accurate results for kids ages 2 to 12 of all accents and dialects. Soapbox Labs also takes into account kids’ unpredictable speech patterns and behaviors.

Read more about SoapBox

Users also considered
Maestra logo

Speech to text, closed captioning & transcription software

learn more
Maestra is a speech to text software designed to help educators, researchers, marketers, journalists, and media houses automatically add transcriptions, captions, subtitles, and voiceovers to audio and video files in real-time. The platform enables professionals to translate text into various languages including English, French, Spanish, and German.

Read more about Maestra

Users also considered
inspeech logo

Extract the wealth hidden in your client’s voice

learn more
Transform the valuable information contained in the calls you already have into competitive intelligence, customer satisfaction and new business.

Read more about inspeech

Users also considered
inspeech logo

Extract the wealth hidden in your client’s voice

learn more
Transform the valuable information contained in the calls you already have into competitive intelligence, customer satisfaction and new business.

Read more about inspeech

Users also considered
Voci logo

Speech analytics software with event-level metadata

learn more
Voci is a cloud-based and on-premise speech analytics software designed to help businesses gain insights into voice data using AI technology and deep learning algorithms. It lets teams transcribe large volumes of audio into analyzable text via high-speed DDR4 SDRAMs.

Read more about Voci

Users also considered
Picovoice logo

Developer-first platform for adding voice to anything

learn more
The first and only ubiquitous on-device voice AI platform. Picovoice offers speech-to-text, voice search, wake word, intent and voice activity detection engines. Its stack can run on anything from embedded devices to web browsers, providing an immersive experience not achievable by any Big Tech.

Read more about Picovoice

Users also considered
Wavel logo

Full Stack Voice AI Solutions for Videos And Localization

learn more
Wavel is an AI-powered video assistant that uses advanced text-to-speech and speech-to-text technology to create captions, subtitles, and dubbing in over 40+ languages. It offers voiceover customization with 250+ emotions and integrates with popular platforms like YouTube and Vimeo

Read more about Wavel

Users also considered
Uniphore  logo

So every person, on every call, can finally be heard.

learn more
Uniphore is the global leader in Conversational Service Automation (CSA), which combines the power of artificial intelligence, automation technology and machine learning.

Read more about Uniphore

Users also considered
Dictalogic logo

Cloud speech recognition solution

learn more
With the use of digital transformation, we allow a voice to text conversion on the fly, where you just record audio and send it to transcribe as you normally would and the audio converts to text before it reaches the transcriber. We have multiple options on assignment for you to explore.

Read more about Dictalogic

Users also considered
SpeechPulse logo

Speed up your typing using Whisper voice recognition

learn more
SpeechPulse is a dictation utility for Windows 10 and 11 and Apple Silicon Macs. It operates totally offline and can type into any text input field, including text editors, web browsers, and office applications. SpeechPulse can also use NVIDIA GPUs to speed up the transcription.

Read more about SpeechPulse

Users also considered
GoVivace logo

Conversational AI and speech analytics solution

learn more
GoVivace is a conversational AI and speech analytics solution. It provides intelligent omnichannel chatbots and voice bots for businesses of all sizes.

Read more about GoVivace

Users also considered
CommPeak Speech-to-Text logo

AI call insights with transcripts, summaries, tone analysis

learn more
Turn call recordings into actionable insights with AI analysis. Access searchable transcripts, AI-generated summaries, and sentiment analysis to identify which conversations need follow-up attention. Use real conversations to coach agents and strengthen training programs.

Read more about CommPeak Speech-to-Text

Users also considered
Aveni Assist logo

Automated meeting capture and compliance for advisers

learn more
Aveni Assist transcribes adviser–client meetings with high accuracy, applies speaker diarisation, and links content to CRM records. Compliance checks analyse transcripts for risks and maintain a searchable audit trail.

Read more about Aveni Assist

Users also considered
CogniAIX logo

AI-based tool turning conversations into tracked tasks

learn more
CogniAIX is an AI-based productivity tool that transcribes conversations and extracts actionable items from audio recordings. The software allows users to upload audio files or record directly through their microphone, then automatically identifies decisions, commitments, and action items from the transcribed content. CogniAIX converts these extracted elements into assigned tasks with designated owners and provides automated task tracking and follow-up capabilities.

Read more about CogniAIX

Users also considered
Amical logo

AI-based open-source speech-to-text application

learn more
Amical is an open-source speech-to-text application powered by generative AI technology that enables users to convert spoken words into text without using a keyboard. The application automatically understands context across different platforms, formatting dictation appropriately whether for professional emails or casual social media posts, while maintaining user privacy and delivering accurate transcriptions.

Read more about Amical

Users also considered