getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Speech Recognition Software with Third-Party Integrations (2026) - Page 2

Last updated: March 2026

1 filter applied

Features


Integrated with

No filters available


Pricing model


Devices supported


Organization types


User rating


40 software options

3Play Media logo

Closed captioning and transcription solution

learn more
3Play Media is designed to help businesses across media, entertainment, eCommerce, fitness, education, and government sectors handle closed captioning, transcription, audio description, live captioning, and subtitling operations. It enables users to manage podcasts, enhance search engine optimization (SEO) activities, and track audience engagement via a unified platform.

Read more about 3Play Media

Users also considered
Kili logo

Training data platform for enterprise AI

learn more
Kili is a training data platform designed to help businesses in banking, manufacturing, and healthcare industries streamline the entire training process for artificial intelligence (AI) and machine learning models, from connecting with raw data sources to preparing models, processing information, and training. The platform enables organizations to handle multiple machine learning projects and process videos, images, text, and other types of data.

Read more about Kili

Users also considered
inspeech logo

Extract the wealth hidden in your client’s voice

learn more
Transform the valuable information contained in the calls you already have into competitive intelligence, customer satisfaction and new business.

Read more about inspeech

Users also considered
Voci logo

Speech analytics software with event-level metadata

learn more
Voci is a cloud-based and on-premise speech analytics software designed to help businesses gain insights into voice data using AI technology and deep learning algorithms. It lets teams transcribe large volumes of audio into analyzable text via high-speed DDR4 SDRAMs.

Read more about Voci

Users also considered
AI-Powered Voice Assistants logo

Customer experience software for eCommerce businesses

learn more
AI-Powered Voice Assistants is a conversational marketing software that helps businesses recognize speech, interpret human language and optimize communications. Administrators can automate various repetitive tasks including insurance premium payment reminders and debt collection processes.

Read more about AI-Powered Voice Assistants

Users also considered
Voximal logo

A phone platform based on Asterisk propulsed by Voximal

learn more
Voximal is Asterisk's VoiceXML engine with state-of-the-art of latest text to speech and speech to text on-premise or online offer.

Read more about Voximal

Users also considered
GoVivace logo

Conversational AI and speech analytics solution

learn more
GoVivace is a conversational AI and speech analytics solution. It provides intelligent omnichannel chatbots and voice bots for businesses of all sizes.

Read more about GoVivace

Users also considered
VALT logo

Speech recognition solution

learn more
VALT is browser-based audio/video capture software for healthcare, education, government, and corporate use. It lets users record, manage, stream, and search content with features, including live observation of nine sessions, customizable data templates, and secure sharing.

Read more about VALT

Users also considered
Amical logo

AI-based open-source speech-to-text application

learn more
Amical is an open-source speech-to-text application powered by generative AI technology that enables users to convert spoken words into text without using a keyboard. The application automatically understands context across different platforms, formatting dictation appropriately whether for professional emails or casual social media posts, while maintaining user privacy and delivering accurate transcriptions.

Read more about Amical

Users also considered
Aveni Assist logo

Automated meeting capture and compliance for advisers

learn more
Aveni Assist transcribes adviser–client meetings with high accuracy, applies speaker diarisation, and links content to CRM records. Compliance checks analyse transcripts for risks and maintain a searchable audit trail.

Read more about Aveni Assist

Users also considered
Vocova logo

AI transcription & translation for audio/video

learn more
Vocova is an AI-powered transcription tool that converts audio and video files into text across more than one hundred languages. The software features automatic speaker identification, word-level timestamps, and the ability to import content directly from over one thousand platforms including YouTube, TikTok, and various podcast hosts. Users can translate transcripts into more than one hundred forty languages and export results in multiple formats such as PDF, DOCX, SRT, and VTT.

Read more about Vocova

Users also considered
Reteta logo

Cloud-based medical transcription tool for doctors.

learn more
Reteta is a cloud-based healthcare technology solution that transforms patient-physician conversations into comprehensive medical diagnoses and treatment notes. The platform provides automated speech recognition (ASR) models that allow medical professionals to recognize medical terminology, medication names, and multiple speakers to generate detailed clinical documentation.

Read more about Reteta

Users also considered
AICHE logo

AI-enabled software that transforms voice into text

learn more
AICHE transforms voice into polished text with one hotkey. Speak naturally - the AI delivers clean, structured output instantly copied to your clipboard. Available on Windows, Mac, Linux with privacy-first zero audio retention.

Read more about AICHE

Users also considered
AICHE logo

AI-enabled software that transforms voice into text

learn more
AICHE transforms voice into polished text with one hotkey. Speak naturally - the AI delivers clean, structured output instantly copied to your clipboard. Available on Windows, Mac, Linux with privacy-first zero audio retention.

Read more about AICHE

Users also considered
Gladia logo

Multilingual speech to text transcription API

learn more
Gladia provides an audio transcription API that converts speech to text through both asynchronous and real-time processing capabilities. The platform supports over one hundred languages and offers features including speaker diarization, sentiment analysis, named entity recognition, and word-level timestamps with sub-three-hundred-millisecond latency for real-time transcription.

Read more about Gladia

Users also considered