getapp-logo

App comparison

Add up to 4 apps below to see how they compare. You can also use the "Compare" buttons while browsing.

GetApp offers objective, independent research and verified user reviews. We may earn a referral fee when you visit a vendor through our links. 

Data Extraction Software for Linux - Page 4

Last updated: March 2026

Verified reviewer profile picture
Get free expert advice+1 (888) 216-6745
Call now for a one-to-one consultation in under 15 mins.
1 filter applied

Features


Integrated with


Pricing model


Devices supported


Organization types


User rating


110 software options

Base64.ai logo

Intelligent Document Processing AI

learn more
Base64.ai is a cutting-edge artificial intelligence platform that automates document processes. It understands all document types worldwide, including IDs, passports, Covid tests, vaccinations, invoices, checks, and forms.

Read more about Base64.ai

Users also considered
Kili logo

Training data platform for enterprise AI

learn more
Kili is a training data platform designed to help businesses in banking, manufacturing, and healthcare industries streamline the entire training process for artificial intelligence (AI) and machine learning models, from connecting with raw data sources to preparing models, processing information, and training. The platform enables organizations to handle multiple machine learning projects and process videos, images, text, and other types of data.

Read more about Kili

Users also considered
DataLark logo

Data management tool

learn more
DataLark is an SAP-focused no-code/low-code data management platform that simplifies the migration and integration of business-critical data.

A specialist will reach out shortly to help you get started with your free 14-day trial.

Read more about DataLark

Users also considered
PDFix SDK logo

Cross-platform solution for PDF & data extraction processes

learn more
Machine learning techniques help us to create an algorithm that allows you to extract data in an easily structured way. Export data as HTML or JSON or use PDFix API calls to use data directly in your workflows.

Read more about PDFix SDK

Users also considered
Web Scraping API logo

Effortlessly scrape web data you need

learn more
Smatproxy has fully equipped data collection tools for price comparison, product research, competitor analysis, SEO research, and brand intelligence. Smartproxy’s Web Scraping API allows you to scrape web data effortlessly - it combines proxies & web scraper for a complete scraping experience.

Read more about Web Scraping API

Users also considered
SingularityAI logo

Your Real-Time AI To Transform Data To Insight

learn more
SingularityAI is an artificial intelligence platform that helps document processing via optical character recognition (OCR), computer vision, natural language processing (NLP), machine learning (ML), and more.

Read more about SingularityAI

Users also considered
PLANET AI logo

Intelligent document analysis software suite

learn more
PLANET AI’s Intelligent Document Analysis (IDA) software suite offers comprehensive capabilities for customers with the common desire for short time-to-value automation and high-quality data capture, extraction, and understanding.

Read more about PLANET AI

Users also considered
Diyotta logo

Effortless Data Integration for Analytics Teams

learn more
Diyotta is a data integration software that provides businesses with tools to automatically source, process, and analyze collected data on a centralized platform. Administrators can gain an overview of all synchronized data and variation trends through graphs & actionable analytics.

Read more about Diyotta

Users also considered
ExB logo

AI-enabled data extraction and mining software

learn more
Cognitive Workbench is an artificial intelligence (AI) enabled platform designed to help businesses in industries such as healthcare, mobility, insurance, and others streamline text mining processes using natural language processing (NLP) and machine learning algorithms.

Read more about ExB

Users also considered
Tom Sawyer logo

Data visualization application with interactive graph layout

learn more
Tom Sawyer Perspectives is a data-driven web, desktop, and cloud-based platform for building graph and data visualization and analysis applications.

Read more about Tom Sawyer

Users also considered
Xtract.io logo

Artificial intelligence (AI)-enabled data extraction system

learn more
Xtract.io is designed to help organizations collect business data from various websites, PDF files, or text files and securely store them in local disks. It enables data analysts to conduct predictive analytics, handle image recognition, and streamline natural language processing operations.

Read more about Xtract.io

Users also considered
AllRead logo

Spot it. Read it. Digitize it.

learn more
AllRead MLT is a Deep Tech & Computer Vision tracking and monitoring software for goods and vehicles in Ports and Intermodal Platforms with high accuracy, fast integration and hardware agnostic.

Read more about AllRead

Users also considered
amberSearch logo

Making Enterprise Search a No Brainer

learn more
amberSearch is an intelligent enterprise search engine combining the knowledge of all data sources within your company

Read more about amberSearch

Users also considered
ITyX logo

Artificial intelligence-enabled process automation platform

learn more
ITyX is an artificial intelligence-enabled platform designed to help businesses automate, capture and manage processes across emails and documents. Administrators can import, analyze, classify, enrich,and validate different text structures and utilize pre-configured formats and workflows to automate corporate processes.

Read more about ITyX

Users also considered
PDFix Desktop Pro logo

Professional Accessibility Remediation Software Tool

learn more
PDFix Desktop Pro is a complex solution for PDF Accessibility, PDF Conversion and Data Extraction designed for professionals and businesses of all sizes.

Read more about PDFix Desktop Pro

Users also considered
Suadeo logo

Self-data service (SDS) platform for data management

learn more
Suadeo is a self-data service (SDS) platform that helps businesses discover new perspectives and identify opportunities by accessing a collaborative environment for team members. It provides an ecosystem for informed decision-making and lets users control the entire lifecycle of data within the organization.

Read more about Suadeo

Users also considered
TIMi logo

Online system for developing analytical & predictive models

learn more
TIMi is a unique platform for the development of analytical and predictive models. It consists of four tools that work together to improve your business including Anatella, Modeler, StarDust, and Kibella.

Read more about TIMi

Users also considered
DOCBrains logo

Cloud-based data extraction and processing software

learn more
DOCBrains is a cloud-based data extraction tool that helps businesses find specific documents, capture data for audit workflow, and recover files from electronic vaults on a unified platform.

Read more about DOCBrains

Users also considered
Price Trakker logo

Real-time competitor price tracking and trustworthy insights

learn more
Price Trakker is a cloud-based software that helps businesses manage their pricing.

Read more about Price Trakker

Users also considered
Patent Monitor logo

Patent classification with NLP and ML technologies

learn more
Patent Monitor is a SaaS solution for classifying and filtering large numbers of patents. By using a unique combination of NLP and Machine Learning, Patent Monitor can reduce manual workloads by around 80% and can reproduce your expert's classification behavior.

Read more about Patent Monitor

Users also considered
FaceMRI logo

Face recognition, demographics and report, marketing insight

learn more
FaceMRI is a cloud-based data extraction platform that helps users boot their marketing and demographics insights. It takes images, videos, and other files and creates attendance reports using Face Recognition. It lets users gain insight into crowds at rallies, street events, malls, and foot traffic in the area. FaceMRI also has the world's first chart and analytics reporting for face, race, and age demographics.

Read more about FaceMRI

Users also considered
elevait logo

AI-enabled sustainable solution

learn more
elevait is a sustainable AI solution that automates recurring business processes through a generic knowledge base using artificial intelligence technology. The software processes documents, manages incoming data, and digitizes 2D plan documents.

Read more about elevait

Users also considered
SL Professional logo

Solution for in-depth data-driven investigations

learn more
SL Professional is an all-in-one OSINT solution for conducting in-depth investigations across social media, blockchains, messengers, and the Dark Web. It provides access to over 500 open data sources and over 1000 integrated search methods.

Read more about SL Professional

Users also considered
Gorilla ROI logo

Syncs Amazon data to Google Sheets in real-time.

learn more
Gorilla ROI is a cloud-based data extraction solution that helps retailers synchronize data from Amazon Seller Central into Google Sheets. It links Seller Central data to spreadsheets and updates in real-time. The connector also pulls product details, inventory, sales, fees, and more.

Read more about Gorilla ROI

Users also considered
Datactics logo

Augmented Data Quality from Datactics delivers trust in data

learn more
Augmented Data Quality from Datactics provides trust in data through AI-suggested data quality rules, connectivity

Read more about Datactics

Users also considered