sourceforge.net

Best Open Source Linux Text Processing Software 2025

  • ️Fri May 02 2025

Browse free open source Text Processing software and projects for Linux below. Use the toggles on the left to filter open source Text Processing software by OS, license, language, programming language, and project status.

  • Comprehensive Cybersecurity to Safeguard Your Organization | SOCRadar Icon

    Comprehensive Cybersecurity to Safeguard Your Organization | SOCRadar

    See what hackers already know about your organization – and stop them from getting in.

    Protect your organization from cyber threats with SOCRadar’s cutting-edge threat intelligence. Gain 360° visibility into your digital assets, monitor the dark web, and stay ahead of hackers with real-time insights. Start for free and transform your cybersecurity today.

  • Our Free Plans just got better! | Auth0 Icon

    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

  • 1

    Scribus

    Scribus

    Powerful desktop publishing software

    Scribus is an Open Source program that brings professional page layout to Linux, BSD UNIX, Solaris, OpenIndiana, GNU/Hurd, Mac OS X, OS/2 Warp 4, eComStation, and Windows desktops with a combination of press-ready output and new approaches to page design. Underneath a modern and user-friendly interface, Scribus supports professional publishing features, such as color separations, CMYK and spot colors, ICC color management, and versatile PDF creation.

    Leader badge

    Downloads: 24,414 This Week

    Last Update: 4 hours ago

    See Project

  • 2

    Former home for the DocBook stylesheets. The development of DocBook XSL has moved to Github at https://github.com/docbook/xslt10-stylesheets.

    Leader badge

    Downloads: 25,884 This Week

    Last Update: 2018-07-13

    See Project

  • 3

    KDiff3

    KDiff3

    A graphical text difference analyzer

    This repository is no longer maintained and is kept for archival purposes. See https://invent.kde.org/sdk/kdiff3 for the newest code and https://download.kde.org/stable/kdiff3/ for release bundles. All bugs should be filed at bugs.kde.org. KDiff3 is a graphical text difference analyzer for up to 3 input files, provides character-by-character analysis and a text merge tool with integrated editor. It can also compare and merge directories. Platform-independant.

    Leader badge

    Downloads: 2,572 This Week

    Last Update: 7 days ago

    See Project

  • 4

    OmegaT - multiplatform CAT tool

    OmegaT is a free and open source multiplatform Computer Assisted Translation tool with fuzzy matching, translation memory, keyword search, glossaries, and translation leveraging into updated projects.

    Leader badge

    Downloads: 2,125 This Week

    Last Update: 6 days ago

    See Project

  • Passwordless Authentication and Passwordless Security Icon

    Passwordless Authentication and Passwordless Security

    Identity is everything. Protect it with Duo.

    It’s no secret — passwords can be a real headache, both for the people who use them and the people who manage them. Over time, we’ve created hundreds of passwords, it’s easy to lose track of them and they’re easily compromised. Fortunately, passwordless authentication is becoming a feasible reality for many businesses. Duo can help you get there.

  • 5

    XMLStarlet is a set of command line utilities (tools) to transform, query, validate, and edit XML documents and files using simple set of shell commands in similar way it is done for text files with UNIX grep, sed, awk, diff, patch, join, etc utilities.

    Leader badge

    Downloads: 964 This Week

    Last Update: 2018-02-04

    See Project

  • 6

    XML Copy Editor

    XML Copy Editor is a fast, free, validating XML editor.

    Leader badge

    Downloads: 914 This Week

    Last Update: 2022-10-08

    See Project

  • 7

    TeXworks

    TeXworks

    A simple interface for working with TeX documents

    TeXworks is a free and simple working environment for authoring TeX (LaTeX, ConTeXt and XeTeX) documents. Inspired by Dick Koch's award-winning TeXShop program for Mac OS X, it makes entry into the TeX world easier for those using desktop operating systems other than OS X. It provides an integrated, easy-to-use environment for users on other platforms particularly GNU/Linux and Windows and features a clean, simple interface accessible to casual and non-technical users.

    Downloads: 145 This Week

    Last Update: 2025-02-14

    See Project

  • 8

    Bowtie, an ultrafast, memory-efficient short read aligner for short DNA sequences (reads) from next-gen sequencers. Please cite: Langmead B, et al. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25.

    Leader badge

    Downloads: 553 This Week

    Last Update: 2024-05-17

    See Project

  • 9

    Swiss File Knife

    Create zip files, extract zip files, replace text in files, search in files using expressions, stream text editor, instant command line ftp and http server, send folder via network, copy folder excluding sub folders and files, find duplicate files, run a command on all files of a folder, split and join large files, make md5 checksum lists of files, remove tab characters, convert CR/LF, list newest or biggest files of a folder, compare folders, treesize, show first or last lines of a file, find filenames fast using index files, rename many files using expressions, copy part of a file, change times of a file, set file time from filename, print colored text to terminal, convert csv to tab separated, download files from web, send http or udp requests, print tcp or udp traffic, create hexdump of files, join many text files into one, list nested .zip .tar .tar.gz .tar.bz2 archive contents. Fully portable, for USB stick, without installation. For Windows, Linux and Mac OS/X.

    Leader badge

    Downloads: 424 This Week

    Last Update: 2025-04-16

    See Project

  • Red Hat Enterprise Linux on Microsoft Azure Icon

    Red Hat Enterprise Linux on Microsoft Azure

    Deploy Red Hat Enterprise Linux on Microsoft Azure for a secure, reliable, and scalable cloud environment, fully integrated with Microsoft services.

    Red Hat Enterprise Linux (RHEL) on Microsoft Azure provides a secure, reliable, and flexible foundation for your cloud infrastructure. Red Hat Enterprise Linux on Microsoft Azure is ideal for enterprises seeking to enhance their cloud environment with seamless integration, consistent performance, and comprehensive support.

  • 10

    biber

    Biber is a sophisticated bibliography processing backend for the LaTeX biblatex package. It supports a unsurpassed feature set for automated conformance to complex bibliography style requirements such as labelling, sorting and name handling. It has comprehensive Unicode support.

    Leader badge

    Downloads: 250 This Week

    Last Update: 2024-12-15

    See Project

  • 11

    Imaginary Teleprompter

    Free teleprompter software. Built with web technologies so its easy to customize. Features include: mirroring, dual-screen support, rich text editing, image support, custom styles, and auto-save.

    Leader badge

    Downloads: 350 This Week

    Last Update: 2021-08-06

    See Project

  • 12

    jq

    jq

    Lightweight and flexible command-line JSON processor

    jq is like sed for JSON data - you can use it to slice, filter, map and transform structured data with the same ease that sed, awk, grep and friends let you play with text. jq is written in portable C, and it has zero runtime dependencies. You can download a single binary, scp it to a far away machine of the same type, and expect it to work. jq can mangle the data format that you have into the one that you want with very little effort, and the program to do so is often shorter and simpler than you'd expect. Data in jq is represented as streams of JSON values - every jq expression runs for each value in its input stream, and can produce any number of values to its output stream. jq filters run on a stream of JSON data. The input to jq is parsed as a sequence of whitespace-separated JSON values which are passed through the provided filter one at a time. The output(s) of the filter are written to standard out, again as a sequence of whitespace-separated JSON data.

    Downloads: 48 This Week

    Last Update: 2023-12-13

    See Project

  • 13

    Diffuse

    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.

    Leader badge

    Downloads: 233 This Week

    Last Update: 2019-01-05

    See Project

  • 14

    Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.

    Leader badge

    Downloads: 131 This Week

    Last Update: 1 day ago

    See Project

  • 15

    A Swiss Army Knife GUI application for PDF documents: combine, split, rotate, reorder (n-up, booklet), watermark, edit bookmarks/fileinfo/pagetransition, compress, encrypt, decrypt, sign, repair, edit attachments and more.

    Leader badge

    Downloads: 99 This Week

    Last Update: 2023-04-19

    See Project

  • 16

    iText®, a JAVA PDF library

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest versions of iText build on the success of previous versions and feature an improved document engine, high and low-level programming capabilities, and a more efficient modular structure. iText represents the next level for developers looking to leverage PDF in document workflows. The main project page for iText is now on GitHub, and all the latest releases, code samples, open source add-ons and tools, etc. can be found at https://github.com/itext/.

    Leader badge

    Downloads: 521 This Week

    Last Update: 2024-06-01

    See Project

  • 17

    PDF-Shuffler

    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.

    Leader badge

    Downloads: 63 This Week

    Last Update: 2018-11-18

    See Project

  • 18

    Find And Replace Text command line utility. New & improved version of the well-known grep command, with advanced features such as: case-adaption of the replace string; find (& replace) in filenames, auto CVS edit. Moved to https://github.com/lionello/fart-it

    Leader badge

    Downloads: 73 This Week

    Last Update: 2019-07-05

    See Project

  • 19

    Notepad--

    Notepad--

    Notepad for Windows, Linux, and Mac platforms

    Notepad-- a text editor written in C++ that works seamlessly across Windows, Linux, and Mac platforms. Our aim is to eventually surpass Notepad++, with a particular focus on the MacOS and Chinese UOS operating system. Unlike Notepad++, our advantage lies in our cross-platform compatibility and support for various OSes. If you are using MacOS and want to find a useful text editor, please try Ndd, it won't disappoint you. My Github homepage is: https://github.com/cxasm/notepad--

    Leader badge

    Downloads: 170 This Week

    Last Update: 2025-04-07

    See Project

  • 20

    ANTLR

    ANTLR

    Parser generator to read, process, or translate structured text

    ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files. It's widely used to build languages, tools, and frameworks. From a grammar, ANTLR generates a parser that can build and walk parse trees. It’s widely used in academia and industry to build all sorts of languages, tools, and frameworks. Twitter search uses ANTLR for query parsing, with over 2 billion queries a day. The languages for Hive and Pig, the data warehouse and analysis systems for Hadoop, both use ANTLR. Lex Machina uses ANTLR for information extraction from legal texts. Oracle uses ANTLR within SQL Developer IDE and their migration tools. NetBeans IDE parses C++ with ANTLR. The HQL language in the Hibernate object-relational mapping framework is built with ANTLR.

    Downloads: 10 This Week

    Last Update: 2024-08-03

    See Project

  • 21

    XSLT syntax highlighting

    Please note that project moved to GitHub: https://github.com/xmlark/xslthl This is an implementation of syntax highlighting as an extension module for XSLT processors (Xalan, Saxon), so if you have e.g. article about programming written in DocBook, code examples can be automatically syntax highlighted during the XSLT processing phase.

    Leader badge

    Downloads: 93 This Week

    Last Update: 2020-07-16

    See Project

  • 22

    FAR - Find And Replace

    Search and replace operations on file content accross multiple files. Recursive operations within entire directory trees. FAR comes with support for regular expressions (regex) over multiple lines, automatic backup and various character encodings. Run grep like extractions to condense or rearrange sources, or perform bulk file renaming.

    Downloads: 38 This Week

    Last Update: 2020-11-20

    See Project

  • 23

    Command-line/Ant-task/embeddable text file preprocessor. Macros, flow control, expressions. Recursive directory processing. Extensible in Java to display data from any data sources (as database). Can generate complete homepages (tree of HTML-s, images, etc.)

    Leader badge

    Downloads: 69 This Week

    Last Update: 2018-09-03

    See Project

  • 24

    Pdftohtml is a tool based on the Xpdf package which translates pdf documents into html format.

    Downloads: 34 This Week

    Last Update: 2013-04-24

    See Project

  • 25

    FCKeditor

    FCKeditor is the previous version of CKEditor and has been discontinued after version 2. The new CKEditor is redesigned from the ground up, offering more WYSIWYG text editing features, enhanced security and better integration. Don’t force yourself with retro FCKeditor. Switch to the new, cool CKEditor at ckeditor.com

    Downloads: 27 This Week

    Last Update: 2017-10-24

    See Project