Categories: News

iText launches iText pdfOCR, a powerful open-source product enabling text recognition in scanned documents and conversion into editable PDFs

LONDON, UK – Media
OutReach – 30 June 2020 – iText Group NV, a globally recognized thought-leader and
innovator in PDF libraries and solutions, today announced the launch of iText
pdfOCR, the newest addition to their award-winning software offering. 

 

iText
pdfOCR, which is part of the renowned iText 7 PDF SDK, offers Optical
Character Recognition (OCR) functionality to convert printed text in scanned
documents and images into a fully searchable PDF/A-3u compliant format (PDF
version 1.7) and make accessing those texts easier and faster. Without
machine-readable text, printed or scanned documents cannot be searched, indexed
or interpreted. Logical follow-up actions could be data extraction with iText pdf2Data, secure content
redaction with iText pdfSweep, or multilingual
document recreation with iText pdfCalligraph. With repurposing data
with the low-code document generator iText DITO® often being the final
cherry on the cake.

The iText
pdfOCR add-on is built on the Tesseract OCR engine technology. Tesseract
supports over 100 languages and was originally developed by Hewlett-Packard
(’85), and was released under the Apache open source license in 2005. Since
2006, its development has been sponsored by Google.

 

“With COVID-19
urging companies to accelerate their digital transformation projects,
organizations are forced to explore new ways of accessing and managing their
data — existing and new. Being a leader in the digital documents space, we’re
pleased to be at the forefront of this new era. As such, I am very proud to
announce the latest addition to our PDF library for today’s new world: thanks
to the OCR capabilities of iText pdfOCR many new opportunities
will open up for users and enterprises that want to maximize their data
potential.” Yeonsu Kim, CEO at iText Group NV stated.

 

“Staying true
to our open-source roots, we’ve decided to build iText
pdfOCR upon the open-source Tesseract OCR Engine. With this, we wish
to reconfirm our positioning as an open-source company – a value which is
appreciated by our millions of users and clients.”

 

“With this new
addition to our PDF library, developers will now be able to leverage data
locked away in documents which until now weren’t accessible. Our latest product
enables them to enlarge their digital workflow capabilities by accessing the
data buried in scanned files and deploy it for any action or purpose they or
their end-user would like.” Tony Van den Zegel, VP of Products & Marketing
at iText Group NV and General Manager at iText Software Belgium, said.

 

The applications
of iText
pdfOCR are various: for instance, archiving of historical documents,
translations of legal documents, automatic data entry while processing all
sorts of physical applications or claims, and sorting of otherwise not editable
printed or scanned documents.

 

Please tune in
for live demos on 9 July 2020. More information can be found here.

Miscw.com

Recent Posts

SEAX Global Acquires Major Stake in Interlink Telecom to Accelerate ASEAN Expansion

Acquisition creates a fully integrated regional connectivity platform, enabling seamless, low-latency connectivity that supports ASEAN’s…

1 day ago

M7 World Championship unveils first Opening Ceremony, tournament draw, and M7 Pass—elevates Indonesia on global stage

The Swiss Stage sets up early mouthwatering clashes between tournament favourites—Team Liquid face Aurora Gaming…

1 day ago

A First in the Asia-Pacific – Taiwan Leads the Launch of the Early Kidney Disease Annual Report, Opening a New Era in Advancing Kidney Care

TAIPEI, TAIWAN - Media OutReach Newswire - 11 December 2025 - Taiwan's chronic disease care…

2 days ago

The Government of Angola Launches an International Public Tender for the Management of the Namibe Corridor

LUANDA, ANGOLA - EQS Newswire - 12 December 2025 - The Ministry of Transport of…

2 days ago

VinFast VF 8 – Crafted for high-expectation markets in the GCC

Premium design meets industry-leading warranty and long-term supportDUBAI, UAE - Media OutReach Newswire - 12…

2 days ago

The 7th Hainan Island International Film Festival Concludes Successfully, With ‘Black Rabbit, White Rabbit’ Receiving the Top Honor

SANYA, CHINA - Media OutReach Newswire - 12 December 2025 - The 7th Hainan Island…

2 days ago