How to Install and Uninstall ocrmypdf Package on Ubuntu 21.10 (Impish Indri)

Last updated: May 03,2024

1. Install "ocrmypdf" package

Please follow the step by step instructions below to install ocrmypdf on Ubuntu 21.10 (Impish Indri)

$ sudo apt update $ sudo apt install ocrmypdf

2. Uninstall "ocrmypdf" package

In this section, we are going to explain the necessary steps to uninstall ocrmypdf on Ubuntu 21.10 (Impish Indri):

$ sudo apt remove ocrmypdf $ sudo apt autoclean && sudo apt autoremove

3. Information about the ocrmypdf package on Ubuntu 21.10 (Impish Indri)

Package: ocrmypdf
Architecture: all
Version: 10.3.1+dfsg-1
Priority: optional
Section: universe/graphics
Origin: Ubuntu
Maintainer: Ubuntu Developers
Original-Maintainer: Sean Whitton
Bugs: https://bugs.launchpad.net/ubuntu/+filebug
Installed-Size: 558
Depends: ghostscript (>= 9.18~dfsg~), icc-profiles-free, liblept5, python3-pdfminer (>= 20181108+dfsg-3), python3-pil, python3-pkg-resources, python3-reportlab, python3-pluggy, python3-coloredlogs, tesseract-ocr (>= 4.0.0), zlib1g, python3-cffi-backend-api-min (<= 9729), python3-cffi-backend-api-max (>= 9729), python3-img2pdf (>= 0.3.0), python3-pikepdf (>= 1.7.0), python3-tqdm, python3:any
Recommends: unpaper, pngquant
Suggests: ocrmypdf-doc, python-watchdog, img2pdf
Filename: pool/universe/o/ocrmypdf/ocrmypdf_10.3.1+dfsg-1_all.deb
Size: 113336
MD5sum: ccc496df49d46b339dc9b4c0d764f4a2
SHA1: 919818469b6b36ac9df09698206140ecf84d1d2f
SHA256: 43b5f949c262a97d471fea44a29c927b74591570be9f9d478c728b5ccecc833f
SHA512: 04f20034c26dc6fcae0b1bdaa0b6c2737625bf827c41fca6b3873da466c9a904e21ce6851b3b1dbecd44af04cc34ad6453e55235c23bf0e95b44f1ab4a569b38
Homepage: https://github.com/jbarlow83/OCRmyPDF
Description-en: add an OCR text layer to PDF files
OCRmyPDF generates a searchable PDF/A file from a regular PDF
containing only images, allowing it to be searched.
.
It uses the Tesseract OCR engine and so supports all the languages
that Tesseract does.
.
Some other main features:
.
* Places OCR text accurately below the image to ease copy / paste
* Keeps the exact resolution of the original embedded images
* When possible, inserts OCR information as a lossless operation
without rendering vector information
* Keeps file size about the same
* If requested deskews and/or cleans the image before performing OCR
* Validates input and output files
* Provides debug mode to enable easy verification of the OCR results
* Processes pages in parallel when more than one CPU core is
available
* Battle-tested on thousands of PDFs, a test suite and continuous
integration.
Description-md5: 92e84e27a8b71a2a3c36765dc4aab039