How To Install tesseract-ocr on Ubuntu 20.04

In this tutorial we learn how to install tesseract-ocr on Ubuntu 20.04. tesseract-ocr is Tesseract command line OCR tool

Introduction

In this tutorial we learn how to install tesseract-ocr on Ubuntu 20.04.

What is tesseract-ocr

tesseract-ocr is:

Tesseract is an open source Optical Character Recognition (OCR) Engine. It can be used directly, or (for programmers) using an API to extract printed text from images. It supports a wide variety of languages. This package includes the command line tool.

There are three methods to install tesseract-ocr on Ubuntu 20.04. We can use apt-get, apt and aptitude. In the following sections we will describe each method. You can choose one of them.

Install tesseract-ocr Using apt-get

Update apt database with apt-get using the following command.

sudo apt-get update

After updating apt database, We can install tesseract-ocr using apt-get by running the following command:

sudo apt-get -y install tesseract-ocr

Install tesseract-ocr Using apt

Update apt database with apt using the following command.

sudo apt update

After updating apt database, We can install tesseract-ocr using apt by running the following command:

sudo apt -y install tesseract-ocr

Install tesseract-ocr Using aptitude

If you want to follow this method, you might need to install aptitude first since aptitude is usually not installed by default on Ubuntu. Update apt database with aptitude using the following command.

sudo aptitude update

After updating apt database, We can install tesseract-ocr using aptitude by running the following command:

sudo aptitude -y install tesseract-ocr

How To Uninstall tesseract-ocr on Ubuntu 20.04

To uninstall only the tesseract-ocr package we can use the following command:

sudo apt-get remove tesseract-ocr

Uninstall tesseract-ocr And Its Dependencies

To uninstall tesseract-ocr and its dependencies that are no longer needed by Ubuntu 20.04, we can use the command below:

sudo apt-get -y autoremove tesseract-ocr

Remove tesseract-ocr Configurations and Data

To remove tesseract-ocr configuration and data from Ubuntu 20.04 we can use the following command:

sudo apt-get -y purge tesseract-ocr

Remove tesseract-ocr configuration, data, and all of its dependencies

We can use the following command to remove tesseract-ocr configurations, data and all of its dependencies, we can use the following command:

sudo apt-get -y autoremove --purge tesseract-ocr

References

Summary

In this tutorial we learn how to install tesseract-ocr package on Ubuntu 20.04 using different package management tools: apt, apt-get and aptitude.