How To Install tesseract-ocr on Kali Linux
Introduction
In this tutorial we learn how to install tesseract-ocr on Kali Linux.
What is tesseract-ocr
tesseract-ocr is:
Tesseract is an open source Optical Character Recognition (OCR) Engine. It can be used directly, or (for programmers) using an API to extract printed text from images. It supports a wide variety of languages. This package includes the command line tool.
There are three methods to install tesseract-ocr on Kali Linux. We can use apt-get, apt and aptitude. In the following sections we will describe each method. You can choose one of them.
Install tesseract-ocr Using apt-get
Update apt database with apt-get using the following command.
sudo apt-get updateAfter updating apt database, We can install tesseract-ocr using apt-get by running the following command:
sudo apt-get -y install tesseract-ocrInstall tesseract-ocr Using apt
Update apt database with apt using the following command.
sudo apt updateAfter updating apt database, We can install tesseract-ocr using apt by running the following command:
sudo apt -y install tesseract-ocrInstall tesseract-ocr Using aptitude
If you want to follow this method, you might need to install aptitude on Kali Linux first since aptitude is usually not installed by default on Kali Linux. Update apt database with aptitude using the following command.
sudo aptitude updateAfter updating apt database, We can install tesseract-ocr using aptitude by running the following command:
sudo aptitude -y install tesseract-ocrHow To Uninstall tesseract-ocr on Kali Linux
To uninstall only the tesseract-ocr package we can use the following command:
sudo apt-get remove tesseract-ocrUninstall tesseract-ocr And Its Dependencies
To uninstall tesseract-ocr and its dependencies that are no longer needed by Kali Linux, we can use the command below:
sudo apt-get -y autoremove tesseract-ocrRemove tesseract-ocr Configurations and Data
To remove tesseract-ocr configuration and data from Kali Linux we can use the following command:
sudo apt-get -y purge tesseract-ocrRemove tesseract-ocr configuration, data, and all of its dependencies
We can use the following command to remove tesseract-ocr configurations, data and all of its dependencies, we can use the following command:
sudo apt-get -y autoremove --purge tesseract-ocrDependencies
tesseract-ocr have the following dependencies:
- libarchive13
- libc6
- libcairo2
- libfontconfig1
- libgcc-s1
- libglib2.0-0
- libicu67
- liblept5
- libpango-1.0-0
- libpangocairo-1.0-0
- libpangoft2-1.0-0
- libstdc++6
- libtesseract4
- tesseract-ocr-eng
- tesseract-ocr-osd
References
Summary
In this tutorial we learn how to install tesseract-ocr package on Kali Linux using different package management tools: apt, apt-get and aptitude.