How To Install tesseract-ocr-jpn on Ubuntu 18.04

In this tutorial we learn how to install tesseract-ocr-jpn on Ubuntu 18.04. tesseract-ocr-jpn is tesseract-ocr language files for Japanese

Introduction

In this tutorial we learn how to install tesseract-ocr-jpn on Ubuntu 18.04.

What is tesseract-ocr-jpn

tesseract-ocr-jpn is:

Tesseract is an open source Optical Character Recognition (OCR) Engine. It can be used directly, or (for programmers) using an API to extract printed text from images. This package contains the data needed for processing images in Japanese language.

There are three methods to install tesseract-ocr-jpn on Ubuntu 18.04. We can use apt-get, apt and aptitude. In the following sections we will describe each method. You can choose one of them.

Install tesseract-ocr-jpn Using apt-get

Update apt database with apt-get using the following command.

sudo apt-get update

After updating apt database, We can install tesseract-ocr-jpn using apt-get by running the following command:

sudo apt-get -y install tesseract-ocr-jpn

Install tesseract-ocr-jpn Using apt

Update apt database with apt using the following command.

sudo apt update

After updating apt database, We can install tesseract-ocr-jpn using apt by running the following command:

sudo apt -y install tesseract-ocr-jpn

Install tesseract-ocr-jpn Using aptitude

If you want to follow this method, you might need to install aptitude first since aptitude is usually not installed by default on Ubuntu. Update apt database with aptitude using the following command.

sudo aptitude update

After updating apt database, We can install tesseract-ocr-jpn using aptitude by running the following command:

sudo aptitude -y install tesseract-ocr-jpn

How To Uninstall tesseract-ocr-jpn on Ubuntu 18.04

To uninstall only the tesseract-ocr-jpn package we can use the following command:

sudo apt-get remove tesseract-ocr-jpn

Uninstall tesseract-ocr-jpn And Its Dependencies

To uninstall tesseract-ocr-jpn and its dependencies that are no longer needed by Ubuntu 18.04, we can use the command below:

sudo apt-get -y autoremove tesseract-ocr-jpn

Remove tesseract-ocr-jpn Configurations and Data

To remove tesseract-ocr-jpn configuration and data from Ubuntu 18.04 we can use the following command:

sudo apt-get -y purge tesseract-ocr-jpn

Remove tesseract-ocr-jpn configuration, data, and all of its dependencies

We can use the following command to remove tesseract-ocr-jpn configurations, data and all of its dependencies, we can use the following command:

sudo apt-get -y autoremove --purge tesseract-ocr-jpn

References

Summary

In this tutorial we learn how to install tesseract-ocr-jpn package on Ubuntu 18.04 using different package management tools: apt, apt-get and aptitude.