How To Install ucto on Ubuntu 18.04

In this tutorial we learn how to install ucto on Ubuntu 18.04. ucto is Unicode Tokenizer

Introduction

In this tutorial we learn how to install ucto on Ubuntu 18.04.

What is ucto

ucto is:

Ucto can tokenize UTF-8 encoded text files (i.e. separate words from punctuation, split sentences, generate n-grams), and offers several other basic preprocessing steps that make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation.

This package provides the command-line tool itself.

Ucto was written by Maarten van Gompel and Ko van der Sloot. Work on Ucto was funded by NWO, the Netherlands Organisation for Scientific Research, under the Implicit Linguistics project, the CLARIN-NL program, and the CLARIAH project.

Ucto is a product of the Centre of Language and Speech Technology (Radboud University Nijmegen), and previously the ILK Research Group (Tilburg University, The Netherlands).

If you are interested in machine parsing of UTF-8 encoded text files, e.g. to do scientific research in natural language processing, ucto will likely be of use to you.

There are three methods to install ucto on Ubuntu 18.04. We can use apt-get, apt and aptitude. In the following sections we will describe each method. You can choose one of them.

Install ucto Using apt-get

Update apt database with apt-get using the following command.

sudo apt-get update

After updating apt database, We can install ucto using apt-get by running the following command:

sudo apt-get -y install ucto

Install ucto Using apt

Update apt database with apt using the following command.

sudo apt update

After updating apt database, We can install ucto using apt by running the following command:

sudo apt -y install ucto

Install ucto Using aptitude

If you want to follow this method, you might need to install aptitude first since aptitude is usually not installed by default on Ubuntu. Update apt database with aptitude using the following command.

sudo aptitude update

After updating apt database, We can install ucto using aptitude by running the following command:

sudo aptitude -y install ucto

How To Uninstall ucto on Ubuntu 18.04

To uninstall only the ucto package we can use the following command:

sudo apt-get remove ucto

Uninstall ucto And Its Dependencies

To uninstall ucto and its dependencies that are no longer needed by Ubuntu 18.04, we can use the command below:

sudo apt-get -y autoremove ucto

Remove ucto Configurations and Data

To remove ucto configuration and data from Ubuntu 18.04 we can use the following command:

sudo apt-get -y purge ucto

Remove ucto configuration, data, and all of its dependencies

We can use the following command to remove ucto configurations, data and all of its dependencies, we can use the following command:

sudo apt-get -y autoremove --purge ucto

References

Summary

In this tutorial we learn how to install ucto package on Ubuntu 18.04 using different package management tools: apt, apt-get and aptitude.