Статьи
Published 2022-11-12
Keywords
- NLP,
- artificial intelligence,
- tokenization,
- stemming,
- lemmatization
- categorization,
- NLTK,
- tagging ...More
How to Cite
Xusainova, Z. (2022). NLP: TOKENIZATION, STEMMING, LEMMATIZATION AND PARTS OF SPEECH TAGGING. Prospects of Uzbek Applied Philology, 1(1). Retrieved from http://compling.navoiy-uni.uz/index.php/conferences3/article/view/272
Abstract
Natural Language Processing (NLP) is one of the fastest growing
fields in the world. Natural language processing (NLP) is a branch of artificial
intelligence that uses computers to intelligently analyze and understand human
language data. A sequenced set of steps to extract the necessary information from
the given data is called a pipeline process in NLP. In this article, we will consider
the processes of tokenization, stemming, lemmatization, and segmentation of
words, which are the stages of initial processing (pipeline process) of text in NLP.
Also, we describe the practical application of these processes using Python
language tools.