Vol. 1 No. 1 (2022): APPLIED UZBEK PHILOLOGY PROSPECTS
Статьи

NLP: TOKENIZATION, STEMMING, LEMMATIZATION AND PARTS OF SPEECH TAGGING

Zilola Xusainova
Toshkent davlat o‘zbek tili va adabiyoti universiteti
Bio

Published 2022-11-12

Keywords

  • NLP,
  • artificial intelligence,
  • tokenization,
  • stemming,
  • lemmatization,
  • categorization,
  • NLTK,
  • tagging
  • ...More
    Less

How to Cite

Xusainova, Z. (2022). NLP: TOKENIZATION, STEMMING, LEMMATIZATION AND PARTS OF SPEECH TAGGING. Prospects of Uzbek Applied Philology, 1(1). Retrieved from http://compling.navoiy-uni.uz/index.php/conferences3/article/view/272

Abstract

Natural Language Processing (NLP) is one of the fastest growing
fields in the world. Natural language processing (NLP) is a branch of artificial
intelligence that uses computers to intelligently analyze and understand human
language data. A sequenced set of steps to extract the necessary information from
the given data is called a pipeline process in NLP. In this article, we will consider
the processes of tokenization, stemming, lemmatization, and segmentation of
words, which are the stages of initial processing (pipeline process) of text in NLP.
Also, we describe the practical application of these processes using Python
language tools.