What is Text Preprocessing?

Text is a kind of unstructured data. Before doing any NLP modelings, we have to do crunching. This is called text preprocessing. The aim of text preprocessing is to extract interesting and non-trivial knowledge from unstructured text data and retrieve to satisfy a user’s need for information.

For my own preference, the general ways to preprocess text include noise removal, text cleansing, text normalization and tokenization.

#text #preprocessing


Comments

  1. Thank you so much for this nice information. Hope so many people will get aware of this and useful as well. And please keep update like this.

    Text Analytics Companies

    Text Analytics Python

    ReplyDelete

Post a Comment