Articles → NLP → NLP Pipelines

NLP Pipelines






Purpose





NLP Pipeline Stages








MethodDescription
Tokenization splitting text into words, sub-words, or characters.
Lowercasingmaking text uniform.
Stop-word Removalremoving frequent but uninformative words (the, is, and).
Stemming / Lemmatizationreducing words to their base form.
Handling punctuation, special characters, numbers, emojis
Normalizationexpanding contractions (don'tdo not), correcting spelling






  1. Bag of Words (BoW)
  2. TF-IDF
  3. Word Embeddings (Word2Vec, GloVe, FastText)
  4. Contextual embeddings (BERT, GPT, etc.)









Posted By  -  Karan Gupta
 
Posted On  -  Monday, September 15, 2025

Query/Feedback


Your Email Id
 
Subject
 
Query/FeedbackCharacters remaining 250