Articles → NLP → NLP Pipelines
NLP Pipelines
Purpose
NLP Pipeline Stages
Method | Description |
---|
Tokenization | splitting text into words, sub-words, or characters. |
Lowercasing | making text uniform. |
Stop-word Removal | removing frequent but uninformative words (the, is, and). |
Stemming / Lemmatization | reducing words to their base form. |
Handling punctuation, special characters, numbers, emojis | |
Normalization | expanding contractions (don't → do not), correcting spelling |
- Bag of Words (BoW)
- TF-IDF
- Word Embeddings (Word2Vec, GloVe, FastText)
- Contextual embeddings (BERT, GPT, etc.)
Posted By - | Karan Gupta |
|
Posted On - | Monday, September 15, 2025 |