Articles → NLP → NLP Pipelines
NLP Pipelines
Purpose
NLP Pipeline Stages
| Method | Description |
|---|
| Tokenization | splitting text into words, sub-words, or characters. |
| Lowercasing | making text uniform. |
| Stop-word Removal | removing frequent but uninformative words (the, is, and). |
| Stemming / Lemmatization | reducing words to their base form. |
| Handling punctuation, special characters, numbers, emojis | |
| Normalization | expanding contractions (don't → do not), correcting spelling |
- Bag of Words (BoW)
- TF-IDF
- Word Embeddings (Word2Vec, GloVe, FastText)
- Contextual embeddings (BERT, GPT, etc.)
| Posted By - | Karan Gupta |
| |
| Posted On - | Monday, September 15, 2025 |