Articles → NLP → One-Hot Encoding Using NLTK

One-Hot Encoding Using NLTK






Code


from sklearn.feature_extraction.text import CountVectorizer# Example documentsdocuments = [    "I like NLP",    "I like machine learning",    "NLP is fun"]# Initialize CountVectorizer with binary=Truevectorizer = CountVectorizer(    lowercase=True,    stop_words='english',    binary=True      # This makes it One-Hot Encoding)# Convert documents to one-hot matrixX = vectorizer.fit_transform(documents)# Vocabularyprint("Vocabulary:", vectorizer.get_feature_names_out())# One-hot encoded matrixprint("\nOne-Hot Encoding Matrix:")print(X.toarray())



Output


Picture showing the output of implementing one-hot encoding in nltk





Posted By  -  Karan Gupta
 
Posted On  -  Tuesday, March 3, 2026

Query/Feedback


Your Email Id
 
Subject
 
Query/FeedbackCharacters remaining 250