Short Notes
ML – Customer Segmentation
Dividing customers into groups based on similar functionality or customer segmentation is based on the problem of clustering which means…
NLP – Bag of Words
Bag of Words A bag of words is a particular representation model used to simplify the contents of a selection…
NLP – Word segmentation
Word segmentation This is the act of taking a string of text and deriving word forms from it. Example: A…
NLP – N-grams
N-grams A combination of multiple words together is called N-Grams. N-grams (N > 1) are generally more informative as compared…
NLP – Tokenization
Tokenization It's the process of breaking down the text into sentences and phrases. The work entails breaking down a text…
NLP – Corpus
Corpus In linguistics and NLP, corpus (literally Latin for body) refers to a collection of texts. Such collections may be formed…
Python – Functions
Functions A function is a rule for taking zero or more inputs and returning a corresponding output. In Python, we…
Linear Regression – Cons[2]
Prone to outliers Linear regression is very sensitive to outliers (anomalies). So, outliers should be analysed and removed before applying…
Linear Regression – Cons[1]
Only Linear Problems Ordinary Least Squares won't work well with non-linear data. If you are not sure about the linearity…