Evaluating the performance of a machine learning model is an important step in the model development process, as it allows us to assess how well the model is able to make predictions on new data. This can be done by …
Machine Learning – Random Forest
Random forests are a powerful machine learning algorithm that can be used for both classification and regression tasks. They are an ensemble learning method, which means they use multiple decision trees to make predictions, and combine the results to improve …
Machine Learning – Model Evaluation
Model evaluation is the process of assessing the performance of a model on a dataset. This is typically done by splitting the original dataset into training and testing sets and using the testing set to evaluate the model’s performance.
The …
Machine Learning – Model Building
Machine learning is a type of artificial intelligence that allows computer programs to learn from data and improve their performance on a specific task without being explicitly programmed. Building a machine learning model involves selecting a model type, training the …
ML – Customer Segmentation
Dividing customers into groups based on similar functionality or customer segmentation is based on the problem of clustering which means finding clusters in a dataset with the same features.
Customer segmentation can help a business focus on marketing strategies to …
NLP – Bag of Words
Bag of Words
A bag of words is a particular representation model used to simplify the contents of a selection of text. The bag of words model omits grammar and word order but is interested in the number of occurrences …
NLP – Word segmentation
Word segmentation
This is the act of taking a string of text and deriving word forms from it. Example: A person scans a handwritten document into a computer. The algorithm would be able to analyze the page and recognize that …
What is Chi-Square Test ?
In the field of statistics Occasionally we heard about categorical data, tests to validate the results, and different approaches to knowing about data. today we are here to learn about one of the statistical tests – the Chi-Square Test.
As …
NLP – N-grams
N-grams
A combination of multiple words together is called N-Grams. N-grams (N > 1) are generally more informative as compared to words and can be used as features for language modeling…
NLP – Tokenization
Tokenization
It’s the process of breaking down the text into sentences and phrases. The work entails breaking down a text into smaller chunks (known as tokens) while discarding some characters, such as punctuation.
Consider the following example:
Text input: Potter …
- « Previous Page
- 1
- 2
- 3
- 4
- …
- 7
- Next Page »