Which Machine learning classifier should I use?
1 view (last 30 days)
Show older comments
Katrina Anderson
on 19 Mar 2019
Commented: Katrina Anderson
on 29 Mar 2019
i am training an algorithm to differientiate between two behavioural variables.
I had 100% accuracy for ensemble bagged trees, and 97.6% for SVM cubic and 97.6% for KNN.
this will be used in unknown data sets. do i go with the most accurate? or are certain classifiers better for different situations.
thanks!
0 Comments
Accepted Answer
Abhishek Singh
on 28 Mar 2019
For your further query you can look into these links:
In short:
Boosting - often effective when a large amount of training data is available.
Random trees - often very effective and can also perform regression.
K-nearest neighbors - simplest thing you can do, often effective but slow and requires lots of memory.
Neural networks - Slow to train but very fast to run, still optimal performer for letter recognition.
SVM - Among the best with limited data, but losing against boosting or random trees only when large data sets are available.
Hope this helps,
More Answers (1)
Abhishek Singh
on 26 Mar 2019
Behavioral variables are just numbers to differentiate between customer or user behaviour
The division of a market into groups according to their knowledge of, and behaviour towards a specific product. Behavioural dimensions commonly used to segment markets include benefits sought, user status, usage rate, loyalty status and buyer readiness stage. Also called Behaviouristic Segmentation.
Certain classifiers are indeed better for different situations.
It depends upon what you want to use the classification for. Since you have not explained your use case, it will be unwise to suggest a classifier based on the accuracy alone. That is unless accuracy is what you care most about.
For example, if you care more about interpretability other classifiers are probably a better choice since they are quite close in terms of accuracy.
Again, it is impossible for anyone to advice you on what classifier to choose without any knowledge of the situation in which it is going to be used. These decisions are better left to data scientists who have in-depth knowledge not only in the business use case but also about the trade-offs between different classifiers.
2 Comments
See Also
Categories
Find more on Statistics and Machine Learning Toolbox in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!