Wednesday, 29 November 2017

Paper Review: A Few Useful Things to Know about Machine Learning – Pedro Domingos

One look at this paper and a reader begins to wonder about the true scope of Machine Learning. For people who have started on their Machine Learning journey, the content of this paper can prove to be quite daunting and many of the terms absolutely alien. So, it is easy for them to miss out the roadmap that Pedro Domingos has laid out for us to the Machine Learning journey.

Pedro points out that there is a lot of “folk knowledge” that is utilized behind building a good Machine Learning model. Knowledge of the different techniques is a good start towards being successful on this journey, however, it is only one of the skills that a Data Scientist needs to possess. 

As many have come to realize (any many more will), the time consumed in actually doing machine learning is very little compared to the time consumed in gathering, integrating, cleaning & pre-processing data, and in performing trial and error on feature design. The paper brings out 1) the aspects necessary to build a model, 2) common traps and pitfalls that modelers should avoid & 3) tips on how to bring out the best from a machine learning exercise.

To give a snapshot of what was covered in the paper, below is a graphical representation of the roadmap presented.