A Course in Machine Learning by Hal Daumé III.
From the webpage:
Machine learning is the study of algorithms that learn from data and experience. It is applied in a vast variety of application areas, from medicine to advertising, from military to pedestrian. Any area in which you need to make sense of data is a potential consumer of machine learning.
CIML is a set of introductory materials that covers most major aspects of modern machine learning (supervised learning, unsupervised learning, large margin methods, probabilistic modeling, learning theory, etc.). It’s focus is on broad applications with a rigorous backbone. A subset can be used for an undergraduate course; a graduate course could probably cover the entire material and then some.
You may obtain the written materials by purchasing a ($55) print copy, by the entire book, or by downloading individual chapters below. If you find the electronic version of the book useful and would like to donate a small amount to support further development, that’s always appreciated! The current version is 0.9 (the “beta” pre-release).
Have you noticed that the quality of materials on the Internet is increasing. At least in some domains?
If you want to look at individual chapters:
- Front Matter
- Decision Trees
- Geometry and Nearest Neighbors
- The Perceptron
- Machine Learning in Practice
- Beyond Binary Classification
- Linear Models
- Probabilistic Modeling
- Neural Networks
- Kernel Methods
- Learning Theory
- Ensemble Methods
- Efficient Learning
- Unsupervised Learning
- Expectation Maximization
- Semi-Supervised Learning
- Graphical Models
- Online Learning
- Structured Learning
- Bayesian Learning
- Back Matter
Code and datasets said to be coming soon.
I first saw this at: A Course in Machine Learning (free book).