paulgorman.org/technical

My Month with Artificial Intelligence

How AI Works: Chapter 1

I started reading Ronald T. Kneusel’s How AI Works.



It’s worth remembering the sage words of British statistician George Box, who said that all models are wrong, but some are useful. At the time, he was referring to other kinds of mathematical models, but the wisdom applies to machine learning.



We have many model types to choose from for our classifier, including decision trees, which generate a series of yes/no questions related to the features used to decide the class label to output for a given input. When the questions are laid out visually, they form a structure reminiscent of an upside-down tree. Think of a decision tree as a computer-generated version of the game 20 Questions.


The MNIST database (Modified National Institute of Standards and Technology database[1]) is a large database of handwritten digits that is commonly used for training various image processing systems.[2][3] The database is also widely used for training and testing in the field of machine learning.

https://en.wikipedia.org/wiki/MNIST_database https://www.nist.gov/itl/products-and-services/emnist-dataset

| 0 | 1 | 3 | 9 –|—–|——-|—–|—- 0 | 978 | 0 | 1 | 1 1 | 2 | 1,128 | 3 | 2 3 | 5 | 0 | 997 | 8 9 | 5 | 1 | 8 | 995

0 1 3 9
19 20 227 762

Another example for interpolation and extrapolation: model the world population between 1950 and 2020.

scikit-learn

https://scikit-learn.org/

Scikit-learn is an open source machine learning library that supports supervised and unsupervised learning. It also provides various tools for model fitting, data preprocessing, model selection, model evaluation, and many other utilities.

Links