(I love the Dog versus Cat problem. I keep bringing it up, and I’m not sure why.)
Imagine you are trying to build a classification model, and you have two classes: Cats and Dogs.
Unfortunately, there are 950 cat pictures and 50 dog pictures. This is a problem, and understanding how to deal with this is a fundamental skill you’ll have to build.
I wrote an article about it. Here it is: