Computer Vision

This chapter explains the four areas of computer vision technology that have had the most impact on real-world applications. These are illustrated below:

The four areas are:

Handwritten Digit Classification: Identifying digits such as handwritten zip codes.

Image Classification: Identifying the primary category (e.g. dog, flower, …) in an image even if the image contains multiple objects.

Object Detection: Localization and classification of multiple objects in an image.

Facial Recognition: Identifying the name of the person in an image.

Note: Localization refers to identifying a bounding box for each object in the image as illustrated below:

11.0 Overview

11.1 Handwritten digit classification

11.2 Image classification

11.2.1 Supervised learning using feature extraction methods

11.2.2 Deep learning

11.2.3 Self-supervised learning

11.2.4 Semi-supervised learning

11.2.5 Weakly supervised learning

11.3 Object detection

11.3.1 Datasets

11.3.2 Supervised learning approaches

11.3.3 Unsupervised object detection

11.3.4 Compositional object detection

11.4 Facial recognition

11.5 What is being learned by computer vision systems?

SHARE THIS

1
1.3.4 Compositional object detection