Chapter 7: Computer vision
A 224×224 RGB image is 150,528 numbers. A fully connected layer on that input would need tens of millions of parameters for the first layer alone. Convolutional neural networks fix this with local filters, weight sharing, and a hierarchy of features. This chapter moves from the basics—convolutions and receptive fields—through ResNet to modern tasks: detection, segmentation, and diffusion-based generation. PyTorch + torchvision is the practical stack.
Content is available with subscription.
Get full access to all courses on the platform for one year with a single payment.
▼
Unlike other platforms that charge per course, here you get everything for one price, and after one year of use there will be no automatic charge for the following year.