Convolution, receptive fields and padding

Chapter 7: Computer vision

A 224×224 RGB image is 150,528 numbers. A fully connected layer on that input would need tens of millions of parameters for the first layer alone. Convolutional neural networks fix this with local filters, weight sharing, and a hierarchy of features. This chapter moves from the basics—convolutions and receptive fields—through ResNet to modern tasks: detection, segmentation, and diffusion-based generation. PyTorch + torchvision is the practical stack.

Content is available with subscription.

Get full access to all courses on the platform for one year with a single payment.

Unlike other platforms that charge per course, here you get everything for one price, and after one year of use there will be no automatic charge for the following year.