Computer Vision
GenAI
Your computer can understand the visual world around you - or even create a new one. From Face ID and Snapchat filters to Stable Diffusion, visual AI has become part of many everyday applications and it’s easier to integrate than ever. Let’s discover the secrets behind computer vision and visual GenAI!
Starting from the basics of computer vision, we’ll explore how deep learning techniques can be leveraged to process, analyze, and alter visual information.
We’ll kick things off with a brief introduction to how images and videos are processed in computers. After that, we’ll take a deep dive into Convolutional Neural Networks (CNNs), the type of neural network that revolutionized the field of computer vision. We’ll show you how different CNN architectures help master various computer vision tasks such as classification, segmentation, and pose detection.
Finally, we’ll explore GenAI and how diffusion models like Stable Diffusion generate and complete images. Throughout this learning experience, we’ll visualize theoretical concepts with live demos and showcase their applications in real-world business cases.
Can’t wait to get started on your own? We’ll introduce you to some open-source vision frameworks like MediaPipe and YOLO that you can try at home.