top of page

Applications of Transformers
Learn to build multimodal models.

About
Before starting the course, let's go over the content and what you need before jumping in.
This course covers strategies for building multimodal AI models, focusing on image -> text models. 
Is this course for you?
Before jumping into this course, I would recommend understanding the basics of how ChatGPT works as well as the basics of PyTorch. If you want to build side projects and dive deeper into practical applications of transformers, this course is for you.

PyTorch-removebg-preview (1).png
0_px3UZhCxcQ9w7LMV.png
1654634918204.png

Topics

  1. Basics of Multimodal AI

  2. Encoder-Decoder Architectures

  3. Image Captioning Architectures

  4. Cross Attention

  5. Review of Decoder Transformers

  6. Utility Libraries for Building Bigger Models

  7. Inserting Pre-Trained Models into a Trainable Network

  8. Image and Text Processing Techniques

  9. Basics of Machine Translation

Check out the previews below, and we can jump into multimodal AI.

Applications of Transformers

Applications of Transformers

Applications of Transformers
Search video...
Intro to Multimodal AI & Encoder-Decoders

Intro to Multimodal AI & Encoder-Decoders

$
11:33
The Image Captioning Architecture

The Image Captioning Architecture

$
26:13
Setting Up Dependencies

Setting Up Dependencies

$
06:28

Note for Members: The Video Info tab has a description with important clarifications and information for each module.

bottom of page