53. Vision Transformers

Learning objectives

  • Conclude multimodal models

Multimodal Models

CLIP

Contrastive Language Image Pre-Training

CLIP architecture

Vision Neurons

vision neurons

vision neurons

Quo Vadimus?

Presently, here are some more applications of multimodal models.

home

receipt bookkeeping

pedagogy

COPUS

software dev

Vision Question Answering

VQA

comp bio

transfer learning between RNA and ATAC sequencing

scButterfly

medicine

data types