Why Multimodal?

Uniomodal Representation Learning

Self-supervised Representation Learning

Machine Learning

Multimodal Learning

Speech and Audio

Multimodal Learning: Newer domains

Datasets

  • [MINST]
  • [FMINST]
  • [CIFAR]
  • [CANDOR]
  • [Coswara]
  • [ImageNet]
  • [WIT]
  • [Google Audio Dataset]
  • [Vox celeb dataset]