2021 · ICML 2021

Learning Transferable Visual Models From Natural Language Supervision

Radford, Kim, Hallacy, et al.

2021 ICML 2021

TL;DR

Train an image encoder and a text encoder to embed matching (image, caption) pairs nearby in a shared space. Gives zero-shot image classification across thousands of categories.

Read paper

BACKLOG · WORK IN PROGRESS

This paper is being written.

The metadata and shape of this page are stable, but the body content isn't ready yet. We'll publish it once it meets the bar of teaching something new with worked examples and real tools.

Back to papers Track progress on GitHub