Skip to content

Vision Foundation Models

Vision Foundation Models (VFMs) are large, general-purpose computer vision models trained on massive datasets, often without labels. They provide reusable features for many tasks like segmentation, depth estimation, or retrieval.

Notes

Future Topics

  • SAM (Segment Anything Model)
  • CLIP
  • Others...