#206 – Ishan Misra: Self-Supervised Deep Learning in Computer Vision

Lex Fridman Podcast - A podcast by Lex Fridman

Categories:

Ishan Misra is a research scientist at FAIR working on self-supervised visual learning. Please support this podcast by checking out our sponsors: – Onnit: https://lexfridman.com/onnit to get up to 10% off – The Information: https://theinformation.com/lex to get 75% off first month – Grammarly: https://grammarly.com/lex to get 20% off premium – Athletic Greens: https://athleticgreens.com/lex and use code LEX to get 1 month of fish oil EPISODE LINKS: Ishan’s twitter: https://twitter.com/imisra_ Ishan’s website: https://imisra.github.io Ishan’s FAIR page: https://ai.facebook.com/people/ishan-misra/ PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ YouTube Full Episodes: https://youtube.com/lexfridman YouTube Clips: https://youtube.com/lexclips SUPPORT & CONNECT: – Check out the sponsors above, it’s the best way to support this podcast – Support on Patreon: https://www.patreon.com/lexfridman – Twitter: https://twitter.com/lexfridman – Instagram: https://www.instagram.com/lexfridman – LinkedIn: https://www.linkedin.com/in/lexfridman – Facebook: https://www.facebook.com/lexfridman – Medium: https://medium.com/@lexfridman OUTLINE: Here’s the timestamps for the episode. On some podcast players you should be able to click the timestamp to jump to that time. (00:00) – Introduction (07:49) – Self-supervised learning (16:24) – Self-supervised learning is the dark matter of intelligence (20:17) – Categorization (28:50) – Is computer vision still really hard? (32:35) – Understanding Language (42:14) – Harder to solve: vision or language (48:59) – Contrastive learning & energy-based models (52:59) – Data augmentation (57:19) – Fixed audio spike by lowering sound with pen tool (1:05:33) – Real data vs. augmented data (1:09:16) – Non-contrastive learning energy based self supervised learning methods (1:12:54) – Unsupervised learning (SwAV) (1:15:37) – Self-supervised Pretraining (SEER) (1:20:44) – Self-supervised learning (SSL) architectures (1:26:43) – VISSL pytorch-based SSL library (1:29:38) – Multi-modal (1:37:06) – Active learning (1:42:45) – Autonomous driving (1:54:12) – Limits of deep learning (1:58:19) – Difference between learning and reasoning (2:03:26) – Building super-human AI (2:11:14) – Most beautiful idea in self-supervised learning (2:15:02) – Simulation for training AI (2:18:27) – Video games replacing reality (2:19:40) – How to write a good research paper (2:24:08) – Best programming language for beginners (2:25:01) – PyTorch vs TensorFlow (2:28:26) – Advice for getting into machine learning (2:30:31) – Advice for young people (2:32:58) – Meaning of life