Stephan Fabel — Efficient Supercomputing with NVIDIA's Base Command Platform

Gradient Dissent: Exploring Machine Learning, AI, Deep Learning, Computer Vision - A podcast by Lukas Biewald

Categories:

Stephan Fabel is Senior Director of Infrastructure Systems & Software at NVIDIA, where he works on Base Command, a software platform to coordinate access to NVIDIA's DGX SuperPOD infrastructure.Lukas and Stephan talk about why having a supercomputer is one thing but using it effectively is another, why a deeper understanding of hardware on the practitioner level is becoming more advantageous, and which areas of the ML tech stack NVIDIA is looking to expand into.The complete show notes (transcript and links) can be found here: http://wandb.me/gd-stephan-fabel---Timestamps: 0:00 Intro1:09 NVIDIA Base Command and DGX SuperPOD10:33 The challenges of multi-node processing at scale18:35 Why it's hard to use a supercomputer effectively25:14 The advantages of de-abstracting hardware29:09 Understanding Base Command's product-market fit36:59 Data center infrastructure as a value center42:13 Base Command's role in tech stacks47:16 Why crowdsourcing is underrated49:24 The challenges of scaling beyond a POC51:39 Outro---Subscribe and listen to our podcast today!👉 Apple Podcasts: http://wandb.me/apple-podcasts​​👉 Google Podcasts: http://wandb.me/google-podcasts​👉 Spotify: http://wandb.me/spotify​