17 subscribers
با برنامه Player FM !
پادکست هایی که ارزش شنیدن دارند
حمایت شده


Stephan Fabel — Efficient Supercomputing with NVIDIA's Base Command Platform
Manage episode 316794071 series 3011550
Stephan Fabel is Senior Director of Infrastructure Systems & Software at NVIDIA, where he works on Base Command, a software platform to coordinate access to NVIDIA's DGX SuperPOD infrastructure.
Lukas and Stephan talk about why having a supercomputer is one thing but using it effectively is another, why a deeper understanding of hardware on the practitioner level is becoming more advantageous, and which areas of the ML tech stack NVIDIA is looking to expand into.
The complete show notes (transcript and links) can be found here: http://wandb.me/gd-stephan-fabel
---
Timestamps:
0:00 Intro
1:09 NVIDIA Base Command and DGX SuperPOD
10:33 The challenges of multi-node processing at scale
18:35 Why it's hard to use a supercomputer effectively
25:14 The advantages of de-abstracting hardware
29:09 Understanding Base Command's product-market fit
36:59 Data center infrastructure as a value center
42:13 Base Command's role in tech stacks
47:16 Why crowdsourcing is underrated
49:24 The challenges of scaling beyond a POC
51:39 Outro
---
Subscribe and listen to our podcast today!
👉 Apple Podcasts: http://wandb.me/apple-podcasts
👉 Google Podcasts: http://wandb.me/google-podcasts
👉 Spotify: http://wandb.me/spotify
126 قسمت
Manage episode 316794071 series 3011550
Stephan Fabel is Senior Director of Infrastructure Systems & Software at NVIDIA, where he works on Base Command, a software platform to coordinate access to NVIDIA's DGX SuperPOD infrastructure.
Lukas and Stephan talk about why having a supercomputer is one thing but using it effectively is another, why a deeper understanding of hardware on the practitioner level is becoming more advantageous, and which areas of the ML tech stack NVIDIA is looking to expand into.
The complete show notes (transcript and links) can be found here: http://wandb.me/gd-stephan-fabel
---
Timestamps:
0:00 Intro
1:09 NVIDIA Base Command and DGX SuperPOD
10:33 The challenges of multi-node processing at scale
18:35 Why it's hard to use a supercomputer effectively
25:14 The advantages of de-abstracting hardware
29:09 Understanding Base Command's product-market fit
36:59 Data center infrastructure as a value center
42:13 Base Command's role in tech stacks
47:16 Why crowdsourcing is underrated
49:24 The challenges of scaling beyond a POC
51:39 Outro
---
Subscribe and listen to our podcast today!
👉 Apple Podcasts: http://wandb.me/apple-podcasts
👉 Google Podcasts: http://wandb.me/google-podcasts
👉 Spotify: http://wandb.me/spotify
126 قسمت
همه قسمت ها
×
1 GitHub CEO Thomas Dohmke on Copilot and the Future of Software Development 1:09:44

1 From Pharma to AGI Hype, and Developing AI in Finance: Martin Shkreli’s Journey 1:30:19

1 AI, autonomy, and the future of naval warfare with Captain Jon Haase, United States Navy 1:01:32

1 R1, OpenAI’s o3, and the ARC-AGI Benchmark: Insights from Mike Knoop 1:12:01

1 Vercel’s CEO & Founder Guillermo Rauch on the impact of AI on Web Development and Front End Engineering 56:57


1 From No-Code to AI-Powered Apps with Airtable’s Howie Liu 1:12:57


1 Harnessing AI for legal practice with CoCounsel’s Jake Heller 1:04:16




1 Accelerating drug discovery with AI: Insights from Isomorphic Labs 1:10:23


1 Navigating the Vector Database Landscape with Pinecone's Edo Liberty 1:06:05


1 Upgrading Your Health: Navigating AI's Future In Healthcare with John Halamka of Mayo Clinic Platform 1:04:24

1 The Power of AI in Search with You.com's Richard Socher 1:08:26

1 AI’s Future: Investment & Impact with Sarah Guo and Elad Gil 1:04:14


1 Bridging AI and Science: The Impact of Machine Learning on Material Innovation with Joe Spisak of Meta 1:14:44


1 Providing Greater Access to LLMs with Brandon Duderstadt, Co-Founder and CEO of Nomic AI 1:01:25

1 Exploring PyTorch and Open-Source Communities with Soumith Chintala, VP/Fellow of Meta, Co-Creator of PyTorch 1:08:35

1 Advanced AI Accelerators and Processors with Andrew Feldman of Cerebras Systems 1:00:10





1 Neural Network Pruning and Training with Jonathan Frankle at MosaicML 1:02:00

1 Sarah Catanzaro — Remembering the Lessons of the Last AI Renaissance 1:16:24


1 Jeremy Howard — The Simple but Profound Insight Behind Diffusion 1:12:57


1 D. Sculley — Technical Debt, Trade-offs, and Kaggle 1:00:26

1 Emad Mostaque — Stable Diffusion, Stability AI, and What’s Next 1:10:29

1 Jehan Wickramasuriya — AI in High-Stress Scenarios 1:00:02


1 Drago Anguelov — Robustness, Safety, and Scalability at Waymo 1:09:01

1 James Cham — Investing in the Intersection of Business and Technology 1:06:11


1 Tristan Handy — The Work Behind the Data Work 1:00:48

به Player FM خوش آمدید!
Player FM در سراسر وب را برای یافتن پادکست های با کیفیت اسکن می کند تا همین الان لذت ببرید. این بهترین برنامه ی پادکست است که در اندروید، آیفون و وب کار می کند. ثبت نام کنید تا اشتراک های شما در بین دستگاه های مختلف همگام سازی شود.