Artwork

محتوای ارائه شده توسط Hugo Bowne-Anderson. تمام محتوای پادکست شامل قسمت‌ها، گرافیک‌ها و توضیحات پادکست مستقیماً توسط Hugo Bowne-Anderson یا شریک پلتفرم پادکست آن‌ها آپلود و ارائه می‌شوند. اگر فکر می‌کنید شخصی بدون اجازه شما از اثر دارای حق نسخه‌برداری شما استفاده می‌کند، می‌توانید روندی که در اینجا شرح داده شده است را دنبال کنید.https://fa.player.fm/legal
Player FM - برنامه پادکست
با برنامه Player FM !

Episode 63: Why Gemini 3 Will Change How You Build AI Agents with Ravin Kumar (Google DeepMind)

1:00:12
 
اشتراک گذاری
 

Manage episode 520612127 series 3317544
محتوای ارائه شده توسط Hugo Bowne-Anderson. تمام محتوای پادکست شامل قسمت‌ها، گرافیک‌ها و توضیحات پادکست مستقیماً توسط Hugo Bowne-Anderson یا شریک پلتفرم پادکست آن‌ها آپلود و ارائه می‌شوند. اگر فکر می‌کنید شخصی بدون اجازه شما از اثر دارای حق نسخه‌برداری شما استفاده می‌کند، می‌توانید روندی که در اینجا شرح داده شده است را دنبال کنید.https://fa.player.fm/legal

Gemini 3 is a few days old and the massive leap in performance and model reasoning has big implications for builders: as models begin to self-heal, builders are literally tearing out the functionality they built just months ago... ripping out the defensive coding and reshipping their agent harnesses entirely.

Ravin Kumar (Google DeepMind) joins Hugo to breaks down exactly why the rapid evolution of models like Gemini 3 is changing how we build software. They detail the shift from simple tool calling to building reliable "Agent Harnesses", explore the architectural tradeoffs between deterministic workflows and high-agency systems, the nuance of preventing context rot in massive windows, and why proper evaluation infrastructure is the only way to manage the chaos of autonomous loops.

They talk through:

  • The implications of models that can "self-heal" and fix their own code
  • The two cultures of agents: LLM workflows with a few tools versus when you should unleash high-agency, autonomous systems.
  • Inside NotebookLM: moving from prototypes to viral production features like Audio Overviews
  • Why Needle in a Haystack benchmarks often fail to predict real-world performance
  • How to build agent harnesses that turn model capabilities into product velocity
  • The shift from measuring latency to managing time-to-compute for reasoning tasks

LINKS

Join the final cohort of our Building AI Applications course starting Jan 12, 2026: https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgrav

  continue reading

64 قسمت

Artwork
iconاشتراک گذاری
 
Manage episode 520612127 series 3317544
محتوای ارائه شده توسط Hugo Bowne-Anderson. تمام محتوای پادکست شامل قسمت‌ها، گرافیک‌ها و توضیحات پادکست مستقیماً توسط Hugo Bowne-Anderson یا شریک پلتفرم پادکست آن‌ها آپلود و ارائه می‌شوند. اگر فکر می‌کنید شخصی بدون اجازه شما از اثر دارای حق نسخه‌برداری شما استفاده می‌کند، می‌توانید روندی که در اینجا شرح داده شده است را دنبال کنید.https://fa.player.fm/legal

Gemini 3 is a few days old and the massive leap in performance and model reasoning has big implications for builders: as models begin to self-heal, builders are literally tearing out the functionality they built just months ago... ripping out the defensive coding and reshipping their agent harnesses entirely.

Ravin Kumar (Google DeepMind) joins Hugo to breaks down exactly why the rapid evolution of models like Gemini 3 is changing how we build software. They detail the shift from simple tool calling to building reliable "Agent Harnesses", explore the architectural tradeoffs between deterministic workflows and high-agency systems, the nuance of preventing context rot in massive windows, and why proper evaluation infrastructure is the only way to manage the chaos of autonomous loops.

They talk through:

  • The implications of models that can "self-heal" and fix their own code
  • The two cultures of agents: LLM workflows with a few tools versus when you should unleash high-agency, autonomous systems.
  • Inside NotebookLM: moving from prototypes to viral production features like Audio Overviews
  • Why Needle in a Haystack benchmarks often fail to predict real-world performance
  • How to build agent harnesses that turn model capabilities into product velocity
  • The shift from measuring latency to managing time-to-compute for reasoning tasks

LINKS

Join the final cohort of our Building AI Applications course starting Jan 12, 2026: https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgrav

  continue reading

64 قسمت

همه قسمت ها

×
 
Loading …

به Player FM خوش آمدید!

Player FM در سراسر وب را برای یافتن پادکست های با کیفیت اسکن می کند تا همین الان لذت ببرید. این بهترین برنامه ی پادکست است که در اندروید، آیفون و وب کار می کند. ثبت نام کنید تا اشتراک های شما در بین دستگاه های مختلف همگام سازی شود.

 

راهنمای مرجع سریع

در حین کاوش به این نمایش گوش دهید
پخش