Reward Models | Data Brew | Episode 40

Data Brew by Databricks

Player FM - Internet Radio Done Right

72 subscribers

Data Science

Artificial Intelligence

اضافه شده در five سال پیش

محتوای ارائه شده توسط Databricks. تمام محتوای پادکست شامل قسمت‌ها، گرافیک‌ها و توضیحات پادکست مستقیماً توسط Databricks یا شریک پلتفرم پادکست آن‌ها آپلود و ارائه می‌شوند. اگر فکر می‌کنید شخصی بدون اجازه شما از اثر دارای حق نسخه‌برداری شما استفاده می‌کند، می‌توانید روندی که در اینجا شرح داده شده است را دنبال کنید.https://fa.player.fm/legal

<div class="span index">1</div> <span><a class="" data-remote="true" data-type="html" href="/series/ask-grumpy">Ask Grumpy</a></span>

1
Ask Grumpy

لغو اشتراک

24 hours پیش24h ago

لغو اشتراک

هفتگی

Ask Grumpy, a podcast featuring Steve Bender, AKA Southern Living’s Grumpy Gardener is back for Season 3. For more than 30 years, Grumpy has been sharing advice on what to grow, when to plant, and how to manage just about anything in your garden. Tune in for short episodes every Wednesday and Saturday as Grumpy answers reader questions, solves seasonal conundrums, and provides need-to-know advice for gardeners with his very Grumpy sense of humor. Be sure to follow Ask Grumpy wherever you listen so you don't miss an episode.

حدود یک سال پیش 39:58

MP3•خانه قسمت

In this episode, Brandon Cui, Research Scientist at MosaicML and Databricks, dives into cutting-edge advancements in AI model optimization, focusing on Reward Models and Reinforcement Learning from Human Feedback (RLHF).
Highlights include:
- How synthetic data and RLHF enable fine-tuning models to generate preferred outcomes.
- Techniques like Policy Proximal Optimization (PPO) and Direct Preference
Optimization (DPO) for enhancing response quality.
- The role of reward models in improving coding, math, reasoning, and other NLP tasks.
Connect with Brandon Cui:
https://www.linkedin.com/in/bcui19/

43 قسمت

#Databricks #Data Analytics #Apache Spark #Delta Lake #Machine Learning #Data Engineering #Artificial Intelligence #Tech #Data Science #Science #Lifestyle #Podcasting Education