0-10 subscribers
با برنامه Player FM !
پادکست هایی که ارزش شنیدن دارند
حمایت شده


274: AI testing AI? A look at CriticGPT
Manage episode 435230955 series 2591275
In this episode, we speak with Rob Whiteley, CEO of Coder, about OpenAI's recent announcement of CriticGPT, a new AI model that provides critiques of ChatGPT responses in order to help the humans training GPT models better evaluate outputs during reinforcement learning from human feedback (RLFH). According to OpenAI, CriticGPT isn't perfect, but it does help trainers catch more problems than they do on their own.
Key talking points include:
- The downsides of having AI testing the quality of other AI models
- Why it's important to be specific about what types of errors the model is allowed to look for
- Is this another example of rushing into AI?
320 قسمت
Manage episode 435230955 series 2591275
In this episode, we speak with Rob Whiteley, CEO of Coder, about OpenAI's recent announcement of CriticGPT, a new AI model that provides critiques of ChatGPT responses in order to help the humans training GPT models better evaluate outputs during reinforcement learning from human feedback (RLFH). According to OpenAI, CriticGPT isn't perfect, but it does help trainers catch more problems than they do on their own.
Key talking points include:
- The downsides of having AI testing the quality of other AI models
- Why it's important to be specific about what types of errors the model is allowed to look for
- Is this another example of rushing into AI?
320 قسمت
ทุกตอน
×
1 321: Bridging the gap between AI tools and Kubernetes with kagent (with solo.io's Idit Levine) 14:16



1 317: Using HOPrS to verify that images and videos are real and unedited (with OpenOrigins' Manny Ahmed) 9:31



1 309: How Chase is building the next generation of expert engineers with its E2 program (with Chase's Priti Naik) 14:04




1 305: Why PostgreSQL became the database of choice for cloud native development (with Neon's Heikki Linnakangas) 11:06





1 298: Building a prompt engineering playground for faster company-wide innovation (with LinkedIn's Lukasz Karolewski and Ajay Prakash) 19:01

1 297: Why a clean codebase is key when using AI-assisted coding tools (with Tabnine's Eran Yahav) 12:31

1 295: How middle code is bridging the gap between low-code and traditional programming (with OpsMill's Damien Garros) 10:10


1 292: Software Engineering Intelligence gains traction as Value Stream Management loses it (with Digital.ai's Derek Holt) 16:11
به Player FM خوش آمدید!
Player FM در سراسر وب را برای یافتن پادکست های با کیفیت اسکن می کند تا همین الان لذت ببرید. این بهترین برنامه ی پادکست است که در اندروید، آیفون و وب کار می کند. ثبت نام کنید تا اشتراک های شما در بین دستگاه های مختلف همگام سازی شود.