Artwork

محتوای ارائه شده توسط Craig S. Smith. تمام محتوای پادکست شامل قسمت‌ها، گرافیک‌ها و توضیحات پادکست مستقیماً توسط Craig S. Smith یا شریک پلتفرم پادکست آن‌ها آپلود و ارائه می‌شوند. اگر فکر می‌کنید شخصی بدون اجازه شما از اثر دارای حق نسخه‌برداری شما استفاده می‌کند، می‌توانید روندی که در اینجا شرح داده شده است را دنبال کنید.https://fa.player.fm/legal
Player FM - برنامه پادکست
با برنامه Player FM !

#200 Trevor Back: How Speechmatics is Shaping the Future of Conversational AI

56:24
 
اشتراک گذاری
 

Manage episode 431852981 series 2455219
محتوای ارائه شده توسط Craig S. Smith. تمام محتوای پادکست شامل قسمت‌ها، گرافیک‌ها و توضیحات پادکست مستقیماً توسط Craig S. Smith یا شریک پلتفرم پادکست آن‌ها آپلود و ارائه می‌شوند. اگر فکر می‌کنید شخصی بدون اجازه شما از اثر دارای حق نسخه‌برداری شما استفاده می‌کند، می‌توانید روندی که در اینجا شرح داده شده است را دنبال کنید.https://fa.player.fm/legal

In this episode of the Eye on AI podcast, we explore the forefront of voice-powered AI technology with Trevor Back, Chief Product Officer at Speechmatics. Discover how Speechmatics is pushing the boundaries of speech recognition and conversational AI with their latest innovation, Flow.

Trevor shares his journey from a background in computational astrophysics to becoming a key figure in AI at DeepMind and now Speechmatics. He delves into the development and potential of Flow, a groundbreaking tool combining automatic speech recognition (ASR), large language models (LLMs), and text-to-speech synthesis, aimed at creating seamless and responsive voice interactions.

We explore the wide-ranging applications of Speechmatics' technology across industries, including media, call centers, and education. Trevor discusses the challenges of achieving high accuracy in speech recognition, especially in diverse and noisy environments, and how Speechmatics addresses these challenges with their unique approach to training models.

Listen in as we uncover the intricacies of handling multiple languages, improving diarization, and the future goals of understanding complex audio cues like emotion and sarcasm. Learn about the company's vision for integrating voice technology into everyday products, making technology more accessible and user-friendly.

Don't miss this insightful conversation on the future of voice technology, AI in business, and its role in the evolving landscape of AI. Like, subscribe, and hit the notification bell for more expert discussions on cutting-edge advancements in AI.

This episode is sponsored by Shopify.

Shopify is a commerce platform that allows anyone to set up an online store and sell their products. Whether you’re selling online, on social media, or in person, Shopify has you covered on every base. With Shopify you can sell physical and digital products. You can sell services, memberships, ticketed events, rentals and even classes and lessons.

Sign up for a $1 per month trial period at http://shopify.com/eyeonai

Checkout Speechmatics, the most accurate AI speech technology - with AI transcription & real-time translation components.: https://www.speechmatics.com/

Stay Updated:

Craig Smith Twitter: https://twitter.com/craigss

Eye on A.I. Twitter: https://twitter.com/EyeOn_AI

(00:00) Introduction and Background

(01:49) Trevor Back's Journey into AI

(04:02) DeepMind and Early AI Applications

(07:30) Speechmatics' Mission and Focus

(12:06) Key Applications of Speechmatics Technology

(14:25) Achieving High Accuracy and Low Latency

(17:52) Language Coverage and Challenges

(21:27) Future of Voice Technology and AGI

(24:52) Integrating Large Language Models

(27:31) Handling Multiple Voices and Diarization

(29:32) Real-world Applications and Challenges

(35:20) Demonstration of Flow and Capabilities

(41:14) Endpoint Prediction and Interruption

(43:53) Real-time Interactions and Future Prospects

(45:34) Launch Event and Future Plans

(50:13) New Language Releases and Compliance

  continue reading

206 قسمت

Artwork
iconاشتراک گذاری
 
Manage episode 431852981 series 2455219
محتوای ارائه شده توسط Craig S. Smith. تمام محتوای پادکست شامل قسمت‌ها، گرافیک‌ها و توضیحات پادکست مستقیماً توسط Craig S. Smith یا شریک پلتفرم پادکست آن‌ها آپلود و ارائه می‌شوند. اگر فکر می‌کنید شخصی بدون اجازه شما از اثر دارای حق نسخه‌برداری شما استفاده می‌کند، می‌توانید روندی که در اینجا شرح داده شده است را دنبال کنید.https://fa.player.fm/legal

In this episode of the Eye on AI podcast, we explore the forefront of voice-powered AI technology with Trevor Back, Chief Product Officer at Speechmatics. Discover how Speechmatics is pushing the boundaries of speech recognition and conversational AI with their latest innovation, Flow.

Trevor shares his journey from a background in computational astrophysics to becoming a key figure in AI at DeepMind and now Speechmatics. He delves into the development and potential of Flow, a groundbreaking tool combining automatic speech recognition (ASR), large language models (LLMs), and text-to-speech synthesis, aimed at creating seamless and responsive voice interactions.

We explore the wide-ranging applications of Speechmatics' technology across industries, including media, call centers, and education. Trevor discusses the challenges of achieving high accuracy in speech recognition, especially in diverse and noisy environments, and how Speechmatics addresses these challenges with their unique approach to training models.

Listen in as we uncover the intricacies of handling multiple languages, improving diarization, and the future goals of understanding complex audio cues like emotion and sarcasm. Learn about the company's vision for integrating voice technology into everyday products, making technology more accessible and user-friendly.

Don't miss this insightful conversation on the future of voice technology, AI in business, and its role in the evolving landscape of AI. Like, subscribe, and hit the notification bell for more expert discussions on cutting-edge advancements in AI.

This episode is sponsored by Shopify.

Shopify is a commerce platform that allows anyone to set up an online store and sell their products. Whether you’re selling online, on social media, or in person, Shopify has you covered on every base. With Shopify you can sell physical and digital products. You can sell services, memberships, ticketed events, rentals and even classes and lessons.

Sign up for a $1 per month trial period at http://shopify.com/eyeonai

Checkout Speechmatics, the most accurate AI speech technology - with AI transcription & real-time translation components.: https://www.speechmatics.com/

Stay Updated:

Craig Smith Twitter: https://twitter.com/craigss

Eye on A.I. Twitter: https://twitter.com/EyeOn_AI

(00:00) Introduction and Background

(01:49) Trevor Back's Journey into AI

(04:02) DeepMind and Early AI Applications

(07:30) Speechmatics' Mission and Focus

(12:06) Key Applications of Speechmatics Technology

(14:25) Achieving High Accuracy and Low Latency

(17:52) Language Coverage and Challenges

(21:27) Future of Voice Technology and AGI

(24:52) Integrating Large Language Models

(27:31) Handling Multiple Voices and Diarization

(29:32) Real-world Applications and Challenges

(35:20) Demonstration of Flow and Capabilities

(41:14) Endpoint Prediction and Interruption

(43:53) Real-time Interactions and Future Prospects

(45:34) Launch Event and Future Plans

(50:13) New Language Releases and Compliance

  continue reading

206 قسمت

همه قسمت ها

×
 
Loading …

به Player FM خوش آمدید!

Player FM در سراسر وب را برای یافتن پادکست های با کیفیت اسکن می کند تا همین الان لذت ببرید. این بهترین برنامه ی پادکست است که در اندروید، آیفون و وب کار می کند. ثبت نام کنید تا اشتراک های شما در بین دستگاه های مختلف همگام سازی شود.

 

راهنمای مرجع سریع