Bringing Whisper and LLaMA to the masses (Interview)

The Changelog: Software Development, Open Source

محتوای ارائه شده توسط Changelog Media. تمام محتوای پادکست شامل قسمت‌ها، گرافیک‌ها و توضیحات پادکست مستقیماً توسط Changelog Media یا شریک پلتفرم پادکست آن‌ها آپلود و ارائه می‌شوند. اگر فکر می‌کنید شخصی بدون اجازه شما از اثر دارای حق نسخه‌برداری شما استفاده می‌کند، می‌توانید روندی که در اینجا شرح داده شده است را دنبال کنید.https://fa.player.fm/legal

2+ y ago 1:11:56

MP3•خانه قسمت

This week we’re talking with Georgi Gerganov about his work on Whisper.cpp and llama.cpp. Georgi first crossed our radar with whisper.cpp, his port of OpenAI’s Whisper model in C and C++. Whisper is a speech recognition model enabling audio transcription and translation. Something we’re paying close attention to here at Changelog, for obvious reasons. Between the invite and the show’s recording, he had a new hit project on his hands: llama.cpp. This is a port of Facebook’s LLaMA model in C and C++. Whisper.cpp made a splash, but llama.cpp is growing in GitHub stars faster than Stable Diffusion did, which was a rocket ship itself.

Join the discussion

Changelog++ members get a bonus 12 minutes at the end of this episode and zero ads. Join today!

Sponsors:

Postman – Build APIs together — More than 20 million developers use Postman for building and using APIs. Postman simplifies each step of the API lifecycle and streamlines collaboration so you can create better APIs—faster.
Sentry – Session Replay! Rewind and replay every step of the user’s journey before and after they encountered an issue. Eliminate the guesswork and get to the root cause of an issue, faster. Use the code CHANGELOG and get the team plan free for three months.
Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com
Typesense – Lightning fast, globally distributed Search-as-a-Service that runs in memory. You literally can’t get any faster!

Featuring:

Georgi Gerganov – Website, GitHub, Mastodon, X
Adam Stacoviak – Website, GitHub, LinkedIn, Mastodon, X
Jerod Santo – GitHub, LinkedIn, Mastodon, X

Show Notes:

Something missing or broken? PRs welcome!

فصل ها

1. This week on The Changelog (00:00:00)

2. Sponsor: Postman (00:01:20)

https://www.postman.com/changelogpod

3. Start the show! (00:04:09)

4. Why is Whisper interesting to us? (00:12:03)

5. What's involved in making a port? (00:17:04)

6. Sponsor: Sentry (00:22:55)

https://sentry.io/for/session-replay/

7. One layer deeper (00:24:51)

8. Examples of Whisper.cpp (00:27:57)

9. Whisper.cpp and speaker detection (00:31:49)

10. What did you learn about Apple Silicon? (00:39:25)

11. Apple's secret M1 coprocessor (00:42:26)

12. GPU support on the roadmap (00:44:56)

13. Cultivating contributions (00:47:06)

14. Ludacris Llama Llama Red Pajama (00:48:49)

15. What is Llama.cpp so interesting? (00:52:57)

16. What are you going from here? (00:57:01)

17. How can this be extended? (00:58:22)

18. How did you learn this stuff? (01:01:22)

19. Wrapping up (01:08:48)

20. Outro (01:10:09)

954 قسمت

#Open Source #Software Development #Changelog #Software #Development #Code #Programming #Change Log #Software Engineering #Changelog Media #Tech #Hackers