

حمایت شده
In this episode of the Virtually Speaking Podcast, we delve into the world of AI with Justin Murray, Product Marketing Engineer, and Frank Denneman, Chief Technologist for AI at Broadcom. We discuss retrieval augmented generation (RAG), a powerful approach that combines large language models with real-time, trusted data. Learn how RAG pipelines can be architected using Private AI Foundation with NVIDIA, including insights into key components like LLMs, NVIDIA Inference Microservices, and Vector DB. We also explore best practices for GPU sizing and when to use fractional or multiple GPUs for optimal performance. Join us for this fascinating conversation!
111 قسمت
In this episode of the Virtually Speaking Podcast, we delve into the world of AI with Justin Murray, Product Marketing Engineer, and Frank Denneman, Chief Technologist for AI at Broadcom. We discuss retrieval augmented generation (RAG), a powerful approach that combines large language models with real-time, trusted data. Learn how RAG pipelines can be architected using Private AI Foundation with NVIDIA, including insights into key components like LLMs, NVIDIA Inference Microservices, and Vector DB. We also explore best practices for GPU sizing and when to use fractional or multiple GPUs for optimal performance. Join us for this fascinating conversation!
111 قسمت
Player FM در سراسر وب را برای یافتن پادکست های با کیفیت اسکن می کند تا همین الان لذت ببرید. این بهترین برنامه ی پادکست است که در اندروید، آیفون و وب کار می کند. ثبت نام کنید تا اشتراک های شما در بین دستگاه های مختلف همگام سازی شود.