32 subscribers
با برنامه Player FM !
پادکست هایی که ارزش شنیدن دارند
حمایت شده


Common Apache Kafka Mistakes to Avoid
Manage episode 424666747 series 2510642
What are some of the common mistakes that you have seen with Apache Kafka® record production and consumption? Nikoleta Verbeck (Principal Solutions Architect at Professional Services, Confluent) has a role that specifically tasks her with performance tuning as well as troubleshooting Kafka installations of all kinds. Based on her field experience, she put together a comprehensive list of common issues with recommendations for building, maintaining, and improving Kafka systems that are applicable across use cases.
Kris and Nikoleta begin by discussing the fact that it is common for those migrating to Kafka from other message brokers to implement too many producers, rather than the one per service. Kafka is thread safe and one producer instance can talk to multiple topics, unlike with traditional message brokers, where you may tend to use a client per topic.
Monitoring is an unabashed good in any Kafka system. Nikoleta notes that it is better to monitor from the start of your installation as thoroughly as possible, even if you don't think you ultimately will require so much detail, because it will pay off in the long run. A major advantage of monitoring is that it lets you predict your potential resource growth in a more orderly fashion, as well as helps you to use your current resources more efficiently. Nikoleta mentions the many dashboards that have been built out by her team to accommodate leading monitoring platforms such as Prometheus, Grafana, New Relic, Datadog, and Splunk.
They also discuss a number of useful elements that are optional in Kafka so people tend to be unaware of them. Compression is the first of these, and Nikoleta absolutely recommends that you enable it. Another is producer callbacks, which you can use to catch exceptions. A third is setting a `ConsumerRebalanceListener`, which notifies you about rebalancing events, letting you prepare for any issues that may result from them.
Other topics covered in the episode are batching and the `linger.ms` Kafka producer setting, how to figure out your units of scale, and the metrics tool Trogdor.
EPISODE LINKS
- 5 Common Pitfalls when Using Apache Kafka
- Kafka Internals course
- linger.ms producer configs.
- Fault Injection—Trogdor
- From Apache Kafka to Performance in Confluent Cloud
- Kafka Compression
- Interface ConsumerRebalanceListener
- Watch the video version of this podcast
- Nikoleta Verbeck’s Twitter
- Kris Jenkins’ Twitter
- Streaming Audio Playlist
- Join the Confluent Community
- Learn more on Confluent Developer
- Use PODCAST100 to get $100 of free Confluent Cloud usage (details)
فصل ها
1. Intro (00:00:00)
2. What is a Solutions Architect (00:01:17)
3. It's a problem to use multiple producers in a single service (00:02:20)
4. The trade off between throughput and latency with batching (00:06:19)
5. What is linger.ms (00:08:05)
6. Enable compression (00:15:00)
7. Define Producer Callbacks (00:25:19)
8. One consumer per thread in a single service instance (00:33:16)
9. Trogdor (00:41:45)
10. Over Committing (00:43:37)
11. Provide a `ConsumerRebalanceListener` (00:55:48)
12. Undersized per Kafka Consumer instances (01:00:16)
13. It's a wrap (01:07:28)
265 قسمت
Manage episode 424666747 series 2510642
What are some of the common mistakes that you have seen with Apache Kafka® record production and consumption? Nikoleta Verbeck (Principal Solutions Architect at Professional Services, Confluent) has a role that specifically tasks her with performance tuning as well as troubleshooting Kafka installations of all kinds. Based on her field experience, she put together a comprehensive list of common issues with recommendations for building, maintaining, and improving Kafka systems that are applicable across use cases.
Kris and Nikoleta begin by discussing the fact that it is common for those migrating to Kafka from other message brokers to implement too many producers, rather than the one per service. Kafka is thread safe and one producer instance can talk to multiple topics, unlike with traditional message brokers, where you may tend to use a client per topic.
Monitoring is an unabashed good in any Kafka system. Nikoleta notes that it is better to monitor from the start of your installation as thoroughly as possible, even if you don't think you ultimately will require so much detail, because it will pay off in the long run. A major advantage of monitoring is that it lets you predict your potential resource growth in a more orderly fashion, as well as helps you to use your current resources more efficiently. Nikoleta mentions the many dashboards that have been built out by her team to accommodate leading monitoring platforms such as Prometheus, Grafana, New Relic, Datadog, and Splunk.
They also discuss a number of useful elements that are optional in Kafka so people tend to be unaware of them. Compression is the first of these, and Nikoleta absolutely recommends that you enable it. Another is producer callbacks, which you can use to catch exceptions. A third is setting a `ConsumerRebalanceListener`, which notifies you about rebalancing events, letting you prepare for any issues that may result from them.
Other topics covered in the episode are batching and the `linger.ms` Kafka producer setting, how to figure out your units of scale, and the metrics tool Trogdor.
EPISODE LINKS
- 5 Common Pitfalls when Using Apache Kafka
- Kafka Internals course
- linger.ms producer configs.
- Fault Injection—Trogdor
- From Apache Kafka to Performance in Confluent Cloud
- Kafka Compression
- Interface ConsumerRebalanceListener
- Watch the video version of this podcast
- Nikoleta Verbeck’s Twitter
- Kris Jenkins’ Twitter
- Streaming Audio Playlist
- Join the Confluent Community
- Learn more on Confluent Developer
- Use PODCAST100 to get $100 of free Confluent Cloud usage (details)
فصل ها
1. Intro (00:00:00)
2. What is a Solutions Architect (00:01:17)
3. It's a problem to use multiple producers in a single service (00:02:20)
4. The trade off between throughput and latency with batching (00:06:19)
5. What is linger.ms (00:08:05)
6. Enable compression (00:15:00)
7. Define Producer Callbacks (00:25:19)
8. One consumer per thread in a single service instance (00:33:16)
9. Trogdor (00:41:45)
10. Over Committing (00:43:37)
11. Provide a `ConsumerRebalanceListener` (00:55:48)
12. Undersized per Kafka Consumer instances (01:00:16)
13. It's a wrap (01:07:28)
265 قسمت
همه قسمت ها
×
1 Migrate Your Kafka Cluster with Minimal Downtime 1:01:30

1 Top 6 Worst Apache Kafka JIRA Bugs 1:10:58









1 Optimizing Apache JVMs for Apache Kafka 1:11:42



1 International Podcast Day - Apache Kafka Edition | Streaming Audio Special 1:02:22




1 Capacity Planning Your Apache Kafka Cluster 1:01:54




1 Streaming Analytics and Real-Time Signal Processing with Apache Kafka 1:06:33



1 Common Apache Kafka Mistakes to Avoid 1:09:43




1 The Evolution of Apache Kafka: From In-House Infrastructure to Managed Cloud Service ft. Jay Kreps 46:32



1 Expanding Apache Kafka Multi-Tenancy for Cloud-Native Systems ft. Anna Povzner and Anastasia Vela 31:01



1 From Batch to Real-Time: Tips for Streaming Data Pipelines with Apache Kafka ft. Danica Fine 29:50
















1 Scaling an Apache Kafka Based Architecture at Therapie Clinic 1:10:56
















1 How to Build a Strong Developer Community with Global Engagement ft. Robin Moffatt and Ale Murray 35:18
به Player FM خوش آمدید!
Player FM در سراسر وب را برای یافتن پادکست های با کیفیت اسکن می کند تا همین الان لذت ببرید. این بهترین برنامه ی پادکست است که در اندروید، آیفون و وب کار می کند. ثبت نام کنید تا اشتراک های شما در بین دستگاه های مختلف همگام سازی شود.