Support kafka 2.1, and misc improvements. #6

Fiveside · 2019-05-17T21:13:47Z

Greetings. This is a bit of a beefy pull request, so I'd like to outline the changes that we made and why, and hopefully we can discuss any of the issues in detail.

Upgraded the supported Kafka version to 2.1. This is the biggest change and the driver for most of the rest of the work. Kafka 2 no longer stores consumer group information in zookeeper. Instead it stores it in Kafka proper. With this, users no longer need to provide zookeeper info, as everything is queryable from Kafka itself. Right now its hardcoded to expect kafka 2.1, but that's really because that's the only version I tested it on (and coded against).
Swapped out the statsd library. The statsd format for tagged stats differs between influxdb and datadog. The tagging format used previously ended up sending garbage to datadog. The new statsd library properly serializes tagged stats for those 2 providers and the tagless fallback.
Removed the onbuild docker dependency. Onbuild images have been deprecated (see [Proposal] Deprecate onbuild tags docker-library/official-images#2076). Instead we're using a normal golang image and a multi-stage build to keep the image size and layer count down.
Exposed cli options as environment variables. This was specifically an issue because we launched this statsd daemon on kubernetes. In our kubernetes cluster, each worker has a datadog agent running locally. We wanted this kafka-statsd to talk to the local datadog agent, and you do this in kubernetes via an environment variable definition. This isn't possible when specifying parameters as command line options. After that fix it only made sense to expose the rest of the options as environment variables.
Recording absolute partition and consumer group offsets. In addition to the consumer lag, we're now also reporting partition and group offsets. It was useful for debugging purposes. In theory we don't actually need to record consumer group lag anymore since that can be calculated from the partition and group offsets, however that would be breaking backwards compatibility.
Kafka calls made concurrent. We need to make a lot of calls to kafka to pull stats information. We just made them concurrent to reduce the chance of falling behind the sampling frequency.

[KAFKA-6] Revamp to remove zookeeper usage

…variable overriding.

Switch back to kingpin for arg parsing

…group position, report latest partition offsets, only collect stats for topics that a consumer group is actually tracking

Report absolute consumer and topic offsets

Erich Healy and others added 15 commits April 1, 2019 10:04

Define dependencies as go modules

d38961f

Update dockerfile for deprecated onbuild

dc3af9e

Begin rewriting to collect info without zookeeper

33d1078

Update deps

04161aa

add topics and partitions loops

7a57328

daemonize the script

7c472a6

Merge pull request #1 from greenbits/onbuild-deprecated

74aa609

[KAFKA-6] Revamp to remove zookeeper usage

Switch back to kingpin for arg parsing because it allows environment …

cc363ed

…variable overriding.

Split up statsd config into address and port

5e3a092

Replace statsd library with one that can format tags

5c95d81

Fix issue with optional prefix and bad itoa

ebca579

Merge pull request #2 from greenbits/add-envar-overrides

66ede95

Switch back to kingpin for arg parsing

Thread the last bit of the info collection, report absolute consumer …

5790dbe

…group position, report latest partition offsets, only collect stats for topics that a consumer group is actually tracking

Fix issue where no consumer groups had information collected

42af27d

Merge pull request #4 from greenbits/limit-groups

132fff6

Report absolute consumer and topic offsets

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support kafka 2.1, and misc improvements. #6

Support kafka 2.1, and misc improvements. #6

Fiveside commented May 17, 2019 •

edited

Loading

Support kafka 2.1, and misc improvements. #6

Are you sure you want to change the base?

Support kafka 2.1, and misc improvements. #6

Conversation

Fiveside commented May 17, 2019 • edited Loading

Fiveside commented May 17, 2019 •

edited

Loading