A whirlwind tour of Kafka Summit SF 2018

2018-11-10Home

Kafka Summit San Francisco 2018 took place last month with +1200 attendees from +350 companies. All 60 talks with videos and slides side by side have been posted on Confluent's website and nicely organized in keynotes and four tracks (Pipeline, Streams, Internals and Business).

I wasn't there but I've glanced through all the slides and lingered over those I found more interesting. I'd like to share my impressions and provide a whirlwind tour of the conference. Here's an official wrap-up, by the way.

Is Kafka a Database

As you know, Kafka was born a message queue and has grown into a full-fledged streaming platform with Kafka Connect, Kafka Streams and KSQL. What else can Kafka be ? How about a database ! Martin Kleppmann argues that Kafka is a database and achieves ACID properties as in relational databases. This is mind-boggling since relation databases don't provide ACID at scale. Note that you can't set up a Kafka cluster and get ACID for free but need to carefully design your Kafka topics and streaming applications as demonstrated by the author. Still, I have some doubt over whether it can only ensure eventual consistency.

Kafka for IoT

Global Kafka

Streaming platform

When Kafka Streams first came out, I was wondering why I would need another Streaming platform given the place had already been crowded with Flink, Spark Streaming, Storm, Gearpump, etc. Today it strikes me that why I would need another Streaming platform and all the workloads to set up a cluster and maintain when I can do streaming with my Kafka already there. Futhurmore, Confluent adds KSQL, a SQL interface, to relieve you of writing cumbersome codes.

Kafka on Cloud

Internals

As a developer, I love nearly every deck of the Internals track.

Wrap-up

Kafka is definitely the backbone of companies' data architectures and serves as many as billions of messages per day. More and more streaming applications are built with Kafka Connect, Kafka Streams and KSQL. Meanwhile, quite a few companies are managing and mirroring Kafka at global scale. Finally, Kafka on Kubernetes is the new fashion.