Flink vs Kafka Streams/ksqlDB: Comparing Stream Processing Tools

Streaming Audio: Apache Kafka® & Real-Time Data

Streaming Audio: Apache Kafka® & Real-Time Data

Player FM - Internet Radio Done Right

32 subscribers

הוסף לפני six שנים

תוכן מסופק על ידי Confluent, founded by the original creators of Apache Kafka® and Founded by the original creators of Apache Kafka®. כל תוכן הפודקאסטים כולל פרקים, גרפיקה ותיאורי פודקאסטים מועלים ומסופקים ישירות על ידי Confluent, founded by the original creators of Apache Kafka® and Founded by the original creators of Apache Kafka® או שותף פלטפורמת הפודקאסט שלהם. אם אתה מאמין שמישהו משתמש ביצירה שלך המוגנת בזכויות יוצרים ללא רשותך, אתה יכול לעקוב אחר התהליך המתואר כאן https://he.player.fm/legal.

<div class="span index">1</div> <span><a class="" data-remote="true" data-type="html" href="/series/everyday-ai-podcast-an-ai-and-chatgpt-podcast">Everyday AI Podcast – An AI and ChatGPT Podcast</a></span>

1
Everyday AI Podcast – An AI and ChatGPT Podcast

בטל רישום

לפני 14 hoursלפני 14h ago

בטל רישום

יומי

The Everyday AI podcast is a daily livestream, podcast and free newsletter where we help everyday people grow their careers with AI. The Everyday AI podcast is hosted by Jordan Wilson, a former journalist who's now the owner of a boutique digital strategy company with 20 years of martech experience. Our main focus is to help you keep up with AI trends to make your job easier. Get your work done faster. Increase your output. - Sign up for our free Prime Prompt Polish ChatGPT course: https://podPPP.com - Make sure to sign up for our daily newsletter at: https://youreverydayai.com - Email us: info@youreverydayai.com - Connect with Jordan on LinkedIn: https://www.linkedin.com/in/jordanwilson04/ In the Everyday AI podcast, we'll cover all things artificial intelligence, machine learning, and practical tips on how to use both in your daily life. We'll include a touch on a variety of topics, software and applications. We may be covering the latest AI news from Microsoft, Google, Facebook, Adobe and social channels like Snapchat, Tiktok, and Instagram. Or, we may be diving into software like ChatGPT, Midjourney, Bard, or Runway ML.

לפני 3 שנים 55:55

MP3•בית הפרקים

The best stream processing tools they consider are Flink along with the options from the Kafka ecosystem: Java-based Kafka Streams and its SQL-wrapped variant—ksqlDB. Flink and ksqlDB tend to be used by divergent types of teams, since they differ in terms of both design and philosophy.

Why Use Apache Flink?

The teams using Flink are often highly specialized, with deep expertise, and with an absolute focus on stream processing. They tend to be responsible for unusually large, industry-outlying amounts of both state and scale, and they usually require complex aggregations. Flink can excel in these use cases, which potentially makes the difficulty of its learning curve and implementation worthwhile.

Why use ksqlDB/Kafka Streams?

Conversely, teams employing ksqlDB/Kafka Streams require less expertise to get started and also less expertise and time to manage their solutions. Jeff notes that the skills of a developer may not even be needed in some cases—those of a data analyst may suffice. ksqlDB and Kafka Streams seamlessly integrate with Kafka itself, as well as with external systems through the use of Kafka Connect. In addition to being easy to adopt, ksqlDB is also deployed on production stream processing applications requiring large scale and state.

There are also other considerations beyond the strictly architectural. Local support availability, the administrative overhead of using a library versus a separate framework, and the availability of stream processing as a fully managed service all matter.

Choosing a stream processing tool is a fraught decision partially because switching between them isn't trivial: the frameworks are different, the APIs are different, and the interfaces are different. In addition to the high-level discussion, Jeff and Matthias also share lots of details you can use to understand the options, covering employment models, transactions, batching, and parallelism, as well as a few interesting tangential topics along the way such as the tyranny of state and the Turing completeness of SQL.

EPISODE LINKS

פרקים

1. Intro (00:00:00)

2. The world of stream processing (00:02:06)

3. Flink vs ksqlDB (00:06:26)

4. Example use case (00:18:34)

5. SQL was built for static data (00:20:03)

6. Concept of event time (00:25:51)

7. Session based window joins (00:29:30)

8. Processing streaming data with SQL (00:35:47)

9. Scaling Kafka Streams/ksqlDB (00:39:47)

10. Exactly-once semantics (00:45:39)

11. Choosing stream processing tools (00:48:15)

12. It's a wrap (00:53:52)

265 פרקים

#Tech #Tech News #News #Confluent #Event Stream Processing #Data #Event Driven Architecture #Open Source #Data In Motion #Kafka Cloud Native #Data Mesh #Data Pipeline #Serverless Kafka #Podcasting Education #Confluent, original creators of Apache Kafka® #original creators of Apache Kafka® #Apache Kafka® #Cloud IT #Real Time

Streaming Audio: Apache Kafka® & Real-Time Data