Systems | Development | Analytics | API | Testing

Kafka

Amazon Kinesis vs. Kafka: A Detailed Comparison of Data Stream Services

The key differences between Amazon Kinesis and Kafka are: Introducing data streamers! These services validate and route messages from one application to another, managing workload and message queues effectively. The result? Users process messages through a centralized processor and handle large data streams more efficiently. Amazon Kinesis and Apache Kafka are two data stream services.

Geospatial data processing with streaming SQL for Apache Kafka

An old airport customer of mine (whilst I worked for another company) used to pop someone next to a busy runway with a stopwatch strapped round their neck. The unfortunate person had to manually log the time aircrafts spent on the runway to measure the runway occupancy. All very archaic. Even in those days.

Lenses upgrade: powerful SQL, Connect multi-tenancy & message headers

We’re continuously releasing new capabilities to enable more people to be productive and compliant when working with Apache Kafka. Our engineers have been hard at work on a great Christmas present for our amazing community of Kafka users. Before unwrapping our latest and greatest release, here’s a sneak peak of what’s inside.

Exploding arrays in Kafka with lateral joins

In this article we are going to explore lateral joins. "What is a lateral join?" you may ask. It's a new kind of join that allows to extract and work with the single elements found inside an array, as if the array was a normal table. Lenses 4.1 comes with a lot of new features that make your life easier when working with arrays: we introduced 6 new functions to work with arrays, better support for array literals, and lateral joins.

How to configure clients to connect to Apache Kafka Clusters securely - Part 2: LDAP

In the previous post, we talked about Kerberos authentication and explained how to configure a Kafka client to authenticate using Kerberos credentials. In this post we will look into how to configure a Kafka client to authenticate using LDAP, instead of Kerberos. We will not cover the server-side configuration in this article but will add some references to it when required to make the examples clearer.

Kafka Is Not a Database

It's important to understand the uses and abuses of streaming infrastructure. Apache Kafka is a message broker that has rapidly grown in popularity in the last few years. Message brokers have been around for a long time; they're a type of datastore specialized for "buffering" messages between producer and consumer systems. Kafka has become popular because it's open-source and capable of scaling to very large numbers of messages.

Life of PII for Apache Kafka

Several years ago when I was working on a big data project, I saw something a data engineer shouldn’t see. Curious to understand the level of detail in a new credit score dataset we’d received in our data lake, I queried it. I was surprised at how easily and suddenly my screen was flooded with the mortgage history, overdraft limits and year-end financial statements of my colleagues, and I felt deeply uneasy.

Black Friday deal: $350 free Managed Kafka credits

Thanksgiving holiday is upon us. For many of our customers, this is one of the most important periods of the year, with more than 189.6 million U.S. shoppers buying up bargains from Thanksgiving day through Cyber Monday last year. For them and for us, it’s crucial that internal systems can handle high traffic volume without downtime or performance degradation.