Engineering - WarpStream - Stream More, Manage Less

Caleb Grillo

June 10, 2024

Secure by default: How WarpStream’s BYOC deployment model secures the most sensitive workloads

WarpStream's Zero Disk / Diskless Architecture enables a BYOC deployment model that is secure by default and does not require any external access to the customer's environment.

Engineering

Richard Artoul

May 31, 2024

Announcing Bento, the open source fork of the project formerly known as Benthos

Announcing Bento, the open source fork of the project formerly known as Benthos.

Engineering

Richard Artoul

May 23, 2024

Zero Disks is Better (for Kafka) - Diskless Kafka

Follow up to "Tiered Storage Won't Fix Kafka", this post covers all the different advantages that WarpStream's Zero Disk / Diskless Architecture provides over Apache Kafka.

Engineering

Richard Artoul

April 28, 2024

Tiered Storage Won’t Fix Kafka

Tiered storage is a hot topic in the world of data streaming systems, and for good reason. Cloud disks are (really) expensive, object storage is cheap, and in most cases, live consumers are just reading the most recently written data. Paying for expensive cloud disks to store historical data isn’t cost-effective, so historical data should be moved (tiered) to object storage. On paper, it makes all the sense in the world.

Engineering

Richard Artoul

April 20, 2024

Cloud Disks are (Really!) Expensive

Cloud disks are expensive. Really expensive. Most engineers intuitively understand this, but the magnitudes are worth considering.

Engineering

Richard Artoul

March 14, 2024

The Original Sin of Cloud Infrastructure

Many of today's most highly adopted open source “big data” infrastructure projects – like Cassandra, Kafka, Hadoop, etc. – follow a common story. A large company, startup or otherwise, faces a unique, high scale infrastructure challenge that's poorly supported by existing tools. They create an internal solution for their specific needs, and then later (kindly) open source it for the greater community to use. Now, even smaller startups can benefit from the work and expertise of these seasoned engineering teams. Great, right?

Engineering

Richard Artoul

March 12, 2024

Deterministic Simulation Testing for Our Entire SaaS

How we leverage Antithesis to deterministically simulate our entire SaaS platform and verify its correctness, all the way from signup to running entire Kafka workloads.

Engineering

Manu Cupcic

March 4, 2024

Kafka as a KV Store: deduplicating millions of keys with just 128 MiB of RAM

A huge part of building a drop-in replacement for Apache Kafka® was implementing support for compacted topics. The primary difference between a “regular” topic in Kafka and a “compacted” topic is that Kafka will asynchronously delete records from compacted topics that are not the latest record for a specific key within a given partition.