Taking out the Trash: Garbage Collection of Object Storage at Massive Scale

Richard Artoul
April 10, 2025
Distributed systems built on object storage all have one common problem: removing files that have been logically deleted either due to data expiry or compaction. We review the pros and cons of five ways to solve this problem.
Read More

A Trip Down Memory Lane: How We Resolved a Memory Leak When pprof Failed Us

Ella Chao
April 3, 2025
Pprof is an amazing tool for debugging memory leaks, but what about when it's not enough? Read about how we used gcore and viewcore to hunt a particularly nasty memory leak in a large distributed system.
Read More

Zero Ops Schema Migration: WarpStream Schema Linking

Brian Shih
March 25, 2025
Today, we're excited to announce WarpStream Schema Linking, a tool to continuously migrate any Confluent-compatible schema registry into a WarpStream BYOC Schema Registry. WarpStream now has a comprehensive Data Governance suite to handle schema needs.
Read More

WarpStream Diagnostics: Keep Your Data Stream Clean and Cost-Effective

Aratz Manterola Lasa
March 18, 2025
We’ve released Diagnostics, a new feature for WarpStream clusters! Diagnostics continuously analyzes your clusters to identify potential problems, cost inefficiencies, and ways to make things better. It looks at the health and cost of your cluster and gives detailed explanations on how to fix and improve them.
Read More

Kafka Transactions Explained (Twice!)

Manu Cupcic
January 13, 2025
In this blog post we'll explain how transactions work in Kafka by comparing and contrasting the implementations of transactions in two different Kafka implementations: the official Apache Kafka project, and WarpStream.
Read More

Getting Rid of (Kafka) Noisy Neighbors Without Having to Buy a Mansion

Aratz Manterola Lasa
December 3, 2024
In this post, we’ll look at what noisy neighbors are, the current ways to handle them (cluster quotas and mirroring clusters), and how WarpStream’s solution compares in terms of elasticity, operational simplicity, and cost efficiency.
Read More

Introducing WarpStream BYOC Schema Registry

Brian Shih
November 25, 2024
WarpStream BYOC reimplements the Kafka protocol with a stateless, zero-disk cloud-native architecture, replacing Kafka brokers with WarpStream Agents to simplify operations. But data streaming extends beyond Kafka clusters alone.
Read More

The Case for Shared Storage

Richard Artoul
November 19, 2024
In this post, I’ll start off with a brief overview of “shared nothing” vs. “shared storage” architectures in general. This discussion will be a bit abstract and high-level, but the goal is to share with you some of the guiding philosophy that ultimately led to WarpStream’s architecture.
Read More