Tags:#software_engineering

Kafka is not an event-driven platform : Why Transport is not Meaning

Apache Kafka has become one of those technologies that is so successful it now suffers from conceptual inflation. It began as a brutally effective answer to a real systems problem: how do you move large volumes of data, durably and efficiently, between many producers and many consumers?

Over time, because Kafka sits at the center of so many modern architectures, people began to assign it a larger meaning. It stopped being described merely as infrastructure and started being treated as destiny.

That is the mistake.

Kafka is not an event-driven platform. It is not even, by itself, an event-driven architecture. Kafka is a sophisticated message bus, a distributed log, and a remarkably powerful substrate for streaming data. Those are major achievements. But they are not the same thing as a platform for event-driven business systems.

This distinction matters because language shapes architecture. When teams say “we have Kafka, therefore we are event-driven,” they often smuggle in an assumption that the hard part has already been solved. It has not.

In many organizations, Kafka has not enabled event-driven architecture so much as disguised the absence of one. The company has topics, partitions, offsets, and consumer groups. What it does not have is coherent event semantics, process awareness, or a shared model of how the business actually changes over time.

That is not a trivial gap. It is the whole game.

Why Kafka gets mistaken for the whole platform

The confusion is understandable. Kafka is central, visible, and technically impressive. It handles throughput at enormous scale. It persists streams. It enables replay. It decouples producers from consumers. Its ecosystem includes connectors, stream processing libraries, schema tooling, and operational dashboards. If you stand back and look at the architecture diagram, Kafka often appears to be the beating heart of the enterprise.

And because the messages flowing through Kafka are commonly called “events,” people begin to assume that the presence of an event transport implies the presence of an event-driven system.

But transporting events is not the same as understanding them.

A road network is not a city. A nervous system is not a mind. A bloodstream is not metabolism. In each case, the infrastructure is essential. In none of those cases is the infrastructure the whole organism.

Kafka occupies the same position in modern enterprise architecture. It is vital connective tissue. But the thing people actually want from an event-driven platform is not simply movement. They want meaning, coordination, responsiveness, and the ability to model business change as it unfolds.

Kafka does not do that on its own.

What Kafka actually is

At its core, Kafka is a distributed append-only log with publish-subscribe semantics and strong operational characteristics for large-scale streaming systems. It is exceptionally good at a few things:

moving records reliably between systems
retaining those records for replay
partitioning workload for scale
enabling asynchronous decoupling
supporting stream processing on top of ordered logs

That is already a profound contribution. In many organizations, Kafka replaces brittle point-to-point integrations, polling architectures, and overloaded transactional databases used as accidental integration hubs. It gives teams a durable event backbone where multiple systems can consume change streams independently.

But notice what this description contains and what it does not.

Kafka handles records. It does not define business truth.
Kafka handles ordering within partitions. It does not define process meaning.
Kafka enables subscriptions. It does not define choreography.
Kafka stores facts-in-motion. It does not determine which facts matter.
Kafka can replay history. It does not explain causality.

Those missing layers are exactly what make an event-driven platform more than a message bus.

The category error at the heart of many “event-driven” programs

The common category error is to confuse infrastructure with architecture.

Kafka is infrastructure. Event-driven architecture is a design discipline. An event-driven platform is an organisational and technical capability stack. These are not interchangeable.

A real event-driven platform requires that events are not just emitted, but modeled. They need to correspond to meaningful business state transitions. They need ownership. They need contracts. They need lineage. They need discoverability. They need policies around compatibility and evolution. They need to fit into bounded contexts.

Most importantly, they need to support how the enterprise coordinates work across domains without collapsing everything back into hidden orchestration or shared database coupling.

Without that, what companies often build is not event-driven architecture but topic-driven integration.

That distinction is devastating.

In topic-driven integration, teams create Kafka topics for whatever data happens to be available: customer-updates, order-events, status-changes, inventory-sync, notification-messages. The names sound eventful. The reality is often much less elegant. The payloads are overloaded. Some messages are commands pretending to be events. Others are database change feeds pretending to be domain signals. Still others are generic integration blobs whose semantics are understood only by the two teams currently fighting over them.

Everything is decoupled mechanically, but nothing is decoupled conceptually.

So the enterprise gets asynchronous confusion at scale.

An event-driven platform requires event semantics, not just event transport

The first thing Kafka lacks, by itself, is semantic discipline.

In a mature event-driven platform, an event is not merely “something published to a topic.” It is a representation of something that happened in the domain. That sounds obvious, but it is where most implementations quietly fail. The critical question is not whether a message exists. The critical question is whether the message corresponds to a meaningful, stable, well-governed business fact.

“OrderPlaced” is potentially a domain event.
“orders_topic_v7 payload with status=3” is not a domain event. It is an integration artifact.

The difference is everything.

Domain-driven design matters here because it forces the enterprise to define bounded contexts, ubiquitous language, and explicit ownership of concepts. Without those disciplines, event streams become a semantic junk drawer. Teams publish whatever they can, other teams subscribe defensively, and over time the architecture accumulates accidental dependencies disguised as flexibility.

Kafka cannot solve this because Kafka is intentionally agnostic about meaning. It should be.

A transport layer should not impose business semantics. But the price of that neutrality is that someone else must provide the semantic layer. If nobody does, the organization mistakes motion for architecture.

Choreography is not the same as “many consumers”

Another thing people over-ascribe to Kafka is event choreography.

In a genuine event-driven platform, choreography means that multiple services and domains coordinate behaviour through shared recognition of domain events, without a central orchestrator dictating every step. This is not just fan-out. It is not merely several consumers reading the same topic. It is a disciplined pattern of autonomous responses to meaningful state transitions.

That only works when events are well defined, when service boundaries are clear, and when downstream actions are designed to remain locally coherent even as they participate in larger business processes.

Kafka helps messages flow among participants. But it does not define the choreography itself. It does not model the dance.

Indeed, many Kafka deployments hide badly designed orchestration under the banner of choreography. A topic becomes an implicit workflow engine. Services react to fragments of state without clear understanding of process context. Retries create duplicate downstream actions. Dead-letter queues become cemeteries for business ambiguity. Eventually someone asks the uncomfortable question: where, exactly, is the actual process model?

There often isn’t one.

A real platform needs explicit thinking about cross-domain process coordination. Which events are authoritative? Which are merely informative? Which actions are compensating? Which are idempotent? Which transitions are observable? Which failures are local versus systemic? None of that emerges automatically from Kafka topics.

Complex event processing is a separate capability, not an automatic consequence

Kafka also gets mistaken for a complex event processing engine simply because it deals with streams.

But complex event processing is not just streaming. It is the detection of higher-order situations from patterns across many events, often across time windows, entities, and contexts. It involves correlation, aggregation, temporal logic, thresholds, sequence detection, and inference.

“PaymentReceived” is an event.
“Customer is likely churning because support complaints increased, order frequency declined, and payment delays crossed a threshold over 45 days” is the result of complex event processing.

That second capability is closer to operational intelligence than message distribution. It requires stateful reasoning. It often requires models, rules, temporal joins, or derived views that are carefully governed. Kafka can feed such systems. Kafka Streams, Flink, ksqlDB, and other stream processors can help implement some of this. But again, this proves the opposite of the common claim: the event-driven platform lies in the layered system above and around Kafka, not in Kafka alone.

The moment you need to infer significance from raw streams, you have left the realm of the message bus and entered the realm of platform intelligence.

State is the hidden center of the whole problem

One reason the “Kafka is the platform” myth persists is that engineers like motion more than state. Motion feels dynamic. State feels messy. But business systems are fundamentally about state transitions, not message throughput.

An order is created, allocated, packed, shipped, returned, refunded. A customer is onboarded, verified, approved, suspended, restored. A claim is filed, assessed, escalated, settled. These are not just events flying around. They are journeys through business state, involving obligations, policies, constraints, and time.

An event-driven platform must therefore answer uncomfortable questions that Kafka does not answer:

What is the current state of an entity?
Which event is the event of record?
How are conflicting updates reconciled?
How are materialized views built and governed?
How is business time distinguished from processing time?
How are partial failures surfaced to operators?

These are platform questions because they combine infrastructure, modeling, operations, and governance. They require more than a bus. They require stateful architecture.

This is where many enterprises discover that they do not have an event-driven platform at all. They have a large event exhaust system with no durable understanding of what the events mean in aggregate.

The missing layers: what a real event-driven platform actually contains

If Kafka is the backbone, what else is needed?

First, event modeling and domain design. Teams need clear domain events, bounded contexts, ownership boundaries, and a stable language for change.

Second, contracts and schema governance. Events must be versioned, discoverable, and governed as long-lived interfaces, not ad hoc payloads.

Third, stateful stream processing and materialization. Someone has to turn event flows into usable operational views, aggregate insights, and action-ready state.

Fourth, complex event processing and pattern detection where the business requires higher-order interpretation rather than raw signal transport.

Fifth, observability and lineage. An event-driven platform must reveal how a business outcome emerged across many services, not just whether a broker is healthy.

Sixth, process awareness. This is the most neglected layer. Enterprises do not merely emit events; they execute processes. A mature platform knows how events relate to business journeys, SLAs, exceptions, and interventions.

Seventh, governance and policy enforcement. Security, access control, retention, compliance, and ownership cannot be afterthoughts when events become enterprise interfaces.

Eighth, developer experience and abstraction. If the platform is real, developers should be able to publish, discover, test, validate, and consume events with clear self-service tooling. Otherwise Kafka remains a specialist-operated transport network, not a platform.

Notice how little of this is provided by Kafka itself. That is not criticism. It is clarity.

Kafka’s proper place is still extremely important

To say Kafka is not an event-driven platform is not to diminish Kafka. If anything, it is to respect it enough to describe it accurately.

Kafka is one of the most important data infrastructure technologies of the last decade because it solved the transport and durability layer exceptionally well. It gave enterprises a shared, scalable medium for data in motion. It made asynchronous architectures far more practical. It enabled replayable histories and real-time pipelines that were previously painful or brittle.

But a medium is not a model.

The industry’s bad habit is to take a powerful enabling technology and let it absorb concepts that belong to adjacent layers. Databases become “AI strategies.” APIs become “platforms.” Message buses become “event-driven operating models.” Each inflation creates confusion, and eventually disappointment.

Kafka should be understood as the event backbone, not the full event civilization.

The deeper implication

The real promise of event-driven architecture was never simply that systems would send messages to one another. We already knew how to do that. The deeper promise was that enterprises could become more responsive, composable, and aware of change as it happens. That requires systems that can recognize business facts, coordinate across domains, infer patterns, maintain state, and expose process health in near real time.

That is a much larger ambition than log-based messaging.

So when someone says, “We have Kafka, therefore we have an event-driven platform,” the right response is: no, you have a highly capable transport substrate. That is necessary, valuable, and often foundational. But it is only one layer.

The platform begins where the semantics begin.

The platform begins where events stop being raw messages and start becoming shared, governed, domain-meaningful facts.

The platform begins where streams are connected to state, state to process, and process to action.

Kafka can carry the pulse of the enterprise. But it cannot, by itself, tell you whether the enterprise has a nervous system, a memory, or a brain. Those must be designed. Those must be layered. Those must be owned.

And until they are, you do not have an event-driven platform.

You have a very good bus.