HorizonFlow

by Crowell Digital Marketplace

Scalable Open-Source Data Pipeline Service

Data Processed

2.4TB

Messages/sec

15,432

Active Streams

8

Query Latency

45ms

System Uptime

99.97%

Apache Kafka

running

High-throughput distributed streaming platform

throughput

15.4K msg/s

partitions

24

replicas

3

Apache Flink

running

Stream processing framework for real-time analytics

jobs

8

checkpoints

99.9%

latency

12ms

ClickHouse

running

Columnar database for analytical queries

queries

1.2K/min

storage

450GB

compression

8.2x

Apache Cassandra

running

Distributed NoSQL database for operational data

nodes

6

writes

8.5K/s

reads

12.3K/s

PostgreSQL

running

Relational database for metadata and configuration

connections

45

size

12GB

queries

450/min

Trino

running

Distributed SQL query engine

workers

12

queries

89/min

data

2.1TB scanned

Apache Airflow

running

Workflow orchestration platform

dags

23

tasks

156 running

success

98.5%

Apache Superset

running

Business intelligence and data visualization

dashboards

34

charts

127

users

89

Recent Activity

Data ingestion pipeline started
2 min ago
ClickHouse cluster scaled up
5 min ago
Flink job completed successfully
12 min ago
New data source connected
18 min ago
Backup completed
25 min ago

System Architecture

Data Ingestion

Kafka producers collect data from multiple sources including APIs, databases, IoT devices, and file systems.

Stream Processing

Flink processes real-time streams for cleaning, enrichment, and aggregation before storing in target systems.

Analytics & Serving

Trino enables federated queries across all data stores while Superset provides interactive dashboards.