update lists

This commit is contained in:
2025-07-18 22:22:32 +02:00
parent 55bed3b4a1
commit 5916c5c074
3078 changed files with 331679 additions and 357255 deletions

View File

@@ -55,10 +55,23 @@ applications.</li>
<li><a href="https://github.com/apache/storm">Apache Storm</a>
[Clojure/Java] - distributed real-time computation system. Storm is to
stream processing what Hadoop is to batch processing.</li>
<li><a href="https://github.com/arkflow-rs/arkflow">ArkFlow</a> [Rust] -
High-performance Rust stream processing engine, providing powerful data
stream processing capabilities, supporting multiple input/output sources
and processors.</li>
<li><a href="https://github.com/ArroyoSystems/arroyo">Arroyo</a> [Rust]
- a distributed stream processing engine. Supports SQL and Rust
pipelines. Scales up to millions of events per second. Supports stateful
operations like windows and joins, state checkpointing for
fault-tolerance and recovery of pipelines. Uses the Timely Dataflow
model.</li>
<li><a href="https://github.com/uber/AthenaX">AthenaX</a> [Java] -
Ubers Stream Analytics Framework used in production</li>
<li><a href="https://github.com/bytewax/bytewax">Bytewax</a> [Python] -
data parallel, distributed, stateful stream processing framework.</li>
<li><a href="https://github.com/cocoindex-io/cocoindex">CocoIndex</a>
[Rust/Python] - ETL framework to build fresh index for AI, with realtime
incremental updates.</li>
<li><a href="https://github.com/robinhood/faust">Faust</a> [Python] -
stream processing library, porting the ideas from Kafka Streams to
Python</li>
@@ -128,7 +141,16 @@ Golang, and it can be run at all kinds of resource-constrained edge
devices.</li>
<li><a href="https://paragroup.github.io/WindFlow">WindFlow</a> [C++] -
A C++17 Data Stream Processing Parallel Library for Multicores and
GPUs</li>
GPUs.</li>
<li><a
href="https://github.com/risingwavelabs/risingwave">RisingWave</a>
[Rust] - A PostgreSQL-compatible streaming database that is designed to
build event-driven applications, real-time ETL pipelines, continuous
analytics services, and feature stores for AI applications. It excels in
extracting fresh and consistent insights from real-time event streams,
database CDC, and time series data within sub-seconds. It unifies
streaming and batch processing, enabling users to ingest, join, and
analyze both live and historical data at a cloud scale.</li>
</ul>
<h3 id="streaming-library">Streaming Library</h3>
<ul>
@@ -196,6 +218,9 @@ API for Java.</li>
<li><a href="https://github.com/brexhq/substation">Substation</a> [Go] -
Substation is a cloud native data pipeline and transformation toolkit
written in Go.</li>
<li><a href="https://github.com/swimos/swim-rust">SwimOS</a> [Rust] - A
framework for building real-time streaming data processing applications
written in Rust.</li>
<li><a href="https://github.com/timkpaine/tributary">Tributary</a>
[Python] - A python library for constructing dataflow graphs. Supports
synchronous, reactive data streams built using python generators that
@@ -278,6 +303,10 @@ flexible messaging model and an intuitive client API.</li>
[Java] - distributed messaging and streaming platform with low latency,
high performance and reliability, trillion-level capacity and flexible
scalability.</li>
<li><a href="https://github.com/AutoMQ/automq">AutoMQ</a> [Scala/Java] -
cloud-first alternative to Kafka by decoupling durability to S3 and EBS.
100% Kafka compatible. 10x cost-effective. Autoscale in seconds.
Single-digit ms latency.</li>
<li><a href="https://github.com/linkedin/Brooklin/">brooklin</a> [Java]
- a distributed system intended for streaming data between various
heterogeneous source and destination systems with high reliability and
@@ -480,3 +509,5 @@ alt="Creative Commons License" />
<p>Licensed under a <a
href="http://creativecommons.org/licenses/by-sa/4.0/">Creative Commons
Attribution-ShareAlike 4.0 International License</a></p>
<p><a href="https://github.com/manuzhang/awesome-streaming">streaming.md
Github</a></p>