- San Francisco, CA
Stars
🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊
Flexible development framework for building streaming data applications in SQL with Kafka, Flink, Postgres, GraphQL, and more.
A reproduce for the issue I'm facing with PyO3 and Sub Interpreter.
A realtime serving engine for Data-Intensive Generative AI Applications
An open-source, cloud-native, unified time series database for metrics, logs and events with SQL/PromQL supported. Available on GreptimeCloud.
Apache DataFusion Python Bindings
TypeScript notebook for rapid prototyping
embeddable cloud-native storage for events and time-series data
Public Fused UDFs. Build any scale workflows with the Fused Python SDK and Workbench webapp, and integrate them into your stack with the Fused Hosted API.
Embeddable stream processing engine based on Apache DataFusion
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
Scalable datastore for metrics, events, and real-time analytics
Feathr – A scalable, unified data and AI engineering platform for enterprise
An idiomatic implementation of serde/avro (de)serialization
Textual apps and libraries
High-performance diffing of large datasets across databases
Apache Arrow DataFusion SQL Query Engine
Embeddable Aggregate Management System for Streams and Queries.
Infrastructure for generating fake data to a kafka cluster
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
NAND is a logic simulator suite made entirely from NAND gates