Lists (1)
Sort Name ascending (A-Z)
Stars
Apache Flink shaded hadoop artifacts repository
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
A library that provides an embeddable, persistent key-value store for fast storage.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Apache Spark - A unified analytics engine for large-scale data processing
LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as Delta Lake, Apache Hudi, and Apache Iceberg.
Know your build - so you can make it faster
"The mother of all demo apps" — Exemplary fullstack Medium.com clone powered by React, Angular, Node, Django, and many more
q - Run SQL directly on delimited files and multi-file sqlite databases
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Qt based cross-platform GUI proxy configuration manager (backend: sing-box)
Xray panel supporting multi-protocol multi-user expire day & traffic & ip limit (Vmess & Vless & Trojan & ShadowSocks & Wireguard)
Upserts, Deletes And Incremental Processing on Big Data.
Showcase your skills on your Github readme or resumé with ease ✨
Streamlit — A faster way to build and share data apps.
Implementation of Ag-Grid component for Streamlit
Telegram bot for botanim.to.digital
🦉 ML Experiments and Data Management with Git