![python logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/python/python.png)
-
eBay
- Shanghai
Block or Report
Block or report HuanjieGuo
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
10 Weeks, 20 Lessons, Data Science for All!
Curated list of resources about Apache Airflow
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Protocol Buffers - Google's data interchange format
Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs,…
Supervisor process control system for Unix (supervisord)
Most popular Mocking framework for unit tests written in Java
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The Java gRPC implementation. HTTP/2 based RPC
Alluxio, data orchestration for analytics and machine learning in the cloud
ClickHouse® is a real-time analytics DBMS
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
Apache Drill is a distributed MPP query layer for self describing data
Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules
A damn simple library for building production-ready RESTful web services.
Toy single-machine implementation of the Pregel graph-based framework
LinkedIn's previous generation Kafka to HDFS pipeline.
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data
Code repository for O'Reilly Hadoop Application Architectures book