Skip to content
View asdf2014's full-sized avatar
😃
To keep learning
😃
To keep learning

Organizations

@apache @aliyun @theme-next @yuzhouwan-Official

Block or report asdf2014

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

19 stars written in Scala
Clear filter

Source code for Twitter's Recommendation Algorithm

Scala 62,136 12,153 Updated Jul 10, 2024

CMAK is a tool for managing Apache Kafka clusters

Scala 11,813 2,504 Updated Aug 2, 2023

Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.

Scala 7,660 1,137 Updated Jul 22, 2020

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,493 1,683 Updated Oct 2, 2024

Apache OpenWhisk is an open source serverless cloud platform

Scala 6,502 1,163 Updated Sep 24, 2024

酷玩 Spark: Spark 源代码解析、Spark 类库等

Scala 3,465 1,410 Updated May 18, 2022

In-memory dimensional time series database.

Scala 3,441 303 Updated Sep 27, 2024

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,079 907 Updated Oct 2, 2024

Distributed Prometheus time series database

Scala 1,428 225 Updated Oct 2, 2024

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.

Scala 1,423 431 Updated Oct 7, 2024

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,163 423 Updated Oct 3, 2024

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

Scala 1,039 201 Updated Nov 21, 2022

A collection of open source Apache 2.0 Kafka Connector maintained by Lenses.io.

Scala 1,004 363 Updated Oct 7, 2024

A Time Series Library for Apache Spark

Scala 1,003 184 Updated Jul 3, 2020

Avro schema generation and serialization / deserialization for Scala

Scala 719 237 Updated Aug 13, 2024

Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover, seamlessly and without downtime.

Scala 516 230 Updated Jan 13, 2020

Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.

Scala 284 92 Updated Aug 3, 2018

Druid indexing plugin for using Spark in batch jobs

Scala 101 56 Updated Oct 21, 2021

A library for querying Druid data sources with Apache Spark

Scala 23 14 Updated Oct 28, 2020