Skip to content
View tomtongue's full-sized avatar

Highlights

  • Pro

Block or report tomtongue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This repository contains the dbt-glue adapter

Python 97 68 Updated Oct 7, 2024

Apache Flink

Java 23,896 13,274 Updated Oct 7, 2024

PartiQL libraries and tools in Kotlin.

Kotlin 539 60 Updated Oct 7, 2024

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Python 1,058 110 Updated Oct 7, 2024

Distributed data engine for Python/SQL designed for the cloud, powered by Rust

Rust 2,175 146 Updated Oct 7, 2024

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 3,836 212 Updated Oct 5, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,281 361 Updated Oct 7, 2024

DuckDB is an analytical in-process SQL database management system

C++ 23,239 1,852 Updated Oct 7, 2024

これからApache Icebergを学びたい人向けの実践的なハンズオンです。コンテナが動く端末1台で始められます

Jupyter Notebook 38 6 Updated Oct 2, 2024

Fancy stream processing made operationally mundane

Go 8,100 826 Updated Oct 7, 2024

Awaitility is a small Java DSL for synchronizing asynchronous operations

Java 3,812 241 Updated Aug 7, 2024

Specification for storing geospatial vector data (point, line, polygon) in Parquet

Python 814 56 Updated Aug 25, 2024

Repository for the book "Crafting Interpreters"

HTML 8,880 1,038 Updated Aug 7, 2024

Unix-like OS in Rust inspired by xv6-riscv

Rust 1,398 56 Updated Jul 31, 2024

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 859 143 Updated Oct 6, 2024

ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.

Java 17,047 3,270 Updated Oct 6, 2024

Apache Iceberg

Rust 622 140 Updated Oct 6, 2024

RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.

Python 309 68 Updated Jul 31, 2024

Brotli compression format

TypeScript 13,502 1,240 Updated Oct 7, 2024

Extendable version manager with support for Ruby, Node.js, Elixir, Erlang & more

Shell 21,775 774 Updated Oct 7, 2024

Your favorite language gets closer to bare metal.

Scala 4,478 363 Updated Oct 7, 2024

JSON library

Scala 1,485 327 Updated Oct 7, 2024

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 29,608 1,891 Updated Oct 7, 2024

A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.

C++ 3,458 1,128 Updated Oct 7, 2024

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,163 423 Updated Oct 3, 2024

Apache Iceberg Documentation Site

SCSS 42 98 Updated Feb 5, 2024

Serverless ETL and Analytics with AWS Glue, published by Packt

Python 45 32 Updated Oct 2, 2023

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (…

Python 3,906 697 Updated Oct 7, 2024

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 36,581 14,166 Updated Oct 7, 2024
Next