Skip to content
View ewhitley's full-sized avatar

Block or report ewhitley

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
59 stars written in Python
Clear filter

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 15,977 1,570 Updated Oct 4, 2024

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Python 12,426 1,674 Updated Oct 4, 2024

Turns Data and AI algorithms into production-ready web applications in no time.

Python 12,341 910 Updated Oct 4, 2024

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…

Python 9,882 895 Updated Oct 4, 2024

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 8,482 1,363 Updated Sep 6, 2024

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 7,791 744 Updated Oct 1, 2024

A modular SQL linter and auto-formatter with support for multiple dialects and templated code.

Python 7,749 709 Updated Oct 4, 2024

the portable Python dataframe library

Python 5,147 590 Updated Oct 4, 2024

VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.

Python 4,931 368 Updated Aug 29, 2024

GitPython is a python library used to interact with Git repositories.

Python 4,596 905 Updated Sep 14, 2024

Configuration Management for Python ⚙

Python 3,729 289 Updated Oct 1, 2024

A GUI for Pandas DataFrames

Python 3,182 232 Updated Dec 7, 2023

An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

Python 3,036 425 Updated Jan 12, 2024

Operate and manipulate physical quantities in Python

Python 2,381 466 Updated Aug 19, 2024

Gin provides a lightweight configuration framework for Python

Python 2,050 120 Updated Aug 19, 2024

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

Python 1,978 94 Updated Sep 21, 2024

A full spaCy pipeline and models for scientific/biomedical documents.

Python 1,688 226 Updated Sep 15, 2024

Efficient data transformation and modeling framework that is backwards compatible with dbt.

Python 1,663 150 Updated Oct 4, 2024

Graph the import dependancies in an Objective-C project

Python 1,346 144 Updated Jan 19, 2024

SQL Lineage Analysis Tool powered by Python

Python 1,298 235 Updated Sep 11, 2024

Document, sample code and other materials for SQLFlow

Python 900 168 Updated Oct 3, 2024

Uses tokenized query returned by python-sqlparse and generates query metadata

Python 799 125 Updated Sep 11, 2024

A Custom Jupyter Widget Library for Power BI

Python 464 149 Updated Sep 26, 2023

Medical Concept Annotation Tool

Python 436 102 Updated Sep 30, 2024

🏥 Medical Text Mining and Information Extraction with spaCy

Python 429 91 Updated Nov 1, 2022

(Legacy) Command Line Interface for Databricks

Python 384 234 Updated Oct 5, 2023

Handle, manipulate, and convert data with units in Python

Python 364 48 Updated Oct 4, 2024

Databricks SDK for Python (Beta)

Python 352 117 Updated Sep 26, 2024

Minimalist Python library for building static websites with Jinja

Python 316 50 Updated Mar 4, 2024
Next