Lists (13)
Sort Oldest
Stars
Data Engineering with Databricks Cookbook, published by Packt
Caching Terraform providers within a GitHub Actions Workflow run to improve execution times.
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
Example project for building scalable data pipelines with Kedro and Ibis.
Code for my "Efficient Data Processing in SQL" book.
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
A playground for running duckdb as a stateless query engine over a data lake
Nicely modeled data built on the Github Archive.
intentionally vuln web Application Security in django
LLM Zoomcamp - a free online course about building a Q&A system
A minimalist yet highly performant, lightweight, lightning fast, multisource, multimodal and local embedding solution, built in rust.
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Interactive chat application leveraging OpenAI's GPT-4 for real-time conversation simulations. Built with Flask, this project showcases streaming LLM responses in a user-friendly web interface.
A full-stack Webui implementation of Large Language model, such as ChatGPT or LLaMA.
Turns Data and AI algorithms into production-ready web applications in no time.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & C…