Skip to content
View bmedi's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report bmedi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Data Exploration

35 repositories

Awesome list for data journalists and future data journalists

175 21 Updated Jan 3, 2022

The 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.

Python 237 54 Updated Jul 10, 2024

Search and browse documents and data; find the people and companies you look for.

JavaScript 1,971 261 Updated Jul 10, 2024

Toxic Span Detection with spaCy

Python 4 2 Updated May 6, 2022

news-please - an integrated web crawler and information extractor for news that just works

Python 1,985 416 Updated Jul 9, 2024

Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

HTML 503 86 Updated Sep 1, 2023

Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k sentences and a state-of-the-art classification model.

Python 135 20 Updated Dec 20, 2023

An open-source intelligence (OSINT) analysis tool leveraging GPT-powered embeddings and vector search engines for efficient data processing

Python 344 52 Updated Dec 11, 2023

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Python 10,964 1,199 Updated Jul 10, 2024

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your d…

Python 14,586 1,709 Updated Jul 10, 2024

AMITT (Adversarial Misinformation and Influence Tactics and Techniques) framework for describing disinformation incidents. Includes TTPs and countermeasures.

Jupyter Notebook 168 33 Updated Jul 3, 2022

An unobtrusive and user-friendly desktop application for IPFS on Windows, Mac and Linux.

JavaScript 5,866 851 Updated Jun 13, 2024

Context-aware knowledge-graph based chatbot using GPT4 and Neo4j

Python 142 50 Updated Jan 7, 2024

Accumulated knowledge and experience in the field of Data Engineering

830 96 Updated Nov 22, 2022
Java 9 3 Updated Sep 12, 2023
JavaScript 832 84 Updated Jun 30, 2024

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

Python 18,084 981 Updated Jul 10, 2024

✨ Awesome - A curated list of amazing Topic Models (implementations, libraries, and resources)

80 6 Updated Jul 11, 2022

An open source framework to crawl data sources and ingest into Vectara

Python 104 48 Updated Jul 10, 2024

LLM-powered Conversational AI experience using Vectara

TypeScript 223 71 Updated Jun 18, 2024

Graph Neural Network Library for PyTorch

Python 20,590 3,573 Updated Jul 10, 2024

Python package built to ease deep learning on graph, on top of existing DL frameworks.

Python 13,200 2,989 Updated Jul 10, 2024

Graph Neural Networks with Keras and Tensorflow 2.

Python 2,353 335 Updated Jan 21, 2024

Network Analysis in Python

Python 14,466 3,183 Updated Jul 10, 2024

PyTrial: A Comprehensive Platform for Artificial Intelligence for Drug Development

Python 73 13 Updated Jan 8, 2024

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 29,248 4,321 Updated Jul 9, 2024

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 7,544 595 Updated Jul 10, 2024

A free & open tool for transcribing audio interviews

JavaScript 901 185 Updated May 11, 2024

Carefully curated list of awesome digital preservation resources.

JavaScript 1 Updated Sep 28, 2023

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 4,800 921 Updated Jul 10, 2024