Skip to content
View happog's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report happog

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.

Python 1,146 247 Updated May 30, 2024

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …

Python 693 111 Updated Jul 9, 2024

LLM Finetuning with peft

Jupyter Notebook 1,824 498 Updated Jul 8, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Python 4,014 305 Updated Jul 9, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 12,866 1,173 Updated Jul 2, 2024

[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective

Python 150 8 Updated Nov 1, 2023

DataComp: In search of the next generation of multimodal datasets

Python 601 49 Updated Jan 2, 2024

👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 1,634 153 Updated Jun 27, 2024

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and…

Python 6,277 1,226 Updated Jul 9, 2024

A pure javascript QR code decoding library, accept Image File object, image url, image base64.

TypeScript 73 8 Updated Jun 14, 2024

[CVPR2023] Blur Interpolation Transformer for Real-World Motion from Blur

Python 205 8 Updated Mar 28, 2024

python implementation of the paper "Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes" - cvpr 2017

Python 144 21 Updated Jan 27, 2023

Robust Python implementatoin for detecting blurry images using ROI estimation and DCT analysis.

Python 41 2 Updated Jan 19, 2022

I know what your pet is thinking - gemini

JavaScript 307 42 Updated Jun 17, 2024

基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用

Python 93 17 Updated Jun 20, 2024

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Python 150 4 Updated Jun 9, 2024

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,476 231 Updated May 1, 2024

A parser, editor and profiler tool for ONNX models.

Python 343 44 Updated Mar 18, 2024
Python 79 6 Updated Oct 28, 2023

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Python 219 7 Updated Jun 25, 2024

大模型基础: 一文了解大模型基础知识

1,901 173 Updated Jul 3, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 21,692 2,217 Updated Jul 9, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,526 965 Updated Jun 22, 2024

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 10,450 758 Updated Jul 4, 2024

Next generation face swapper and enhancer

Python 16,519 2,416 Updated Jul 9, 2024

one-click face swap

Python 25,675 6,264 Updated Jul 5, 2024

Set of Python bindings to C++ libraries which provides full HW acceleration for video decoding, encoding and GPU-accelerated color space and pixel format conversions

C++ 1,291 231 Updated Jun 10, 2024

Compare NVIDIA Video Codec SDK's, PyAV's, and OpenCV's performance on video decoding.

C++ 9 2 Updated Dec 18, 2022
Next