Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Recent Advances in Vision-Language Pre-training!
A treasure chest for visual classification and recognition powered by PaddlePaddle
Collection of AWESOME vision-language models for vision tasks
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …
LAVIS - A One-stop Library for Language-Vision Intelligence
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
一些经典的CTR算法的复现; LR, FM, FFM, AFM, DeepFM, xDeepFM, PNN, DCN, DCNv2, DIFM, AutoInt, FiBiNet,AFN,ONN,DIN, DIEN ... (pytorch, tf2.0)
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
Transformer: PyTorch Implementation of "Attention Is All You Need"
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
TensorFlow code and pre-trained models for BERT
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
An open source implementation of CLIP.
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
Official PyTorch implementation of SynDiff described in the paper (https://arxiv.org/abs/2207.08208).
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☁️ Build multimodal AI applications with cloud-native stack
Official implementation for "EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations" (NIPS 2022)
Official Repository of "Unpaired Image-to-Image Translation via Neural Schrödinger Bridge" (ICLR 2024)
Official repo for consistency models.
Contrastive Model Adaptation for Cross-Condition Robustness in Semantic Segmentation [ICCV 2023]
[CVPR' 22 ORAL] SIGMA: Semantic-complete Graph Matching for Domain Adaptative Object Detection
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io
推荐系统论文算法实现,包括序列推荐,多任务学习,元学习等。 Recommendation system papers implementations, including sequence recommendation, multi-task learning, meta-learning, etc.
IJCAI 2023 accepted paper for unsupervised nighttime semantic segmentation