Skip to content
View FreddyBanana's full-sized avatar
  • Sun Yat-sen University
  • Guangzhou, Guangdong, China
Block or Report

Block or report FreddyBanana

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

PDF Guru Anki是一款以PDF为中心的多功能办公学习工具箱软件,包含四大板块功能:PDF实用工具箱、Anki制卡神器、Anki最强辅助、视频笔记神器,软件功能众多且强大,熟练运用可以大幅提高办公和学习效率,绝对是您不可多得的效率神器。人生苦短,我用Guru!

Vue 1,958 167 Updated Jul 10, 2024

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

373 48 Updated Jul 11, 2024

🦜🔗 Build context-aware reasoning applications

Python 89,470 14,119 Updated Jul 20, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,368 364 Updated Jul 18, 2024

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 30,384 3,583 Updated Jul 13, 2024

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,023 603 Updated Nov 21, 2022

Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic…

Python 34,265 3,355 Updated Jul 20, 2024

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Python 8,015 562 Updated Jul 19, 2024

An implementation of the Prompt-to-Prompt paper for the SDXL architecture

Jupyter Notebook 76 5 Updated Jun 9, 2024

HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing

Python 61 3 Updated Apr 18, 2024

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 1,994 620 Updated Aug 9, 2023

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,860 294 Updated Jul 16, 2024

CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)

Python 27 1 Updated Nov 12, 2022

Data of ACL 2019 Paper "Expressing Visual Relationships via Language".

Jupyter Notebook 61 7 Updated Sep 30, 2020
Python 255 7 Updated Jan 27, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,102 277 Updated May 4, 2024

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

Python 493 37 Updated Jan 27, 2024

Data repository for the VALSE benchmark.

Python 33 3 Updated Feb 15, 2024

Situation With Groundings (SWiG) dataset and Joint Situation Localizer (JSL)

Python 60 13 Updated Mar 19, 2021

EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Images. Harsh Jhamtani, Taylor Berg-Kirkpatrick.

Jupyter Notebook 48 8 Updated Mar 29, 2020

Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)

Python 42 10 Updated Dec 8, 2022

Image Hosting solution, Flickr/imgur alternative, make it easy for users to share their images. Using Cloudflare Pages and Telegraph.

HTML 2,805 5,308 Updated Jul 17, 2024

awesome grounding: A curated list of research papers in visual grounding

988 97 Updated Apr 9, 2023

Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"

OpenEdge ABL 188 44 Updated Oct 31, 2020

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 452 22 Updated Jul 1, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,620 366 Updated Mar 14, 2024

Transparent Image Layer Diffusion using Latent Transparency

1,905 22 Updated Jun 16, 2024

中山大学本科生毕业论文 LaTeX 模板

TeX 22 2 Updated May 26, 2024

[CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Jupyter Notebook 112 2 Updated Jul 18, 2024

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Python 1,292 75 Updated Jan 17, 2024
Next