FreddyBanana

Follow

FreddyBanana

Follow

A student majoring in artificial intelligence.

2 followers · 4 following

Sun Yat-sen University
Guangzhou, Guangdong, China

Block or Report

Block or report FreddyBanana

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (1)

Sort

🔮 Future ideas

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

kevin2li / PDF-Guru

PDF Guru Anki是一款以PDF为中心的多功能办公学习工具箱软件，包含四大板块功能：PDF实用工具箱、Anki制卡神器、Anki最强辅助、视频笔记神器，软件功能众多且强大，熟练运用可以大幅提高办公和学习效率，绝对是您不可多得的效率神器。人生苦短，我用Guru!

Vue 1,958 167 Updated Jul 10, 2024

Paranioar / Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model, Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

373 48 Updated Jul 11, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Python 89,470 14,119 Updated Jul 20, 2024

QwenLM / Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,368 364 Updated Jul 18, 2024

NaiboWang / EasySpider

A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化的设计和执行爬虫任务。别名：ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 30,384 3,583 Updated Jul 13, 2024

CLUEbenchmark / CLUEDatasetSearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Python 4,023 603 Updated Nov 21, 2022

QuivrHQ / quivr

Open-source RAG Framework for building GenAI Second Brains 🧠 Build productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic…

Python 34,265 3,355 Updated Jul 20, 2024

OpenBMB / MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Python 8,015 562 Updated Jul 19, 2024

RoyiRa / prompt-to-prompt-with-sdxl

An implementation of the Prompt-to-Prompt paper for the SDXL architecture

Jupyter Notebook 76 5 Updated Jun 9, 2024

UCSC-VLAA / HQ-Edit

HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing

Python 61 3 Updated Apr 18, 2024

ankush-me / SynthText

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 1,994 620 Updated Aug 9, 2023

FoundationVision / VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,860 294 Updated Jul 16, 2024

sushizixin / CLIP4IDC

CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)

Python 27 1 Updated Nov 12, 2022

airsplay / VisualRelationships

Data of ACL 2019 Paper "Expressing Visual Relationships via Language".

Jupyter Notebook 61 7 Updated Sep 30, 2020

tsb0601 / MMVP

Python 255 7 Updated Jan 27, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,102 277 Updated May 4, 2024

luogen1996 / LaVIN

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

Python 493 37 Updated Jan 27, 2024

Heidelberg-NLP / VALSE

Data repository for the VALSE benchmark.

Python 33 3 Updated Feb 15, 2024

allenai / swig

Situation With Groundings (SWiG) dataset and Joint Situation Localizer (JSL)

Python 60 13 Updated Mar 19, 2021

harsh19 / spot-the-diff

EMNLP 2018. Learning to Describe Differences Between Pairs of Similar Images. Harsh Jhamtani, Taylor Berg-Kirkpatrick.

Jupyter Notebook 48 8 Updated Mar 29, 2020

Seth-Park / RobustChangeCaptioning

Code and dataset release for Park et al., Robust Change Captioning (ICCV 2019)

Python 42 10 Updated Dec 8, 2022

cf-pages / Telegraph-Image

Image Hosting solution, Flickr/imgur alternative, make it easy for users to share their images. Using Cloudflare Pages and Telegraph.

HTML 2,805 5,308 Updated Jul 17, 2024

TheShadow29 / awesome-grounding

awesome grounding: A curated list of research papers in visual grounding

988 97 Updated Apr 9, 2023

LisaAnne / LocalizingMoments

Github for my ICCV 2017 paper: "Localizing Moments in Video with Natural Language"

OpenEdge ABL 188 44 Updated Oct 31, 2020

jy0205 / LaVIT

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Jupyter Notebook 452 22 Updated Jul 1, 2024

OpenGVLab / LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,620 366 Updated Mar 14, 2024

layerdiffusion / LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

1,905 22 Updated Jun 16, 2024

irenier / sysuthesis

中山大学本科生毕业论文 LaTeX 模板

TeX 22 2 Updated May 26, 2024

JiuTian-VL / JiuTian-LION

[CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Jupyter Notebook 112 2 Updated Jul 18, 2024

NVlabs / prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

Python 1,292 75 Updated Jan 17, 2024