Highlights
- Pro
Starred repositories
Refine high-quality datasets and visual AI models
[ECCV24] 3D Single-object Tracking in Point Clouds with High Temporal Variation
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
Open-source simulator for autonomous driving research.
AGI资料汇总学习(主要包括LLM和AIGC),持续更新......
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
3D detection and tracking viewer (visualization) for kitti & waymo dataset
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Official code base of the BEVDet series .
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution…
An Efficient, Flexible, and General deep learning framework that retains minimal.
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Official Repo For IROS 2023 Accepted Paper "Poly-MOT"
[NeurIPS 2024] PointMamba: A Simple State Space Model for Point Cloud Analysis
Automatic Labeling to Generate Training Data for Online LiDAR-based Moving Object Segmentation
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
XuyangBai / TransFusion
Forked from open-mmlab/mmdetection3d[PyTorch] Official implementation of CVPR2022 paper "TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers". https://arxiv.org/abs/2203.11496
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[ICCV2023] MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box Priors
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
🧑🍳 This repository contains the source code for the website https://emojikitchen.dev and allows for quick and easy browsing of the over 100,000 supported emoji mashups as part of Google's Emoji Ki…
This repo contains the official implementation of ICCV 2023 paper "Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?"