Stars
world modeling challenge for humanoid robots
Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots
🦩 Visual Instruction Tuning with Polite Flamingo - training multi-modal LLMs to be both clever and polite! (AAAI-24 Oral)
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Pyhton script for generating zoom in/out videos from a set of images
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation
Official codebase for "Any-point Trajectory Modeling for Policy Learning"
Dexterous teleoperation for the Stretch mobile manipulators from Hello Robot Inc., for CMU MCDS Capstone ARVR robotics project
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.
Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset
A Versatile Teleoperation framework for Robotic Manipulation using Meta Quest3
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
A high-throughput and memory-efficient inference and serving engine for LLMs
Python library for loading and using triangular meshes.
A CLI for processing composite Wavefront OBJ files for use in MuJoCo.
Python package for importing and loading external assets into AI2THOR
🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!
ROS 2 packages for the Stretch mobile manipulators from Hello Robot Inc.
ROS Wrapper for Intel(R) RealSense(TM) Cameras