-
Take-D
- Missouri City, TX
- unsetopt.co
Stars
Universal Pasteboard Across Devices
A multimodal agent framework for solving complex tasks [EMNLP'2024]
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-spee…
Awesome LLMs on Device: A Comprehensive Survey
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
A Python library for converting images into FPGA-displayable pixel art.
Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms.
Next-Generation Interactive Intelligent Programming Assistant
Cocos simplifies game creation and distribution with Cocos Creator, a free, open-source, cross-platform game engine. Empowering millions of developers to create high-performance, engaging 2D/3D gam…
A lightweight general game development framework for Unity.
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab
Fullstack engineer's checklist for your cybersecurity.
A curated list of papers, code and resources pertaining to image composition/compositing or object insertion, which aims to generate realistic composite image.
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency a…
Unofficial Implementation of ReplaceAnything: https://aigcdesigngroup.github.io/replace-anything/
[NeurIPS 2024 D&B Track] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Official implementation of "Generating images with 3D annotations using diffusion models".
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inferenc…
Real-time and accurate open-vocabulary end-to-end object detection