- Singapore
Stars
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
Open Source framework for voice and multimodal conversational AI
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
A generative speech model for daily dialogue.
基于PlayWright和xvfb实现对js渲染的动态网页进行抓取,包含网页源码、截图、网站入口发现、网页交互过程、Web 指纹信息等等,支持优先级任务调度。
基于python的网页自动化工具。既能控制浏览器,也能收发数据包。可兼顾浏览器自动化的便利性和requests的高效率。功能强大,内置无数人性化设计和便捷功能。语法简洁而优雅,代码量少。
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTORCH
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
curl-impersonate: A special build of curl that can impersonate Chrome & Firefox
Python binding for curl-impersonate via cffi. A http client that can impersonate browser tls/ja3/http2 fingerprints.
PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…
Automate webpages at scale, scrape web data completely and accurately with high performance, distributed AI-RPA.
[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
SimXNS is a research project for information retrieval. This repo contains official implementations by MSRA NLC team.
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Set up your GitHub Actions workflow with ffmpeg
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
q - Run SQL directly on delimited files and multi-file sqlite databases