Block or Report
Block or report wntg
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A comprehensive benchmark of deepfake detection
Prompting Large Language Models with Audio for General-Purpose Speech Summarization
This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".
Awesome speech/audio LLMs, representation learning, and codec models
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
Implementation of the paper "Improved DeepFake Detection Using Whisper Features"
Metadata and versioning details for the Common Voice dataset
RGB-T Fusion, RGB-T SOD, RGB-T Vehicle Detection, RGB-T Crowd Counting, RGB-T Pedestrian Detection, RGB-T Semantic Segmeantaion, RGB-T Tracking
RGB-T Crowd Counting from Drone: A Benchmark and MMCCN Network
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
AI powered speech denoising and enhancement
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data
调节 Whisper 转录生成的 srt 文件,避免一句话被分成两行,避免一句话过短。
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Extensions to YAML syntax for better python interaction
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
《Python预测之美:数据分析与算法实战》书籍代码维护
机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶
本科毕业设计:针对Deepfake假脸视频面部细节特征的提取算法
To classify video into various classes using keras library with tensorflow as back-end.
3D ResNets for Action Recognition (CVPR 2018)
渗透测试有关的POC、EXP、脚本、提权、小工具等---About penetration-testing python-script poc getshell csrf xss cms php-getshell domainmod-xss csrf-webshell cobub-razor cve rce sql sql-poc poc-exp bypass oa-getshell cve…
Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.