Skip to content
View wntg's full-sized avatar
Block or Report

Block or report wntg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A comprehensive benchmark of deepfake detection

Python 375 52 Updated Jun 29, 2024

Prompting Large Language Models with Audio for General-Purpose Speech Summarization

Python 2 1 Updated Jun 18, 2024

This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".

Python 16 2 Updated May 16, 2024

Awesome speech/audio LLMs, representation learning, and codec models

495 24 Updated May 29, 2024

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Python 330 59 Updated Jul 27, 2022

Implementation of the paper "Improved DeepFake Detection Using Whisper Features"

Python 74 4 Updated Apr 28, 2024

Metadata and versioning details for the Common Voice dataset

JavaScript 132 15 Updated Jul 1, 2024

RGB-T Fusion, RGB-T SOD, RGB-T Vehicle Detection, RGB-T Crowd Counting, RGB-T Pedestrian Detection, RGB-T Semantic Segmeantaion, RGB-T Tracking

28 1 Updated Apr 7, 2024

RGB-T Crowd Counting from Drone: A Benchmark and MMCCN Network

30 3 Updated Apr 27, 2024

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,013 401 Updated Jun 6, 2024

AI powered speech denoising and enhancement

Python 1,084 103 Updated Jun 21, 2024

Preprocess Audio for training

Python 177 31 Updated Jun 3, 2024

SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.

Python 411 59 Updated Apr 30, 2024

ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).

HTML 35 6 Updated May 9, 2024

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

SCSS 26 2 Updated Dec 31, 2023
Jupyter Notebook 6,942 504 Updated Jun 16, 2024

调节 Whisper 转录生成的 srt 文件,避免一句话被分成两行,避免一句话过短。

Python 22 1 Updated May 23, 2024

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,503 223 Updated Jun 25, 2024

Extensions to YAML syntax for better python interaction

Python 49 18 Updated Dec 13, 2023

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 61,333 7,649 Updated Jul 4, 2024

《Python预测之美:数据分析与算法实战》书籍代码维护

Jupyter Notebook 63 31 Updated Mar 25, 2023

机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶

C++ 7,814 2,757 Updated Jul 29, 2021

:trollface: 本科毕业设计:针对Deepfake假脸视频面部细节特征的提取算法

Jupyter Notebook 48 7 Updated May 12, 2022

To classify video into various classes using keras library with tensorflow as back-end.

Python 272 116 Updated Oct 18, 2020

3D ResNets for Action Recognition (CVPR 2018)

Python 3,841 931 Updated Jan 20, 2021

渗透测试有关的POC、EXP、脚本、提权、小工具等---About penetration-testing python-script poc getshell csrf xss cms php-getshell domainmod-xss csrf-webshell cobub-razor cve rce sql sql-poc poc-exp bypass oa-getshell cve…

HTML 6,308 1,924 Updated Jun 27, 2024

Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star

HTML 2,389 417 Updated Apr 6, 2024

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 48,792 11,053 Updated Jul 5, 2024

PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.

Python 1,163 250 Updated Dec 27, 2023