This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …

Python 748 119 Updated Sep 9, 2024

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,066 93 Updated Aug 18, 2024

YuanGongND / psla

Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".

Python 137 16 Updated Jul 13, 2023

makobouzu / FSD50KLabelClassification

Python 1 Updated Oct 31, 2020

RetroCirce / HTS-Audio-Transformer

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Python 342 62 Updated Aug 16, 2024

yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 805 127 Updated Jul 18, 2024

qiuqiangkong / audioset_tagging_cnn

Python 1,315 249 Updated Jul 25, 2024

lihanghang / CASR-DEMO

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

CSS 155 28 Updated Mar 31, 2024

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 5,876 752 Updated Aug 19, 2024

clusterzx / pr0lator

Video to Text Translation + VTT Subtitle Generation + WebService

CSS 8 Updated Feb 11, 2024

Audio-WestlakeU / ATST-SED

This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".

Jupyter Notebook 80 11 Updated Aug 17, 2024

qiuqiangkong / panns_inference

Python 190 29 Updated Mar 5, 2024

leiurayer / downkyi

哔哩下载姬downkyi，哔哩哔哩网站视频下载工具，支持批量下载，支持8K、HDR、杜比视界，提供工具箱（音视频提取、去水印等）。

C# 20,598 2,267 Updated Aug 14, 2024

yeyupiaoling / AudioClassification-Pytorch

The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.

Python 380 79 Updated Sep 4, 2024

xiayongtao / aidatatang_1505zh

Shell 27 19 Updated Jul 9, 2019

openvpi / audio-slicer

Python script that slices audio with silence detection

Python 747 265 Updated Jun 8, 2024

zmeet-ai / asr_demo

语音识别API，分实时语音和长语音离线上传识别，支持中英文等多达100个国家的语言实时转写和同声传译

Java 54 5 Updated Jul 11, 2023

DIYgod / DPlayer

🍭 Wow, such a lovely HTML5 danmaku video player

JavaScript 15,384 2,403 Updated Mar 24, 2024

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,154 1,218 Updated Aug 14, 2024

mokohe / Deep-Learning-Framework

Forked from Karenina-na/Deep-Learning-Framework

深度学习脚手架

Jupyter Notebook 9 1 Updated Aug 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ning-Lorraine

Achievements

Achievements

Block or report Ning-Lorraine

Stars

qinL-cdy / auto_ai_subtitle

X-LANCE / MSDWILD

shichaog / WebRTC-audio-processing

wiseman / py-webrtcvad

lovemefan / fsmn-vad

yeyupiaoling / VoiceprintRecognition-Pytorch