🏠
Working from home
Undergraduate student of University of Chinese Academy of Sciences
-
University of Chinese Academy of Sciences
- University of Chinese Academy of Sciences
Highlights
- Pro
Stars
Awesome speech/audio LLMs, representation learning, and codec models
A playbook for systematically maximizing the performance of deep learning models.
A latent text-to-image diffusion model
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a ve…
Pytorch implementation of time-domain filterbanks
Finding the genre of a song with Deep Learning