ReconVAT: a semi-supervised automatic music transcription (AMT) model
Statistics for this project are still being loaded, please check back later.
AeCC: Autoencoders for Compressed Communication
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition syst...
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Easiest way of fine-tuning HuggingFace video classification models
The training code for the 4th place model at MDX 2021 leaderboard A.
Code for "V1T: Large-scale mouse V1 response prediction using a Vision Transformer"
Audio processing by using pytorch 1D convolution network
My research code to setup and train PyTorch deep nets.
Code for the paper "Jukebox: A Generative Model for Music"
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
Deciphering 3'UTR mediated gene regulation using interpretable deep representation learning
PyTorch Dataset for Speech and Music audio
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
GUI for a Vocal Remover that uses Deep Neural Networks.