Some PyTorch code for the Kaggle Speech Recognition Challenge
Statistics for this project are still being loaded, please check back later.
CTC end -to-end ASR for timit and 863 corpus.
Exploration of Self-Semi-Supervised learning for handling unlabeled data
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Experiments of Pytorch SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
A simplified implemention of Faster R-CNN that replicate performance from origin paper
My solution to Kaggle challenge "IEEE Camera Model Identification" [top 3%]
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of Py...
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
State Representation Learning (SRL) zoo with PyTorch - Part of S-RL Toolbox
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Modeling Temporal Dynamics and Spatial Configurations of Actions UsingTwo-Stream Recurrent Neural...
Kaggle | 1st place solution for Freesound Audio Tagging 2019