Variational auto-encoders for audio
MIT License
UPDATE (20.5.20): I decided to isolate the code for reproducing the paper Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders (up from here) from this repo.
For variational auto-encoders (VAEs) and audio/music lovers, based on PyTorch.
The repo is under construction.
The project is built to facillitate research on using VAEs to model audio. It provides
The project structure is based on PyTorch Template.
Dataset
classes in dataset/datasets.py
python dataset/audio_transform.py -c your_config_of_audio_transform.json
to compute audio features (e.g., spectrograms)DataLoader
classes in data_loader/data_loaders.py
Run python train.py -c your_config_of_model_train.json