Code for searching the www.xeno-canto.org bird sound database, and training a machine learning model to classify birds according to their sounds.
Convert text and audio to facial expressions
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation
Audio super resolution using neural networks
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
A TensorFlow implementation of DeepMind's WaveNet paper
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TT...
Audio Classification using Image Classification
本项目基于PaddleDetection目标检测开发套件,选取1.3M超轻量PPYOLO tiny进行项目开发,并部署于windows端。
Multilingual Voice Understanding Model
Easily turn large sets of audio urls to an audio dataset.
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unoffic...
Simple GUI application to help record audio dictated from given text prompts, for use with traini...