audio2numpy

load audio files into numpy array

MIT License

Downloads
8.1K
Stars
18
Committers
2

audio2numpy

Description

audio2numpy load an audio file and directly ouputs the audio data as a numpy array and its sampling rate. Supports .wav, .aiff via python's standard library, and .mp3 via ffmpeg.

Installation

Using pip:

pip install audio2numpy

FFmpeg for decoding mp3

audio2numpy requires ffmpeg to decode mp3 files. You would need to install ffmpeg in order to have mp3 support.

macOS

homebrew install ffmpeg

Linux

sudo apt-get install ffmpeg

Check here for other installation methods for different Linux distributions.

Windows

Colab

!apt install ffmpeg

Usage

from audio2numpy import open_audio
fp = "./examples/word.mp3"  # change to the correct path to your file accordingly
signal, sampling_rate = open_audio(fp)

Version History

0.1.3 (17.10.2019)

  • Add Colab ffmpeg installation guide.
  • Remove ffmpeg from requirements.txt. Users should install it separately

0.1.2 (20.08.2019)

Add instructions to install ffmpeg if load mp3 failed with ffmpeg backend not available.

0.1.1 (14.08.2019)

Initial release.