GitHub - fluxid/audio2png: Python script to generate waveform images from audio files.

fluxid / audio2png Public

Notifications You must be signed in to change notification settings
Fork 0
Star 11

Python script to generate waveform images from audio files.

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
rendered_examples		rendered_examples
.gitignore		.gitignore
README		README
audio2png.py		audio2png.py

Repository files navigation

Generates waveform images from audio file.
Requires pyffmpeg (beta version, available on GitHub), numpy and PIL.

Supports:
- Waveform view with left and right channels
- Waveform view with maximum and minimum level of all averaged channels
- Antialiasing (oversampling)
In rendered_examples directory you'll find rendered images using this script.

Will support:
- Spectral view

Won't support:
- Colors

Drawbacks:
- Needs to read whole audio file at least twice

TODO:
- Document the code (docstrings, comments)
- Implement spectral view
- Create nice command line utility

About rendered examples:
Track is "Unsquare Dance" by Dave Brubeck.
unsquare1* are stereo waveforms: above is left channel, below is right channel (absolute maximal levels).
unsquare2* are mono waveforms: above are maximum levels, and below minimum.
Files without sufix are normal waveforms with no filtering.
Files with "_aa" suffix are antialiased: It's almost the same as rendering waveform with width 20 times larger than intended, and downscaling it with bilinear filtering.
Files with "_avg" suffix are "averaged": it's roughly the same as antialiased but with applied contrast.
Files with "_hyb" suffix are "hybrid": a little averaged first, and then antialiased...