Audio-based gender recognition

This was a collage project.

We extract features for every record and create dataset in the schema audio-based features --> label M or F. No Deep Learning.

The term gender recognition is unfortunately inaccurate. Every speaker in Librispeech dataset has M or F label assigned. M voice is considered to have male characteristics and F voice has female characteristics. We follow this notation focusing only on technical aspects of classification. The voice timbre may be very misleading and its subjective perception is not the area of focus of this project.

Installation

Simple deps:

pip install -r requirements.txt

Running

Download and unpack Librispeech data directly in data/raw/ folder so that the data with SPEAKERS.txt, BOOKS.txt etc. files is in data/raw/LibriSpeech/.

Project is based only on clean files (train-100, train-360, dev and test).

Run:

python extract_features

Results

Trained on signals from train-clean-100 set, tested on signals from dev-clean.

Dev f1 score: 0.8861251457442676

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
api		api
data		data
eda		eda
model_store		model_store
notebooks		notebooks
utils		utils
.gitignore		.gitignore
Aptfile		Aptfile
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio-based gender recognition

Installation

Running

Results

About

Releases

Packages

Contributors 2

Languages

License

smolendawid/librispeech-gender-recognition

Folders and files

Latest commit

History

Repository files navigation

Audio-based gender recognition

Installation

Running

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages