The codes ainvolve speech processing, dedicated to assignments and final project at Izmir Democracy Uniersity.
The code shows how to run simple audio analyses on a speech signal- a wav sound of 44100 Hz sample rate. The signal is a mono speech file named 'speech_signal'. Windowing, Fast Fourier Transformation and Spectrogram were applied. The audio file can be found in the 'data' folder.
The algorithm calculates the edit distance between threee different words sources and their corresponding words targets using the recursive method.
The code creates a function to copute LPC coeffecients using Durbin’s recursion and then apply it to a wave sound of a male speaker to calculate coeffecinets of the sound signal and plot the spectrum. The code plots the spectrum using DFT or mainly FFT. Finally, both fft then LPC functions are applied on the wav sound.
The code compares three different machine learning algoritms aiming at training an annotatoed data for speech gender classification purposes, male and female speech classification. The training data is at http://www.openslr.org/12/ (development set, "clean" speech ). Examples of female and male speech in the dataset are inside 'data' folder '.flac' formats. The compared models all classify the same training and testing data based on Mel-frequency cepstrum coefficients extraction from the audio files.