R2DNet Project Documentation

R2DNet (Reverberation to Decay Net) uses machine learning to offer an approach to estimating room acoustic decay parameters and noise floor estimation from the reverberant speech of approximately 1 second. It is trained using energy decay curve loss for more roburst estimation in various acoustic environments.

Directory Structure

This repository is organized into several sections, each dedicated to specific aspects of the acoustic analysis process:

Data Preparation

IR_selection.py: Selects impulse responses for testing and training datasets.
VAD.py: Implements voice activity detection to identify speech segments.
dataset.py: Manages loading of datasets for testing, validation, and room analysis.
reverb_preprocess_validation.py & reverbspeech_preprocess.py: Generate reverberant speech segments for model input.

Model Architecture and Training

model_FiNS.py: Defines the FiNS model architecture.
model_alter.py: Implements an encoder-decoder architecture for the S2IR model.
model_flex.py: Offers a flexible CNN architecture for various dataset complexities.
s2ir.py: Contains the architecture for the Speech-to-Impulse Response (S2IR) model.
training.py: Facilitates model training with comprehensive configuration options.

Testing and Analysis

test.py: Executes model testing against predefined datasets.
process_utils.py: Provides essential functions and loss calculations for model evaluation.
Analysis_plot_aggregate_analysis.py & Analysis_aggregate.py: Perform detailed error and performance analysis.

Utilities and Miscellaneous

hyperparameter_tuning.py: Optimizes model hyperparameters using Optuna.
organise_test_data.py & organize_test_data_position.py: Prepare test data, including positional information.

Comprehensive MATLAB Script

speech_processing_individual_loc.m: Consolidates reverberant speech data, labels, and other relevant information into single MATLAB files for each room/location, streamlining the dataset preparation process.

Prerequisites

To work with the R2DNet framework, ensure you have Python 3.6+ and the following packages installed:

pip install numpy torch librosa optuna matplotlib seaborn

Usage

Each script can be executed individually to perform its designated function. For example, to generate reverberant speech data for validation:

python reverb_preprocess_validation.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
A_script_summary.py		A_script_summary.py
Analysis_aggregate.py		Analysis_aggregate.py
Analysis_plot_per_epoch.py		Analysis_plot_per_epoch.py
Analysis_plot_tb.py		Analysis_plot_tb.py
Analysis_validation.py		Analysis_validation.py
Anaysis_plot_testing_per_room.py		Anaysis_plot_testing_per_room.py
IR_selection.py		IR_selection.py
LICENSE.md		LICENSE.md
README.md		README.md
VAD.py		VAD.py
check_synthetic_plot.py		check_synthetic_plot.py
data_error_freq_analysis.py		data_error_freq_analysis.py
dataset.py		dataset.py
dataset.sh		dataset.sh
dataset_loader_s2ir.py		dataset_loader_s2ir.py
error_data.pickle		error_data.pickle
gird_all_in_one_buffer.txt		gird_all_in_one_buffer.txt
grid_all_in_one.py		grid_all_in_one.py
grid_color.py		grid_color.py
hilbert_lstm.py		hilbert_lstm.py
hilbert_trasnform.py		hilbert_trasnform.py
histogram_data.pickle		histogram_data.pickle
hyperparameter_study.py		hyperparameter_study.py
hyperparameter_tuning.py		hyperparameter_tuning.py
hyperparameter_tuning.sh		hyperparameter_tuning.sh
log_spectral_feature_extractor.py		log_spectral_feature_extractor.py
log_spectral_features.py		log_spectral_features.py
lundeby.py		lundeby.py
make_noisy_data.py		make_noisy_data.py
misc_plots.py		misc_plots.py
model.py		model.py
model_FiNS.py		model_FiNS.py
model_alter.py		model_alter.py
model_flex.py		model_flex.py
model_log_spectral_CNN.py		model_log_spectral_CNN.py
model_visvualize.py		model_visvualize.py
movie_maker.py		movie_maker.py
noise_roburstness_test.py		noise_roburstness_test.py
normalizer.py		normalizer.py
organise_test_data.py		organise_test_data.py
organize_test_data_position.py		organize_test_data_position.py
preprocessing.py		preprocessing.py
process_utils.py		process_utils.py
re_plotting_curves.py		re_plotting_curves.py
readmatfile.py		readmatfile.py
reverb_preprocess_noise_roburstness.py		reverb_preprocess_noise_roburstness.py
reverb_preprocess_validation.py		reverb_preprocess_validation.py
reverbspeech_preprocess.py		reverbspeech_preprocess.py
s2ir.py		s2ir.py
s2ir_training_plots.py		s2ir_training_plots.py
speech_selection.py		speech_selection.py
table_plot.py		table_plot.py
table_plot_heatmap.py		table_plot_heatmap.py
test.py		test.py
test_dataset_prep.py		test_dataset_prep.py
training.py		training.py
training_mod.py		training_mod.py
training_plot.py		training_plot.py
truncation.py		truncation.py
violin_plot.py		violin_plot.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

R2DNet Project Documentation

Directory Structure

Data Preparation

Model Architecture and Training

Testing and Analysis

Utilities and Miscellaneous

Comprehensive MATLAB Script

Prerequisites

Usage

About

Releases

Packages

Languages

License

Pshar10/Blind-Estimation-of-EDC-from-Live-Signals

Folders and files

Latest commit

History

Repository files navigation

R2DNet Project Documentation

Directory Structure

Data Preparation

Model Architecture and Training

Testing and Analysis

Utilities and Miscellaneous

Comprehensive MATLAB Script

Prerequisites

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages