Structure from sound - estimate microphone and sound source positions from sound recordings
Code for the Structure from Sound project (SFS) The aim of the project is to develop tools for estimating microphone positions and sound source movements using a set of audio files as input only.
MATLAB with Signal Processing Toolbox
- Clone this repository
git clone --recurse-submodules [email protected]:kalleastrom/StructureFromSound2.git
- If you did not use
--recurse-submodules
, update the submodules
git submodule init
git submodule update
- Download data from here
and put the files in the
data
folder - Start matlab and execute the setup script to generate mex files and set paths.
run setup.m
- Open the matlab file StructureFromSound2/matlab/demo_full_system_from_raw_musicfiles.m in the editor. Change line 14 if you put the data somewhere else. Run demo_full_system_from_raw_musicfiles and keep your fingers crossed that it works.
Example data for the system can be downloaded from the StructureFromSoundDatabase (sfsdb)
If you use this code, please cite the three relevant papers that it builds upon
Åström, K., Larsson, M., Flood, G., & Oskarsson, M. (2021, August). Extension of Time-Difference-of-Arrival Self Calibration Solutions Using Robust Multilateration. In 2021 29th European Signal Processing Conference (EUSIPCO) (pp. 870-874). IEEE.
@inproceedings{aastrom2021extension,
title={Extension of Time-Difference-of-Arrival Self Calibration Solutions Using Robust Multilateration},
author={{\AA}str{\"o}m, Kalle and Larsson, Martin and Flood, Gabrielle and Oskarsson, Magnus},
booktitle={2021 29th European Signal Processing Conference (EUSIPCO)},
pages={870--874},
year={2021},
organization={IEEE}
}
Larsson, M., Flood, G., Oskarsson, M., & Åström, K. (2021, June). Fast and robust stratified self-calibration using time-difference-of-arrival measurements. In ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 4640-4644). IEEE.
@inproceedings{larsson2021fast,
title={Fast and robust stratified self-calibration using time-difference-of-arrival measurements},
author={Larsson, Martin and Flood, Gabrielle and Oskarsson, Magnus and {\AA}str{\"o}m, Kalle},
booktitle={ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
pages={4640--4644},
year={2021},
organization={IEEE}
}
Larsson, M., Flood, G., Oskarsson, M., & Åström K. (2020, May). Upgrade methods for stratified sensor network self-calibration. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 4851-4855). IEEE.
@inproceedings{larsson2020upgrade,
title={Upgrade methods for stratified sensor network self-calibration},
author={Larsson, Martin and Flood, Gabrielle and Oskarsson, Magnus and {\AA}str{\"o}m, Kalle},
booktitle={ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
pages={4851--4855},
year={2020},
organization={IEEE}
}
Simayijiang Zhayida, Simon Segerblom Rex, Yubin Kuang, Fredrik Andersson, Kalle Åström, An Automatic System for Acoustic Microphone Geometry Calibration based on Minimal Solvers, arXiv preprint arXiv:1610.02392, 2016.
@article{simayijiang2016automatic,
title={An Automatic System for Acoustic Microphone Geometry Calibration based on Minimal Solvers},
author={Simayijiang Zhayida, Simon Segerblom Rex, Yubin Kuang, Fredrik Andersson, Kalle {\AA}str{\"o}m},
journal={arXiv preprint arXiv:1610.02392},
year={2016}
}
Other relevant papers are:
@inproceedings{simayijiang2014automatic,
title={An Automatic System for Microphone Self-Localization Using Ambient Sound},
author={Simayijiang, Zhayida and Andersson, Fredrik and Kuang, Yubin and {\AA}str{\"o}m, Kalle},
booktitle={European Signal Processing Conference (Eusipco 2014)},
pages={5},
year={2014},
organization={EURASIP (European Association for Signal Processing)}
}
@inbook{393831000cb64d7988c470971301d957,
title = "Stratified Sensor Network Self-Calibration From TDOA Measurements",
keywords = "Network Self-calibration, TDOA, TOA, Minimal Problem",
author = "Yubin Kuang and Kalle {\AA}str{\"o}m",
year = "2013",
booktitle = "[European Signal Processing Conference (Eusipco 2014)]",
publisher = "IEEE--Institute of Electrical and Electronics Engineers Inc.",
}
Example data for the system can be downloaded from the StructureFromSoundDatabase (sfsdb) at http://vision.maths.lth.se/sfsdb/
Briefly the datasets are:
- bassh1 - lunch room (3DR, 3DS, Cont, Echo, Single, Distinct)
- bassh2 - lunch room (3DR, 3DS, Cont, Echo, Single, Distinct)
- bassh3 - lunch room (3DR, 3DS, Cont, Echo, Single, Distinct)
- grieg1 - lecture room (3DR, 3DS, Cont, Echo, Single, Distinct)
- grieg2 - lecture room (3DR, 3DS, Cont, Echo, Single, Distinct)
- grieg3 - lecture room (3DR, 3DS, Cont, Echo, Single, Distinct)
- grieg4 - lecture room (3DR, 3DS, Cont, Echo, Single, Distinct)
- grieg5 - lecture room (3DR, 3DS, Cont, Echo, Single, Distinct)
- spacecure1 - anechoic chamber (3DR, 3DS, Cont, NoEcho, Single, Distinct)
- whywereyouawayayearroy - anechoic chamber (3DR, 3DS, Cont, NoEcho, Multiple, NotDistinct)
- axelf - kitchen (2DR, 3DS, Cont, Echo, Single, Distinct)
- gone - kitchen (2DR, 3DS, Cont, Echo, Single, Distinct)
Here
- 3DR - Dimension of affine hull of receivers is 3. (Microphones not in a plane)
- 2DR - Dimension of affine hull of receivers is 2. (Microphones in a plane)
- 1DR - Dimension of affine hull of receivers is 1. (Microphones on a line)
- (0DR - Dimension of affine hull of receivers is 0. (Microphones all in the same position)
- 3DS- Dimension of affine hull of sound events is 3. (Sound events not in a plane)
- 2DS- Dimension of affine hull of sound events is 2. (Sound events in a plane)
- 1DS - Dimension of affine hull of sound events is 1. (Sound events on a line)
- (0DS Dimension of affine hull of sound events is 0. (Sound events all in the same position)
- Claps - Claps, a few distinct sound events
- Cont - Continuous sound events
- Echo - In a normal (reverberant) room
- NoEcho - In anaechoic chamber
- Single - Single sound source
- Multiple - Multiple sound sources
- Distinct - Spatially distinct sound sources
- NotDistinct - Spatially spread out sound sources
The microphone setup is illustrated in the following figure.
We made 3 experiments (bassh1, bassh2, bassh3). There are several still images of the microphone setup and one stationary few film recording of the experiments.
The microphone setup is illustrated in the following figure.
We made 5 experiments (greig1 to grieg5). There are several still images of the microphone setup and a few film recordings of the experiment. During theses film recordings, the camera was moving.