GitHub - jessb0t/sendValPredict: project | predicting human valence ratings of natural narratives

Predicting Human Valence Ratings of Natural Spoken Narratives from Lexical and Acoustic Information

Overview

The cues used by human observers to predict the emotional state of an interlocutor are ambiguous, and this is particularly true for vocal emotion (Atias et al., 2019). Efforts to predict the negative-to-positive degree of “pleasantness” of emotional speech (termed “valence”) have been especially fraught (Busso & Rahman, 2012). According to prior work using time-series models (Ong et al., 2021), the perception of speaker valence by human listeners in an auditory-only modality is predominantly based on signal semantics, with acoustic features such as prosodic contour and voice quality demonstrating weaker explanatory power. Here, I investigate several linear regression models to compare the explanatory power of semantic and acoustic information, both alone and in combination. Furthermore, I explore the extent to which a fine-tuned, self-supervised transformer model is able to simulate human behavior in valence ratings of natural, spoken narratives.

This project analyzes data provided in the first release the Stanford Emotional Narratives Dataset. For results of analyses to date, please see the results folder of this repository, which includes results both as a readme (for easy online viewing), as well as a .docx file.

Repository License: CC BY-SA 4.0

Scripts

Data Extraction and Organization

`acousticsExtractor.py`

inputs: audio files in .wav format

outputs: for each input .wav, a .csv file with 88 extracted acoustic features (eGeMAPS) for every five-second window in the file {data/egemaps}

`dataCompiler.py`

inputs:

time-aligned transcription files from the SEND dataset
time-aligned, aggregated (Evaluator Weighted Estimator) human valence ratings from the SEND dataset
acoustic features extracted (via acousticsExtractor.py) from .wav files exported from SEND videos
lexical valence and arousal norms collected by Warriner et al. (2013)

outputs: dataframe with each row representing a five-second window and columns providing data relating to the identification, human rating, lexical/semantic features, and acoustic features of the window {"data.csv"}

`audeeringPredictions.py`

inputs:

audio files in .wav format
data.csv

outputs: dataframe with each row representing a five-second window and columns providing data relating to the identification, human rating, and model-predicted rating of the window {"llm_prediction_data.csv"}

Modeling

`analysisRegressions.R`

inputs: data.csv

outputs: for four models, a plot comparing the human gold standard valence rating for each five-second window and the valence rating predicted by the model, including the coefficient of determination, Pearson correlation coefficient, and concordance correlation coefficient; the script also contains in-line code to explore the beta weights for statistically significant predictors in each model {figs}

`analysisLLM.R`

inputs: llm_prediction_data.csv

outputs: a plot comparing the human gold standard valence rating for each five-second window and the valence rating predicted by the model, including the coefficient of determination, Pearson correlation coefficient, and concordance correlation coefficient {figs}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Human Valence Ratings of Natural Spoken Narratives from Lexical and Acoustic Information

Overview

Scripts

Data Extraction and Organization

`acousticsExtractor.py`

`dataCompiler.py`

`audeeringPredictions.py`

Modeling

`analysisRegressions.R`

`analysisLLM.R`

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
data_scripts		data_scripts
figs		figs
results		results
.gitignore		.gitignore
analysisLLM.R		analysisLLM.R
analysisRegressions.R		analysisRegressions.R
data.csv		data.csv
llm_prediction_data.csv		llm_prediction_data.csv
readme.md		readme.md

jessb0t/sendValPredict

Folders and files

Latest commit

History

Repository files navigation

Predicting Human Valence Ratings of Natural Spoken Narratives from Lexical and Acoustic Information

Overview

Scripts

Data Extraction and Organization

acousticsExtractor.py

dataCompiler.py

audeeringPredictions.py

Modeling

analysisRegressions.R

analysisLLM.R

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`acousticsExtractor.py`

`dataCompiler.py`

`audeeringPredictions.py`

`analysisRegressions.R`

`analysisLLM.R`

Packages