Hierarchical organisation of voice and voice gender perception

Charest, Ian (2010) Hierarchical organisation of voice and voice gender perception. PhD thesis, University of Glasgow.

Due to Embargo and/or Third Party Copyright restrictions, this thesis is not available in this service.
Printed Thesis Information: https://eleanor.lib.gla.ac.uk/record=b2714439


The most important sound in our auditory environment is the human voice. Voice professionals, whether they are teachers, radio hosts, sport coaches,
use their voice on a everyday basis to earn their living and communicate information and knowledge. We grow up spending most of our time everyday listening to voices in school, at the sports club, on t.v., etc. So much that by the time we are adults, voice plays a major role in our everyday social interactions. Yet, while extensive research has been conducted on speech perception voice alone has only just started generating more and more interest in the cognitive neuroscience research community. Voice is not "just" a speech carrier, it conveys rich paralinguistic information such as gender, age, identity or affective state. A theoretical model which emphasises the similarities between face and voice processing was recently introduced, suggesting a serial and parallel processing pathway of voice information leading to high level cognitive processes like person identification. Globally this model of voice processing suggests an extraction of low-level acoustic features, followed by a voice structural encoding leading to parallel pathways for the recognition of speech, affect and identity related information. Furthermore, this model suggested potential interactions with face perception pathways. In this thesis, I investigated two different stages of this voice perception model.

First, little is known about the speed at which the distinction between vocal and non-vocal sounds is performed, i.e. is there a time-frame where the "voice structural analysis" would occur. Using electroencephalography, we conducted an experiment in order to delineate this voice vs. non-voice perception time-frame. I observed an early electro-physiological response preferential to voice stimuli, emerging around 164 ms on fronto-temporal electrodes FC5 and FC6 which was termed the "fronto-temporal positivity to voice".

Second, little is known about the neural basis behind the perception of paralinguistic information such as identity, gender or affective state contained in the human voice. I used voice gender as a tool to investigated the "voice recognition units" stage of the voice perception model. The cognitive processes behind voice gender perception are still under debate, and more precisely, the nature of the representation of voice gender, whether it is organised around low level acoustical discriminants, or relies on high level categorical representations still remains unclear. Voice gender continua can be created in order to parametrically control the degree of gender contained in voice.

I investigated the importance of low level acoustic features using the recently developed auditory morphing algorithms. I averaged 32 male and 32 female voices in order to "approximate" a prototypical voice for each gender. From those prototypes, I generated caricatures by exaggerating the acoustical properties of the male prototype in reference to the female prototype. Those voice composites were included along with 3 pairs of male and female voice exemplars in a voice gender adaptation experiment. I observed significantly stronger perceptual after-effects caused by adaptation from the voice gender caricatures. This result provides evidence for a determinant role of the low level acoustical features in our ability to perceive the gender of a voice.

Finally, using functional magnetic resonance imaging (fMRI), I investigated whether brain regions of the
auditory cortex are sensitive to voice gender, voice gender adaptation, and whether a dissociation between extraction of acoustical features and higher level, perceptual representations could be achieved. I used voice gender continua and an event-related fMRI design called the continuous carry-over design to assess these working hypotheses. I observed a covariation between BOLD signal and the degree of acoustical differences in consecutive voices in the anterior part of the right superior temporal sulcus, where the extraction of voice gender related acoustical features occurs. Furthermore, I observed a higher level network involving the bilateral inferior frontal gyrus, the insula and the anterior cingulate cortex where a summary of acoustical features would be input from auditory areas enabling a voice gender categorisation.

Item Type: Thesis (PhD)
Qualification Level: Doctoral
Additional Information: Due to copyright restrictions the full text of this thesis cannot be made available online. Access to the printed version is available.
Keywords: Psychology, Auditory cognition, Voice, Voice gender, Neuroimaging, EEG, fMRI
Subjects: B Philosophy. Psychology. Religion > BF Psychology
Colleges/Schools: College of Science and Engineering > School of Psychology
Supervisor's Name: Belin, Prof. Pascal
Date of Award: 2010
Depositing User: Mr Ian Charest
Unique ID: glathesis:2010-1681
Copyright: Copyright of this thesis is held by the author.
Date Deposited: 13 Apr 2010
Last Modified: 10 Dec 2012 13:44
URI: https://theses.gla.ac.uk/id/eprint/1681

Actions (login required)

View Item View Item


Downloads per month over past year