Sparse Coding of Speech Data Predicts Properties of the Early Auditory System

Posted on:2013-01-08

Degree:Ph.D

Type:Dissertation

University:University of California, Berkeley

Candidate:Carlson, Nicole Liu

Full Text:PDF

GTID:1458390008488440

Subject:Biology

Abstract/Summary:

I have developed a sparse mathematical representation of speech that minimizes the number of active model neurons needed to represent typical speech sounds. The model learns several well-known acoustic features of speech such as harmonic stacks, formants, onsets and terminations, but I also find more exotic structures in the spectrogram representation of sound such as localized checkerboard patterns and frequency-modulated excitatory subregions flanked by suppressive sidebands. Moreover, several of these novel features resemble neuronal receptive fields reported in the Inferior Colliculus (IC), as well as auditory thalamus and cortex, and my model neurons exhibit the same tradeoff in spectrotemporal resolution as has been observed in IC. To my knowledge, this is the first demonstration that receptive fields of neurons in the ascending mammalian auditory pathway beyond the auditory nerve can be predicted based on coding principles and the statistical properties of recorded sounds. In my second study, I look at linear filter estimation by creating spike-triggered averages for my model neurons. Surprisingly, whitening does not remove the effect of choosing different probe stimulus sets. This suggests that the type of probe stimulus is very important for uncovering the true receptive field of a neuron.

Keywords/Search Tags:

Speech, Model neurons, Auditory

Related items

1	Research On Artificial Auditory Neurons Based On Biological Memristor
2	Cortical dynamics of auditory-visual speech: A forward model of multisensory integration
3	An auditory feedback-based model of speech production in the developing child
4	Computational Auditory Scene Analysis Based Voice Pretreatment System
5	Invariant speech recognition and auditory object formation: Neural models and psychophysics
6	Speech Signals's Analysis And Application Based On The Auditory Model Inversion
7	Speech Enhancement Based On Auditory Masking And Auditory Wavelet Packet Decomposition
8	Monaural Speech Segregation Based On Computational Auditory Scene Analysis
9	Speech-coding and training-induced plasticity in auditory cortex of normal and dyslexia model rats
10	Research On Auditory Characteristics And Robust Speech Recognition Algorithms