Font Size: a A A

Research On Nonlinear Filtering With Application To Speaker Tracking

Posted on:2009-01-04Degree:DoctorType:Dissertation
Country:ChinaCandidate:D W HouFull Text:PDF
GTID:1118360275457912Subject:Signal and Information Processing
Abstract/Summary:PDF Full Text Request
Speaker localization is one of the important techniques in acoustic signal processing, which have found many applications in fields such as speech enhancement, video-conferencing,human-computer interface,robot navigation,et al.Traditional approaches to speaker localization,which collect data from several microphones and exploit a frame of data merely at the current time to estimate the current source location,may causes wrong results since noise and reverberation cause spurious peaks to occur in the localization function.Therefore,it is necessary to track the location utilizing state space approach to improve localization accuracy.Speaker tracking is a typical nonlinear filtering problem.Taking the posterior probability density function as a clue,the nonlinear filtering methods,including Gaussian filters and non-Gaussian filters,are studied and improved in accuracy,robustness and computational complexity in this dissertation.Moreover,some improving measures are proposed purposely when applying the nonlinear filter to speaker tracking problem.The main contributions of this dissertation are as follows:(1) Under Gaussian assumption,an iterated sigma point Kalman filter(ISPKF) is proposed to improve estimation accuracy by using the new measurement iteratively.In the new method,the iteration is formulated as a nonlinear optimization process and a new update process is proposed based on the Levenberg-Marquardt algorithm,which insures convergence of the estimation and improves mean square estimation error.(2) Traditionally,Bayesian estimator has been based on the minimization of the H2 -norm of the corresponding estimation error.This type of estimator assumes the message model and the noise descriptions have known statistical properties.Unfortunately,accurate system models and statistical nature of the noise processes are not readily available.Here a H∞sigma point Kalman filter is presented under H∞performance criterion.Since sigma point transformation technique is used instead of Taylor series expansion,linearization error of the nonlinear system is decreased.Moreover,the noise uncertainty problem is solved utilizing H∞filtering method,which enhance robustness of the system.(3) In the framework of particle filtering,a mean shift quasi-Monte Carlo(MS-QMC) method is proposed in which low-discrepancy sequences are exploited instead of random draws according to a quasi-Monte Carlo integration rule.The idea of using deterministic points suggests that we can choose the points that provide the best-possible spread in the sample space and attain low integration error.Additionally,the mean shift technique is used to estimates the gradient of the approximated density and moves particles toward the modes of the posterior,leading to a more effective allocation of particles thereupon fewer particles are needed and the computational demand is reduced.(4) A sufficient statistics based particle filter is put forward to deal with sample impoverishment and large computational complexity problems in particle filtering.If the posterior density function depends on the observed data only through a set of sufficient statistics,which is straightforward to update,both of the problems mentioned above can be mitigated utilizing the proposed method by propagating the sufficient statistics instead of the posterior density function.As new samples are drawn from continuous density rather than discrete one,resampling is not required in the new filter,which results in a reduced complexity compared with the sequential importance sampling filter with resampling procedure.(5) A new interacting multiple model(IMM) particle filtering algorithm based on sampling interaction is developed to track a randomly moving speaker according to dynamic characteristics of the speaker.The interacting process in the new algorithm is accomplished by properly selecting the sampling region.Thus,not only the number of particles in each mode can be controlled so that the degeneracy problem around mode transition is avoided,but also the Gaussian assumption of posterior density function of the state is abandoned so that the filter can adapt to all distribution and the robustness of the speaker tracking system is enhanced.(6) Applying information fusion techniques to speaker tracking,both direction of arrival (DOA) and time difference of arrival(TDOA) of speech source are used to localize the speaker.Since the measurement modalities differ in the level of information they provide about the state,layered sampling method is employed in the proposed filter.Posterior density function of the DOA based filter is used as proposal distribution of the TDOA based filter,so that the new filter exploits the information in the most recent observation and guides the search in the state space effectively.Thus,the speaker localization error is decreased.
Keywords/Search Tags:Nonlinear Filtering, Speaker Tracking, Microphone Array, Bayesian Estimation, Kalman Filter, Particle filter, Monte Carlo, Unscented Transformation
PDF Full Text Request
Related items