Parametric classification with non-normal data

Posted on:2000-08-03

Degree:Ph.D

Type:Thesis

University:Montana State University

Candidate:Willse, Alan Ray

Full Text:PDF

GTID:2468390014966122

Subject:Statistics

Abstract/Summary:

This thesis is concerned with parametric classification of non-standard data. Specifically, methods are developed for classifying two of the most common types of non-Gaussian distributed data: data with mixed categorical and continuous variables (often called mixed-mode data), and sparse count data. Both supervised and unsupervised methods are described. First, a promising, recently proposed method that uses finite mixtures of homogeneous conditional Gaussian distributions (Lawrence and Krzanowski, 1996) is shown to be non-identifiable. Identifiable finite mixtures of homogeneous conditional Gaussian distributions are obtained by imposing constraints on some of the model parameters. Then, in contrast, it is shown that supervised classification of mixed-mode data using the homogeneous conditional Gaussian model can sometimes be improved by relaxing parameter constraints in the model; specifically, certain features of the continuous variable covariance matrix---such as volume, shape or orientation---are allowed to differ between groups. In addition, the use of latent class and latent profile models in supervised mixed-mode classification is investigated. Finally, mixtures of over-dispersed Poisson latent variable models are developed for unsupervised classification of sparse count data. Simulation studies suggest that for non-Gaussian data these methods can significantly outperform methods based in Gaussian theory.

Keywords/Search Tags:

Data, Classification, Methods, Homogeneous conditional gaussian

Related items

1	Research Of Ensemble Super Parent One-Dependence Estimator By Maximizing Conditional Likelihood
2	Gaussian Processes Based Classification For Hyper-spectral Imagery
3	Predictive Gaussian Classification of Functional MRI Data
4	Research And Implementation Of Social Robot Detection Technology Based On Improved Conditional Generation Adversarial Network
5	Gaussian conditional random fields for regression in remote sensing
6	Gaussian Process Classification Algorithm And Application Research
7	Data Augmentation Model Based On Deep Generative Model
8	A Large-scale Experimental Evaluation And Analysis Of Data Classification Algorithms
9	Research On Data Driven Fault Diagnosis Methods For Non-Gaussian Process
10	Classification methods for high-dimensional sparse data