Classification Trees with Synthetic Feature

Posted on:2019-07-05

Degree:M.S

Type:Thesis

University:Texas A&M University - Commerce

Candidate:Msabaeka, Tsitsi

Full Text:PDF

GTID:2478390017485110

Subject:Mathematics

Abstract/Summary:

Trained synthetic features were used with classification and regression trees (CART) and boosting methods to predict outcomes of categorical response variables in general. The trained synthetic features involved were synthetic features (Zieba, Tomczak, & Tomczak, 2016), principal component analysis (PCA), zero-one regression (ZO), logistic regression (LS), linear discriminant analysis (LDA), robust fitting of linear models (RLM), least trimmed squares (LTS), naive Bayes (NBAY), and univariate spline (SPL) using the statistical software R. To illustrate the trained synthetic features in this paper, they were applied to Polish companies' financial data, Fisher's Iris data, and skin lesion data. The objective of the research was to apply trained synthetic features to CART, stock boosting method that had been fitted with the synthetic features at the root node, and synthetic boosting method that was reweighted and refitted the synthetic features at each iteration, to improve on predictive accuracy for classes in a given data set rather than random guessing based on the prior probabilities.

Keywords/Search Tags:

Synthetic, Data

Related items

1	The Research And Application Of Synthetic Clock Synchroniization By Wire And Wireless In Data Acquisiton
2	Automated Synthetic Feasibility Assessment: A Data-driven Derivation of Computational Tools for Medicinal Chemistry
3	Classification Trees with Synthetic Feature
4	Using multiply imputed, synthetic data to facilitate data sharing
5	The safe use of synthetic data in classification
6	The Study Of Synthetic Data In Differential Privacy
7	Study On Methods Of Synthetic Aperture Radar Target Echoed Data Simulation
8	Trademark Detection In Natural Scene Base On Synthetic Data
9	Optimization Control Of Synthetic Ammonia Based On Data Driven
10	Generating synthetic space-time paths using a cloning algorithm on activity behavior data