Statistical and machine learning techniques for dealing with missing data in criminal justice: A simulation and comparison of missing data methods

Posted on:2013-04-28

Degree:Ph.D

Type:Dissertation

University:Sam Houston State University

Candidate:Hill, Joshua

Full Text:PDF

GTID:1450390008466901

Subject:Sociology

Abstract/Summary:

Dealing with missing data has been a continuous problem within the context of the social sciences and more specifically, criminal justice. While rarely talked about, missing data can bias results as well as influence model efficiency. Currently, there is only a very small literature of criminal justice specific research on missing data. The goal of this dissertation is to remedy, in part, this lack of attention to an important topic. The analysis within examines the use of eleven frequently used imputation techniques, including both classical statistical techniques as well as newer, algorithmic techniques. Using an advanced simulation methodology, the dissertation examines both the imputation of missing values, as well as the impact of those imputed datum on substantive analysis. Additionally, it seeks to develop a user-friendly package for the program R to assist researchers with the imputation of missing data.;KEY WORDS: Machine learning, Missing data, Listwise deletion, Random Forests, Hot deck imputation, Multiple imputation.

Keywords/Search Tags:

Missing data, Machine learning, Criminal justice, Techniques, Imputation

Related items

1	Prospectivity Mapping For Seafloor Massive Sulfide Based On Machine Learning And Missing Value Imputation Techniques
2	Comparison And Empirical Analysis Of Imputation Methods For Missing Data
3	Maximum likelihood estimation and multiple imputation: A Monte Carlo comparison of modern missing data techniques for multilevel data
4	Missing covariates in causal inference matching: Statistical imputation using machine learning and evolutionary search algorithm
5	Improved Algorithms Based On Extreme Learning Machine For Handing With Missing Data And Application
6	Imputation Methods Of Missing Values For Compositional Data
7	Missing Value Imputation Study For Typical High-throughput Omics Data
8	Incomplete Data Filled
9	Research On Integrating Soil Moisture Based On Machine Learning To Improve Satellite Precipitation Accuracy
10	Imputation For Missing Value Of Compositional Data Based On Biclustering Algorithm