Research On Neural Networks Producing High-confidence Predictions For Unknown Class Samples

Posted on:2022-11-12

Degree:Master

Type:Thesis

Country:China

Candidate:D R B Miao

Full Text:PDF

GTID:2518306755995849

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

With the explosive development of artificial intelligence in recent years,neural networks have obtained unprecedented performance in areas such as computer vision and have become the state-of-the-art standard for many learning tasks.Despite their great success,neural networks currently have some inherent limitations.For example,after training a neural network with normal supervision methods,neural network models usually perform well when the training and test data are independently sampled from the same distribution.However,when the distribution of the test data is different from the distribution of the training data,a neural network model may incorrectly classify an unknown class of samples not in the training set as a known class with high confidence,which may result in an extremely high error cost.Currently,it is difficult to collect training samples covering all the classes when training neural networks due to various limitations of uncontrollable reasons,so unknown class samples may be submitted to the model during testing,which requires that the model should not make high confidence predictions for samples far from the training data,which is especially important in safety-critical applications.In recent years,researchers have done a lot of research on this problem.One of the major approaches is to set up a binary detector to detect whether a sample is an unknown category,but this approach does not fundamentally solve the problem of neural networks generating high confidence in such unknown samples.Another method is to construct a subset of out-of-distribution samples from extra datasets and suppress the confidence of these out-of-distribution samples to achieve the purpose of suppressing the confidence of the prediction of other out-of-distribution samples.Combining the above problems,the main research work of this thesis is as follows.（1）In this thesis,a novel generative adversarial network is designed and trained to generate a subset of high-quality out-of-distribution unknown class samples without using extra data.（2）By suppressing the confidence of the out-of-distribution unknown class samples generated in this thesis during training,the neural network is able to make the network significantly alleviate the problem of high confidence of errors on many unknown samples and does not affect the performance of the original task.（3）The combination of the neural network trained by suppressing the confidence of the out-of-distribution unknown category samples generated in this thesis and the current state-of-the-art unknown class sample detection methods can further improve the detection performance.（4）In this thesis,we design a novel model training architecture that can effectively improve the accuracy of the original classification task by combining the generated out-of-distribution unknown classes samples.

Keywords/Search Tags:

Neural Networks, Model Robustness, Open-set Identification, Unknown Class

PDF Full Text Request

Related items

1	Out-of-Distribution Sample Detection Technology Based On Sample Enhancement And Gradient Distribution Matching
2	On-line modeling and inverse optimal control of a class of unknown nonlinear systems using dynamic neural networks
3	Research On Unknown Traffic Identification Based On Self Encoder And Attention Mechanism
4	Based On The Complex System Of Neural Network Model Identification Technology And Its Application
5	Research On Identification Of Unknown Radio Emitters Based On Deep Learning
6	Research On The Multi-Classification Algorithm For Security Data Based On Siamese Neural Network
7	Structural Optimization And Robustness Enhancement Of CNNs
8	Modeling And Identification Of Brain Motor Neural System
9	The Pertubation Properties And Applications Of A Class Of Fuzzy Hopfield Neural Networks
10	A Study For Identification And Control Of Nonlinear Systems Using Neural Networks