Font Size: a A A

Deep Learning Robust Perception Paradigm for Autonomous Vehicle

Posted on:2018-08-05Degree:Ph.DType:Dissertation
University:University of Louisiana at LafayetteCandidate:Ali, Alaa Mohamed LeithyFull Text:PDF
GTID:1472390020956014Subject:Computer Engineering
Abstract/Summary:
Robust and reliable environment perception is an essential challenge to realize artificial intelligence robotics, smart traffic management, driver assistance, and autonomous driving. The present research focuses on high-performance vision based perception modules in terms of speed and precision, such as traffic sign recognition and multi-target object detection in the context of vehicle and pedestrian detection for autonomous driving. Several detection techniques are developed based on deformable part models (DPMs) and Convolutional Neural Network (CNN) for its foreseen precision. Specifically, we propose multiple approaches to improve the performance of DPM and a more accurate DeepPyramid DPM model which maps DPM into multiple CNN layers as an assembly of the two concepts with intensive computations. Multiple techniques are employed to build our frameworks, including but not limited to Fast Fourier transform, feature pyramid approximation, early classification, edge boxes region proposals, neighborhood aware cascade, and region proposal networks. In addition, our implementation employs low-level optimization techniques to achieve a faster and real-time detector, including Single Instruction Multiple Data (SIMD) instructions, multiple cores, assembly programming and, cache management. Our experimental results demonstrate that our fast DPM can process Pascal VOC 2010 at 47HZ and VGA image at 40HZ on CPU with the loss of 2% in precision, while fast DeepPyramid DPM processes PASCAL VOC at 17 fps and VGA images at 10 fps on a GPU (about 11x faster than DeepPyramid DPM). Furthermore, we introduce a programmable, low-power hardware implementation of DPM-based object detection for real-time applications. The proposed hardware circuit uses 65nm CMOS technology and consumes only 36.5mW (0.81 nJ/pixel) based on the post-layout simulation and uses the area of 3362 kgates. Finally, we present a robust end-to-end approach based on the aggregate channel features and CNN to simultaneously and effectively detect and classify traffic signs.
Keywords/Search Tags:Perception, Deeppyramid DPM, Traffic, CNN, Autonomous
Related items