Sensor Fusion with Conditional Random Fields

Posted on:2017-01-21

Degree:Ph.D

Type:Dissertation

University:University of California, Santa Barbara

Candidate:Rahimi, Amir Mohaymen

Full Text:PDF

GTID:1448390005474011

Subject:Electrical engineering

Abstract/Summary:

Visual and multimodal sensors are playing an increasingly important role in applications such as object recognition, activity analysis, agriculture monitoring, autonomous navigation and assisted healthcare. Typical features computed from the raw data/ measurements have significant mutual information that can be exploited towards improving the overall pattern recognition performance. In this dissertation we introduce principled methods to aggregate such information using conditional random field (CRF) models. Our aggregation is built on top of a large number of classifiers working independently on features computed from the raw data. The statistical dependencies among classifier outputs are formulated in a factorized form to characterize each class label. This technique enables the consistent prediction patterns to unfold a stronger discriminative power. In the first part of the presentation we demonstrate the effectiveness of this approach in combining information from a large number of visual features to improve the classification accuracy. Our experimental results show a significant improvement over the state-of-the-art aggregation methods on diverse image datasets. In the second application, we integrate the proposed CRF-based approach to combine multimodal measurements in a task involving a quadcopter UAV with autonomous path planning to detect and recognize the frontal view of a person. In this scenario, we utilize geographic location, time of day, view angle of the camera, altitude and magnetic heading, in addition to the video data, to estimate the body orientation in real time and automatically maneuver the UAV to maintain the frontal view of a person. We demonstrate that the contextual information implicit in the multimodal measurements can significantly reduce the vision-based computations for such real-time tasks.

Keywords/Search Tags:

Multimodal, Information

Related items

1	Multimodal Sentiment Analysis For Text,Audio And Video
2	Study Of Multimodal Integration And Based On Intelligent Information Processing Methods
3	Research On 3G Mobile Voice Control Of A Multimodal Health Information Web Portal
4	Study On The Multimodal Interaction And The Information Intergration
5	Study On Multimodal Emotion Recognition Based On Deep Learning
6	Multimodal Information Retrieval and Classification
7	Multimodal Interaction Technology And Application For Elderly
8	Research On Emotion Recognition Based On Multimodal Fusion
9	Research And Application Of Video Retrieval Technology Based On Multimodal Feature
10	Design And Implementation Of A Multimodal And Device-free System For Person Identification