Font Size: a A A

Pattern extraction from the World Wide Web

Posted on:2011-07-28Degree:M.S.C.SType:Thesis
University:University of Nevada, Las VegasCandidate:Mettu, PraveenaFull Text:PDF
GTID:2448390002960580Subject:Computer Science
Abstract/Summary:
The World Wide Web is a source of huge amount of unlabeled information spread across different sources in varied formats. This presents us with both opportunities and challenges in leveraging such large amount of unstructured data to build knowledge bases and to extract relevant information.;As part of this thesis, a semi-supervised logistic regression model called "Dual Iterative Pattern Relation Extraction" proposed by Sergey Brin is selected for further investigation. DIPRE presents a technique which exploits the duality between sets of patterns and relations to grow the target relation starting from a small sample.;This project built in JAVA using "Google AJAX Search API" includes designing, implementing and testing DIPRE approach in extracting various relationships from the web.;Keywords: Pattern Extraction, Machine Learning, DIPRE...
Keywords/Search Tags:Pattern, Extraction, DIPRE
Related items