Please join us for an upcoming talk.
Lunch will be provided by Yahoo!
Linguistic Pattern Learning for Web Information Extraction
Who: Justin Betteridge
When: Friday, May 23rd, 12:00pm
Where: NSH 3002
Most approaches to automatically extracting structured information from the web
rely on surface text patterns. However, the manner in which such patterns are
defined, learned, and employed in the larger system varies with each case. In
this talk, I will outline the spectrum of previous work in this area and argue
for a linguistically-motivated definition, a hybrid heuristic/classifier-based
assessment, and a multi-purpose employment of textual patterns in the context of
Web Information Extraction (WIE). I will also give preliminary results from
adopting such an approach in our WIE system.