Use of ontologies in information extraction

dc.contributor.authorWimalasuriya, Daya Chinthana
dc.date.accessioned2011-06-07T23:54:44Z
dc.date.available2011-06-07T23:54:44Z
dc.date.issued2011-03
dc.descriptionxiii, 149 p. : ill. (some col.)en_US
dc.description.abstractInformation extraction (IE) aims to recognize and retrieve certain types of information from natural language text. For instance, an information extraction system may extract key geopolitical indicators about countries from a set of web pages while ignoring other types of information. IE has existed as a research field for a few decades, and ontology-based information extraction (OBIE) has recently emerged as one of its subfields. Here, the general idea is to use ontologies--which provide formal and explicit specifications of shared conceptualizations--to guide the information extraction process. This dissertation presents two novel directions for ontology-based information extraction in which ontologies are used to improve the information extraction process. First, I describe how a component-based approach for information extraction can be designed through the use of ontologies in information extraction. A key idea in this approach is identifying components of information extraction systems which make extractions with respect to specific ontological concepts. These components are termed "information extractors". The component-based approach explores how information extractors as well as other types of components can be used in developing information extraction systems. This approach has the potential to make a significant contribution towards the widespread usage and commercialization of information extraction. Second, I describe how an ontology-based information extraction system can make use of multiple ontologies. Almost all previous systems use a single ontology, although multiple ontologies are available for most domains. Using multiple ontologies in information extraction has the potential to extract more information from text and thus leads to an improvement in performance measures. The concept of information extractor, conceived in the component-based approach for information extraction, is used in designing the principles for accommodating multiple ontologies in an ontology-based information extraction system.en_US
dc.description.sponsorshipCommittee in charge: Dr. Dejing Dou, Chair; Dr. Arthur Farley, Member; Dr. Michal Young, Member; Dr. Monte Westerfield, Outside Memberen_US
dc.identifier.urihttps://hdl.handle.net/1794/11216
dc.language.isoen_USen_US
dc.publisherUniversity of Oregonen_US
dc.relation.ispartofseriesUniversity of Oregon theses, Dept. of Computer and Information Science, Ph. D., 2011;
dc.subjectInformation extractionen_US
dc.subjectOntologies (Information retrieval)en_US
dc.subjectSoftware componentsen_US
dc.subjectComputer scienceen_US
dc.titleUse of ontologies in information extractionen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Wimalasuriya_Daya_Chinthana_phd2011wi.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format
Description:
thesis
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
2.13 KB
Format:
Item-specific license agreed upon to submission
Description: