Use of ontologies in information extraction

Wimalasuriya, Daya Chinthana

Use of ontologies in information extraction

dc.contributor.author	Wimalasuriya, Daya Chinthana
dc.date.accessioned	2011-06-07T23:54:44Z
dc.date.available	2011-06-07T23:54:44Z
dc.date.issued	2011-03
dc.description	xiii, 149 p. : ill. (some col.)	en_US
dc.description.abstract	Information extraction (IE) aims to recognize and retrieve certain types of information from natural language text. For instance, an information extraction system may extract key geopolitical indicators about countries from a set of web pages while ignoring other types of information. IE has existed as a research field for a few decades, and ontology-based information extraction (OBIE) has recently emerged as one of its subfields. Here, the general idea is to use ontologies--which provide formal and explicit specifications of shared conceptualizations--to guide the information extraction process. This dissertation presents two novel directions for ontology-based information extraction in which ontologies are used to improve the information extraction process. First, I describe how a component-based approach for information extraction can be designed through the use of ontologies in information extraction. A key idea in this approach is identifying components of information extraction systems which make extractions with respect to specific ontological concepts. These components are termed "information extractors". The component-based approach explores how information extractors as well as other types of components can be used in developing information extraction systems. This approach has the potential to make a significant contribution towards the widespread usage and commercialization of information extraction. Second, I describe how an ontology-based information extraction system can make use of multiple ontologies. Almost all previous systems use a single ontology, although multiple ontologies are available for most domains. Using multiple ontologies in information extraction has the potential to extract more information from text and thus leads to an improvement in performance measures. The concept of information extractor, conceived in the component-based approach for information extraction, is used in designing the principles for accommodating multiple ontologies in an ontology-based information extraction system.	en_US
dc.description.sponsorship	Committee in charge: Dr. Dejing Dou, Chair; Dr. Arthur Farley, Member; Dr. Michal Young, Member; Dr. Monte Westerfield, Outside Member	en_US
dc.identifier.uri	https://hdl.handle.net/1794/11216
dc.language.iso	en_US	en_US
dc.publisher	University of Oregon	en_US
dc.relation.ispartofseries	University of Oregon theses, Dept. of Computer and Information Science, Ph. D., 2011;
dc.subject	Information extraction	en_US
dc.subject	Ontologies (Information retrieval)	en_US
dc.subject	Software components	en_US
dc.subject	Computer science	en_US
dc.title	Use of ontologies in information extraction	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Wimalasuriya_Daya_Chinthana_phd2011wi.pdf
Size:: 1.21 MB
Format:: Adobe Portable Document Format
Description:: thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.13 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses and Dissertations
Computer Science Theses and Dissertations