Semantic component selection

Sjachyn, M. 2009. Semantic component selection. PhD thesis University of Westminster School of Electronics and Computer Science https://doi.org/10.34737/90z98

TitleSemantic component selection
TypePhD thesis
AuthorsSjachyn, M.
Abstract

The means of locating information quickly and efficiently is a growing area of research. However the real challenge is not related to locating bits of information, but finding those that are relevant. Relevant information resides within unstructured ‘natural’ text. However, understanding natural text and judging information relevancy is a challenge. The challenge is partially addressed by use of semantic models and reasoning approaches that allow categorisation and (within limited fashion) provide understanding of this information. Nevertheless, many such methods are dependent on expert input and, consequently, are expensive to produce and do not scale.
Although automated solutions exist, thus far, these have not been able to approach accuracy levels achievable through use of expert input. This thesis presents SemaCS - a novel nondomain specific automated framework of categorising and searching natural text. SemaCS does not rely on expert input; it is based on actual data being searched and statistical semantic distances between words. These semantic distances are used to perform basic reasoning and semantic query interpretation. The approach was tested through a feasibility study and two case studies. Based on reasoning and analyses of data collected through these studies, it can be concluded that SemaCS provides a domain independent approach of semantic model generation and query interpretation without expert input. Moreover, SemaCS can be further extended to provide a scalable solution applicable to large datasets (i.e. World Wide Web).
This thesis contributes to the current body of knowledge by establishing, adapting, and using novel techniques to define a generic selection/categorisation framework.
Implementing the framework outlined in the thesis improves an existing algorithm of semantic distance acquisition. Finally, as a novel approach to the extraction of semantic information is proposed, there exists a positive impact on Information Retrieval domain and, specifically, on Natural Language Processing, word disambiguation and Web/Intranet search.

Year2009
File
PublisherUniversity of Westminster
Publication dates
Published2009
Digital Object Identifier (DOI)https://doi.org/10.34737/90z98

Related outputs

Semantic distance acquisition in SemaCS
Sjachyn, M. and Beus-Dukic, L. 2010. Semantic distance acquisition in SemaCS. in: Proceedings of the 4th IEEE international conference on research challenges in information science (RCIS 2010), Nice, France. IEEE . pp. 183-190

Semantic component selection - SemaCS
Sjachyn, M. and Beus-Dukic, L. 2006. Semantic component selection - SemaCS. in: Fifth International Conference on Commercial-off-the-Shelf (COTS)-Based Software Systems, 2006 Los Alamitos, USA IEEE . pp. 83-89

Permalink - https://westminsterresearch.westminster.ac.uk/item/90z98/semantic-component-selection


Share this

Usage statistics

116 total views
264 total downloads
These values cover views and downloads from WestminsterResearch and are for the period from September 2nd 2018, when this repository was created.