Content In Context: A Data-driven Approach
Price
Free (open access)
Volume
25
Pages
5
Published
2000
Size
537 kb
Paper DOI
10.2495/DATA000201
Copyright
WIT Press
Author(s)
Judi Vernau
Abstract
The dramatic increase in document availability via the Internet means that traditional methods of cataloguing, classifying and retrieving information have been unable to cope with the demand. In order to solve this problem, the emphasis has shifted from the previous labour-intensive methods to technical solutions which rely on statistical and algorithmic means of identifying topics and proper names in documents. While these various solutions have provided a great improvement in retrieval of relevant information, it is still not possible to identify the exact nature of particular terms or relationships between terms and topics. However, using a specially-devised ontology covering topic and type, and a vocabulary set classified using this ontology, it is possible to disambiguate between different senses of terms, and therefore to identify the main topic of a document and the nature of the lexical terms and prop
Keywords