Hostname: page-component-745bb68f8f-mzp66 Total loading time: 0 Render date: 2025-02-07T06:37:36.659Z Has data issue: false hasContentIssue false

The Danish SIMPLE lexicon and its application in content-based querying

Published online by Cambridge University Press:  04 May 2004

Bolette Sandford Pedersen
Affiliation:
Center for Sprogteknologi, Københavns Universitet, Njalsgade 80, DK-2300 S. E-mail: bolette@cst.dk
Patrizia Paggio
Affiliation:
Center for Sprogteknologi, Københavns Universitet, Njalsgade 80, DK-2300 S. E-mail: patrizia@cst.dk

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

This paper deals with the SIMPLE-DK lexicon, a computational lexicon for Danish developed at the Centre for Language Technology in Copenhagen within the European Union project SIMPLE. The general SIMPLE model, on which the Danish lexicon is based, is presented, and the way in which several specific aspects of Danish, such as nominal compounds and time expressions, are accommodated in this model is then described. Phrasal verbs – in particular phrasal motion verbs – are shown to be a challenging phenomenon since they are difficult to place in the SIMPLE event ontology, and pose problems regarding the interpretation of the directional particle they combine with. The encoding strategy that is proposed here accounts for compositional and non-compositional types of phrasal verb, and captures the relation between act-denoting and transition-denoting senses of the same verb in terms of regular polysemy. The final part of the paper deals with the exploitation of SIMPLE-DK as an ontological and lexical source in the Danish project on content-based querying OntoQuery. In the OntoQuery ontology, the structured concepts in SIMPLE-DK are combined with nutrition concepts, and the resulting ontology is used for matching evaluation. It is also discussed how selectional restrictions and qualia roles from SIMPLE-DK can be included in a conceptual grammar to be used for query and text analysis.

Type
Research Article
Copyright
© 2004 Cambridge University Press