Specification of Requirements on Terminological Analysis
The ontology-lexicon model must be able to represent the lexico-syntactic and -semantic structure of nominal compounds and terms.
The lexico-syntactic structure comprises i) the constituency structure as well as ii) the dependency structure between the components of the nominal compound. The semantic structure comprises a specification of the classes and relations that each component refers to. The ontology-lexicon model should thus allow to represent the correct syntactic and semantic analysis of a term in the domain described by the ontology.
Take the example of the term small appliance industry. This term could have one of the following readings:
[small/ADJ [appliance/NN industry/NN] ]
or
[ [small/ADJ appliance/NN] industry/NN]
Further, the ontology-lexicon model should allow to specify that the right interpretation of such a compound with respect to a given OWL ontology could be:
Industry ⊓ ∀manufactures.(Appliance ⊓ ∀size.{small})
Real-life examples:
- xEBR Core Reference Taxonomy (http://www.xbrleurope.org/working-groups/xebr-wg/xebr-taxonomy) defined by the XBRL Europe Business Registers Working Group
EN term/label: tangible fixed assets
DE term/label: Sachanlagen
xEBR does not have URIs
- STW Thesaurus for Economics (http://zbw.eu/stw/versions/latest/about.en.html)
also in German: Standard-Thesaurus Wirtschaft (http://zbw.eu/stw/versions/latest/about.de.html)
DE term/label: Immaterielles Anlagevermögen (http://zbw.eu/stw/descriptor/12376-2 -- URI for concept NOT for DE term)
EN term/label: Intangible assets (http://zbw.eu/stw/descriptor/12376-2 -- URI for concept NOT for EN term)
# Examples: Intangible assets
@prefix lemon: <http://www.monnet-project.eu/lemon#> .
@prefix zbw: <http://zbw.eu/stw/descriptor/> .
@prefix dbpedia: <http://dbpedia.org/page/> .
:lexicon a lemon:Lexicon ;
lemon:language "en" ;
lemon:entry :intangible,
:asset.
:Intangible assets
lemon:canonicalForm [ lemon:writtenRep "Intangible assets" ] ;
lemon:sense [ lemon:ref zbw:12376-2 ] .
:intangible
lemon:canonicalForm [ lemon:writtenRep "intangible"] ;
lexinfo:partOfSpeech lexinfo:adjective . # partOfSpeech=adjective
:asset
lemon:canonicalForm [ lemon:writtenRep "asset" ;
isocat:DC-1298 isocat:DC-1387 # number=singular] ;
lemon:altForm [ lemon:writtenRep "assets" ;
isocat:DC-1298 isocat:DC-1354 # number=plural] ;
lemon:sense [ lemon:ref dbpedia:Asset ] .
lexinfo:partOfSpeech lexinfo:noun . # partOfSpeech=noun
:Intangible Assets lemon:Phrase;
lemon:decomposition (
[ lemon:element :intangible ]
[ lemon:element :asset ] ) .
# Examples: Immaterielles Anlagevermögen & Intangible assets
@prefix lemon: <http://www.monnet-project.eu/lemon#> .
@prefix zbw: <http://zbw.eu/stw/descriptor/> .
@prefix dbpedia: <http://dbpedia.org/page/> .
:lexicon a lemon:Lexicon ;
lemon:language "de" ;
lemon:entry :immateriell,
:Anlagevermögen.
:Immaterielles Anlagevermögen
lemon:canonicalForm [ lemon:writtenRep "Immaterielles Anlagevermögen" ] ;
lemon:sense [ lemon:ref zbw:12376-2 ] .
:immateriell
lemon:canonicalForm [ lemon:writtenRep "immateriell" ;
lemon:altForm [ lemon:writtenRep "immaterieller" ;
isocat:DC-1297 isocat:DC-1883 # gender=masculine
isocat:DC-1298 isocat:DC-1387 # number=singular] ;
[ lemon:writtenRep "immaterielle" ;
isocat:DC-1297 isocat:DC-1880 # gender=feminine
isocat:DC-1298 isocat:DC-1387 # number=singular] ;
[ lemon:writtenRep "immaterielles" ;
isocat:DC-1297 isocat:DC-1884 # gender=neuter
isocat:DC-1298 isocat:DC-1387 # number=singular] ;
lexinfo:partOfSpeech lexinfo:adjective . # partOfSpeech=adjective
:Anlagevermögen
lemon:canonicalForm [ lemon:writtenRep "Anlagevermögen" ;
isocat:DC-1297 isocat:DC-1884 # gender=neuter
isocat:DC-1298 isocat:DC-1387 # number=singular] ;
lemon:altForm [ lemon:writtenRep "Anlagevermögen" ;
isocat:DC-1297 isocat:DC-1884 # gender=neuter
isocat:DC-1298 isocat:DC-1354 # number=plural] ;
lemon:sense [ lemon:ref dbpedia:Asset ]
lexinfo:partOfSpeech lexinfo:noun . # partOfSpeech=noun
:Immaterielles Anlagevermögen lemon:Phrase;
lemon:decomposition (
[ lemon:element :immateriell ]
[ lemon:element :Anlagevermögen ] ) .
- RadLex (http://rsna.org/RadLex.aspx, http://www.radlex.org/) defined by the Radiological Society of North America (RSNA®)
EN term/label: free lower limb segment (URI http://www.radlex.org/RID/RID34535)
EN term/label: left upper lobe posterior segment artery (URI http://www.radlex.org/RID/RID35837)
#Example: left upper lobe posterior segment artery
@prefix lemon: <http://www.monnet-project.eu/lemon#> .
@prefix radlex: <http://http://www.radlex.org/RID/> .
:left upper lobe posterior segment artery
lemon:canonicalForm [ lemon:writtenRep "left upper lobe posterior segment artery" ] ;
lemon:sense [ lemon:ref radlex:RID35837 ] .
:left upper lobe artery
lemon:canonicalForm [ lemon:writtenRep "left upper lobe artery" ] ;
lemon:sense [ lemon:ref radlex:RID994 ] .
:posterior segmental artery
lemon:canonicalForm [ lemon:writtenRep "posterior segmental artery" ] ;
lemon:sense [ lemon:ref radlex: RID35836 ] .
:lexicon a lemon:Lexicon ;
lemon:language "en" ;
lemon:entry :left,
:upper,
:lobe,
:posterior,
:segment,
:artery.
:left
lemon:canonicalForm [ lemon:writtenRep "left" ;
lexinfo:partOfSpeech lexinfo:adjective .
:upper
lemon:canonicalForm [ lemon:writtenRep "upper" ;
lexinfo:partOfSpeech lexinfo:adjective .
:lobe
lemon:canonicalForm [ lemon:writtenRep "lobe" ;
isocat:DC-1298 isocat:DC-1387 # number=singular] ;
lemon:altForm [ lemon:writtenRep "lobes" ;
isocat:DC-1298 isocat:DC-1354 # number=plural] ;
lexinfo:partOfSpeech lexinfo:noun .
:posterior
lemon:canonicalForm [ lemon:writtenRep "posterior" ;
isocat:DC-1298 isocat:DC-1387 # number=singular] ;
lemon:altForm [ lemon:writtenRep "posteriors" ;
isocat:DC-1298 isocat:DC-1354 # number=plural] ;
lexinfo:partOfSpeech lexinfo:noun .
:segment
lemon:canonicalForm [ lemon:writtenRep "segment" ;
isocat:DC-1298 isocat:DC-1387 # number=singular] ;
lemon:altForm [ lemon:writtenRep "segments" ;
isocat:DC-1298 isocat:DC-1354 # number=plural] ;
lexinfo:partOfSpeech lexinfo:noun .
:artery
lemon:canonicalForm [ lemon:writtenRep "artery" ;
isocat:DC-1298 isocat:DC-1387 # number=singular] ;
lemon:altForm [ lemon:writtenRep "arteries" ;
isocat:DC-1298 isocat:DC-1354 # number=plural] ;
lemon:sense [ lemon:ref radlex:RID478 ] .
lexinfo:partOfSpeech lexinfo:noun .
:left upper lobe posterior segment artery:Phrase ;
lemon:decomposition (
[ lemon:element :artery ]
[ lemon:element :left upper lobe artery ]
[ lemon:element :posterior segment artery ] ) .
It is not the goal of the ontology-lexicon model to specify how the compositional structure of the term relates to the composition of the complex concept it refers to.