日本語Wordnetのlink

定義されている概念関係(link)の一覧を調べてみた。

sqlite> select link,count(*) from synlink group by link;
also|2692
attr|1278
caus|220
dmnc|6643
dmnr|1345
dmnu|967
dmtc|6643
dmtr|1345
dmtu|967
enta|408
hasi|8577
hmem|12293
hprt|9097
hsub|797
hype|89089
hypo|89089
inst|8577
mmem|12293
mprt|9097
msub|797
sim|21386

wn.rbのusageとかなり相違がある。登録された関係をもとに作り直すとこんな感じ。

  def self.print_usage
    puts <<-EOS
usage: wn.rb word link [lang]
    word      word to investigate

    link
      hype - Hypernyms
      inst - Instances
      hypo - Hyponym
      hasi - Has Instances
      mmem - Meronyms --- Member
      msub - Meronyms --- Substance
      mprt - Meronyms --- Part
      hmem - Holonyms --- Member
      hsub - Holonyms --- Substance
      hprt - Holonyms -- Part
      attr - Attributes
      sim  - Similar to
      also - Also
      enta - Entails
      caus - Causes
      dmnc - Domain --- Category
      dmnu - Domain --- usage
      dmnr - Domain --- Region
      dmtc - In Domain --- Category
      dmtu - In Domain --- usage
      dmtr - In Domain --- Region

    lang (default: jpn)
      jpn - Japanese
      eng - English
    EOS
  end

entaからdmtrまでは末尾がgで終わっていたので、グループに該当するのかもしれない。

sqlite> select s1.name,slink.link,s2.name
 from synlink slink, synset s1, synset s2
 where slink.link='enta'
 and slink.synset1=s1.synset
 and slink.synset2=s2.synset limit 10;

queue_up|enta|wait
harp|enta|pick_off
licence|enta|o.k.
rout_out|enta|look
exercise|enta|do_work
exercise|enta|move
asphyxiate|enta|cover
address|enta|speak
immigrate|enta|come
share|enta|part
sqlite>