Document (#36741)

Author
Bouramoul, A.
Title
¬The semantic dimension in information retrieval, from document indexing to query reformulation
Source
Knowledge organization. 38(2011) no.5, S.425-437
Year
2011
Abstract
In the context of this research, we present an approach for representing the semantic content of documents and guiding the automatic query reformulation using a domain ontology. The aim is to improve the performance of information retrieval systems. In order to operationalize our proposal, the development of a set of external resources was needed, so we have constructed the 'AnimOnto' domain ontology relating to the animals domain and a document base that covers the same field. They were used to test and validate our proposal. Specifically, we propose in this paper a general architecture based on three complementary processes, this architecture uses the ontology during the semantic indexing stage and in the query reformulation phase. We also describe the 'AnimeSe Finder' tool (Animal Semantic Finder). The latter has the advantage of being generic and adaptable to other search types. It is possible to use another ontology with another document base for a new domain, to exploit the general functionalities offered by the 'AnimeSe Finder' tool.
Content
Beitrag innerhalb einer Special Section: Knowledge Organization, Competitive Intelligence, and Information Systems - Papers from 4th International Conference on "Information Systems & Economic Intelligence," February 17-19th, 2011. Marrakech - Morocco. Vgl.: http://www.ergon-verlag.de/isko_ko/downloads/ko_38_2011_5f.pdf.
Theme
Wissensrepräsentation
Field
Biologie
Object
AnimSe Finder

Similar documents (content)

  1. Burke, R.D.: Question answering from frequently asked question files : experiences with the FAQ Finder System (1997) 0.39
    0.3946839 = sum of:
      0.3946839 = product of:
        1.6445162 = sum of:
          0.020800496 = weight(abstract_txt:retrieval in 1191) [ClassicSimilarity], result of:
            0.020800496 = score(doc=1191,freq=1.0), product of:
              0.0547247 = queryWeight, product of:
                1.0515922 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014974895 = queryNorm
              0.38009337 = fieldWeight in 1191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
          0.12528953 = weight(abstract_txt:base in 1191) [ClassicSimilarity], result of:
            0.12528953 = score(doc=1191,freq=2.0), product of:
              0.1437918 = queryWeight, product of:
                1.7046009 = boost
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.014974895 = queryNorm
              0.871326 = fieldWeight in 1191, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
          0.05880469 = weight(abstract_txt:document in 1191) [ClassicSimilarity], result of:
            0.05880469 = score(doc=1191,freq=1.0), product of:
              0.12524854 = queryWeight, product of:
                1.9484427 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.014974895 = queryNorm
              0.46950403 = fieldWeight in 1191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
          0.07986587 = weight(abstract_txt:query in 1191) [ClassicSimilarity], result of:
            0.07986587 = score(doc=1191,freq=1.0), product of:
              0.15360506 = queryWeight, product of:
                2.1577644 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.014974895 = queryNorm
              0.519943 = fieldWeight in 1191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
          0.1255693 = weight(abstract_txt:semantic in 1191) [ClassicSimilarity], result of:
            0.1255693 = score(doc=1191,freq=2.0), product of:
              0.18143591 = queryWeight, product of:
                2.707898 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.014974895 = queryNorm
              0.6920863 = fieldWeight in 1191, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.109375 = fieldNorm(doc=1191)
          1.2341863 = weight(title_txt:finder in 1191) [ClassicSimilarity], result of:
            1.2341863 = score(doc=1191,freq=1.0), product of:
              0.5492084 = queryWeight, product of:
                4.0800915 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.014974895 = queryNorm
              2.2472093 = fieldWeight in 1191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.25 = fieldNorm(doc=1191)
        0.24 = coord(6/25)
    
  2. Stebelman, S.: Analysis of retrieval performance in four cross-disciplinary databases : Article1st, Faxon Finder, UnCover, and a locally mounted database (1994) 0.19
    0.18649782 = sum of:
      0.18649782 = product of:
        1.1656114 = sum of:
          0.014857496 = weight(abstract_txt:retrieval in 952) [ClassicSimilarity], result of:
            0.014857496 = score(doc=952,freq=1.0), product of:
              0.0547247 = queryWeight, product of:
                1.0515922 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014974895 = queryNorm
              0.27149525 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=952)
          0.028837603 = weight(abstract_txt:general in 952) [ClassicSimilarity], result of:
            0.028837603 = score(doc=952,freq=1.0), product of:
              0.085151516 = queryWeight, product of:
                1.3117523 = boost
                4.3348765 = idf(docFreq=1574, maxDocs=44218)
                0.014974895 = queryNorm
              0.33866224 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3348765 = idf(docFreq=1574, maxDocs=44218)
                0.078125 = fieldNorm(doc=952)
          0.042003352 = weight(abstract_txt:document in 952) [ClassicSimilarity], result of:
            0.042003352 = score(doc=952,freq=1.0), product of:
              0.12524854 = queryWeight, product of:
                1.9484427 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.014974895 = queryNorm
              0.33536002 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=952)
          1.0799129 = weight(title_txt:finder in 952) [ClassicSimilarity], result of:
            1.0799129 = score(doc=952,freq=1.0), product of:
              0.5492084 = queryWeight, product of:
                4.0800915 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.014974895 = queryNorm
              1.9663081 = fieldWeight in 952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.21875 = fieldNorm(doc=952)
        0.16 = coord(4/25)
    
  3. Kiryakov, A.; Popov, B.; Terziev, I.; Manov, D.; Ognyanoff, D.: Semantic annotation, indexing, and retrieval (2004) 0.17
    0.17425404 = sum of:
      0.17425404 = product of:
        0.484039 = sum of:
          0.015440364 = weight(abstract_txt:retrieval in 700) [ClassicSimilarity], result of:
            0.015440364 = score(doc=700,freq=3.0), product of:
              0.0547247 = queryWeight, product of:
                1.0515922 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014974895 = queryNorm
              0.28214616 = fieldWeight in 700, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.046875 = fieldNorm(doc=700)
          0.0077537806 = weight(abstract_txt:this in 700) [ClassicSimilarity], result of:
            0.0077537806 = score(doc=700,freq=3.0), product of:
              0.039577752 = queryWeight, product of:
                1.0952843 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.014974895 = queryNorm
              0.1959126 = fieldWeight in 700, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.046875 = fieldNorm(doc=700)
          0.024719482 = weight(abstract_txt:indexing in 700) [ClassicSimilarity], result of:
            0.024719482 = score(doc=700,freq=2.0), product of:
              0.08573043 = queryWeight, product of:
                1.316204 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.014974895 = queryNorm
              0.28833964 = fieldWeight in 700, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.046875 = fieldNorm(doc=700)
          0.03796846 = weight(abstract_txt:base in 700) [ClassicSimilarity], result of:
            0.03796846 = score(doc=700,freq=1.0), product of:
              0.1437918 = queryWeight, product of:
                1.7046009 = boost
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.014974895 = queryNorm
              0.26405165 = fieldWeight in 700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.046875 = fieldNorm(doc=700)
          0.039141666 = weight(abstract_txt:architecture in 700) [ClassicSimilarity], result of:
            0.039141666 = score(doc=700,freq=1.0), product of:
              0.14673881 = queryWeight, product of:
                1.7219802 = boost
                5.690534 = idf(docFreq=405, maxDocs=44218)
                0.014974895 = queryNorm
              0.26674378 = fieldWeight in 700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.690534 = idf(docFreq=405, maxDocs=44218)
                0.046875 = fieldNorm(doc=700)
          0.025202012 = weight(abstract_txt:document in 700) [ClassicSimilarity], result of:
            0.025202012 = score(doc=700,freq=1.0), product of:
              0.12524854 = queryWeight, product of:
                1.9484427 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.014974895 = queryNorm
              0.20121601 = fieldWeight in 700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.046875 = fieldNorm(doc=700)
          0.12620834 = weight(abstract_txt:semantic in 700) [ClassicSimilarity], result of:
            0.12620834 = score(doc=700,freq=11.0), product of:
              0.18143591 = queryWeight, product of:
                2.707898 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.014974895 = queryNorm
              0.6956084 = fieldWeight in 700, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.046875 = fieldNorm(doc=700)
          0.063803986 = weight(abstract_txt:domain in 700) [ClassicSimilarity], result of:
            0.063803986 = score(doc=700,freq=2.0), product of:
              0.2032438 = queryWeight, product of:
                2.8660207 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.014974895 = queryNorm
              0.3139283 = fieldWeight in 700, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.046875 = fieldNorm(doc=700)
          0.14380091 = weight(abstract_txt:ontology in 700) [ClassicSimilarity], result of:
            0.14380091 = score(doc=700,freq=4.0), product of:
              0.2773 = queryWeight, product of:
                3.347693 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.014974895 = queryNorm
              0.51857525 = fieldWeight in 700, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.046875 = fieldNorm(doc=700)
        0.36 = coord(9/25)
    
  4. Mestrovic, A.; Cali, A.: ¬An ontology-based approach to information retrieval (2017) 0.17
    0.17425019 = sum of:
      0.17425019 = product of:
        0.5445318 = sum of:
          0.014857496 = weight(abstract_txt:retrieval in 3489) [ClassicSimilarity], result of:
            0.014857496 = score(doc=3489,freq=1.0), product of:
              0.0547247 = queryWeight, product of:
                1.0515922 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014974895 = queryNorm
              0.27149525 = fieldWeight in 3489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=3489)
          0.010551559 = weight(abstract_txt:this in 3489) [ClassicSimilarity], result of:
            0.010551559 = score(doc=3489,freq=2.0), product of:
              0.039577752 = queryWeight, product of:
                1.0952843 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.014974895 = queryNorm
              0.2666033 = fieldWeight in 3489, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=3489)
          0.04078253 = weight(abstract_txt:general in 3489) [ClassicSimilarity], result of:
            0.04078253 = score(doc=3489,freq=2.0), product of:
              0.085151516 = queryWeight, product of:
                1.3117523 = boost
                4.3348765 = idf(docFreq=1574, maxDocs=44218)
                0.014974895 = queryNorm
              0.47894073 = fieldWeight in 3489, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3348765 = idf(docFreq=1574, maxDocs=44218)
                0.078125 = fieldNorm(doc=3489)
          0.1550056 = weight(abstract_txt:base in 3489) [ClassicSimilarity], result of:
            0.1550056 = score(doc=3489,freq=6.0), product of:
              0.1437918 = queryWeight, product of:
                1.7046009 = boost
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.014974895 = queryNorm
              1.0779865 = fieldWeight in 3489, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.078125 = fieldNorm(doc=3489)
          0.05940171 = weight(abstract_txt:document in 3489) [ClassicSimilarity], result of:
            0.05940171 = score(doc=3489,freq=2.0), product of:
              0.12524854 = queryWeight, product of:
                1.9484427 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.014974895 = queryNorm
              0.4742707 = fieldWeight in 3489, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=3489)
          0.08067672 = weight(abstract_txt:query in 3489) [ClassicSimilarity], result of:
            0.08067672 = score(doc=3489,freq=2.0), product of:
              0.15360506 = queryWeight, product of:
                2.1577644 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.014974895 = queryNorm
              0.52522177 = fieldWeight in 3489, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=3489)
          0.06342208 = weight(abstract_txt:semantic in 3489) [ClassicSimilarity], result of:
            0.06342208 = score(doc=3489,freq=1.0), product of:
              0.18143591 = queryWeight, product of:
                2.707898 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.014974895 = queryNorm
              0.34955636 = fieldWeight in 3489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=3489)
          0.119834095 = weight(abstract_txt:ontology in 3489) [ClassicSimilarity], result of:
            0.119834095 = score(doc=3489,freq=1.0), product of:
              0.2773 = queryWeight, product of:
                3.347693 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.014974895 = queryNorm
              0.43214604 = fieldWeight in 3489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.078125 = fieldNorm(doc=3489)
        0.32 = coord(8/25)
    
  5. Kara, S.: ¬An ontology-based retrieval system using semantic indexing (2012) 0.17
    0.17190285 = sum of:
      0.17190285 = product of:
        0.5371964 = sum of:
          0.025733938 = weight(abstract_txt:retrieval in 3829) [ClassicSimilarity], result of:
            0.025733938 = score(doc=3829,freq=3.0), product of:
              0.0547247 = queryWeight, product of:
                1.0515922 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.014974895 = queryNorm
              0.47024357 = fieldWeight in 3829, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=3829)
          0.007461079 = weight(abstract_txt:this in 3829) [ClassicSimilarity], result of:
            0.007461079 = score(doc=3829,freq=1.0), product of:
              0.039577752 = queryWeight, product of:
                1.0952843 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.014974895 = queryNorm
              0.18851699 = fieldWeight in 3829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=3829)
          0.028837603 = weight(abstract_txt:general in 3829) [ClassicSimilarity], result of:
            0.028837603 = score(doc=3829,freq=1.0), product of:
              0.085151516 = queryWeight, product of:
                1.3117523 = boost
                4.3348765 = idf(docFreq=1574, maxDocs=44218)
                0.014974895 = queryNorm
              0.33866224 = fieldWeight in 3829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3348765 = idf(docFreq=1574, maxDocs=44218)
                0.078125 = fieldNorm(doc=3829)
          0.041199137 = weight(abstract_txt:indexing in 3829) [ClassicSimilarity], result of:
            0.041199137 = score(doc=3829,freq=2.0), product of:
              0.08573043 = queryWeight, product of:
                1.316204 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.014974895 = queryNorm
              0.48056605 = fieldWeight in 3829, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=3829)
          0.057047054 = weight(abstract_txt:query in 3829) [ClassicSimilarity], result of:
            0.057047054 = score(doc=3829,freq=1.0), product of:
              0.15360506 = queryWeight, product of:
                2.1577644 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.014974895 = queryNorm
              0.37138787 = fieldWeight in 3829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=3829)
          0.12684415 = weight(abstract_txt:semantic in 3829) [ClassicSimilarity], result of:
            0.12684415 = score(doc=3829,freq=4.0), product of:
              0.18143591 = queryWeight, product of:
                2.707898 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.014974895 = queryNorm
              0.6991127 = fieldWeight in 3829, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=3829)
          0.13023935 = weight(abstract_txt:domain in 3829) [ClassicSimilarity], result of:
            0.13023935 = score(doc=3829,freq=3.0), product of:
              0.2032438 = queryWeight, product of:
                2.8660207 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.014974895 = queryNorm
              0.6408035 = fieldWeight in 3829, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.078125 = fieldNorm(doc=3829)
          0.119834095 = weight(abstract_txt:ontology in 3829) [ClassicSimilarity], result of:
            0.119834095 = score(doc=3829,freq=1.0), product of:
              0.2773 = queryWeight, product of:
                3.347693 = boost
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.014974895 = queryNorm
              0.43214604 = fieldWeight in 3829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5314693 = idf(docFreq=475, maxDocs=44218)
                0.078125 = fieldNorm(doc=3829)
        0.32 = coord(8/25)