Document (#32581)

Author
Niemi, T.
Jämsen, J.
Title
¬A query language for discovering semantic associations, part II : sample queries and query evaluation
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.11, S.1686-1700
Year
2007
Abstract
In our query language introduced in Part I (Journal of the American Society for Information Science and Technology. 58(2007) no.11, S.1559-1568) the user can formulate queries to find out (possibly complex) semantic relationships among entities. In this article we demonstrate the usage of our query language and discuss the new applications that it supports. We categorize several query types and give sample queries. The query types are categorized based on whether the entities specified in a query are known or unknown to the user in advance, and whether text information in documents is utilized. Natural language is used to represent the results of queries in order to facilitate correct interpretation by the user. We discuss briefly the issues related to the prototype implementation of the query language and show that an independent operation like Rho (Sheth et al., 2005; Anyanwu & Sheth, 2002, 2003), which presupposes entities of interest to be known in advance, is exceedingly inefficient in emulating the behavior of our query language. The discussion also covers potential problems, and challenges for future work.
Theme
Computerlinguistik
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (author)

  1. Niemi, T.; Jämsen , J.: ¬A query language for discovering semantic associations, part I : approach and formal definition of query primitives (2007) 4.70
    4.697151 = sum of:
      4.697151 = weight(author_txt:niemi in 591) [ClassicSimilarity], result of:
        4.697151 = fieldWeight in 591, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.5 = fieldNorm(doc=591)
    
  2. Järvelin, K.; Niemi, T.: Deductive information retrieval based on classifications (1993) 4.70
    4.697151 = sum of:
      4.697151 = weight(author_txt:niemi in 2229) [ClassicSimilarity], result of:
        4.697151 = fieldWeight in 2229, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.5 = fieldNorm(doc=2229)
    
  3. Niemi, T.; Hirvonen, L.; Järvelin, K.: Multidimensional data model and query language for informetrics (2003) 3.52
    3.5228634 = sum of:
      3.5228634 = weight(author_txt:niemi in 1753) [ClassicSimilarity], result of:
        3.5228634 = fieldWeight in 1753, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.375 = fieldNorm(doc=1753)
    
  4. Järvelin, K.; Ingwersen, P.; Niemi, T.: ¬A user-oriented interface for generalised informetric analysis based on applying advanced data modelling techniques (2000) 3.52
    3.5228634 = sum of:
      3.5228634 = weight(author_txt:niemi in 4545) [ClassicSimilarity], result of:
        3.5228634 = fieldWeight in 4545, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.375 = fieldNorm(doc=4545)
    
  5. Näppilä, T.; Järvelin, K.; Niemi, T.: ¬A tool for data cube construction from structurally heterogeneous XML documents (2008) 3.52
    3.5228634 = sum of:
      3.5228634 = weight(author_txt:niemi in 1369) [ClassicSimilarity], result of:
        3.5228634 = fieldWeight in 1369, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.375 = fieldNorm(doc=1369)
    

Similar documents (content)

  1. Niemi, T.; Jämsen , J.: ¬A query language for discovering semantic associations, part I : approach and formal definition of query primitives (2007) 0.31
    0.30589962 = sum of:
      0.30589962 = product of:
        0.9559363 = sum of:
          0.08568289 = weight(abstract_txt:unknown in 591) [ClassicSimilarity], result of:
            0.08568289 = score(doc=591,freq=2.0), product of:
              0.13427307 = queryWeight, product of:
                1.0765078 = boost
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.01727673 = queryNorm
              0.63812417 = fieldWeight in 591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.06615347 = weight(abstract_txt:discovering in 591) [ClassicSimilarity], result of:
            0.06615347 = score(doc=591,freq=1.0), product of:
              0.14237636 = queryWeight, product of:
                1.1085153 = boost
                7.4342074 = idf(docFreq=70, maxDocs=44218)
                0.01727673 = queryNorm
              0.46463796 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4342074 = idf(docFreq=70, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.07065383 = weight(abstract_txt:semantic in 591) [ClassicSimilarity], result of:
            0.07065383 = score(doc=591,freq=6.0), product of:
              0.10314612 = queryWeight, product of:
                1.3343328 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01727673 = queryNorm
              0.6849878 = fieldWeight in 591, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.04551898 = weight(abstract_txt:known in 591) [ClassicSimilarity], result of:
            0.04551898 = score(doc=591,freq=1.0), product of:
              0.13981095 = queryWeight, product of:
                1.5534894 = boost
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.01727673 = queryNorm
              0.3255752 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.041814614 = weight(abstract_txt:user in 591) [ClassicSimilarity], result of:
            0.041814614 = score(doc=591,freq=3.0), product of:
              0.10486283 = queryWeight, product of:
                1.6477606 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.01727673 = queryNorm
              0.39875534 = fieldWeight in 591, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.23484354 = weight(abstract_txt:entities in 591) [ClassicSimilarity], result of:
            0.23484354 = score(doc=591,freq=6.0), product of:
              0.26297346 = queryWeight, product of:
                2.609392 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.01727673 = queryNorm
              0.89303136 = fieldWeight in 591, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.0999296 = weight(abstract_txt:language in 591) [ClassicSimilarity], result of:
            0.0999296 = score(doc=591,freq=2.0), product of:
              0.27033734 = queryWeight, product of:
                3.7415485 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.01727673 = queryNorm
              0.3696478 = fieldWeight in 591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
          0.3113394 = weight(abstract_txt:query in 591) [ClassicSimilarity], result of:
            0.3113394 = score(doc=591,freq=4.0), product of:
              0.52394587 = queryWeight, product of:
                6.379508 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.01727673 = queryNorm
              0.5942206 = fieldWeight in 591, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=591)
        0.32 = coord(8/25)
    
  2. Owei, V.; Higa, K.: ¬A paradigm for natural language explanation of database queries : a semantic data model approach (1994) 0.24
    0.24367644 = sum of:
      0.24367644 = product of:
        1.0153185 = sum of:
          0.17049308 = weight(abstract_txt:specified in 8189) [ClassicSimilarity], result of:
            0.17049308 = score(doc=8189,freq=3.0), product of:
              0.12778354 = queryWeight, product of:
                1.0501714 = boost
                7.042927 = idf(docFreq=104, maxDocs=44218)
                0.01727673 = queryNorm
              1.3342335 = fieldWeight in 8189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.042927 = idf(docFreq=104, maxDocs=44218)
                0.109375 = fieldNorm(doc=8189)
          0.050477535 = weight(abstract_txt:semantic in 8189) [ClassicSimilarity], result of:
            0.050477535 = score(doc=8189,freq=1.0), product of:
              0.10314612 = queryWeight, product of:
                1.3343328 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01727673 = queryNorm
              0.4893789 = fieldWeight in 8189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.109375 = fieldNorm(doc=8189)
          0.07317558 = weight(abstract_txt:user in 8189) [ClassicSimilarity], result of:
            0.07317558 = score(doc=8189,freq=3.0), product of:
              0.10486283 = queryWeight, product of:
                1.6477606 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.01727673 = queryNorm
              0.69782186 = fieldWeight in 8189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.109375 = fieldNorm(doc=8189)
          0.21225289 = weight(abstract_txt:queries in 8189) [ClassicSimilarity], result of:
            0.21225289 = score(doc=8189,freq=2.0), product of:
              0.26871374 = queryWeight, product of:
                3.0457737 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.01727673 = queryNorm
              0.78988475 = fieldWeight in 8189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.109375 = fieldNorm(doc=8189)
          0.12365658 = weight(abstract_txt:language in 8189) [ClassicSimilarity], result of:
            0.12365658 = score(doc=8189,freq=1.0), product of:
              0.27033734 = queryWeight, product of:
                3.7415485 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.01727673 = queryNorm
              0.45741582 = fieldWeight in 8189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.109375 = fieldNorm(doc=8189)
          0.38526288 = weight(abstract_txt:query in 8189) [ClassicSimilarity], result of:
            0.38526288 = score(doc=8189,freq=2.0), product of:
              0.52394587 = queryWeight, product of:
                6.379508 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.01727673 = queryNorm
              0.73531044 = fieldWeight in 8189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.109375 = fieldNorm(doc=8189)
        0.24 = coord(6/25)
    
  3. Airio, E.: Who benefits from CLIR in web retrieval? (2008) 0.21
    0.20934401 = sum of:
      0.20934401 = product of:
        0.87226677 = sum of:
          0.052917935 = weight(abstract_txt:utilized in 2342) [ClassicSimilarity], result of:
            0.052917935 = score(doc=2342,freq=1.0), product of:
              0.122688755 = queryWeight, product of:
                1.029023 = boost
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.01727673 = queryNorm
              0.43131855 = fieldWeight in 2342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.0625 = fieldNorm(doc=2342)
          0.0626978 = weight(abstract_txt:formulate in 2342) [ClassicSimilarity], result of:
            0.0626978 = score(doc=2342,freq=1.0), product of:
              0.13737394 = queryWeight, product of:
                1.0888672 = boost
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.01727673 = queryNorm
              0.4564024 = fieldWeight in 2342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3024383 = idf(docFreq=80, maxDocs=44218)
                0.0625 = fieldNorm(doc=2342)
          0.03656758 = weight(abstract_txt:whether in 2342) [ClassicSimilarity], result of:
            0.03656758 = score(doc=2342,freq=1.0), product of:
              0.12082134 = queryWeight, product of:
                1.4441408 = boost
                4.8425326 = idf(docFreq=947, maxDocs=44218)
                0.01727673 = queryNorm
              0.3026583 = fieldWeight in 2342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8425326 = idf(docFreq=947, maxDocs=44218)
                0.0625 = fieldNorm(doc=2342)
          0.14854605 = weight(abstract_txt:queries in 2342) [ClassicSimilarity], result of:
            0.14854605 = score(doc=2342,freq=3.0), product of:
              0.26871374 = queryWeight, product of:
                3.0457737 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.01727673 = queryNorm
              0.5528041 = fieldWeight in 2342, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=2342)
          0.2234494 = weight(abstract_txt:language in 2342) [ClassicSimilarity], result of:
            0.2234494 = score(doc=2342,freq=10.0), product of:
              0.27033734 = queryWeight, product of:
                3.7415485 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.01727673 = queryNorm
              0.82655764 = fieldWeight in 2342, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=2342)
          0.34808806 = weight(abstract_txt:query in 2342) [ClassicSimilarity], result of:
            0.34808806 = score(doc=2342,freq=5.0), product of:
              0.52394587 = queryWeight, product of:
                6.379508 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.01727673 = queryNorm
              0.6643588 = fieldWeight in 2342, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=2342)
        0.24 = coord(6/25)
    
  4. Lewandowski, D.: Evaluating the retrieval effectiveness of web search engines using a representative query sample (2015) 0.21
    0.20837133 = sum of:
      0.20837133 = product of:
        0.8682139 = sum of:
          0.10514732 = weight(abstract_txt:correct in 2157) [ClassicSimilarity], result of:
            0.10514732 = score(doc=2157,freq=3.0), product of:
              0.11586561 = queryWeight, product of:
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.01727673 = queryNorm
              0.90749377 = fieldWeight in 2157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
          0.035914805 = weight(abstract_txt:types in 2157) [ClassicSimilarity], result of:
            0.035914805 = score(doc=2157,freq=1.0), product of:
              0.10287784 = queryWeight, product of:
                1.3325964 = boost
                4.4684987 = idf(docFreq=1377, maxDocs=44218)
                0.01727673 = queryNorm
              0.34910145 = fieldWeight in 2157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4684987 = idf(docFreq=1377, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
          0.0301771 = weight(abstract_txt:user in 2157) [ClassicSimilarity], result of:
            0.0301771 = score(doc=2157,freq=1.0), product of:
              0.10486283 = queryWeight, product of:
                1.6477606 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.01727673 = queryNorm
              0.2877769 = fieldWeight in 2157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
          0.09734497 = weight(abstract_txt:sample in 2157) [ClassicSimilarity], result of:
            0.09734497 = score(doc=2157,freq=2.0), product of:
              0.1587347 = queryWeight, product of:
                1.6552883 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.01727673 = queryNorm
              0.6132558 = fieldWeight in 2157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
          0.26259485 = weight(abstract_txt:queries in 2157) [ClassicSimilarity], result of:
            0.26259485 = score(doc=2157,freq=6.0), product of:
              0.26871374 = queryWeight, product of:
                3.0457737 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.01727673 = queryNorm
              0.97722894 = fieldWeight in 2157, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
          0.3370348 = weight(abstract_txt:query in 2157) [ClassicSimilarity], result of:
            0.3370348 = score(doc=2157,freq=3.0), product of:
              0.52394587 = queryWeight, product of:
                6.379508 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.01727673 = queryNorm
              0.6432626 = fieldWeight in 2157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
        0.24 = coord(6/25)
    
  5. Rozinajová, V.; Macko, P.: Using natural language to search linked data (2017) 0.20
    0.20254935 = sum of:
      0.20254935 = product of:
        0.84395564 = sum of:
          0.035914805 = weight(abstract_txt:types in 3488) [ClassicSimilarity], result of:
            0.035914805 = score(doc=3488,freq=1.0), product of:
              0.10287784 = queryWeight, product of:
                1.3325964 = boost
                4.4684987 = idf(docFreq=1377, maxDocs=44218)
                0.01727673 = queryNorm
              0.34910145 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4684987 = idf(docFreq=1377, maxDocs=44218)
                0.078125 = fieldNorm(doc=3488)
          0.036055382 = weight(abstract_txt:semantic in 3488) [ClassicSimilarity], result of:
            0.036055382 = score(doc=3488,freq=1.0), product of:
              0.10314612 = queryWeight, product of:
                1.3343328 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01727673 = queryNorm
              0.34955636 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=3488)
          0.0603542 = weight(abstract_txt:user in 3488) [ClassicSimilarity], result of:
            0.0603542 = score(doc=3488,freq=4.0), product of:
              0.10486283 = queryWeight, product of:
                1.6477606 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.01727673 = queryNorm
              0.5755538 = fieldWeight in 3488, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.078125 = fieldNorm(doc=3488)
          0.1516092 = weight(abstract_txt:queries in 3488) [ClassicSimilarity], result of:
            0.1516092 = score(doc=3488,freq=2.0), product of:
              0.26871374 = queryWeight, product of:
                3.0457737 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.01727673 = queryNorm
              0.5642034 = fieldWeight in 3488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.078125 = fieldNorm(doc=3488)
          0.124912 = weight(abstract_txt:language in 3488) [ClassicSimilarity], result of:
            0.124912 = score(doc=3488,freq=2.0), product of:
              0.27033734 = queryWeight, product of:
                3.7415485 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.01727673 = queryNorm
              0.46205974 = fieldWeight in 3488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=3488)
          0.43511006 = weight(abstract_txt:query in 3488) [ClassicSimilarity], result of:
            0.43511006 = score(doc=3488,freq=5.0), product of:
              0.52394587 = queryWeight, product of:
                6.379508 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.01727673 = queryNorm
              0.8304485 = fieldWeight in 3488, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=3488)
        0.24 = coord(6/25)