Document (#32582)

Author
Niemi, T.
Jämsen, J.
Title
¬A query language for discovering semantic associations, part II : sample queries and query evaluation
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.11, S.1686-1700
Year
2007
Abstract
In our query language introduced in Part I (Journal of the American Society for Information Science and Technology. 58(2007) no.11, S.1559-1568) the user can formulate queries to find out (possibly complex) semantic relationships among entities. In this article we demonstrate the usage of our query language and discuss the new applications that it supports. We categorize several query types and give sample queries. The query types are categorized based on whether the entities specified in a query are known or unknown to the user in advance, and whether text information in documents is utilized. Natural language is used to represent the results of queries in order to facilitate correct interpretation by the user. We discuss briefly the issues related to the prototype implementation of the query language and show that an independent operation like Rho (Sheth et al., 2005; Anyanwu & Sheth, 2002, 2003), which presupposes entities of interest to be known in advance, is exceedingly inefficient in emulating the behavior of our query language. The discussion also covers potential problems, and challenges for future work.
Theme
Computerlinguistik
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (author)

  1. Niemi, T.; Jämsen , J.: ¬A query language for discovering semantic associations, part I : approach and formal definition of query primitives (2007) 4.67
    4.670967 = sum of:
      4.670967 = weight(author_txt:niemi in 2592) [ClassicSimilarity], result of:
        4.670967 = fieldWeight in 2592, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.341934 = idf(docFreq=9, maxDocs=41962)
          0.5 = fieldNorm(doc=2592)
    
  2. Järvelin, K.; Niemi, T.: Deductive information retrieval based on classifications (1993) 4.67
    4.670967 = sum of:
      4.670967 = weight(author_txt:niemi in 4230) [ClassicSimilarity], result of:
        4.670967 = fieldWeight in 4230, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.341934 = idf(docFreq=9, maxDocs=41962)
          0.5 = fieldNorm(doc=4230)
    
  3. Niemi, T.; Hirvonen, L.; Järvelin, K.: Multidimensional data model and query language for informetrics (2003) 3.50
    3.5032253 = sum of:
      3.5032253 = weight(author_txt:niemi in 2754) [ClassicSimilarity], result of:
        3.5032253 = fieldWeight in 2754, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.341934 = idf(docFreq=9, maxDocs=41962)
          0.375 = fieldNorm(doc=2754)
    
  4. Järvelin, K.; Ingwersen, P.; Niemi, T.: ¬A user-oriented interface for generalised informetric analysis based on applying advanced data modelling techniques (2000) 3.50
    3.5032253 = sum of:
      3.5032253 = weight(author_txt:niemi in 546) [ClassicSimilarity], result of:
        3.5032253 = fieldWeight in 546, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.341934 = idf(docFreq=9, maxDocs=41962)
          0.375 = fieldNorm(doc=546)
    
  5. Näppilä, T.; Järvelin, K.; Niemi, T.: ¬A tool for data cube construction from structurally heterogeneous XML documents (2008) 3.50
    3.5032253 = sum of:
      3.5032253 = weight(author_txt:niemi in 3370) [ClassicSimilarity], result of:
        3.5032253 = fieldWeight in 3370, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.341934 = idf(docFreq=9, maxDocs=41962)
          0.375 = fieldNorm(doc=3370)
    

Similar documents (content)

  1. Niemi, T.; Jämsen , J.: ¬A query language for discovering semantic associations, part I : approach and formal definition of query primitives (2007) 0.31
    0.30814728 = sum of:
      0.30814728 = product of:
        0.96296024 = sum of:
          0.08739015 = weight(abstract_txt:unknown in 2592) [ClassicSimilarity], result of:
            0.08739015 = score(doc=2592,freq=2.0), product of:
              0.1356658 = queryWeight, product of:
                1.0884218 = boost
                7.287811 = idf(docFreq=77, maxDocs=41962)
                0.017103149 = queryNorm
              0.6441575 = fieldWeight in 2592, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.287811 = idf(docFreq=77, maxDocs=41962)
                0.0625 = fieldNorm(doc=2592)
          0.06738771 = weight(abstract_txt:discovering in 2592) [ClassicSimilarity], result of:
            0.06738771 = score(doc=2592,freq=1.0), product of:
              0.14373389 = queryWeight, product of:
                1.1203188 = boost
                7.501385 = idf(docFreq=62, maxDocs=41962)
                0.017103149 = queryNorm
              0.46883658 = fieldWeight in 2592, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.501385 = idf(docFreq=62, maxDocs=41962)
                0.0625 = fieldNorm(doc=2592)
          0.07172726 = weight(abstract_txt:semantic in 2592) [ClassicSimilarity], result of:
            0.07172726 = score(doc=2592,freq=6.0), product of:
              0.10389336 = queryWeight, product of:
                1.3470104 = boost
                4.509629 = idf(docFreq=1254, maxDocs=41962)
                0.017103149 = queryNorm
              0.6903931 = fieldWeight in 2592, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.509629 = idf(docFreq=1254, maxDocs=41962)
                0.0625 = fieldNorm(doc=2592)
          0.046315424 = weight(abstract_txt:known in 2592) [ClassicSimilarity], result of:
            0.046315424 = score(doc=2592,freq=1.0), product of:
              0.14103681 = queryWeight, product of:
                1.5694348 = boost
                5.254279 = idf(docFreq=595, maxDocs=41962)
                0.017103149 = queryNorm
              0.32839245 = fieldWeight in 2592, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.254279 = idf(docFreq=595, maxDocs=41962)
                0.0625 = fieldNorm(doc=2592)
          0.04137677 = weight(abstract_txt:user in 2592) [ClassicSimilarity], result of:
            0.04137677 = score(doc=2592,freq=3.0), product of:
              0.103834845 = queryWeight, product of:
                1.6492794 = boost
                3.6810596 = idf(docFreq=2873, maxDocs=41962)
                0.017103149 = queryNorm
              0.39848638 = fieldWeight in 2592, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6810596 = idf(docFreq=2873, maxDocs=41962)
                0.0625 = fieldNorm(doc=2592)
          0.24231224 = weight(abstract_txt:entities in 2592) [ClassicSimilarity], result of:
            0.24231224 = score(doc=2592,freq=6.0), product of:
              0.26775995 = queryWeight, product of:
                2.6484725 = boost
                5.9111786 = idf(docFreq=308, maxDocs=41962)
                0.017103149 = queryNorm
              0.90496075 = fieldWeight in 2592, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9111786 = idf(docFreq=308, maxDocs=41962)
                0.0625 = fieldNorm(doc=2592)
          0.1004656 = weight(abstract_txt:language in 2592) [ClassicSimilarity], result of:
            0.1004656 = score(doc=2592,freq=2.0), product of:
              0.27053538 = queryWeight, product of:
                3.7648675 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.017103149 = queryNorm
              0.37135845 = fieldWeight in 2592, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.0625 = fieldNorm(doc=2592)
          0.3059851 = weight(abstract_txt:query in 2592) [ClassicSimilarity], result of:
            0.3059851 = score(doc=2592,freq=4.0), product of:
              0.5164557 = queryWeight, product of:
                6.370886 = boost
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.017103149 = queryNorm
              0.5924711 = fieldWeight in 2592, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.0625 = fieldNorm(doc=2592)
        0.32 = coord(8/25)
    
  2. Owei, V.; Higa, K.: ¬A paradigm for natural language explanation of database queries : a semantic data model approach (1994) 0.24
    0.24144304 = sum of:
      0.24144304 = product of:
        1.0060127 = sum of:
          0.16879222 = weight(abstract_txt:specified in 189) [ClassicSimilarity], result of:
            0.16879222 = score(doc=189,freq=3.0), product of:
              0.12657304 = queryWeight, product of:
                1.0513145 = boost
                7.0393496 = idf(docFreq=99, maxDocs=41962)
                0.017103149 = queryNorm
              1.3335558 = fieldWeight in 189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0393496 = idf(docFreq=99, maxDocs=41962)
                0.109375 = fieldNorm(doc=189)
          0.05124443 = weight(abstract_txt:semantic in 189) [ClassicSimilarity], result of:
            0.05124443 = score(doc=189,freq=1.0), product of:
              0.10389336 = queryWeight, product of:
                1.3470104 = boost
                4.509629 = idf(docFreq=1254, maxDocs=41962)
                0.017103149 = queryNorm
              0.49324065 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.509629 = idf(docFreq=1254, maxDocs=41962)
                0.109375 = fieldNorm(doc=189)
          0.07240935 = weight(abstract_txt:user in 189) [ClassicSimilarity], result of:
            0.07240935 = score(doc=189,freq=3.0), product of:
              0.103834845 = queryWeight, product of:
                1.6492794 = boost
                3.6810596 = idf(docFreq=2873, maxDocs=41962)
                0.017103149 = queryNorm
              0.69735116 = fieldWeight in 189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6810596 = idf(docFreq=2873, maxDocs=41962)
                0.109375 = fieldNorm(doc=189)
          0.21060957 = weight(abstract_txt:queries in 189) [ClassicSimilarity], result of:
            0.21060957 = score(doc=189,freq=2.0), product of:
              0.26656845 = queryWeight, product of:
                3.0513809 = boost
                5.107828 = idf(docFreq=689, maxDocs=41962)
                0.017103149 = queryNorm
              0.79007685 = fieldWeight in 189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.107828 = idf(docFreq=689, maxDocs=41962)
                0.109375 = fieldNorm(doc=189)
          0.12431984 = weight(abstract_txt:language in 189) [ClassicSimilarity], result of:
            0.12431984 = score(doc=189,freq=1.0), product of:
              0.27053538 = queryWeight, product of:
                3.7648675 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.017103149 = queryNorm
              0.45953265 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.109375 = fieldNorm(doc=189)
          0.37863722 = weight(abstract_txt:query in 189) [ClassicSimilarity], result of:
            0.37863722 = score(doc=189,freq=2.0), product of:
              0.5164557 = queryWeight, product of:
                6.370886 = boost
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.017103149 = queryNorm
              0.7331456 = fieldWeight in 189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.109375 = fieldNorm(doc=189)
        0.24 = coord(6/25)
    
  3. Airio, E.: Who benefits from CLIR in web retrieval? (2008) 0.21
    0.20891805 = sum of:
      0.20891805 = product of:
        0.8704919 = sum of:
          0.053666726 = weight(abstract_txt:utilized in 4343) [ClassicSimilarity], result of:
            0.053666726 = score(doc=4343,freq=1.0), product of:
              0.12349294 = queryWeight, product of:
                1.038444 = boost
                6.9531717 = idf(docFreq=108, maxDocs=41962)
                0.017103149 = queryNorm
              0.43457323 = fieldWeight in 4343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9531717 = idf(docFreq=108, maxDocs=41962)
                0.0625 = fieldNorm(doc=4343)
          0.06574228 = weight(abstract_txt:formulate in 4343) [ClassicSimilarity], result of:
            0.06574228 = score(doc=4343,freq=1.0), product of:
              0.14138453 = queryWeight, product of:
                1.1111251 = boost
                7.439827 = idf(docFreq=66, maxDocs=41962)
                0.017103149 = queryNorm
              0.4649892 = fieldWeight in 4343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.439827 = idf(docFreq=66, maxDocs=41962)
                0.0625 = fieldNorm(doc=4343)
          0.036937226 = weight(abstract_txt:whether in 4343) [ClassicSimilarity], result of:
            0.036937226 = score(doc=4343,freq=1.0), product of:
              0.121289976 = queryWeight, product of:
                1.4554238 = boost
                4.8725843 = idf(docFreq=872, maxDocs=41962)
                0.017103149 = queryNorm
              0.30453652 = fieldWeight in 4343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8725843 = idf(docFreq=872, maxDocs=41962)
                0.0625 = fieldNorm(doc=4343)
          0.14739598 = weight(abstract_txt:queries in 4343) [ClassicSimilarity], result of:
            0.14739598 = score(doc=4343,freq=3.0), product of:
              0.26656845 = queryWeight, product of:
                3.0513809 = boost
                5.107828 = idf(docFreq=689, maxDocs=41962)
                0.017103149 = queryNorm
              0.5529386 = fieldWeight in 4343, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.107828 = idf(docFreq=689, maxDocs=41962)
                0.0625 = fieldNorm(doc=4343)
          0.22464791 = weight(abstract_txt:language in 4343) [ClassicSimilarity], result of:
            0.22464791 = score(doc=4343,freq=10.0), product of:
              0.27053538 = queryWeight, product of:
                3.7648675 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.017103149 = queryNorm
              0.83038276 = fieldWeight in 4343, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.0625 = fieldNorm(doc=4343)
          0.34210175 = weight(abstract_txt:query in 4343) [ClassicSimilarity], result of:
            0.34210175 = score(doc=4343,freq=5.0), product of:
              0.5164557 = queryWeight, product of:
                6.370886 = boost
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.017103149 = queryNorm
              0.66240287 = fieldWeight in 4343, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.0625 = fieldNorm(doc=4343)
        0.24 = coord(6/25)
    
  4. Lewandowski, D.: Evaluating the retrieval effectiveness of web search engines using a representative query sample (2015) 0.21
    0.20647404 = sum of:
      0.20647404 = product of:
        0.8603085 = sum of:
          0.10375916 = weight(abstract_txt:correct in 4158) [ClassicSimilarity], result of:
            0.10375916 = score(doc=4158,freq=3.0), product of:
              0.114518575 = queryWeight, product of:
                6.69576 = idf(docFreq=140, maxDocs=41962)
                0.017103149 = queryNorm
              0.90604657 = fieldWeight in 4158, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.69576 = idf(docFreq=140, maxDocs=41962)
                0.078125 = fieldNorm(doc=4158)
          0.03666147 = weight(abstract_txt:types in 4158) [ClassicSimilarity], result of:
            0.03666147 = score(doc=4158,freq=1.0), product of:
              0.10400366 = queryWeight, product of:
                1.3477252 = boost
                4.512022 = idf(docFreq=1251, maxDocs=41962)
                0.017103149 = queryNorm
              0.35250172 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.512022 = idf(docFreq=1251, maxDocs=41962)
                0.078125 = fieldNorm(doc=4158)
          0.029861113 = weight(abstract_txt:user in 4158) [ClassicSimilarity], result of:
            0.029861113 = score(doc=4158,freq=1.0), product of:
              0.103834845 = queryWeight, product of:
                1.6492794 = boost
                3.6810596 = idf(docFreq=2873, maxDocs=41962)
                0.017103149 = queryNorm
              0.28758278 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6810596 = idf(docFreq=2873, maxDocs=41962)
                0.078125 = fieldNorm(doc=4158)
          0.09822652 = weight(abstract_txt:sample in 4158) [ClassicSimilarity], result of:
            0.09822652 = score(doc=4158,freq=2.0), product of:
              0.15923965 = queryWeight, product of:
                1.6676413 = boost
                5.5830626 = idf(docFreq=428, maxDocs=41962)
                0.017103149 = queryNorm
              0.6168471 = fieldWeight in 4158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5830626 = idf(docFreq=428, maxDocs=41962)
                0.078125 = fieldNorm(doc=4158)
          0.26056176 = weight(abstract_txt:queries in 4158) [ClassicSimilarity], result of:
            0.26056176 = score(doc=4158,freq=6.0), product of:
              0.26656845 = queryWeight, product of:
                3.0513809 = boost
                5.107828 = idf(docFreq=689, maxDocs=41962)
                0.017103149 = queryNorm
              0.97746664 = fieldWeight in 4158, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.107828 = idf(docFreq=689, maxDocs=41962)
                0.078125 = fieldNorm(doc=4158)
          0.33123854 = weight(abstract_txt:query in 4158) [ClassicSimilarity], result of:
            0.33123854 = score(doc=4158,freq=3.0), product of:
              0.5164557 = queryWeight, product of:
                6.370886 = boost
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.017103149 = queryNorm
              0.64136875 = fieldWeight in 4158, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.078125 = fieldNorm(doc=4158)
        0.24 = coord(6/25)
    
  5. Rozinajová, V.; Macko, P.: Using natural language to search linked data (2017) 0.20
    0.20079154 = sum of:
      0.20079154 = product of:
        0.8366314 = sum of:
          0.036603164 = weight(abstract_txt:semantic in 53) [ClassicSimilarity], result of:
            0.036603164 = score(doc=53,freq=1.0), product of:
              0.10389336 = queryWeight, product of:
                1.3470104 = boost
                4.509629 = idf(docFreq=1254, maxDocs=41962)
                0.017103149 = queryNorm
              0.35231474 = fieldWeight in 53, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.509629 = idf(docFreq=1254, maxDocs=41962)
                0.078125 = fieldNorm(doc=53)
          0.03666147 = weight(abstract_txt:types in 53) [ClassicSimilarity], result of:
            0.03666147 = score(doc=53,freq=1.0), product of:
              0.10400366 = queryWeight, product of:
                1.3477252 = boost
                4.512022 = idf(docFreq=1251, maxDocs=41962)
                0.017103149 = queryNorm
              0.35250172 = fieldWeight in 53, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.512022 = idf(docFreq=1251, maxDocs=41962)
                0.078125 = fieldNorm(doc=53)
          0.059722226 = weight(abstract_txt:user in 53) [ClassicSimilarity], result of:
            0.059722226 = score(doc=53,freq=4.0), product of:
              0.103834845 = queryWeight, product of:
                1.6492794 = boost
                3.6810596 = idf(docFreq=2873, maxDocs=41962)
                0.017103149 = queryNorm
              0.57516557 = fieldWeight in 53, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6810596 = idf(docFreq=2873, maxDocs=41962)
                0.078125 = fieldNorm(doc=53)
          0.1504354 = weight(abstract_txt:queries in 53) [ClassicSimilarity], result of:
            0.1504354 = score(doc=53,freq=2.0), product of:
              0.26656845 = queryWeight, product of:
                3.0513809 = boost
                5.107828 = idf(docFreq=689, maxDocs=41962)
                0.017103149 = queryNorm
              0.5643406 = fieldWeight in 53, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.107828 = idf(docFreq=689, maxDocs=41962)
                0.078125 = fieldNorm(doc=53)
          0.125582 = weight(abstract_txt:language in 53) [ClassicSimilarity], result of:
            0.125582 = score(doc=53,freq=2.0), product of:
              0.27053538 = queryWeight, product of:
                3.7648675 = boost
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.017103149 = queryNorm
              0.46419805 = fieldWeight in 53, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2014413 = idf(docFreq=1707, maxDocs=41962)
                0.078125 = fieldNorm(doc=53)
          0.42762718 = weight(abstract_txt:query in 53) [ClassicSimilarity], result of:
            0.42762718 = score(doc=53,freq=5.0), product of:
              0.5164557 = queryWeight, product of:
                6.370886 = boost
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.017103149 = queryNorm
              0.8280036 = fieldWeight in 53, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.739769 = idf(docFreq=996, maxDocs=41962)
                0.078125 = fieldNorm(doc=53)
        0.24 = coord(6/25)