Document (#33344)

Author
Klein, S.T.
Title
Processing queries with metrical constraints in XML-based IR systems
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.1, S.86-97
Year
2008
Abstract
XML documents combine features from classical IR systems allowing free text, with explicit structures as in databases. Many query languages have been specially designed for IR applications on XML documents. This work concentrates on a special type of language for which the problem of processing queries including metrical constraints is investigated. The main question is how to define the distance between terms in different locations of the XML tree in an intuitively justifiable way, without jeopardizing the ability to get good retrieval results in terms of recall and precision. A new definition is given and its usefulness is shown on several examples from the INEX collection.
Object
XML

Similar documents (author)

  1. Klein, W.: Organisation des Wissens durch Sprache : Konsequenzen für die maschinelle Sprachanalyse (1977) 4.96
    4.961945 = sum of:
      4.961945 = weight(author_txt:klein in 1748) [ClassicSimilarity], result of:
        4.961945 = fieldWeight in 1748, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.939112 = idf(docFreq=40, maxDocs=42306)
          0.625 = fieldNorm(doc=1748)
    
  2. Klein, H.: GENIOS jetzt mit Thesaurus-Suche (1993) 4.96
    4.961945 = sum of:
      4.961945 = weight(author_txt:klein in 7537) [ClassicSimilarity], result of:
        4.961945 = fieldWeight in 7537, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.939112 = idf(docFreq=40, maxDocs=42306)
          0.625 = fieldNorm(doc=7537)
    
  3. Klein, R.D.: ¬The problem of cataloguing world literature using the Nippon Decimal Classification (1994) 4.96
    4.961945 = sum of:
      4.961945 = weight(author_txt:klein in 936) [ClassicSimilarity], result of:
        4.961945 = fieldWeight in 936, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.939112 = idf(docFreq=40, maxDocs=42306)
          0.625 = fieldNorm(doc=936)
    
  4. Klein, G.M.: Is there a standard default keyword operator? : a bibliometric analysis of processing options chosen by libraries to execute keyword searches in online public access catalogs (1994) 4.96
    4.961945 = sum of:
      4.961945 = weight(author_txt:klein in 2269) [ClassicSimilarity], result of:
        4.961945 = fieldWeight in 2269, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.939112 = idf(docFreq=40, maxDocs=42306)
          0.625 = fieldNorm(doc=2269)
    
  5. Klein, J.T.: Interdisciplinary needs : the current context (1996) 4.96
    4.961945 = sum of:
      4.961945 = weight(author_txt:klein in 246) [ClassicSimilarity], result of:
        4.961945 = fieldWeight in 246, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.939112 = idf(docFreq=40, maxDocs=42306)
          0.625 = fieldNorm(doc=246)
    

Similar documents (content)

  1. Klein, S.T.: On the use of negation in Boolean IR queries. (2009) 0.31
    0.3081708 = sum of:
      0.3081708 = product of:
        1.10061 = sum of:
          0.050924115 = weight(abstract_txt:shown in 928) [ClassicSimilarity], result of:
            0.050924115 = score(doc=928,freq=1.0), product of:
              0.096988015 = queryWeight, product of:
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.017317448 = queryNorm
              0.52505577 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.09375 = fieldNorm(doc=928)
          0.058232255 = weight(abstract_txt:investigated in 928) [ClassicSimilarity], result of:
            0.058232255 = score(doc=928,freq=1.0), product of:
              0.10605834 = queryWeight, product of:
                1.0457151 = boost
                5.8566265 = idf(docFreq=328, maxDocs=42306)
                0.017317448 = queryNorm
              0.54905874 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8566265 = idf(docFreq=328, maxDocs=42306)
                0.09375 = fieldNorm(doc=928)
          0.064926356 = weight(abstract_txt:usefulness in 928) [ClassicSimilarity], result of:
            0.064926356 = score(doc=928,freq=1.0), product of:
              0.114038035 = queryWeight, product of:
                1.0843409 = boost
                6.072954 = idf(docFreq=264, maxDocs=42306)
                0.017317448 = queryNorm
              0.56933945 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.072954 = idf(docFreq=264, maxDocs=42306)
                0.09375 = fieldNorm(doc=928)
          0.054863617 = weight(abstract_txt:terms in 928) [ClassicSimilarity], result of:
            0.054863617 = score(doc=928,freq=2.0), product of:
              0.101927646 = queryWeight, product of:
                1.4497795 = boost
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.017317448 = queryNorm
              0.5382604 = fieldWeight in 928, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.09375 = fieldNorm(doc=928)
          0.13277392 = weight(abstract_txt:queries in 928) [ClassicSimilarity], result of:
            0.13277392 = score(doc=928,freq=3.0), product of:
              0.16050202 = queryWeight, product of:
                1.8192661 = boost
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.017317448 = queryNorm
              0.8272414 = fieldWeight in 928, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.09375 = fieldNorm(doc=928)
          0.18298046 = weight(abstract_txt:constraints in 928) [ClassicSimilarity], result of:
            0.18298046 = score(doc=928,freq=1.0), product of:
              0.28666997 = queryWeight, product of:
                2.4313476 = boost
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.017317448 = queryNorm
              0.6382966 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.808497 = idf(docFreq=126, maxDocs=42306)
                0.09375 = fieldNorm(doc=928)
          0.55590934 = weight(abstract_txt:metrical in 928) [ClassicSimilarity], result of:
            0.55590934 = score(doc=928,freq=1.0), product of:
              0.60133296 = queryWeight, product of:
                3.5213847 = boost
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.017317448 = queryNorm
              0.9244617 = fieldWeight in 928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.860925 = idf(docFreq=5, maxDocs=42306)
                0.09375 = fieldNorm(doc=928)
        0.28 = coord(7/25)
    
  2. Schlieder, T.; Meuss, H.: Querying and ranking XML documents (2002) 0.09
    0.09226938 = sum of:
      0.09226938 = product of:
        0.38445577 = sum of:
          0.053342946 = weight(abstract_txt:classical in 1460) [ClassicSimilarity], result of:
            0.053342946 = score(doc=1460,freq=1.0), product of:
              0.13108346 = queryWeight, product of:
                1.1625588 = boost
                6.5110207 = idf(docFreq=170, maxDocs=42306)
                0.017317448 = queryNorm
              0.4069388 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5110207 = idf(docFreq=170, maxDocs=42306)
                0.0625 = fieldNorm(doc=1460)
          0.057859786 = weight(abstract_txt:combine in 1460) [ClassicSimilarity], result of:
            0.057859786 = score(doc=1460,freq=1.0), product of:
              0.13838248 = queryWeight, product of:
                1.1944872 = boost
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.017317448 = queryNorm
              0.41811496 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6898394 = idf(docFreq=142, maxDocs=42306)
                0.0625 = fieldNorm(doc=1460)
          0.08779346 = weight(abstract_txt:tree in 1460) [ClassicSimilarity], result of:
            0.08779346 = score(doc=1460,freq=2.0), product of:
              0.14503117 = queryWeight, product of:
                1.2228457 = boost
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.017317448 = queryNorm
              0.60534203 = fieldWeight in 1460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8486633 = idf(docFreq=121, maxDocs=42306)
                0.0625 = fieldNorm(doc=1460)
          0.036575742 = weight(abstract_txt:terms in 1460) [ClassicSimilarity], result of:
            0.036575742 = score(doc=1460,freq=2.0), product of:
              0.101927646 = queryWeight, product of:
                1.4497795 = boost
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.017317448 = queryNorm
              0.35884026 = fieldWeight in 1460, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.0625 = fieldNorm(doc=1460)
          0.046674434 = weight(abstract_txt:documents in 1460) [ClassicSimilarity], result of:
            0.046674434 = score(doc=1460,freq=3.0), product of:
              0.10475759 = queryWeight, product of:
                1.4697678 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.017317448 = queryNorm
              0.445547 = fieldWeight in 1460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.0625 = fieldNorm(doc=1460)
          0.10220941 = weight(abstract_txt:queries in 1460) [ClassicSimilarity], result of:
            0.10220941 = score(doc=1460,freq=4.0), product of:
              0.16050202 = queryWeight, product of:
                1.8192661 = boost
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.017317448 = queryNorm
              0.6368108 = fieldWeight in 1460, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.0625 = fieldNorm(doc=1460)
        0.24 = coord(6/25)
    
  3. Vilares, J.; Alonso, M.A.; Vilares, M.: Extraction of complex index terms in non-English IR : a shallow parsing based approach (2008) 0.08
    0.08302136 = sum of:
      0.08302136 = product of:
        0.29650486 = sum of:
          0.033949412 = weight(abstract_txt:shown in 4108) [ClassicSimilarity], result of:
            0.033949412 = score(doc=4108,freq=1.0), product of:
              0.096988015 = queryWeight, product of:
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.017317448 = queryNorm
              0.3500372 = fieldWeight in 4108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.600595 = idf(docFreq=424, maxDocs=42306)
                0.0625 = fieldNorm(doc=4108)
          0.053342946 = weight(abstract_txt:classical in 4108) [ClassicSimilarity], result of:
            0.053342946 = score(doc=4108,freq=1.0), product of:
              0.13108346 = queryWeight, product of:
                1.1625588 = boost
                6.5110207 = idf(docFreq=170, maxDocs=42306)
                0.017317448 = queryNorm
              0.4069388 = fieldWeight in 4108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5110207 = idf(docFreq=170, maxDocs=42306)
                0.0625 = fieldNorm(doc=4108)
          0.015424745 = weight(abstract_txt:systems in 4108) [ClassicSimilarity], result of:
            0.015424745 = score(doc=4108,freq=1.0), product of:
              0.07221907 = queryWeight, product of:
                1.220343 = boost
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.017317448 = queryNorm
              0.21358272 = fieldWeight in 4108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.0625 = fieldNorm(doc=4108)
          0.036575742 = weight(abstract_txt:terms in 4108) [ClassicSimilarity], result of:
            0.036575742 = score(doc=4108,freq=2.0), product of:
              0.101927646 = queryWeight, product of:
                1.4497795 = boost
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.017317448 = queryNorm
              0.35884026 = fieldWeight in 4108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.0625 = fieldNorm(doc=4108)
          0.038109515 = weight(abstract_txt:documents in 4108) [ClassicSimilarity], result of:
            0.038109515 = score(doc=4108,freq=2.0), product of:
              0.10475759 = queryWeight, product of:
                1.4697678 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.017317448 = queryNorm
              0.36378762 = fieldWeight in 4108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.0625 = fieldNorm(doc=4108)
          0.046829537 = weight(abstract_txt:processing in 4108) [ClassicSimilarity], result of:
            0.046829537 = score(doc=4108,freq=1.0), product of:
              0.15142113 = queryWeight, product of:
                1.7670515 = boost
                4.94827 = idf(docFreq=815, maxDocs=42306)
                0.017317448 = queryNorm
              0.30926687 = fieldWeight in 4108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.94827 = idf(docFreq=815, maxDocs=42306)
                0.0625 = fieldNorm(doc=4108)
          0.07227297 = weight(abstract_txt:queries in 4108) [ClassicSimilarity], result of:
            0.07227297 = score(doc=4108,freq=2.0), product of:
              0.16050202 = queryWeight, product of:
                1.8192661 = boost
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.017317448 = queryNorm
              0.4502932 = fieldWeight in 4108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.0625 = fieldNorm(doc=4108)
        0.28 = coord(7/25)
    
  4. Pal, S.; Mitra, M.; Kamps, J.: Evaluation effort, reliability and reusability in XML retrieval (2011) 0.08
    0.07525353 = sum of:
      0.07525353 = product of:
        0.4703346 = sum of:
          0.044827618 = weight(abstract_txt:recall in 1198) [ClassicSimilarity], result of:
            0.044827618 = score(doc=1198,freq=2.0), product of:
              0.10127719 = queryWeight, product of:
                1.0218726 = boost
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.017317448 = queryNorm
              0.44262305 = fieldWeight in 1198, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1198)
          0.026993303 = weight(abstract_txt:systems in 1198) [ClassicSimilarity], result of:
            0.026993303 = score(doc=1198,freq=4.0), product of:
              0.07221907 = queryWeight, product of:
                1.220343 = boost
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.017317448 = queryNorm
              0.37376976 = fieldWeight in 1198, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4173236 = idf(docFreq=3771, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1198)
          0.30908045 = weight(abstract_txt:inex in 1198) [ClassicSimilarity], result of:
            0.30908045 = score(doc=1198,freq=5.0), product of:
              0.2703225 = queryWeight, product of:
                1.6694833 = boost
                9.3501 = idf(docFreq=9, maxDocs=42306)
                0.017317448 = queryNorm
              1.1433767 = fieldWeight in 1198, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.3501 = idf(docFreq=9, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1198)
          0.08943324 = weight(abstract_txt:queries in 1198) [ClassicSimilarity], result of:
            0.08943324 = score(doc=1198,freq=4.0), product of:
              0.16050202 = queryWeight, product of:
                1.8192661 = boost
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.017317448 = queryNorm
              0.55720943 = fieldWeight in 1198, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.0546875 = fieldNorm(doc=1198)
        0.16 = coord(4/25)
    
  5. Billhardt, H.; Borrajo, D.; Maojo, V.: ¬A context vector model for information retrieval (2002) 0.07
    0.066510744 = sum of:
      0.066510744 = product of:
        0.2771281 = sum of:
          0.036226183 = weight(abstract_txt:recall in 1252) [ClassicSimilarity], result of:
            0.036226183 = score(doc=1252,freq=1.0), product of:
              0.10127719 = queryWeight, product of:
                1.0218726 = boost
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.017317448 = queryNorm
              0.35769343 = fieldWeight in 1252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723095 = idf(docFreq=375, maxDocs=42306)
                0.0625 = fieldNorm(doc=1252)
          0.0449839 = weight(abstract_txt:define in 1252) [ClassicSimilarity], result of:
            0.0449839 = score(doc=1252,freq=1.0), product of:
              0.117004156 = queryWeight, product of:
                1.0983522 = boost
                6.151426 = idf(docFreq=244, maxDocs=42306)
                0.017317448 = queryNorm
              0.38446411 = fieldWeight in 1252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.151426 = idf(docFreq=244, maxDocs=42306)
                0.0625 = fieldNorm(doc=1252)
          0.053342946 = weight(abstract_txt:classical in 1252) [ClassicSimilarity], result of:
            0.053342946 = score(doc=1252,freq=1.0), product of:
              0.13108346 = queryWeight, product of:
                1.1625588 = boost
                6.5110207 = idf(docFreq=170, maxDocs=42306)
                0.017317448 = queryNorm
              0.4069388 = fieldWeight in 1252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5110207 = idf(docFreq=170, maxDocs=42306)
                0.0625 = fieldNorm(doc=1252)
          0.044795953 = weight(abstract_txt:terms in 1252) [ClassicSimilarity], result of:
            0.044795953 = score(doc=1252,freq=3.0), product of:
              0.101927646 = queryWeight, product of:
                1.4497795 = boost
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.017317448 = queryNorm
              0.43948776 = fieldWeight in 1252, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.059814 = idf(docFreq=1983, maxDocs=42306)
                0.0625 = fieldNorm(doc=1252)
          0.046674434 = weight(abstract_txt:documents in 1252) [ClassicSimilarity], result of:
            0.046674434 = score(doc=1252,freq=3.0), product of:
              0.10475759 = queryWeight, product of:
                1.4697678 = boost
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.017317448 = queryNorm
              0.445547 = fieldWeight in 1252, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.115787 = idf(docFreq=1875, maxDocs=42306)
                0.0625 = fieldNorm(doc=1252)
          0.051104706 = weight(abstract_txt:queries in 1252) [ClassicSimilarity], result of:
            0.051104706 = score(doc=1252,freq=1.0), product of:
              0.16050202 = queryWeight, product of:
                1.8192661 = boost
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.017317448 = queryNorm
              0.3184054 = fieldWeight in 1252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.094486 = idf(docFreq=704, maxDocs=42306)
                0.0625 = fieldNorm(doc=1252)
        0.24 = coord(6/25)