Document (#33343)

Author
Klein, S.T.
Title
Processing queries with metrical constraints in XML-based IR systems
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.1, S.86-97
Year
2008
Abstract
XML documents combine features from classical IR systems allowing free text, with explicit structures as in databases. Many query languages have been specially designed for IR applications on XML documents. This work concentrates on a special type of language for which the problem of processing queries including metrical constraints is investigated. The main question is how to define the distance between terms in different locations of the XML tree in an intuitively justifiable way, without jeopardizing the ability to get good retrieval results in terms of recall and precision. A new definition is given and its usefulness is shown on several examples from the INEX collection.
Object
XML

Similar documents (author)

  1. Klein, W.: Organisation des Wissens durch Sprache : Konsequenzen für die maschinelle Sprachanalyse (1977) 4.96
    4.9598045 = sum of:
      4.9598045 = weight(author_txt:klein in 1748) [ClassicSimilarity], result of:
        4.9598045 = fieldWeight in 1748, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.935687 = idf(docFreq=42, maxDocs=44218)
          0.625 = fieldNorm(doc=1748)
    
  2. Klein, H.: GENIOS jetzt mit Thesaurus-Suche (1993) 4.96
    4.9598045 = sum of:
      4.9598045 = weight(author_txt:klein in 7537) [ClassicSimilarity], result of:
        4.9598045 = fieldWeight in 7537, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.935687 = idf(docFreq=42, maxDocs=44218)
          0.625 = fieldNorm(doc=7537)
    
  3. Klein, R.D.: ¬The problem of cataloguing world literature using the Nippon Decimal Classification (1994) 4.96
    4.9598045 = sum of:
      4.9598045 = weight(author_txt:klein in 867) [ClassicSimilarity], result of:
        4.9598045 = fieldWeight in 867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.935687 = idf(docFreq=42, maxDocs=44218)
          0.625 = fieldNorm(doc=867)
    
  4. Klein, G.M.: Is there a standard default keyword operator? : a bibliometric analysis of processing options chosen by libraries to execute keyword searches in online public access catalogs (1994) 4.96
    4.9598045 = sum of:
      4.9598045 = weight(author_txt:klein in 2200) [ClassicSimilarity], result of:
        4.9598045 = fieldWeight in 2200, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.935687 = idf(docFreq=42, maxDocs=44218)
          0.625 = fieldNorm(doc=2200)
    
  5. Klein, J.T.: Interdisciplinary needs : the current context (1996) 4.96
    4.9598045 = sum of:
      4.9598045 = weight(author_txt:klein in 7176) [ClassicSimilarity], result of:
        4.9598045 = fieldWeight in 7176, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.935687 = idf(docFreq=42, maxDocs=44218)
          0.625 = fieldNorm(doc=7176)
    

Similar documents (content)

  1. Klein, S.T.: On the use of negation in Boolean IR queries. (2009) 0.30
    0.30462766 = sum of:
      0.30462766 = product of:
        1.087956 = sum of:
          0.051043253 = weight(abstract_txt:shown in 3927) [ClassicSimilarity], result of:
            0.051043253 = score(doc=3927,freq=1.0), product of:
              0.09744031 = queryWeight, product of:
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.017438546 = queryNorm
              0.52384126 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.05748846 = weight(abstract_txt:investigated in 3927) [ClassicSimilarity], result of:
            0.05748846 = score(doc=3927,freq=1.0), product of:
              0.10547921 = queryWeight, product of:
                1.0404329 = boost
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.017438546 = queryNorm
              0.5450217 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.06506621 = weight(abstract_txt:usefulness in 3927) [ClassicSimilarity], result of:
            0.06506621 = score(doc=3927,freq=1.0), product of:
              0.114555724 = queryWeight, product of:
                1.084274 = boost
                6.0585327 = idf(docFreq=280, maxDocs=44218)
                0.017438546 = queryNorm
              0.56798744 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0585327 = idf(docFreq=280, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.054725364 = weight(abstract_txt:terms in 3927) [ClassicSimilarity], result of:
            0.054725364 = score(doc=3927,freq=2.0), product of:
              0.10207175 = queryWeight, product of:
                1.4474329 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017438546 = queryNorm
              0.53614604 = fieldWeight in 3927, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.13496952 = weight(abstract_txt:queries in 3927) [ClassicSimilarity], result of:
            0.13496952 = score(doc=3927,freq=3.0), product of:
              0.16276956 = queryWeight, product of:
                1.827815 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.017438546 = queryNorm
              0.82920617 = fieldWeight in 3927, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.18212835 = weight(abstract_txt:constraints in 3927) [ClassicSimilarity], result of:
            0.18212835 = score(doc=3927,freq=1.0), product of:
              0.28666508 = queryWeight, product of:
                2.4256775 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.017438546 = queryNorm
              0.63533497 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
          0.5425348 = weight(abstract_txt:metrical in 3927) [ClassicSimilarity], result of:
            0.5425348 = score(doc=3927,freq=1.0), product of:
              0.59348285 = queryWeight, product of:
                3.4901955 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.017438546 = queryNorm
              0.9141542 = fieldWeight in 3927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.09375 = fieldNorm(doc=3927)
        0.28 = coord(7/25)
    
  2. Schlieder, T.; Meuss, H.: Querying and ranking XML documents (2002) 0.09
    0.09314824 = sum of:
      0.09314824 = product of:
        0.38811767 = sum of:
          0.053664804 = weight(abstract_txt:classical in 459) [ClassicSimilarity], result of:
            0.053664804 = score(doc=459,freq=1.0), product of:
              0.13201815 = queryWeight, product of:
                1.1639853 = boost
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.017438546 = queryNorm
              0.40649566 = fieldWeight in 459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.0625 = fieldNorm(doc=459)
          0.058305584 = weight(abstract_txt:combine in 459) [ClassicSimilarity], result of:
            0.058305584 = score(doc=459,freq=1.0), product of:
              0.1395235 = queryWeight, product of:
                1.1966147 = boost
                6.686252 = idf(docFreq=149, maxDocs=44218)
                0.017438546 = queryNorm
              0.41789076 = fieldWeight in 459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.686252 = idf(docFreq=149, maxDocs=44218)
                0.0625 = fieldNorm(doc=459)
          0.088464685 = weight(abstract_txt:tree in 459) [ClassicSimilarity], result of:
            0.088464685 = score(doc=459,freq=2.0), product of:
              0.14622128 = queryWeight, product of:
                1.2249997 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.017438546 = queryNorm
              0.60500556 = fieldWeight in 459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0625 = fieldNorm(doc=459)
          0.036483575 = weight(abstract_txt:terms in 459) [ClassicSimilarity], result of:
            0.036483575 = score(doc=459,freq=2.0), product of:
              0.10207175 = queryWeight, product of:
                1.4474329 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017438546 = queryNorm
              0.3574307 = fieldWeight in 459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=459)
          0.047299437 = weight(abstract_txt:documents in 459) [ClassicSimilarity], result of:
            0.047299437 = score(doc=459,freq=3.0), product of:
              0.106018305 = queryWeight, product of:
                1.4751498 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017438546 = queryNorm
              0.44614407 = fieldWeight in 459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=459)
          0.10389959 = weight(abstract_txt:queries in 459) [ClassicSimilarity], result of:
            0.10389959 = score(doc=459,freq=4.0), product of:
              0.16276956 = queryWeight, product of:
                1.827815 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.017438546 = queryNorm
              0.63832325 = fieldWeight in 459, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=459)
        0.24 = coord(6/25)
    
  3. Vilares, J.; Alonso, M.A.; Vilares, M.: Extraction of complex index terms in non-English IR : a shallow parsing based approach (2008) 0.08
    0.08359591 = sum of:
      0.08359591 = product of:
        0.2985568 = sum of:
          0.034028836 = weight(abstract_txt:shown in 2107) [ClassicSimilarity], result of:
            0.034028836 = score(doc=2107,freq=1.0), product of:
              0.09744031 = queryWeight, product of:
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.017438546 = queryNorm
              0.3492275 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58764 = idf(docFreq=449, maxDocs=44218)
                0.0625 = fieldNorm(doc=2107)
          0.053664804 = weight(abstract_txt:classical in 2107) [ClassicSimilarity], result of:
            0.053664804 = score(doc=2107,freq=1.0), product of:
              0.13201815 = queryWeight, product of:
                1.1639853 = boost
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.017438546 = queryNorm
              0.40649566 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.0625 = fieldNorm(doc=2107)
          0.01549432 = weight(abstract_txt:systems in 2107) [ClassicSimilarity], result of:
            0.01549432 = score(doc=2107,freq=1.0), product of:
              0.072660595 = queryWeight, product of:
                1.2212235 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.017438546 = queryNorm
              0.2132424 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0625 = fieldNorm(doc=2107)
          0.036483575 = weight(abstract_txt:terms in 2107) [ClassicSimilarity], result of:
            0.036483575 = score(doc=2107,freq=2.0), product of:
              0.10207175 = queryWeight, product of:
                1.4474329 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.017438546 = queryNorm
              0.3574307 = fieldWeight in 2107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2107)
          0.03861983 = weight(abstract_txt:documents in 2107) [ClassicSimilarity], result of:
            0.03861983 = score(doc=2107,freq=2.0), product of:
              0.106018305 = queryWeight, product of:
                1.4751498 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.017438546 = queryNorm
              0.36427513 = fieldWeight in 2107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2107)
          0.046797313 = weight(abstract_txt:processing in 2107) [ClassicSimilarity], result of:
            0.046797313 = score(doc=2107,freq=1.0), product of:
              0.15182078 = queryWeight, product of:
                1.7652706 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.017438546 = queryNorm
              0.3082405 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=2107)
          0.073468104 = weight(abstract_txt:queries in 2107) [ClassicSimilarity], result of:
            0.073468104 = score(doc=2107,freq=2.0), product of:
              0.16276956 = queryWeight, product of:
                1.827815 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.017438546 = queryNorm
              0.4513627 = fieldWeight in 2107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0625 = fieldNorm(doc=2107)
        0.28 = coord(7/25)
    
  4. Pal, S.; Mitra, M.; Kamps, J.: Evaluation effort, reliability and reusability in XML retrieval (2011) 0.08
    0.07684726 = sum of:
      0.07684726 = product of:
        0.4802954 = sum of:
          0.0458594 = weight(abstract_txt:recall in 4197) [ClassicSimilarity], result of:
            0.0458594 = score(doc=4197,freq=2.0), product of:
              0.10314404 = queryWeight, product of:
                1.0288516 = boost
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.017438546 = queryNorm
              0.44461513 = fieldWeight in 4197, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4197)
          0.027115058 = weight(abstract_txt:systems in 4197) [ClassicSimilarity], result of:
            0.027115058 = score(doc=4197,freq=4.0), product of:
              0.072660595 = queryWeight, product of:
                1.2212235 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.017438546 = queryNorm
              0.3731742 = fieldWeight in 4197, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4197)
          0.31640878 = weight(abstract_txt:inex in 4197) [ClassicSimilarity], result of:
            0.31640878 = score(doc=4197,freq=5.0), product of:
              0.27542982 = queryWeight, product of:
                1.6812649 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.017438546 = queryNorm
              1.1487819 = fieldWeight in 4197, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4197)
          0.09091214 = weight(abstract_txt:queries in 4197) [ClassicSimilarity], result of:
            0.09091214 = score(doc=4197,freq=4.0), product of:
              0.16276956 = queryWeight, product of:
                1.827815 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.017438546 = queryNorm
              0.55853283 = fieldWeight in 4197, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4197)
        0.16 = coord(4/25)
    
  5. Pérez Pozo, Á.; Rosa, J. de la; Ros, S.; González-Blanco, E.; Hernández, L.; Sisto, M. de: ¬A bridge too far for artificial intelligence? : automatic classification of stanzas in Spanish poetry (2022) 0.08
    0.07642341 = sum of:
      0.07642341 = product of:
        0.4776463 = sum of:
          0.053664804 = weight(abstract_txt:classical in 468) [ClassicSimilarity], result of:
            0.053664804 = score(doc=468,freq=1.0), product of:
              0.13201815 = queryWeight, product of:
                1.1639853 = boost
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.017438546 = queryNorm
              0.40649566 = fieldWeight in 468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.0625 = fieldNorm(doc=468)
          0.01549432 = weight(abstract_txt:systems in 468) [ClassicSimilarity], result of:
            0.01549432 = score(doc=468,freq=1.0), product of:
              0.072660595 = queryWeight, product of:
                1.2212235 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.017438546 = queryNorm
              0.2132424 = fieldWeight in 468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.0625 = fieldNorm(doc=468)
          0.046797313 = weight(abstract_txt:processing in 468) [ClassicSimilarity], result of:
            0.046797313 = score(doc=468,freq=1.0), product of:
              0.15182078 = queryWeight, product of:
                1.7652706 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.017438546 = queryNorm
              0.3082405 = fieldWeight in 468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=468)
          0.36168987 = weight(abstract_txt:metrical in 468) [ClassicSimilarity], result of:
            0.36168987 = score(doc=468,freq=1.0), product of:
              0.59348285 = queryWeight, product of:
                3.4901955 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.017438546 = queryNorm
              0.6094361 = fieldWeight in 468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=468)
        0.16 = coord(4/25)