Document (#33370)

Author
Näppilä, T.
Järvelin, K.
Niemi, T.
Title
¬A tool for data cube construction from structurally heterogeneous XML documents
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.3, S.435-449
Year
2008
Abstract
Data cubes for OLAP (On-Line Analytical Processing) often need to be constructed from data located in several distributed and autonomous information sources. Such a data integration process is challenging due to semantic, syntactic, and structural heterogeneity among the data. While XML (extensible markup language) is the de facto standard for data exchange, the three types of heterogeneity remain. Moreover, popular path-oriented XML query languages, such as XQuery, require the user to know in much detail the structure of the documents to be processed and are, thus, effectively impractical in many real-world data integration tasks. Several Lowest Common Ancestor (LCA)-based XML query evaluation strategies have recently been introduced to provide a more structure-independent way to access XML documents. We shall, however, show that this approach leads in the context of certain - not uncommon - types of XML documents to undesirable results. This article introduces a novel high-level data extraction primitive that utilizes the purpose-built Smallest Possible Context (SPC) query evaluation strategy. We demonstrate, through a system prototype for OLAP data cube construction and a sample application in informetrics, that our approach has real advantages in data integration.

Similar documents (author)

  1. Järvelin, K.; Niemi, T.: Deductive information retrieval based on classifications (1993) 6.14
    6.144943 = sum of:
      6.144943 = sum of:
        2.3369596 = weight(author_txt:järvelin in 2229) [ClassicSimilarity], result of:
          2.3369596 = score(doc=2229,freq=1.0), product of:
            0.58546096 = queryWeight, product of:
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.07333557 = queryNorm
            3.9916575 = fieldWeight in 2229, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.5 = fieldNorm(doc=2229)
        3.8079836 = weight(author_txt:niemi in 2229) [ClassicSimilarity], result of:
          3.8079836 = score(doc=2229,freq=1.0), product of:
            0.81070065 = queryWeight, product of:
              1.1767421 = boost
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.07333557 = queryNorm
            4.697151 = fieldWeight in 2229, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.5 = fieldNorm(doc=2229)
    
  2. Niemi, T.; Hirvonen, L.; Järvelin, K.: Multidimensional data model and query language for informetrics (2003) 4.61
    4.6087074 = sum of:
      4.6087074 = sum of:
        1.7527198 = weight(author_txt:järvelin in 1753) [ClassicSimilarity], result of:
          1.7527198 = score(doc=1753,freq=1.0), product of:
            0.58546096 = queryWeight, product of:
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.07333557 = queryNorm
            2.9937432 = fieldWeight in 1753, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.375 = fieldNorm(doc=1753)
        2.8559875 = weight(author_txt:niemi in 1753) [ClassicSimilarity], result of:
          2.8559875 = score(doc=1753,freq=1.0), product of:
            0.81070065 = queryWeight, product of:
              1.1767421 = boost
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.07333557 = queryNorm
            3.5228634 = fieldWeight in 1753, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.375 = fieldNorm(doc=1753)
    
  3. Järvelin, K.; Ingwersen, P.; Niemi, T.: ¬A user-oriented interface for generalised informetric analysis based on applying advanced data modelling techniques (2000) 4.61
    4.6087074 = sum of:
      4.6087074 = sum of:
        1.7527198 = weight(author_txt:järvelin in 4545) [ClassicSimilarity], result of:
          1.7527198 = score(doc=4545,freq=1.0), product of:
            0.58546096 = queryWeight, product of:
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.07333557 = queryNorm
            2.9937432 = fieldWeight in 4545, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.375 = fieldNorm(doc=4545)
        2.8559875 = weight(author_txt:niemi in 4545) [ClassicSimilarity], result of:
          2.8559875 = score(doc=4545,freq=1.0), product of:
            0.81070065 = queryWeight, product of:
              1.1767421 = boost
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.07333557 = queryNorm
            3.5228634 = fieldWeight in 4545, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.375 = fieldNorm(doc=4545)
    
  4. Niemi, T.; Junkkari, M.; Järvelin, K.; Viita, S.: Advanced query language for manipulating complex entities (2004) 3.84
    3.8405895 = sum of:
      3.8405895 = sum of:
        1.4605998 = weight(author_txt:järvelin in 4218) [ClassicSimilarity], result of:
          1.4605998 = score(doc=4218,freq=1.0), product of:
            0.58546096 = queryWeight, product of:
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.07333557 = queryNorm
            2.494786 = fieldWeight in 4218, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.3125 = fieldNorm(doc=4218)
        2.3799896 = weight(author_txt:niemi in 4218) [ClassicSimilarity], result of:
          2.3799896 = score(doc=4218,freq=1.0), product of:
            0.81070065 = queryWeight, product of:
              1.1767421 = boost
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.07333557 = queryNorm
            2.9357195 = fieldWeight in 4218, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.3125 = fieldNorm(doc=4218)
    
  5. Järvelin, K.; Kristensen, J.; Niemi, T.; Sormunen, E.; Keskustalo, H.: ¬A deductive data model for query expansion (1996) 3.84
    3.8405895 = sum of:
      3.8405895 = sum of:
        1.4605998 = weight(author_txt:järvelin in 2230) [ClassicSimilarity], result of:
          1.4605998 = score(doc=2230,freq=1.0), product of:
            0.58546096 = queryWeight, product of:
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.07333557 = queryNorm
            2.494786 = fieldWeight in 2230, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.983315 = idf(docFreq=40, maxDocs=44218)
              0.3125 = fieldNorm(doc=2230)
        2.3799896 = weight(author_txt:niemi in 2230) [ClassicSimilarity], result of:
          2.3799896 = score(doc=2230,freq=1.0), product of:
            0.81070065 = queryWeight, product of:
              1.1767421 = boost
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.07333557 = queryNorm
            2.9357195 = fieldWeight in 2230, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.3125 = fieldNorm(doc=2230)
    

Similar documents (content)

  1. Chang, Y.; Ounis, I.; Kim, M.: Query reformulation using automatically generated query concepts from a document space (2006) 0.12
    0.11785962 = sum of:
      0.11785962 = product of:
        0.58929807 = sum of:
          0.034091696 = weight(abstract_txt:evaluation in 972) [ClassicSimilarity], result of:
            0.034091696 = score(doc=972,freq=1.0), product of:
              0.09727308 = queryWeight, product of:
                1.1371206 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.019068662 = queryNorm
              0.35047412 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.078125 = fieldNorm(doc=972)
          0.27425936 = weight(abstract_txt:primitive in 972) [ClassicSimilarity], result of:
            0.27425936 = score(doc=972,freq=4.0), product of:
              0.19527106 = queryWeight, product of:
                1.1392372 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.019068662 = queryNorm
              1.4045058 = fieldWeight in 972, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.078125 = fieldNorm(doc=972)
          0.13606304 = weight(abstract_txt:query in 972) [ClassicSimilarity], result of:
            0.13606304 = score(doc=972,freq=5.0), product of:
              0.16384284 = queryWeight, product of:
                1.8074635 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.019068662 = queryNorm
              0.8304485 = fieldWeight in 972, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=972)
          0.074765176 = weight(abstract_txt:documents in 972) [ClassicSimilarity], result of:
            0.074765176 = score(doc=972,freq=2.0), product of:
              0.16419496 = queryWeight, product of:
                2.0893207 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.019068662 = queryNorm
              0.4553439 = fieldWeight in 972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=972)
          0.07011887 = weight(abstract_txt:data in 972) [ClassicSimilarity], result of:
            0.07011887 = score(doc=972,freq=1.0), product of:
              0.26901317 = queryWeight, product of:
                4.2284575 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019068662 = queryNorm
              0.26065218 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=972)
        0.2 = coord(5/25)
    
  2. Mayr, P.; Mutschke, P.; Petras, V.: Reducing semantic complexity in distributed digital libraries : Treatment of term vagueness and document re-ranking (2008) 0.10
    0.104440555 = sum of:
      0.104440555 = product of:
        0.43516898 = sum of:
          0.024862241 = weight(abstract_txt:several in 1909) [ClassicSimilarity], result of:
            0.024862241 = score(doc=1909,freq=1.0), product of:
              0.09996664 = queryWeight, product of:
                1.1527569 = boost
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.019068662 = queryNorm
              0.24870539 = fieldWeight in 1909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.042899564 = weight(abstract_txt:construction in 1909) [ClassicSimilarity], result of:
            0.042899564 = score(doc=1909,freq=1.0), product of:
              0.14381255 = queryWeight, product of:
                1.3826383 = boost
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.019068662 = queryNorm
              0.29830194 = fieldWeight in 1909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.060237676 = weight(abstract_txt:query in 1909) [ClassicSimilarity], result of:
            0.060237676 = score(doc=1909,freq=2.0), product of:
              0.16384284 = queryWeight, product of:
                1.8074635 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.019068662 = queryNorm
              0.36765522 = fieldWeight in 1909, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.21119034 = weight(abstract_txt:heterogeneity in 1909) [ClassicSimilarity], result of:
            0.21119034 = score(doc=1909,freq=3.0), product of:
              0.28856072 = queryWeight, product of:
                1.9585235 = boost
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.019068662 = queryNorm
              0.7318749 = fieldWeight in 1909, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.05897227 = weight(abstract_txt:integration in 1909) [ClassicSimilarity], result of:
            0.05897227 = score(doc=1909,freq=1.0), product of:
              0.20352787 = queryWeight, product of:
                2.0145023 = boost
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.019068662 = queryNorm
              0.28975034 = fieldWeight in 1909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
          0.03700687 = weight(abstract_txt:documents in 1909) [ClassicSimilarity], result of:
            0.03700687 = score(doc=1909,freq=1.0), product of:
              0.16419496 = queryWeight, product of:
                2.0893207 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.019068662 = queryNorm
              0.22538373 = fieldWeight in 1909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1909)
        0.24 = coord(6/25)
    
  3. Aldana, J.F.; Gómez, A.C.; Moreno, N.; Nebro, A.J.; Roldán, M.M.: Metadata functionality for semantic Web integration (2003) 0.10
    0.1000621 = sum of:
      0.1000621 = product of:
        0.31269407 = sum of:
          0.055646807 = weight(abstract_txt:autonomous in 2731) [ClassicSimilarity], result of:
            0.055646807 = score(doc=2731,freq=1.0), product of:
              0.15045603 = queryWeight, product of:
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.019068662 = queryNorm
              0.3698543 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.026192503 = weight(abstract_txt:context in 2731) [ClassicSimilarity], result of:
            0.026192503 = score(doc=2731,freq=2.0), product of:
              0.09104039 = queryWeight, product of:
                1.1000875 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.019068662 = queryNorm
              0.28770202 = fieldWeight in 2731, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.018752689 = weight(abstract_txt:structure in 2731) [ClassicSimilarity], result of:
            0.018752689 = score(doc=2731,freq=1.0), product of:
              0.0917984 = queryWeight, product of:
                1.1046578 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.019068662 = queryNorm
              0.20428121 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.020455018 = weight(abstract_txt:evaluation in 2731) [ClassicSimilarity], result of:
            0.020455018 = score(doc=2731,freq=1.0), product of:
              0.09727308 = queryWeight, product of:
                1.1371206 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.019068662 = queryNorm
              0.21028447 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.036509544 = weight(abstract_txt:query in 2731) [ClassicSimilarity], result of:
            0.036509544 = score(doc=2731,freq=1.0), product of:
              0.16384284 = queryWeight, product of:
                1.8074635 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.019068662 = queryNorm
              0.22283271 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.05054766 = weight(abstract_txt:integration in 2731) [ClassicSimilarity], result of:
            0.05054766 = score(doc=2731,freq=1.0), product of:
              0.20352787 = queryWeight, product of:
                2.0145023 = boost
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.019068662 = queryNorm
              0.24835745 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.031720176 = weight(abstract_txt:documents in 2731) [ClassicSimilarity], result of:
            0.031720176 = score(doc=2731,freq=1.0), product of:
              0.16419496 = queryWeight, product of:
                2.0893207 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.019068662 = queryNorm
              0.19318606 = fieldWeight in 2731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
          0.07286966 = weight(abstract_txt:data in 2731) [ClassicSimilarity], result of:
            0.07286966 = score(doc=2731,freq=3.0), product of:
              0.26901317 = queryWeight, product of:
                4.2284575 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019068662 = queryNorm
              0.27087766 = fieldWeight in 2731, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.046875 = fieldNorm(doc=2731)
        0.32 = coord(8/25)
    
  4. Mothe, J.; Chrisment, C.; Dousset, B.; Alaux, J.: DocCube : Multi-dimensional visualisation and exploration of large document sets (2003) 0.10
    0.09925852 = sum of:
      0.09925852 = product of:
        0.62036574 = sum of:
          0.037505377 = weight(abstract_txt:structure in 1613) [ClassicSimilarity], result of:
            0.037505377 = score(doc=1613,freq=1.0), product of:
              0.0917984 = queryWeight, product of:
                1.1046578 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.019068662 = queryNorm
              0.40856242 = fieldWeight in 1613, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.09375 = fieldNorm(doc=1613)
          0.07301909 = weight(abstract_txt:query in 1613) [ClassicSimilarity], result of:
            0.07301909 = score(doc=1613,freq=1.0), product of:
              0.16384284 = queryWeight, product of:
                1.8074635 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.019068662 = queryNorm
              0.44566542 = fieldWeight in 1613, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.09375 = fieldNorm(doc=1613)
          0.08971821 = weight(abstract_txt:documents in 1613) [ClassicSimilarity], result of:
            0.08971821 = score(doc=1613,freq=2.0), product of:
              0.16419496 = queryWeight, product of:
                2.0893207 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.019068662 = queryNorm
              0.5464127 = fieldWeight in 1613, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=1613)
          0.4201231 = weight(abstract_txt:olap in 1613) [ClassicSimilarity], result of:
            0.4201231 = score(doc=1613,freq=1.0), product of:
              0.45957577 = queryWeight, product of:
                2.4716601 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.019068662 = queryNorm
              0.9141542 = fieldWeight in 1613, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.09375 = fieldNorm(doc=1613)
        0.16 = coord(4/25)
    
  5. Special topic issue: XML (2002) 0.10
    0.09744676 = sum of:
      0.09744676 = product of:
        0.4872338 = sum of:
          0.025003586 = weight(abstract_txt:structure in 458) [ClassicSimilarity], result of:
            0.025003586 = score(doc=458,freq=1.0), product of:
              0.0917984 = queryWeight, product of:
                1.1046578 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.019068662 = queryNorm
              0.27237496 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
          0.25424457 = weight(abstract_txt:xquery in 458) [ClassicSimilarity], result of:
            0.25424457 = score(doc=458,freq=3.0), product of:
              0.2371106 = queryWeight, product of:
                1.2553669 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.019068662 = queryNorm
              1.0722615 = fieldWeight in 458, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
          0.06884306 = weight(abstract_txt:query in 458) [ClassicSimilarity], result of:
            0.06884306 = score(doc=458,freq=2.0), product of:
              0.16384284 = queryWeight, product of:
                1.8074635 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.019068662 = queryNorm
              0.4201774 = fieldWeight in 458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
          0.05981214 = weight(abstract_txt:documents in 458) [ClassicSimilarity], result of:
            0.05981214 = score(doc=458,freq=2.0), product of:
              0.16419496 = queryWeight, product of:
                2.0893207 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.019068662 = queryNorm
              0.36427513 = fieldWeight in 458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
          0.079330444 = weight(abstract_txt:data in 458) [ClassicSimilarity], result of:
            0.079330444 = score(doc=458,freq=2.0), product of:
              0.26901317 = queryWeight, product of:
                4.2284575 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019068662 = queryNorm
              0.29489428 = fieldWeight in 458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=458)
        0.2 = coord(5/25)