Document (#33368)

Author
Näppilä, T.
Järvelin, K.
Niemi, T.
Title
¬A tool for data cube construction from structurally heterogeneous XML documents
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.3, S.435-449
Year
2008
Abstract
Data cubes for OLAP (On-Line Analytical Processing) often need to be constructed from data located in several distributed and autonomous information sources. Such a data integration process is challenging due to semantic, syntactic, and structural heterogeneity among the data. While XML (extensible markup language) is the de facto standard for data exchange, the three types of heterogeneity remain. Moreover, popular path-oriented XML query languages, such as XQuery, require the user to know in much detail the structure of the documents to be processed and are, thus, effectively impractical in many real-world data integration tasks. Several Lowest Common Ancestor (LCA)-based XML query evaluation strategies have recently been introduced to provide a more structure-independent way to access XML documents. We shall, however, show that this approach leads in the context of certain - not uncommon - types of XML documents to undesirable results. This article introduces a novel high-level data extraction primitive that utilizes the purpose-built Smallest Possible Context (SPC) query evaluation strategy. We demonstrate, through a system prototype for OLAP data cube construction and a sample application in informetrics, that our approach has real advantages in data integration.

Similar documents (author)

  1. Järvelin, K.; Niemi, T.: Deductive information retrieval based on classifications (1993) 6.16
    6.1609344 = sum of:
      6.1609344 = sum of:
        2.3832746 = weight(author_txt:järvelin in 4227) [ClassicSimilarity], result of:
          2.3832746 = score(doc=4227,freq=1.0), product of:
            0.59254366 = queryWeight, product of:
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.073660836 = queryNorm
            4.022108 = fieldWeight in 4227, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.5 = fieldNorm(doc=4227)
        3.7776597 = weight(author_txt:niemi in 4227) [ClassicSimilarity], result of:
          3.7776597 = score(doc=4227,freq=1.0), product of:
            0.8055383 = queryWeight, product of:
              1.165958 = boost
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.073660836 = queryNorm
            4.689609 = fieldWeight in 4227, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.5 = fieldNorm(doc=4227)
    
  2. Niemi, T.; Hirvonen, L.; Järvelin, K.: Multidimensional data model and query language for informetrics (2003) 4.62
    4.620701 = sum of:
      4.620701 = sum of:
        1.787456 = weight(author_txt:järvelin in 2751) [ClassicSimilarity], result of:
          1.787456 = score(doc=2751,freq=1.0), product of:
            0.59254366 = queryWeight, product of:
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.073660836 = queryNorm
            3.016581 = fieldWeight in 2751, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.375 = fieldNorm(doc=2751)
        2.8332446 = weight(author_txt:niemi in 2751) [ClassicSimilarity], result of:
          2.8332446 = score(doc=2751,freq=1.0), product of:
            0.8055383 = queryWeight, product of:
              1.165958 = boost
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.073660836 = queryNorm
            3.5172067 = fieldWeight in 2751, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.375 = fieldNorm(doc=2751)
    
  3. Järvelin, K.; Ingwersen, P.; Niemi, T.: ¬A user-oriented interface for generalised informetric analysis based on applying advanced data modelling techniques (2000) 4.62
    4.620701 = sum of:
      4.620701 = sum of:
        1.787456 = weight(author_txt:järvelin in 543) [ClassicSimilarity], result of:
          1.787456 = score(doc=543,freq=1.0), product of:
            0.59254366 = queryWeight, product of:
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.073660836 = queryNorm
            3.016581 = fieldWeight in 543, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.375 = fieldNorm(doc=543)
        2.8332446 = weight(author_txt:niemi in 543) [ClassicSimilarity], result of:
          2.8332446 = score(doc=543,freq=1.0), product of:
            0.8055383 = queryWeight, product of:
              1.165958 = boost
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.073660836 = queryNorm
            3.5172067 = fieldWeight in 543, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.375 = fieldNorm(doc=543)
    
  4. Niemi, T.; Junkkari, M.; Järvelin, K.; Viita, S.: Advanced query language for manipulating complex entities (2004) 3.85
    3.850584 = sum of:
      3.850584 = sum of:
        1.4895467 = weight(author_txt:järvelin in 5216) [ClassicSimilarity], result of:
          1.4895467 = score(doc=5216,freq=1.0), product of:
            0.59254366 = queryWeight, product of:
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.073660836 = queryNorm
            2.5138175 = fieldWeight in 5216, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.3125 = fieldNorm(doc=5216)
        2.3610373 = weight(author_txt:niemi in 5216) [ClassicSimilarity], result of:
          2.3610373 = score(doc=5216,freq=1.0), product of:
            0.8055383 = queryWeight, product of:
              1.165958 = boost
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.073660836 = queryNorm
            2.9310057 = fieldWeight in 5216, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.3125 = fieldNorm(doc=5216)
    
  5. Järvelin, K.; Kristensen, J.; Niemi, T.; Sormunen, E.; Keskustalo, H.: ¬A deductive data model for query expansion (1996) 3.85
    3.850584 = sum of:
      3.850584 = sum of:
        1.4895467 = weight(author_txt:järvelin in 4228) [ClassicSimilarity], result of:
          1.4895467 = score(doc=4228,freq=1.0), product of:
            0.59254366 = queryWeight, product of:
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.073660836 = queryNorm
            2.5138175 = fieldWeight in 4228, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.044216 = idf(docFreq=37, maxDocs=43556)
              0.3125 = fieldNorm(doc=4228)
        2.3610373 = weight(author_txt:niemi in 4228) [ClassicSimilarity], result of:
          2.3610373 = score(doc=4228,freq=1.0), product of:
            0.8055383 = queryWeight, product of:
              1.165958 = boost
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.073660836 = queryNorm
            2.9310057 = fieldWeight in 4228, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.379218 = idf(docFreq=9, maxDocs=43556)
              0.3125 = fieldNorm(doc=4228)
    

Similar documents (content)

  1. Chang, Y.; Ounis, I.; Kim, M.: Query reformulation using automatically generated query concepts from a document space (2006) 0.12
    0.117436305 = sum of:
      0.117436305 = product of:
        0.5871815 = sum of:
          0.27240813 = weight(abstract_txt:primitive in 2970) [ClassicSimilarity], result of:
            0.27240813 = score(doc=2970,freq=4.0), product of:
              0.19427903 = queryWeight, product of:
                1.1362615 = boost
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.019053446 = queryNorm
              1.402149 = fieldWeight in 2970, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.973753 = idf(docFreq=14, maxDocs=43556)
                0.078125 = fieldNorm(doc=2970)
          0.034199525 = weight(abstract_txt:evaluation in 2970) [ClassicSimilarity], result of:
            0.034199525 = score(doc=2970,freq=1.0), product of:
              0.09742175 = queryWeight, product of:
                1.137911 = boost
                4.49339 = idf(docFreq=1323, maxDocs=43556)
                0.019053446 = queryNorm
              0.3510461 = fieldWeight in 2970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.49339 = idf(docFreq=1323, maxDocs=43556)
                0.078125 = fieldNorm(doc=2970)
          0.13486816 = weight(abstract_txt:query in 2970) [ClassicSimilarity], result of:
            0.13486816 = score(doc=2970,freq=5.0), product of:
              0.16278808 = queryWeight, product of:
                1.8015122 = boost
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.019053446 = queryNorm
              0.8284892 = fieldWeight in 2970, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.078125 = fieldNorm(doc=2970)
          0.074601874 = weight(abstract_txt:documents in 2970) [ClassicSimilarity], result of:
            0.074601874 = score(doc=2970,freq=2.0), product of:
              0.16386104 = queryWeight, product of:
                2.0870514 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.019053446 = queryNorm
              0.45527524 = fieldWeight in 2970, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.078125 = fieldNorm(doc=2970)
          0.07110379 = weight(abstract_txt:data in 2970) [ClassicSimilarity], result of:
            0.07110379 = score(doc=2970,freq=1.0), product of:
              0.2713695 = queryWeight, product of:
                4.2466435 = boost
                3.3538349 = idf(docFreq=4137, maxDocs=43556)
                0.019053446 = queryNorm
              0.26201835 = fieldWeight in 2970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3538349 = idf(docFreq=4137, maxDocs=43556)
                0.078125 = fieldNorm(doc=2970)
        0.2 = coord(5/25)
    
  2. Mayr, P.; Mutschke, P.; Petras, V.: Reducing semantic complexity in distributed digital libraries : Treatment of term vagueness and document re-ranking (2008) 0.11
    0.10551131 = sum of:
      0.10551131 = product of:
        0.43963045 = sum of:
          0.02494971 = weight(abstract_txt:several in 3907) [ClassicSimilarity], result of:
            0.02494971 = score(doc=3907,freq=1.0), product of:
              0.10014306 = queryWeight, product of:
                1.1536943 = boost
                4.5557156 = idf(docFreq=1243, maxDocs=43556)
                0.019053446 = queryNorm
              0.2491407 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5557156 = idf(docFreq=1243, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3907)
          0.043455526 = weight(abstract_txt:construction in 3907) [ClassicSimilarity], result of:
            0.043455526 = score(doc=3907,freq=1.0), product of:
              0.14496857 = queryWeight, product of:
                1.3880887 = boost
                5.4812937 = idf(docFreq=492, maxDocs=43556)
                0.019053446 = queryNorm
              0.29975826 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4812937 = idf(docFreq=492, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3907)
          0.05970868 = weight(abstract_txt:query in 3907) [ClassicSimilarity], result of:
            0.05970868 = score(doc=3907,freq=2.0), product of:
              0.16278808 = queryWeight, product of:
                1.8015122 = boost
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.019053446 = queryNorm
              0.3667878 = fieldWeight in 3907, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3907)
          0.21605574 = weight(abstract_txt:heterogeneity in 3907) [ClassicSimilarity], result of:
            0.21605574 = score(doc=3907,freq=3.0), product of:
              0.29280645 = queryWeight, product of:
                1.9727435 = boost
                7.7899823 = idf(docFreq=48, maxDocs=43556)
                0.019053446 = queryNorm
              0.73787904 = fieldWeight in 3907, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.7899823 = idf(docFreq=48, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3907)
          0.058534723 = weight(abstract_txt:integration in 3907) [ClassicSimilarity], result of:
            0.058534723 = score(doc=3907,freq=1.0), product of:
              0.20240286 = queryWeight, product of:
                2.0087886 = boost
                5.288212 = idf(docFreq=597, maxDocs=43556)
                0.019053446 = queryNorm
              0.28919908 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.288212 = idf(docFreq=597, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3907)
          0.036926046 = weight(abstract_txt:documents in 3907) [ClassicSimilarity], result of:
            0.036926046 = score(doc=3907,freq=1.0), product of:
              0.16386104 = queryWeight, product of:
                2.0870514 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.019053446 = queryNorm
              0.22534975 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.0546875 = fieldNorm(doc=3907)
        0.24 = coord(6/25)
    
  3. Aldana, J.F.; Gómez, A.C.; Moreno, N.; Nebro, A.J.; Roldán, M.M.: Metadata functionality for semantic Web integration (2003) 0.10
    0.1002872 = sum of:
      0.1002872 = product of:
        0.3133975 = sum of:
          0.05570658 = weight(abstract_txt:autonomous in 3729) [ClassicSimilarity], result of:
            0.05570658 = score(doc=3729,freq=1.0), product of:
              0.15047674 = queryWeight, product of:
                7.897613 = idf(docFreq=43, maxDocs=43556)
                0.019053446 = queryNorm
              0.3702006 = fieldWeight in 3729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.897613 = idf(docFreq=43, maxDocs=43556)
                0.046875 = fieldNorm(doc=3729)
          0.026425868 = weight(abstract_txt:context in 3729) [ClassicSimilarity], result of:
            0.026425868 = score(doc=3729,freq=2.0), product of:
              0.09152742 = queryWeight, product of:
                1.1029502 = boost
                4.355337 = idf(docFreq=1519, maxDocs=43556)
                0.019053446 = queryNorm
              0.2887208 = fieldWeight in 3729, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.355337 = idf(docFreq=1519, maxDocs=43556)
                0.046875 = fieldNorm(doc=3729)
          0.018839661 = weight(abstract_txt:structure in 3729) [ClassicSimilarity], result of:
            0.018839661 = score(doc=3729,freq=1.0), product of:
              0.092028804 = queryWeight, product of:
                1.105967 = boost
                4.36725 = idf(docFreq=1501, maxDocs=43556)
                0.019053446 = queryNorm
              0.20471483 = fieldWeight in 3729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.36725 = idf(docFreq=1501, maxDocs=43556)
                0.046875 = fieldNorm(doc=3729)
          0.020519715 = weight(abstract_txt:evaluation in 3729) [ClassicSimilarity], result of:
            0.020519715 = score(doc=3729,freq=1.0), product of:
              0.09742175 = queryWeight, product of:
                1.137911 = boost
                4.49339 = idf(docFreq=1323, maxDocs=43556)
                0.019053446 = queryNorm
              0.21062766 = fieldWeight in 3729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.49339 = idf(docFreq=1323, maxDocs=43556)
                0.046875 = fieldNorm(doc=3729)
          0.036188927 = weight(abstract_txt:query in 3729) [ClassicSimilarity], result of:
            0.036188927 = score(doc=3729,freq=1.0), product of:
              0.16278808 = queryWeight, product of:
                1.8015122 = boost
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.019053446 = queryNorm
              0.22230698 = fieldWeight in 3729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.046875 = fieldNorm(doc=3729)
          0.05017262 = weight(abstract_txt:integration in 3729) [ClassicSimilarity], result of:
            0.05017262 = score(doc=3729,freq=1.0), product of:
              0.20240286 = queryWeight, product of:
                2.0087886 = boost
                5.288212 = idf(docFreq=597, maxDocs=43556)
                0.019053446 = queryNorm
              0.24788493 = fieldWeight in 3729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.288212 = idf(docFreq=597, maxDocs=43556)
                0.046875 = fieldNorm(doc=3729)
          0.031650893 = weight(abstract_txt:documents in 3729) [ClassicSimilarity], result of:
            0.031650893 = score(doc=3729,freq=1.0), product of:
              0.16386104 = queryWeight, product of:
                2.0870514 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.019053446 = queryNorm
              0.19315693 = fieldWeight in 3729, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.046875 = fieldNorm(doc=3729)
          0.07389322 = weight(abstract_txt:data in 3729) [ClassicSimilarity], result of:
            0.07389322 = score(doc=3729,freq=3.0), product of:
              0.2713695 = queryWeight, product of:
                4.2466435 = boost
                3.3538349 = idf(docFreq=4137, maxDocs=43556)
                0.019053446 = queryNorm
              0.27229744 = fieldWeight in 3729, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3538349 = idf(docFreq=4137, maxDocs=43556)
                0.046875 = fieldNorm(doc=3729)
        0.32 = coord(8/25)
    
  4. Mothe, J.; Chrisment, C.; Dousset, B.; Alaux, J.: DocCube : Multi-dimensional visualisation and exploration of large document sets (2003) 0.10
    0.098724976 = sum of:
      0.098724976 = product of:
        0.6170311 = sum of:
          0.037679322 = weight(abstract_txt:structure in 2611) [ClassicSimilarity], result of:
            0.037679322 = score(doc=2611,freq=1.0), product of:
              0.092028804 = queryWeight, product of:
                1.105967 = boost
                4.36725 = idf(docFreq=1501, maxDocs=43556)
                0.019053446 = queryNorm
              0.40942967 = fieldWeight in 2611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.36725 = idf(docFreq=1501, maxDocs=43556)
                0.09375 = fieldNorm(doc=2611)
          0.07237785 = weight(abstract_txt:query in 2611) [ClassicSimilarity], result of:
            0.07237785 = score(doc=2611,freq=1.0), product of:
              0.16278808 = queryWeight, product of:
                1.8015122 = boost
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.019053446 = queryNorm
              0.44461396 = fieldWeight in 2611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.09375 = fieldNorm(doc=2611)
          0.08952226 = weight(abstract_txt:documents in 2611) [ClassicSimilarity], result of:
            0.08952226 = score(doc=2611,freq=2.0), product of:
              0.16386104 = queryWeight, product of:
                2.0870514 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.019053446 = queryNorm
              0.54633033 = fieldWeight in 2611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.09375 = fieldNorm(doc=2611)
          0.41745168 = weight(abstract_txt:olap in 2611) [ClassicSimilarity], result of:
            0.41745168 = score(doc=2611,freq=1.0), product of:
              0.45736107 = queryWeight, product of:
                2.4655278 = boost
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.019053446 = queryNorm
              0.9127399 = fieldWeight in 2611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.735892 = idf(docFreq=6, maxDocs=43556)
                0.09375 = fieldNorm(doc=2611)
        0.16 = coord(4/25)
    
  5. Special topic issue: XML (2002) 0.10
    0.09722611 = sum of:
      0.09722611 = product of:
        0.48613057 = sum of:
          0.025119549 = weight(abstract_txt:structure in 1456) [ClassicSimilarity], result of:
            0.025119549 = score(doc=1456,freq=1.0), product of:
              0.092028804 = queryWeight, product of:
                1.105967 = boost
                4.36725 = idf(docFreq=1501, maxDocs=43556)
                0.019053446 = queryNorm
              0.27295312 = fieldWeight in 1456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.36725 = idf(docFreq=1501, maxDocs=43556)
                0.0625 = fieldNorm(doc=1456)
          0.25264627 = weight(abstract_txt:xquery in 1456) [ClassicSimilarity], result of:
            0.25264627 = score(doc=1456,freq=3.0), product of:
              0.2359794 = queryWeight, product of:
                1.2522826 = boost
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.019053446 = queryNorm
              1.0706285 = fieldWeight in 1456, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.890043 = idf(docFreq=5, maxDocs=43556)
                0.0625 = fieldNorm(doc=1456)
          0.06823849 = weight(abstract_txt:query in 1456) [ClassicSimilarity], result of:
            0.06823849 = score(doc=1456,freq=2.0), product of:
              0.16278808 = queryWeight, product of:
                1.8015122 = boost
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.019053446 = queryNorm
              0.41918606 = fieldWeight in 1456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.742549 = idf(docFreq=1031, maxDocs=43556)
                0.0625 = fieldNorm(doc=1456)
          0.0596815 = weight(abstract_txt:documents in 1456) [ClassicSimilarity], result of:
            0.0596815 = score(doc=1456,freq=2.0), product of:
              0.16386104 = queryWeight, product of:
                2.0870514 = boost
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.019053446 = queryNorm
              0.3642202 = fieldWeight in 1456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1206813 = idf(docFreq=1921, maxDocs=43556)
                0.0625 = fieldNorm(doc=1456)
          0.080444746 = weight(abstract_txt:data in 1456) [ClassicSimilarity], result of:
            0.080444746 = score(doc=1456,freq=2.0), product of:
              0.2713695 = queryWeight, product of:
                4.2466435 = boost
                3.3538349 = idf(docFreq=4137, maxDocs=43556)
                0.019053446 = queryNorm
              0.29643992 = fieldWeight in 1456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3538349 = idf(docFreq=4137, maxDocs=43556)
                0.0625 = fieldNorm(doc=1456)
        0.2 = coord(5/25)