Document (#34626)

Author
Heidorn, P.B.
Wei, Q.
Title
Automatic metadata extraction from museum specimen labels
Source
Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas
Imprint
Göttingen : Univ.-Verl.
Year
2008
Pages
S.57-68
Abstract
This paper describes the information properties of museum specimen labels and machine learning tools to automatically extract Darwin Core (DwC) and other metadata from these labels processed through Optical Character Recognition (OCR). The DwC is a metadata profile describing the core set of access points for search and retrieval of natural history collections and observation databases. Using the HERBIS Learning System (HLS) we extract 74 independent elements from these labels. The automated text extraction tools are provided as a web service so that users can reference digital images of specimens and receive back an extended Darwin Core XML representation of the content of the label. This automated extraction task is made more difficult by the high variability of museum label formats, OCR errors and the open class nature of some elements. In this paper we introduce our overall system architecture, and variability robust solutions including, the application of Hidden Markov and Naïve Bayes machine learning models, data cleaning, use of field element identifiers, and specialist learning models. The techniques developed here could be adapted to any metadata extraction situation with noisy text and weakly ordered elements.
Content
Vgl. unter: http://dcpapers.dublincore.org/ojs/pubs/article/view/919/915.
Theme
Metadaten
Area
Museen

Similar documents (author)

  1. Heidorn, P.B.: ¬The identification of index terms in natural language object descriptions (1999) 5.92
    5.9235125 = sum of:
      5.9235125 = weight(author_txt:heidorn in 1682) [ClassicSimilarity], result of:
        5.9235125 = fieldWeight in 1682, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.625 = fieldNorm(doc=1682)
    
  2. Heidorn, P.B.: Image retrieval as linguistic and nonlinguistic visual model matching (1999) 5.92
    5.9235125 = sum of:
      5.9235125 = weight(author_txt:heidorn in 1967) [ClassicSimilarity], result of:
        5.9235125 = fieldWeight in 1967, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.625 = fieldNorm(doc=1967)
    
  3. Cui, H.; Heidorn, P.B.: ¬The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions (2007) 4.74
    4.73881 = sum of:
      4.73881 = weight(author_txt:heidorn in 2085) [ClassicSimilarity], result of:
        4.73881 = fieldWeight in 2085, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.5 = fieldNorm(doc=2085)
    
  4. Jensen, K.; Heidorn, G.E.; Richardson, S.D.: Natural language processing : the PLNLP approach (19??) 3.55
    3.5541077 = sum of:
      3.5541077 = weight(author_txt:heidorn in 6364) [ClassicSimilarity], result of:
        3.5541077 = fieldWeight in 6364, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.375 = fieldNorm(doc=6364)
    
  5. Koshman, S.; Heidorn, B.; Kim, H.: ACM SIGIR '93 provides information retrieval roundup (1993) 3.55
    3.5541077 = sum of:
      3.5541077 = weight(author_txt:heidorn in 693) [ClassicSimilarity], result of:
        3.5541077 = fieldWeight in 693, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.47762 = idf(docFreq=8, maxDocs=43254)
          0.375 = fieldNorm(doc=693)
    

Similar documents (content)

  1. Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.21
    0.21282744 = sum of:
      0.21282744 = product of:
        0.6650858 = sum of:
          0.028098311 = weight(abstract_txt:text in 96) [ClassicSimilarity], result of:
            0.028098311 = score(doc=96,freq=4.0), product of:
              0.06343592 = queryWeight, product of:
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.015664203 = queryNorm
              0.4429401 = fieldWeight in 96, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0546875 = fieldNorm(doc=96)
          0.059191145 = weight(abstract_txt:noisy in 96) [ClassicSimilarity], result of:
            0.059191145 = score(doc=96,freq=1.0), product of:
              0.13133977 = queryWeight, product of:
                1.0174557 = boost
                8.240858 = idf(docFreq=30, maxDocs=43254)
                0.015664203 = queryNorm
              0.4506719 = fieldWeight in 96, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.240858 = idf(docFreq=30, maxDocs=43254)
                0.0546875 = fieldNorm(doc=96)
          0.0068213423 = weight(abstract_txt:from in 96) [ClassicSimilarity], result of:
            0.0068213423 = score(doc=96,freq=1.0), product of:
              0.044858567 = queryWeight, product of:
                1.0299134 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.015664203 = queryNorm
              0.15206331 = fieldWeight in 96, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0546875 = fieldNorm(doc=96)
          0.031858157 = weight(abstract_txt:machine in 96) [ClassicSimilarity], result of:
            0.031858157 = score(doc=96,freq=1.0), product of:
              0.10949195 = queryWeight, product of:
                1.3137826 = boost
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.015664203 = queryNorm
              0.29096347 = fieldWeight in 96, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.0546875 = fieldNorm(doc=96)
          0.16494188 = weight(abstract_txt:label in 96) [ClassicSimilarity], result of:
            0.16494188 = score(doc=96,freq=4.0), product of:
              0.20642821 = queryWeight, product of:
                1.8039185 = boost
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.015664203 = queryNorm
              0.7990278 = fieldWeight in 96, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.305397 = idf(docFreq=78, maxDocs=43254)
                0.0546875 = fieldNorm(doc=96)
          0.10432377 = weight(abstract_txt:learning in 96) [ClassicSimilarity], result of:
            0.10432377 = score(doc=96,freq=5.0), product of:
              0.17790052 = queryWeight, product of:
                2.3682961 = boost
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.015664203 = queryNorm
              0.5864163 = fieldWeight in 96, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7954893 = idf(docFreq=971, maxDocs=43254)
                0.0546875 = fieldNorm(doc=96)
          0.101364724 = weight(abstract_txt:extraction in 96) [ClassicSimilarity], result of:
            0.101364724 = score(doc=96,freq=1.0), product of:
              0.29842576 = queryWeight, product of:
                3.0673656 = boost
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.015664203 = queryNorm
              0.3396648 = fieldWeight in 96, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.0546875 = fieldNorm(doc=96)
          0.16848645 = weight(abstract_txt:labels in 96) [ClassicSimilarity], result of:
            0.16848645 = score(doc=96,freq=1.0), product of:
              0.41875026 = queryWeight, product of:
                3.633498 = boost
                7.357357 = idf(docFreq=74, maxDocs=43254)
                0.015664203 = queryNorm
              0.40235546 = fieldWeight in 96, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.357357 = idf(docFreq=74, maxDocs=43254)
                0.0546875 = fieldNorm(doc=96)
        0.32 = coord(8/25)
    
  2. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.16
    0.16133073 = sum of:
      0.16133073 = product of:
        0.8066536 = sum of:
          0.011024954 = weight(abstract_txt:from in 56) [ClassicSimilarity], result of:
            0.011024954 = score(doc=56,freq=2.0), product of:
              0.044858567 = queryWeight, product of:
                1.0299134 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.015664203 = queryNorm
              0.24577142 = fieldWeight in 56, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.024779787 = weight(abstract_txt:models in 56) [ClassicSimilarity], result of:
            0.024779787 = score(doc=56,freq=1.0), product of:
              0.08471731 = queryWeight, product of:
                1.1556292 = boost
                4.679995 = idf(docFreq=1090, maxDocs=43254)
                0.015664203 = queryNorm
              0.2924997 = fieldWeight in 56, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.679995 = idf(docFreq=1090, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.10858989 = weight(abstract_txt:core in 56) [ClassicSimilarity], result of:
            0.10858989 = score(doc=56,freq=4.0), product of:
              0.16359767 = queryWeight, product of:
                1.9668288 = boost
                5.3100944 = idf(docFreq=580, maxDocs=43254)
                0.015664203 = queryNorm
              0.6637618 = fieldWeight in 56, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3100944 = idf(docFreq=580, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.2316908 = weight(abstract_txt:extraction in 56) [ClassicSimilarity], result of:
            0.2316908 = score(doc=56,freq=4.0), product of:
              0.29842576 = queryWeight, product of:
                3.0673656 = boost
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.015664203 = queryNorm
              0.77637666 = fieldWeight in 56, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2110133 = idf(docFreq=235, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
          0.43056822 = weight(abstract_txt:labels in 56) [ClassicSimilarity], result of:
            0.43056822 = score(doc=56,freq=5.0), product of:
              0.41875026 = queryWeight, product of:
                3.633498 = boost
                7.357357 = idf(docFreq=74, maxDocs=43254)
                0.015664203 = queryNorm
              1.028222 = fieldWeight in 56, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.357357 = idf(docFreq=74, maxDocs=43254)
                0.0625 = fieldNorm(doc=56)
        0.2 = coord(5/25)
    
  3. Cui, H.: Competency evaluation of plant character ontologies against domain literature (2010) 0.13
    0.13054886 = sum of:
      0.13054886 = product of:
        0.46624592 = sum of:
          0.027810115 = weight(abstract_txt:text in 467) [ClassicSimilarity], result of:
            0.027810115 = score(doc=467,freq=3.0), product of:
              0.06343592 = queryWeight, product of:
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.015664203 = queryNorm
              0.438397 = fieldWeight in 467, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.049738 = idf(docFreq=2048, maxDocs=43254)
                0.0625 = fieldNorm(doc=467)
          0.015591639 = weight(abstract_txt:from in 467) [ClassicSimilarity], result of:
            0.015591639 = score(doc=467,freq=4.0), product of:
              0.044858567 = queryWeight, product of:
                1.0299134 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.015664203 = queryNorm
              0.34757328 = fieldWeight in 467, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.0625 = fieldNorm(doc=467)
          0.021388391 = weight(abstract_txt:tools in 467) [ClassicSimilarity], result of:
            0.021388391 = score(doc=467,freq=1.0), product of:
              0.076799646 = queryWeight, product of:
                1.1003022 = boost
                4.4559355 = idf(docFreq=1364, maxDocs=43254)
                0.015664203 = queryNorm
              0.27849597 = fieldWeight in 467, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4559355 = idf(docFreq=1364, maxDocs=43254)
                0.0625 = fieldNorm(doc=467)
          0.09950948 = weight(abstract_txt:specimens in 467) [ClassicSimilarity], result of:
            0.09950948 = score(doc=467,freq=1.0), product of:
              0.16987915 = queryWeight, product of:
                1.1571441 = boost
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.015664203 = queryNorm
              0.58576626 = fieldWeight in 467, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.37226 = idf(docFreq=9, maxDocs=43254)
                0.0625 = fieldNorm(doc=467)
          0.036409326 = weight(abstract_txt:machine in 467) [ClassicSimilarity], result of:
            0.036409326 = score(doc=467,freq=1.0), product of:
              0.10949195 = queryWeight, product of:
                1.3137826 = boost
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.015664203 = queryNorm
              0.3325297 = fieldWeight in 467, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.320475 = idf(docFreq=574, maxDocs=43254)
                0.0625 = fieldNorm(doc=467)
          0.042920463 = weight(abstract_txt:automated in 467) [ClassicSimilarity], result of:
            0.042920463 = score(doc=467,freq=1.0), product of:
              0.12218467 = queryWeight, product of:
                1.3878443 = boost
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.015664203 = queryNorm
              0.35127535 = fieldWeight in 467, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.0625 = fieldNorm(doc=467)
          0.2226165 = weight(abstract_txt:specimen in 467) [ClassicSimilarity], result of:
            0.2226165 = score(doc=467,freq=1.0), product of:
              0.36611035 = queryWeight, product of:
                2.4023616 = boost
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.015664203 = queryNorm
              0.60805845 = fieldWeight in 467, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.728935 = idf(docFreq=6, maxDocs=43254)
                0.0625 = fieldNorm(doc=467)
        0.28 = coord(7/25)
    
  4. Hooland, S. van; Verborgh, R.: Linked data for Lilibraries, archives and museums : how to clean, link, and publish your metadata (2014) 0.12
    0.12137725 = sum of:
      0.12137725 = product of:
        0.4334902 = sum of:
          0.008268716 = weight(abstract_txt:from in 154) [ClassicSimilarity], result of:
            0.008268716 = score(doc=154,freq=2.0), product of:
              0.044858567 = queryWeight, product of:
                1.0299134 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.015664203 = queryNorm
              0.18432857 = fieldWeight in 154, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.046875 = fieldNorm(doc=154)
          0.10448564 = weight(abstract_txt:cleaning in 154) [ClassicSimilarity], result of:
            0.10448564 = score(doc=154,freq=3.0), product of:
              0.14740774 = queryWeight, product of:
                1.0778977 = boost
                8.730406 = idf(docFreq=18, maxDocs=43254)
                0.015664203 = queryNorm
              0.7088206 = fieldWeight in 154, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.730406 = idf(docFreq=18, maxDocs=43254)
                0.046875 = fieldNorm(doc=154)
          0.022685817 = weight(abstract_txt:tools in 154) [ClassicSimilarity], result of:
            0.022685817 = score(doc=154,freq=2.0), product of:
              0.076799646 = queryWeight, product of:
                1.1003022 = boost
                4.4559355 = idf(docFreq=1364, maxDocs=43254)
                0.015664203 = queryNorm
              0.2953896 = fieldWeight in 154, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4559355 = idf(docFreq=1364, maxDocs=43254)
                0.046875 = fieldNorm(doc=154)
          0.018584842 = weight(abstract_txt:models in 154) [ClassicSimilarity], result of:
            0.018584842 = score(doc=154,freq=1.0), product of:
              0.08471731 = queryWeight, product of:
                1.1556292 = boost
                4.679995 = idf(docFreq=1090, maxDocs=43254)
                0.015664203 = queryNorm
              0.21937478 = fieldWeight in 154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.679995 = idf(docFreq=1090, maxDocs=43254)
                0.046875 = fieldNorm(doc=154)
          0.03219035 = weight(abstract_txt:automated in 154) [ClassicSimilarity], result of:
            0.03219035 = score(doc=154,freq=1.0), product of:
              0.12218467 = queryWeight, product of:
                1.3878443 = boost
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.015664203 = queryNorm
              0.26345652 = fieldWeight in 154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.046875 = fieldNorm(doc=154)
          0.07118493 = weight(abstract_txt:museum in 154) [ClassicSimilarity], result of:
            0.07118493 = score(doc=154,freq=1.0), product of:
              0.23740439 = queryWeight, product of:
                2.3693128 = boost
                6.3967304 = idf(docFreq=195, maxDocs=43254)
                0.015664203 = queryNorm
              0.29984674 = fieldWeight in 154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3967304 = idf(docFreq=195, maxDocs=43254)
                0.046875 = fieldNorm(doc=154)
          0.17608988 = weight(abstract_txt:metadata in 154) [ClassicSimilarity], result of:
            0.17608988 = score(doc=154,freq=17.0), product of:
              0.18587297 = queryWeight, product of:
                2.4207811 = boost
                4.9017644 = idf(docFreq=873, maxDocs=43254)
                0.015664203 = queryNorm
              0.9473668 = fieldWeight in 154, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                4.9017644 = idf(docFreq=873, maxDocs=43254)
                0.046875 = fieldNorm(doc=154)
        0.28 = coord(7/25)
    
  5. Lagoze, C.: Keeping Dublin Core simple : Cross-domain discovery or resource description? (2001) 0.12
    0.11830213 = sum of:
      0.11830213 = product of:
        0.36969417 = sum of:
          0.012927905 = weight(abstract_txt:from in 3217) [ClassicSimilarity], result of:
            0.012927905 = score(doc=3217,freq=11.0), product of:
              0.044858567 = queryWeight, product of:
                1.0299134 = boost
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.015664203 = queryNorm
              0.28819254 = fieldWeight in 3217, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                2.7805862 = idf(docFreq=7289, maxDocs=43254)
                0.03125 = fieldNorm(doc=3217)
          0.0106941955 = weight(abstract_txt:tools in 3217) [ClassicSimilarity], result of:
            0.0106941955 = score(doc=3217,freq=1.0), product of:
              0.076799646 = queryWeight, product of:
                1.1003022 = boost
                4.4559355 = idf(docFreq=1364, maxDocs=43254)
                0.015664203 = queryNorm
              0.13924798 = fieldWeight in 3217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4559355 = idf(docFreq=1364, maxDocs=43254)
                0.03125 = fieldNorm(doc=3217)
          0.017521955 = weight(abstract_txt:models in 3217) [ClassicSimilarity], result of:
            0.017521955 = score(doc=3217,freq=2.0), product of:
              0.08471731 = queryWeight, product of:
                1.1556292 = boost
                4.679995 = idf(docFreq=1090, maxDocs=43254)
                0.015664203 = queryNorm
              0.2068285 = fieldWeight in 3217, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.679995 = idf(docFreq=1090, maxDocs=43254)
                0.03125 = fieldNorm(doc=3217)
          0.030349351 = weight(abstract_txt:automated in 3217) [ClassicSimilarity], result of:
            0.030349351 = score(doc=3217,freq=2.0), product of:
              0.12218467 = queryWeight, product of:
                1.3878443 = boost
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.015664203 = queryNorm
              0.24838918 = fieldWeight in 3217, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6204057 = idf(docFreq=425, maxDocs=43254)
                0.03125 = fieldNorm(doc=3217)
          0.053162836 = weight(abstract_txt:elements in 3217) [ClassicSimilarity], result of:
            0.053162836 = score(doc=3217,freq=4.0), product of:
              0.16131558 = queryWeight, product of:
                1.9530625 = boost
                5.2729278 = idf(docFreq=602, maxDocs=43254)
                0.015664203 = queryNorm
              0.32955799 = fieldWeight in 3217, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2729278 = idf(docFreq=602, maxDocs=43254)
                0.03125 = fieldNorm(doc=3217)
          0.07678464 = weight(abstract_txt:core in 3217) [ClassicSimilarity], result of:
            0.07678464 = score(doc=3217,freq=8.0), product of:
              0.16359767 = queryWeight, product of:
                1.9668288 = boost
                5.3100944 = idf(docFreq=580, maxDocs=43254)
                0.015664203 = queryNorm
              0.46935046 = fieldWeight in 3217, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.3100944 = idf(docFreq=580, maxDocs=43254)
                0.03125 = fieldNorm(doc=3217)
          0.047456622 = weight(abstract_txt:museum in 3217) [ClassicSimilarity], result of:
            0.047456622 = score(doc=3217,freq=1.0), product of:
              0.23740439 = queryWeight, product of:
                2.3693128 = boost
                6.3967304 = idf(docFreq=195, maxDocs=43254)
                0.015664203 = queryNorm
              0.19989783 = fieldWeight in 3217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3967304 = idf(docFreq=195, maxDocs=43254)
                0.03125 = fieldNorm(doc=3217)
          0.120796666 = weight(abstract_txt:metadata in 3217) [ClassicSimilarity], result of:
            0.120796666 = score(doc=3217,freq=18.0), product of:
              0.18587297 = queryWeight, product of:
                2.4207811 = boost
                4.9017644 = idf(docFreq=873, maxDocs=43254)
                0.015664203 = queryNorm
              0.6498883 = fieldWeight in 3217, product of:
                4.2426405 = tf(freq=18.0), with freq of:
                  18.0 = termFreq=18.0
                4.9017644 = idf(docFreq=873, maxDocs=43254)
                0.03125 = fieldNorm(doc=3217)
        0.32 = coord(8/25)