Document (#34625)

Author
Heidorn, P.B.
Wei, Q.
Title
Automatic metadata extraction from museum specimen labels
Source
Metadata for semantic and social applications : proceedings of the International Conference on Dublin Core and Metadata Applications, Berlin, 22 - 26 September 2008, DC 2008: Berlin, Germany / ed. by Jane Greenberg and Wolfgang Klas
Imprint
Göttingen : Univ.-Verl.
Year
2008
Pages
S.57-68
Abstract
This paper describes the information properties of museum specimen labels and machine learning tools to automatically extract Darwin Core (DwC) and other metadata from these labels processed through Optical Character Recognition (OCR). The DwC is a metadata profile describing the core set of access points for search and retrieval of natural history collections and observation databases. Using the HERBIS Learning System (HLS) we extract 74 independent elements from these labels. The automated text extraction tools are provided as a web service so that users can reference digital images of specimens and receive back an extended Darwin Core XML representation of the content of the label. This automated extraction task is made more difficult by the high variability of museum label formats, OCR errors and the open class nature of some elements. In this paper we introduce our overall system architecture, and variability robust solutions including, the application of Hidden Markov and Naïve Bayes machine learning models, data cleaning, use of field element identifiers, and specialist learning models. The techniques developed here could be adapted to any metadata extraction situation with noisy text and weakly ordered elements.
Content
Vgl. unter: http://dcpapers.dublincore.org/ojs/pubs/article/view/919/915.
Theme
Metadaten
Area
Museen

Similar documents (author)

  1. Heidorn, P.B.: ¬The identification of index terms in natural language object descriptions (1999) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:heidorn in 6681) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 6681, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=6681)
    
  2. Heidorn, P.B.: Image retrieval as linguistic and nonlinguistic visual model matching (1999) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:heidorn in 841) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 841, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=841)
    
  3. Cui, H.; Heidorn, P.B.: ¬The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions (2007) 4.75
    4.749831 = sum of:
      4.749831 = weight(author_txt:heidorn in 84) [ClassicSimilarity], result of:
        4.749831 = fieldWeight in 84, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.5 = fieldNorm(doc=84)
    
  4. Jensen, K.; Heidorn, G.E.; Richardson, S.D.: Natural language processing : the PLNLP approach (19??) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:heidorn in 5295) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 5295, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=5295)
    
  5. Koshman, S.; Heidorn, B.; Kim, H.: ACM SIGIR '93 provides information retrieval roundup (1993) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:heidorn in 5692) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 5692, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=5692)
    

Similar documents (content)

  1. Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.21
    0.2091512 = sum of:
      0.2091512 = product of:
        0.6535975 = sum of:
          0.028208889 = weight(abstract_txt:text in 4095) [ClassicSimilarity], result of:
            0.028208889 = score(doc=4095,freq=4.0), product of:
              0.06377803 = queryWeight, product of:
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015771545 = queryNorm
              0.4422979 = fieldWeight in 4095, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.058808126 = weight(abstract_txt:noisy in 4095) [ClassicSimilarity], result of:
            0.058808126 = score(doc=4095,freq=1.0), product of:
              0.131134 = queryWeight, product of:
                1.013928 = boost
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.015771545 = queryNorm
              0.44845825 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.200379 = idf(docFreq=32, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.006754848 = weight(abstract_txt:from in 4095) [ClassicSimilarity], result of:
            0.006754848 = score(doc=4095,freq=1.0), product of:
              0.04468975 = queryWeight, product of:
                1.0252129 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.015771545 = queryNorm
              0.15114984 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.03136914 = weight(abstract_txt:machine in 4095) [ClassicSimilarity], result of:
            0.03136914 = score(doc=4095,freq=1.0), product of:
              0.10866812 = queryWeight, product of:
                1.3053156 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.015771545 = queryNorm
              0.2886692 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.16128233 = weight(abstract_txt:label in 4095) [ClassicSimilarity], result of:
            0.16128233 = score(doc=4095,freq=4.0), product of:
              0.20392555 = queryWeight, product of:
                1.7881349 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.015771545 = queryNorm
              0.7908883 = fieldWeight in 4095, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.102282375 = weight(abstract_txt:learning in 4095) [ClassicSimilarity], result of:
            0.102282375 = score(doc=4095,freq=5.0), product of:
              0.1760574 = queryWeight, product of:
                2.3496685 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.015771545 = queryNorm
              0.5809604 = fieldWeight in 4095, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.10124995 = weight(abstract_txt:extraction in 4095) [ClassicSimilarity], result of:
            0.10124995 = score(doc=4095,freq=1.0), product of:
              0.29902464 = queryWeight, product of:
                3.0621958 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.015771545 = queryNorm
              0.3386007 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
          0.16364181 = weight(abstract_txt:labels in 4095) [ClassicSimilarity], result of:
            0.16364181 = score(doc=4095,freq=1.0), product of:
              0.41181922 = queryWeight, product of:
                3.593625 = boost
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.015771545 = queryNorm
              0.39736322 = fieldWeight in 4095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4095)
        0.32 = coord(8/25)
    
  2. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.16
    0.15873468 = sum of:
      0.15873468 = product of:
        0.7936734 = sum of:
          0.010917483 = weight(abstract_txt:from in 5055) [ClassicSimilarity], result of:
            0.010917483 = score(doc=5055,freq=2.0), product of:
              0.04468975 = queryWeight, product of:
                1.0252129 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.015771545 = queryNorm
              0.24429502 = fieldWeight in 5055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.024375493 = weight(abstract_txt:models in 5055) [ClassicSimilarity], result of:
            0.024375493 = score(doc=5055,freq=1.0), product of:
              0.08402491 = queryWeight, product of:
                1.147806 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015771545 = queryNorm
              0.2900984 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.108764306 = weight(abstract_txt:core in 5055) [ClassicSimilarity], result of:
            0.108764306 = score(doc=5055,freq=4.0), product of:
              0.16422546 = queryWeight, product of:
                1.9653068 = boost
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.015771545 = queryNorm
              0.6622865 = fieldWeight in 5055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.298292 = idf(docFreq=600, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.23142846 = weight(abstract_txt:extraction in 5055) [ClassicSimilarity], result of:
            0.23142846 = score(doc=5055,freq=4.0), product of:
              0.29902464 = queryWeight, product of:
                3.0621958 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.015771545 = queryNorm
              0.77394444 = fieldWeight in 5055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.41818768 = weight(abstract_txt:labels in 5055) [ClassicSimilarity], result of:
            0.41818768 = score(doc=5055,freq=5.0), product of:
              0.41181922 = queryWeight, product of:
                3.593625 = boost
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.015771545 = queryNorm
              1.0154642 = fieldWeight in 5055, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
        0.2 = coord(5/25)
    
  3. Cui, H.: Competency evaluation of plant character ontologies against domain literature (2010) 0.13
    0.13177764 = sum of:
      0.13177764 = product of:
        0.47063446 = sum of:
          0.027919559 = weight(abstract_txt:text in 3466) [ClassicSimilarity], result of:
            0.027919559 = score(doc=3466,freq=3.0), product of:
              0.06377803 = queryWeight, product of:
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015771545 = queryNorm
              0.4377614 = fieldWeight in 3466, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.015439653 = weight(abstract_txt:from in 3466) [ClassicSimilarity], result of:
            0.015439653 = score(doc=3466,freq=4.0), product of:
              0.04468975 = queryWeight, product of:
                1.0252129 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.015771545 = queryNorm
              0.34548533 = fieldWeight in 3466, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.021498004 = weight(abstract_txt:tools in 3466) [ClassicSimilarity], result of:
            0.021498004 = score(doc=3466,freq=1.0), product of:
              0.0772748 = queryWeight, product of:
                1.1007366 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.015771545 = queryNorm
              0.278202 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.10104644 = weight(abstract_txt:specimens in 3466) [ClassicSimilarity], result of:
            0.10104644 = score(doc=3466,freq=1.0), product of:
              0.17209826 = queryWeight, product of:
                1.1615494 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.015771545 = queryNorm
              0.5871439 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.035850443 = weight(abstract_txt:machine in 3466) [ClassicSimilarity], result of:
            0.035850443 = score(doc=3466,freq=1.0), product of:
              0.10866812 = queryWeight, product of:
                1.3053156 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.015771545 = queryNorm
              0.32990766 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.042883784 = weight(abstract_txt:automated in 3466) [ClassicSimilarity], result of:
            0.042883784 = score(doc=3466,freq=1.0), product of:
              0.12245256 = queryWeight, product of:
                1.3856336 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.015771545 = queryNorm
              0.35020733 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
          0.2259966 = weight(abstract_txt:specimen in 3466) [ClassicSimilarity], result of:
            0.2259966 = score(doc=3466,freq=1.0), product of:
              0.37082905 = queryWeight, product of:
                2.4113004 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.015771545 = queryNorm
              0.6094361 = fieldWeight in 3466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=3466)
        0.28 = coord(7/25)
    
  4. Laparra, E.; Binford-Walsh, A.; Emerson, K.; Miller, M.L.; López-Hoffman, L.; Currim, F.; Bethard, S.: Addressing structural hurdles for metadata extraction from environmental impact statements (2023) 0.13
    0.12657677 = sum of:
      0.12657677 = product of:
        0.5274032 = sum of:
          0.013371131 = weight(abstract_txt:from in 1042) [ClassicSimilarity], result of:
            0.013371131 = score(doc=1042,freq=3.0), product of:
              0.04468975 = queryWeight, product of:
                1.0252129 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.015771545 = queryNorm
              0.29919907 = fieldWeight in 1042, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1042)
          0.050700184 = weight(abstract_txt:machine in 1042) [ClassicSimilarity], result of:
            0.050700184 = score(doc=1042,freq=2.0), product of:
              0.10866812 = queryWeight, product of:
                1.3053156 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.015771545 = queryNorm
              0.4665599 = fieldWeight in 1042, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=1042)
          0.09897282 = weight(abstract_txt:extract in 1042) [ClassicSimilarity], result of:
            0.09897282 = score(doc=1042,freq=2.0), product of:
              0.16973567 = queryWeight, product of:
                1.6313646 = boost
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.015771545 = queryNorm
              0.5830997 = fieldWeight in 1042, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.0625 = fieldNorm(doc=1042)
          0.073930345 = weight(abstract_txt:learning in 1042) [ClassicSimilarity], result of:
            0.073930345 = score(doc=1042,freq=2.0), product of:
              0.1760574 = queryWeight, product of:
                2.3496685 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.015771545 = queryNorm
              0.41992182 = fieldWeight in 1042, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=1042)
          0.1267841 = weight(abstract_txt:metadata in 1042) [ClassicSimilarity], result of:
            0.1267841 = score(doc=1042,freq=5.0), product of:
              0.18585275 = queryWeight, product of:
                2.4141483 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.015771545 = queryNorm
              0.68217504 = fieldWeight in 1042, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.0625 = fieldNorm(doc=1042)
          0.16364463 = weight(abstract_txt:extraction in 1042) [ClassicSimilarity], result of:
            0.16364463 = score(doc=1042,freq=2.0), product of:
              0.29902464 = queryWeight, product of:
                3.0621958 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.015771545 = queryNorm
              0.54726136 = fieldWeight in 1042, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=1042)
        0.24 = coord(6/25)
    
  5. Hooland, S. van; Verborgh, R.: Linked data for Lilibraries, archives and museums : how to clean, link, and publish your metadata (2014) 0.12
    0.12053901 = sum of:
      0.12053901 = product of:
        0.43049645 = sum of:
          0.008188113 = weight(abstract_txt:from in 5153) [ClassicSimilarity], result of:
            0.008188113 = score(doc=5153,freq=2.0), product of:
              0.04468975 = queryWeight, product of:
                1.0252129 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.015771545 = queryNorm
              0.18322127 = fieldWeight in 5153, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.10255428 = weight(abstract_txt:cleaning in 5153) [ClassicSimilarity], result of:
            0.10255428 = score(doc=5153,freq=3.0), product of:
              0.14598797 = queryWeight, product of:
                1.0698134 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.015771545 = queryNorm
              0.7024844 = fieldWeight in 5153, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.022802075 = weight(abstract_txt:tools in 5153) [ClassicSimilarity], result of:
            0.022802075 = score(doc=5153,freq=2.0), product of:
              0.0772748 = queryWeight, product of:
                1.1007366 = boost
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.015771545 = queryNorm
              0.29507777 = fieldWeight in 5153, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.451232 = idf(docFreq=1401, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.018281618 = weight(abstract_txt:models in 5153) [ClassicSimilarity], result of:
            0.018281618 = score(doc=5153,freq=1.0), product of:
              0.08402491 = queryWeight, product of:
                1.147806 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.015771545 = queryNorm
              0.21757379 = fieldWeight in 5153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.032162838 = weight(abstract_txt:automated in 5153) [ClassicSimilarity], result of:
            0.032162838 = score(doc=5153,freq=1.0), product of:
              0.12245256 = queryWeight, product of:
                1.3856336 = boost
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.015771545 = queryNorm
              0.2626555 = fieldWeight in 5153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033173 = idf(docFreq=442, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.0711738 = weight(abstract_txt:museum in 5153) [ClassicSimilarity], result of:
            0.0711738 = score(doc=5153,freq=1.0), product of:
              0.23803574 = queryWeight, product of:
                2.3660896 = boost
                6.378767 = idf(docFreq=203, maxDocs=44218)
                0.015771545 = queryNorm
              0.2990047 = fieldWeight in 5153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.378767 = idf(docFreq=203, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
          0.17533374 = weight(abstract_txt:metadata in 5153) [ClassicSimilarity], result of:
            0.17533374 = score(doc=5153,freq=17.0), product of:
              0.18585275 = queryWeight, product of:
                2.4141483 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.015771545 = queryNorm
              0.9434014 = fieldWeight in 5153, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.046875 = fieldNorm(doc=5153)
        0.28 = coord(7/25)