Document (#38889)

Author
Pope, J.T.
Holley, R.P.
Title
Google Book Search and metadata
Source
Cataloging and classification quarterly. 49(2011) no.1, S.1-13
Year
2011
Abstract
This article summarizes published documents on metadata provided by Google for books scanned as part of the Google Book Search (GBS) project and provides suggestions for improvement. The faulty, misleading, and confusing metadata in current Google records can pose potentially serious problems for users of GBS. Google admits that it took data, which proved to be inaccurate, from many sources and is attempting to correct errors. Some argue that metadata is not needed with keyword searching; but optical character recognition (OCR) errors, synonym control, and materials in foreign languages make reliable metadata a requirement for academic researchers. The authors recommend that users should be able to submit error reports to Google to correct faulty metadata.
Theme
Formalerschließung
Metadaten
Object
Google Book Search

Similar documents (author)

  1. Holley, R.P.: Report from the section on classification and indexing : 1988-89 (1989) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:holley in 425) [ClassicSimilarity], result of:
        5.3510256 = score(doc=425,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 425, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=425)
    
  2. Holley, R.P.: Subject access in the online catalog (1989) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:holley in 443) [ClassicSimilarity], result of:
        5.3510256 = score(doc=443,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 443, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=443)
    
  3. Holley, R.P.: Entwicklung und Fortschritt bei Klassifikation und Indexierung (1987) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:holley in 929) [ClassicSimilarity], result of:
        5.3510256 = score(doc=929,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 929, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=929)
    
  4. Holley, E.G.: ¬The trend to LC : thoughts on changing library classification schemes (1967) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:holley in 1713) [ClassicSimilarity], result of:
        5.3510256 = score(doc=1713,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 1713, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=1713)
    
  5. Holley, R.P.: Classification in the USA (1985) 5.35
    5.3510256 = sum of:
      5.3510256 = weight(author_txt:holley in 1730) [ClassicSimilarity], result of:
        5.3510256 = score(doc=1730,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.116800025 = queryNorm
          5.351026 = fieldWeight in 1730, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.561642 = idf(docFreq=21, maxDocs=42306)
            0.625 = fieldNorm(doc=1730)
    

Similar documents (content)

  1. Dawson, A.; Hamilton, V.: Optimising metadata to make high-value content more accessible to Google users (2006) 0.11
    0.10974305 = sum of:
      0.10974305 = product of:
        0.54871523 = sum of:
          0.02003274 = weight(abstract_txt:users in 599) [ClassicSimilarity], result of:
            0.02003274 = score(doc=599,freq=2.0), product of:
              0.063337795 = queryWeight, product of:
                1.0511414 = boost
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.01683912 = queryNorm
              0.31628412 = fieldWeight in 599, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.0625 = fieldNorm(doc=599)
          0.011171128 = weight(abstract_txt:that in 599) [ClassicSimilarity], result of:
            0.011171128 = score(doc=599,freq=3.0), product of:
              0.042910878 = queryWeight, product of:
                1.0596416 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.01683912 = queryNorm
              0.26033324 = fieldWeight in 599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=599)
          0.033773232 = weight(abstract_txt:search in 599) [ClassicSimilarity], result of:
            0.033773232 = score(doc=599,freq=5.0), product of:
              0.06610553 = queryWeight, product of:
                1.0738622 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01683912 = queryNorm
              0.51089877 = fieldWeight in 599, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.0625 = fieldNorm(doc=599)
          0.19026788 = weight(abstract_txt:metadata in 599) [ClassicSimilarity], result of:
            0.19026788 = score(doc=599,freq=3.0), product of:
              0.35789558 = queryWeight, product of:
                4.3278127 = boost
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.01683912 = queryNorm
              0.53162956 = fieldWeight in 599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.0625 = fieldNorm(doc=599)
          0.29347026 = weight(abstract_txt:google in 599) [ClassicSimilarity], result of:
            0.29347026 = score(doc=599,freq=4.0), product of:
              0.43408608 = queryWeight, product of:
                4.7662654 = boost
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.01683912 = queryNorm
              0.67606467 = fieldWeight in 599, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.0625 = fieldNorm(doc=599)
        0.2 = coord(5/25)
    
  2. Dawson, A.: Creating metadata that work for digital libraries and Google (2004) 0.11
    0.10955673 = sum of:
      0.10955673 = product of:
        0.6847296 = sum of:
          0.021247929 = weight(abstract_txt:users in 763) [ClassicSimilarity], result of:
            0.021247929 = score(doc=763,freq=1.0), product of:
              0.063337795 = queryWeight, product of:
                1.0511414 = boost
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.01683912 = queryNorm
              0.33547 = fieldWeight in 763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.09375 = fieldNorm(doc=763)
          0.022655772 = weight(abstract_txt:search in 763) [ClassicSimilarity], result of:
            0.022655772 = score(doc=763,freq=1.0), product of:
              0.06610553 = queryWeight, product of:
                1.0738622 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01683912 = queryNorm
              0.34272128 = fieldWeight in 763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.09375 = fieldNorm(doc=763)
          0.3295536 = weight(abstract_txt:metadata in 763) [ClassicSimilarity], result of:
            0.3295536 = score(doc=763,freq=4.0), product of:
              0.35789558 = queryWeight, product of:
                4.3278127 = boost
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.01683912 = queryNorm
              0.9208094 = fieldWeight in 763, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.09375 = fieldNorm(doc=763)
          0.31127223 = weight(abstract_txt:google in 763) [ClassicSimilarity], result of:
            0.31127223 = score(doc=763,freq=2.0), product of:
              0.43408608 = queryWeight, product of:
                4.7662654 = boost
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.01683912 = queryNorm
              0.7170749 = fieldWeight in 763, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.09375 = fieldNorm(doc=763)
        0.16 = coord(4/25)
    
  3. Golderman, G.M.; Connolly, B.: Between the book covers : going beyond OPAC keyword searching with the deep linking capabilities of Google Scholar and Google Book Search (2004/05) 0.11
    0.1063164 = sum of:
      0.1063164 = product of:
        0.44298503 = sum of:
          0.014165286 = weight(abstract_txt:users in 2732) [ClassicSimilarity], result of:
            0.014165286 = score(doc=2732,freq=1.0), product of:
              0.063337795 = queryWeight, product of:
                1.0511414 = boost
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.01683912 = queryNorm
              0.22364666 = fieldWeight in 2732, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.0625 = fieldNorm(doc=2732)
          0.018242376 = weight(abstract_txt:that in 2732) [ClassicSimilarity], result of:
            0.018242376 = score(doc=2732,freq=8.0), product of:
              0.042910878 = queryWeight, product of:
                1.0596416 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.01683912 = queryNorm
              0.4251224 = fieldWeight in 2732, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=2732)
          0.015103849 = weight(abstract_txt:search in 2732) [ClassicSimilarity], result of:
            0.015103849 = score(doc=2732,freq=1.0), product of:
              0.06610553 = queryWeight, product of:
                1.0738622 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01683912 = queryNorm
              0.22848086 = fieldWeight in 2732, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.0625 = fieldNorm(doc=2732)
          0.066309385 = weight(abstract_txt:attempting in 2732) [ClassicSimilarity], result of:
            0.066309385 = score(doc=2732,freq=1.0), product of:
              0.1406758 = queryWeight, product of:
                1.1077056 = boost
                7.5418105 = idf(docFreq=60, maxDocs=42306)
                0.01683912 = queryNorm
              0.47136316 = fieldWeight in 2732, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5418105 = idf(docFreq=60, maxDocs=42306)
                0.0625 = fieldNorm(doc=2732)
          0.035693865 = weight(abstract_txt:book in 2732) [ClassicSimilarity], result of:
            0.035693865 = score(doc=2732,freq=1.0), product of:
              0.117284805 = queryWeight, product of:
                1.4303771 = boost
                4.869359 = idf(docFreq=882, maxDocs=42306)
                0.01683912 = queryNorm
              0.30433494 = fieldWeight in 2732, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.869359 = idf(docFreq=882, maxDocs=42306)
                0.0625 = fieldNorm(doc=2732)
          0.29347026 = weight(abstract_txt:google in 2732) [ClassicSimilarity], result of:
            0.29347026 = score(doc=2732,freq=4.0), product of:
              0.43408608 = queryWeight, product of:
                4.7662654 = boost
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.01683912 = queryNorm
              0.67606467 = fieldWeight in 2732, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.0625 = fieldNorm(doc=2732)
        0.24 = coord(6/25)
    
  4. Lewandowski, D.: Evaluating the retrieval effectiveness of web search engines using a representative query sample (2015) 0.11
    0.10609825 = sum of:
      0.10609825 = product of:
        0.53049123 = sum of:
          0.017706608 = weight(abstract_txt:users in 4158) [ClassicSimilarity], result of:
            0.017706608 = score(doc=4158,freq=1.0), product of:
              0.063337795 = queryWeight, product of:
                1.0511414 = boost
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.01683912 = queryNorm
              0.27955833 = fieldWeight in 4158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.078125 = fieldNorm(doc=4158)
          0.013963911 = weight(abstract_txt:that in 4158) [ClassicSimilarity], result of:
            0.013963911 = score(doc=4158,freq=3.0), product of:
              0.042910878 = queryWeight, product of:
                1.0596416 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.01683912 = queryNorm
              0.32541656 = fieldWeight in 4158, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.078125 = fieldNorm(doc=4158)
          0.03775962 = weight(abstract_txt:search in 4158) [ClassicSimilarity], result of:
            0.03775962 = score(doc=4158,freq=4.0), product of:
              0.06610553 = queryWeight, product of:
                1.0738622 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01683912 = queryNorm
              0.57120216 = fieldWeight in 4158, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.078125 = fieldNorm(doc=4158)
          0.20166758 = weight(abstract_txt:correct in 4158) [ClassicSimilarity], result of:
            0.20166758 = score(doc=4158,freq=3.0), product of:
              0.22230864 = queryWeight, product of:
                1.9692817 = boost
                6.703924 = idf(docFreq=140, maxDocs=42306)
                0.01683912 = queryNorm
              0.90715134 = fieldWeight in 4158, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.703924 = idf(docFreq=140, maxDocs=42306)
                0.078125 = fieldNorm(doc=4158)
          0.2593935 = weight(abstract_txt:google in 4158) [ClassicSimilarity], result of:
            0.2593935 = score(doc=4158,freq=2.0), product of:
              0.43408608 = queryWeight, product of:
                4.7662654 = boost
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.01683912 = queryNorm
              0.5975624 = fieldWeight in 4158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.078125 = fieldNorm(doc=4158)
        0.2 = coord(5/25)
    
  5. Wallis, R.; Isaac, A.; Charles, V.; Manguinhas, H.: Recommendations for the application of Schema.org to aggregated cultural heritage metadata to increase relevance and visibility to search engines : the case of Europeana (2017) 0.10
    0.10196793 = sum of:
      0.10196793 = product of:
        0.50983965 = sum of:
          0.02003274 = weight(abstract_txt:users in 291) [ClassicSimilarity], result of:
            0.02003274 = score(doc=291,freq=2.0), product of:
              0.063337795 = queryWeight, product of:
                1.0511414 = boost
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.01683912 = queryNorm
              0.31628412 = fieldWeight in 291, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5783465 = idf(docFreq=3210, maxDocs=42306)
                0.0625 = fieldNorm(doc=291)
          0.0064496538 = weight(abstract_txt:that in 291) [ClassicSimilarity], result of:
            0.0064496538 = score(doc=291,freq=1.0), product of:
              0.042910878 = queryWeight, product of:
                1.0596416 = boost
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.01683912 = queryNorm
              0.15030347 = fieldWeight in 291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4048555 = idf(docFreq=10381, maxDocs=42306)
                0.0625 = fieldNorm(doc=291)
          0.030207697 = weight(abstract_txt:search in 291) [ClassicSimilarity], result of:
            0.030207697 = score(doc=291,freq=4.0), product of:
              0.06610553 = queryWeight, product of:
                1.0738622 = boost
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.01683912 = queryNorm
              0.45696172 = fieldWeight in 291, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6556938 = idf(docFreq=2971, maxDocs=42306)
                0.0625 = fieldNorm(doc=291)
          0.24563478 = weight(abstract_txt:metadata in 291) [ClassicSimilarity], result of:
            0.24563478 = score(doc=291,freq=5.0), product of:
              0.35789558 = queryWeight, product of:
                4.3278127 = boost
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.01683912 = queryNorm
              0.68633085 = fieldWeight in 291, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.9109836 = idf(docFreq=846, maxDocs=42306)
                0.0625 = fieldNorm(doc=291)
          0.20751481 = weight(abstract_txt:google in 291) [ClassicSimilarity], result of:
            0.20751481 = score(doc=291,freq=2.0), product of:
              0.43408608 = queryWeight, product of:
                4.7662654 = boost
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.01683912 = queryNorm
              0.4780499 = fieldWeight in 291, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4085174 = idf(docFreq=514, maxDocs=42306)
                0.0625 = fieldNorm(doc=291)
        0.2 = coord(5/25)