Document (#38888)

Pope, J.T.
Holley, R.P.
Google Book Search and metadata
Cataloging and classification quarterly. 49(2011) no.1, S.1-13
This article summarizes published documents on metadata provided by Google for books scanned as part of the Google Book Search (GBS) project and provides suggestions for improvement. The faulty, misleading, and confusing metadata in current Google records can pose potentially serious problems for users of GBS. Google admits that it took data, which proved to be inaccurate, from many sources and is attempting to correct errors. Some argue that metadata is not needed with keyword searching; but optical character recognition (OCR) errors, synonym control, and materials in foreign languages make reliable metadata a requirement for academic researchers. The authors recommend that users should be able to submit error reports to Google to correct faulty metadata.
Google Book Search

Similar documents (author)

  1. Holley, R.P.: Report from the section on classification and indexing : 1988-89 (1989) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:holley in 425) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 425, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=425)
  2. Holley, R.P.: Subject access in the online catalog (1989) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:holley in 443) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 443, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=443)
  3. Holley, R.P.: Entwicklung und Fortschritt bei Klassifikation und Indexierung (1987) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:holley in 929) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 929, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=929)
  4. Holley, E.G.: ¬The trend to LC : thoughts on changing library classification schemes (1967) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:holley in 1713) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 1713, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=1713)
  5. Holley, R.P.: Classification in the USA (1985) 5.35
    5.3508706 = sum of:
      5.3508706 = weight(author_txt:holley in 1730) [ClassicSimilarity], result of:
        5.3508706 = fieldWeight in 1730, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.561393 = idf(docFreq=22, maxDocs=44218)
          0.625 = fieldNorm(doc=1730)

Similar documents (content)

  1. Dawson, A.; Hamilton, V.: Optimising metadata to make high-value content more accessible to Google users (2006) 0.11
    0.10837219 = sum of:
      0.10837219 = product of:
        0.54186094 = sum of:
          0.01075554 = weight(abstract_txt:that in 5598) [ClassicSimilarity], result of:
            0.01075554 = score(doc=5598,freq=3.0), product of:
              0.041931406 = queryWeight, product of:
                1.0533923 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.01679953 = queryNorm
              0.2565032 = fieldWeight in 5598, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=5598)
          0.020019926 = weight(abstract_txt:users in 5598) [ClassicSimilarity], result of:
            0.020019926 = score(doc=5598,freq=2.0), product of:
              0.06344921 = queryWeight, product of:
                1.0580055 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.01679953 = queryNorm
              0.31552678 = fieldWeight in 5598, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=5598)
          0.0340611 = weight(abstract_txt:search in 5598) [ClassicSimilarity], result of:
            0.0340611 = score(doc=5598,freq=5.0), product of:
              0.066626 = queryWeight, product of:
                1.0841682 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.01679953 = queryNorm
              0.5112284 = fieldWeight in 5598, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=5598)
          0.18806072 = weight(abstract_txt:metadata in 5598) [ClassicSimilarity], result of:
            0.18806072 = score(doc=5598,freq=3.0), product of:
              0.35589892 = queryWeight, product of:
                4.3400903 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.01679953 = queryNorm
              0.5284105 = fieldWeight in 5598, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.0625 = fieldNorm(doc=5598)
          0.2889637 = weight(abstract_txt:google in 5598) [ClassicSimilarity], result of:
            0.2889637 = score(doc=5598,freq=4.0), product of:
              0.43057013 = queryWeight, product of:
                4.7737246 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.01679953 = queryNorm
              0.6711188 = fieldWeight in 5598, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.0625 = fieldNorm(doc=5598)
        0.2 = coord(5/25)
  2. Dawson, A.: Creating metadata that work for digital libraries and Google (2004) 0.11
    0.108209 = sum of:
      0.108209 = product of:
        0.67630625 = sum of:
          0.021234335 = weight(abstract_txt:users in 4762) [ClassicSimilarity], result of:
            0.021234335 = score(doc=4762,freq=1.0), product of:
              0.06344921 = queryWeight, product of:
                1.0580055 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.01679953 = queryNorm
              0.33466667 = fieldWeight in 4762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.09375 = fieldNorm(doc=4762)
          0.02284888 = weight(abstract_txt:search in 4762) [ClassicSimilarity], result of:
            0.02284888 = score(doc=4762,freq=1.0), product of:
              0.066626 = queryWeight, product of:
                1.0841682 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.01679953 = queryNorm
              0.34294242 = fieldWeight in 4762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.09375 = fieldNorm(doc=4762)
          0.32573074 = weight(abstract_txt:metadata in 4762) [ClassicSimilarity], result of:
            0.32573074 = score(doc=4762,freq=4.0), product of:
              0.35589892 = queryWeight, product of:
                4.3400903 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.01679953 = queryNorm
              0.91523385 = fieldWeight in 4762, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.09375 = fieldNorm(doc=4762)
          0.30649227 = weight(abstract_txt:google in 4762) [ClassicSimilarity], result of:
            0.30649227 = score(doc=4762,freq=2.0), product of:
              0.43057013 = queryWeight, product of:
                4.7737246 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.01679953 = queryNorm
              0.71182895 = fieldWeight in 4762, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.09375 = fieldNorm(doc=4762)
        0.16 = coord(4/25)
  3. Lewandowski, D.: Evaluating the retrieval effectiveness of web search engines using a representative query sample (2015) 0.11
    0.105570935 = sum of:
      0.105570935 = product of:
        0.5278547 = sum of:
          0.013444425 = weight(abstract_txt:that in 2157) [ClassicSimilarity], result of:
            0.013444425 = score(doc=2157,freq=3.0), product of:
              0.041931406 = queryWeight, product of:
                1.0533923 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.01679953 = queryNorm
              0.320629 = fieldWeight in 2157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
          0.017695282 = weight(abstract_txt:users in 2157) [ClassicSimilarity], result of:
            0.017695282 = score(doc=2157,freq=1.0), product of:
              0.06344921 = queryWeight, product of:
                1.0580055 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.01679953 = queryNorm
              0.2788889 = fieldWeight in 2157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
          0.038081467 = weight(abstract_txt:search in 2157) [ClassicSimilarity], result of:
            0.038081467 = score(doc=2157,freq=4.0), product of:
              0.066626 = queryWeight, product of:
                1.0841682 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.01679953 = queryNorm
              0.5715707 = fieldWeight in 2157, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
          0.2032233 = weight(abstract_txt:correct in 2157) [ClassicSimilarity], result of:
            0.2032233 = score(doc=2157,freq=3.0), product of:
              0.22393906 = queryWeight, product of:
                1.9876491 = boost
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.01679953 = queryNorm
              0.90749377 = fieldWeight in 2157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
          0.25541022 = weight(abstract_txt:google in 2157) [ClassicSimilarity], result of:
            0.25541022 = score(doc=2157,freq=2.0), product of:
              0.43057013 = queryWeight, product of:
                4.7737246 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.01679953 = queryNorm
              0.5931908 = fieldWeight in 2157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.078125 = fieldNorm(doc=2157)
        0.2 = coord(5/25)
  4. Golderman, G.M.; Connolly, B.: Between the book covers : going beyond OPAC keyword searching with the deep linking capabilities of Google Scholar and Google Book Search (2004/05) 0.11
    0.105069906 = sum of:
      0.105069906 = product of:
        0.4377913 = sum of:
          0.017563723 = weight(abstract_txt:that in 731) [ClassicSimilarity], result of:
            0.017563723 = score(doc=731,freq=8.0), product of:
              0.041931406 = queryWeight, product of:
                1.0533923 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.01679953 = queryNorm
              0.41886798 = fieldWeight in 731, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=731)
          0.014156225 = weight(abstract_txt:users in 731) [ClassicSimilarity], result of:
            0.014156225 = score(doc=731,freq=1.0), product of:
              0.06344921 = queryWeight, product of:
                1.0580055 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.01679953 = queryNorm
              0.22311112 = fieldWeight in 731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=731)
          0.015232587 = weight(abstract_txt:search in 731) [ClassicSimilarity], result of:
            0.015232587 = score(doc=731,freq=1.0), product of:
              0.066626 = queryWeight, product of:
                1.0841682 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.01679953 = queryNorm
              0.22862828 = fieldWeight in 731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=731)
          0.06623392 = weight(abstract_txt:attempting in 731) [ClassicSimilarity], result of:
            0.06623392 = score(doc=731,freq=1.0), product of:
              0.1408764 = queryWeight, product of:
                1.1147537 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.01679953 = queryNorm
              0.47015625 = fieldWeight in 731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=731)
          0.035641123 = weight(abstract_txt:book in 731) [ClassicSimilarity], result of:
            0.035641123 = score(doc=731,freq=1.0), product of:
              0.117425434 = queryWeight, product of:
                1.4393151 = boost
                4.856341 = idf(docFreq=934, maxDocs=44218)
                0.01679953 = queryNorm
              0.3035213 = fieldWeight in 731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.856341 = idf(docFreq=934, maxDocs=44218)
                0.0625 = fieldNorm(doc=731)
          0.2889637 = weight(abstract_txt:google in 731) [ClassicSimilarity], result of:
            0.2889637 = score(doc=731,freq=4.0), product of:
              0.43057013 = queryWeight, product of:
                4.7737246 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.01679953 = queryNorm
              0.6711188 = fieldWeight in 731, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.0625 = fieldNorm(doc=731)
        0.24 = coord(6/25)
  5. Zakaria, M.S.: Measuring typographical errors in online catalogs of academic libraries using Ballard's list : a case study from Egypt (2023) 0.10
    0.10287014 = sum of:
      0.10287014 = product of:
        0.42862558 = sum of:
          0.070809916 = weight(abstract_txt:error in 1184) [ClassicSimilarity], result of:
            0.070809916 = score(doc=1184,freq=2.0), product of:
              0.11690614 = queryWeight, product of:
                1.0154966 = boost
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.01679953 = queryNorm
              0.6056988 = fieldWeight in 1184, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.0625 = fieldNorm(doc=1184)
          0.051698685 = weight(abstract_txt:serious in 1184) [ClassicSimilarity], result of:
            0.051698685 = score(doc=1184,freq=1.0), product of:
              0.11942748 = queryWeight, product of:
                1.0263889 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.01679953 = queryNorm
              0.43288767 = fieldWeight in 1184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.0625 = fieldNorm(doc=1184)
          0.01075554 = weight(abstract_txt:that in 1184) [ClassicSimilarity], result of:
            0.01075554 = score(doc=1184,freq=3.0), product of:
              0.041931406 = queryWeight, product of:
                1.0533923 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.01679953 = queryNorm
              0.2565032 = fieldWeight in 1184, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1184)
          0.014156225 = weight(abstract_txt:users in 1184) [ClassicSimilarity], result of:
            0.014156225 = score(doc=1184,freq=1.0), product of:
              0.06344921 = queryWeight, product of:
                1.0580055 = boost
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.01679953 = queryNorm
              0.22311112 = fieldWeight in 1184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.569778 = idf(docFreq=3384, maxDocs=44218)
                0.0625 = fieldNorm(doc=1184)
          0.06706287 = weight(abstract_txt:pose in 1184) [ClassicSimilarity], result of:
            0.06706287 = score(doc=1184,freq=1.0), product of:
              0.14204939 = queryWeight, product of:
                1.119385 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.01679953 = queryNorm
              0.47210953 = fieldWeight in 1184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=1184)
          0.21414237 = weight(abstract_txt:errors in 1184) [ClassicSimilarity], result of:
            0.21414237 = score(doc=1184,freq=6.0), product of:
              0.21357279 = queryWeight, product of:
                1.9410993 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.01679953 = queryNorm
              1.002667 = fieldWeight in 1184, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.0625 = fieldNorm(doc=1184)
        0.24 = coord(6/25)