Document (#11118)

Author
Dubois, C.P.R.
Title
Free text vs. controlled vocabulary; a reassessment
Source
Online review. 11(1987), S.243-253
Year
1987
Abstract
Free text and controlled vocabulary searching can no longer be viewed as antagonistic techniques in information retrieval since they both display advantages and weaknesses dependent on a fairly wide range of context, with the option to use both increasingly favoured. An attempt is made to present a list of features associated with the two techniques and to suggest a methodology to assist in deciding on the optimal retrieval technique for a particular purpose. The relevance of the techniques in expert systems and full text contexts is also discussed. Finally, recommendations for further research are suggested, concentrating on survey techniques in real-life retrieval situations
Theme
Volltextretrieval

Similar documents (author)

  1. Dubois, C.P.R.: Text retrieval 92 : summary of papers and trends (1993) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:dubois in 6255) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 6255, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=6255)
    
  2. Schulz-DuBois, E.O.: Arbeiten deutscher Wissenschaftler, die weltweit am häufigsten zitiert wurden (1984) 4.95
    4.952564 = sum of:
      4.952564 = weight(author_txt:dubois in 359) [ClassicSimilarity], result of:
        4.952564 = fieldWeight in 359, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.5 = fieldNorm(doc=359)
    
  3. Dubois, D.; Prade, H.: Measuring and updating information (1991) 4.95
    4.952564 = sum of:
      4.952564 = weight(author_txt:dubois in 4106) [ClassicSimilarity], result of:
        4.952564 = fieldWeight in 4106, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.5 = fieldNorm(doc=4106)
    
  4. Bosc, P.; Dubois, D.; Prade, H.: Fuzzy functional dependencies and redundancy elimination (1998) 3.71
    3.7144227 = sum of:
      3.7144227 = weight(author_txt:dubois in 590) [ClassicSimilarity], result of:
        3.7144227 = fieldWeight in 590, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.375 = fieldNorm(doc=590)
    

Similar documents (content)

  1. Srinivasan, P.: Optimal document-indexing vocabulary for MEDLINE (1996) 0.19
    0.18574001 = sum of:
      0.18574001 = product of:
        0.7739167 = sum of:
          0.0644896 = weight(abstract_txt:contexts in 6634) [ClassicSimilarity], result of:
            0.0644896 = score(doc=6634,freq=1.0), product of:
              0.11921902 = queryWeight, product of:
                1.0041273 = boost
                5.7699614 = idf(docFreq=374, maxDocs=44218)
                0.020577086 = queryNorm
              0.54093385 = fieldWeight in 6634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7699614 = idf(docFreq=374, maxDocs=44218)
                0.09375 = fieldNorm(doc=6634)
          0.084535465 = weight(abstract_txt:retrieval in 6634) [ClassicSimilarity], result of:
            0.084535465 = score(doc=6634,freq=4.0), product of:
              0.12973748 = queryWeight, product of:
                1.814301 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020577086 = queryNorm
              0.6515886 = fieldWeight in 6634, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=6634)
          0.10329526 = weight(abstract_txt:vocabulary in 6634) [ClassicSimilarity], result of:
            0.10329526 = score(doc=6634,freq=1.0), product of:
              0.2056282 = queryWeight, product of:
                1.864972 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.020577086 = queryNorm
              0.50233996 = fieldWeight in 6634, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.09375 = fieldNorm(doc=6634)
          0.15361345 = weight(abstract_txt:controlled in 6634) [ClassicSimilarity], result of:
            0.15361345 = score(doc=6634,freq=2.0), product of:
              0.21263687 = queryWeight, product of:
                1.8964888 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.020577086 = queryNorm
              0.7224215 = fieldWeight in 6634, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.09375 = fieldNorm(doc=6634)
          0.20484376 = weight(abstract_txt:free in 6634) [ClassicSimilarity], result of:
            0.20484376 = score(doc=6634,freq=3.0), product of:
              0.22504558 = queryWeight, product of:
                1.9510403 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.020577086 = queryNorm
              0.9102323 = fieldWeight in 6634, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.09375 = fieldNorm(doc=6634)
          0.1631392 = weight(abstract_txt:text in 6634) [ClassicSimilarity], result of:
            0.1631392 = score(doc=6634,freq=6.0), product of:
              0.17567688 = queryWeight, product of:
                2.111222 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020577086 = queryNorm
              0.92863214 = fieldWeight in 6634, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=6634)
        0.24 = coord(6/25)
    
  2. Velios, A.; St.John, K.: Linked conservation data: : the adoption and use of vocabularies in the field of heritage conservation for publishing conservation records as linked data (2021) 0.15
    0.14934342 = sum of:
      0.14934342 = product of:
        0.46669817 = sum of:
          0.0424651 = weight(abstract_txt:recommendations in 580) [ClassicSimilarity], result of:
            0.0424651 = score(doc=580,freq=1.0), product of:
              0.118240975 = queryWeight, product of:
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.020577086 = queryNorm
              0.3591403 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=580)
          0.051710587 = weight(abstract_txt:assist in 580) [ClassicSimilarity], result of:
            0.051710587 = score(doc=580,freq=1.0), product of:
              0.13483404 = queryWeight, product of:
                1.0678636 = boost
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.020577086 = queryNorm
              0.38351285 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.0625 = fieldNorm(doc=580)
          0.024786623 = weight(abstract_txt:both in 580) [ClassicSimilarity], result of:
            0.024786623 = score(doc=580,freq=1.0), product of:
              0.10404826 = queryWeight, product of:
                1.3266257 = boost
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.020577086 = queryNorm
              0.23822238 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.0625 = fieldNorm(doc=580)
          0.06886351 = weight(abstract_txt:vocabulary in 580) [ClassicSimilarity], result of:
            0.06886351 = score(doc=580,freq=1.0), product of:
              0.2056282 = queryWeight, product of:
                1.864972 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.020577086 = queryNorm
              0.33489332 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.0625 = fieldNorm(doc=580)
          0.07241408 = weight(abstract_txt:controlled in 580) [ClassicSimilarity], result of:
            0.07241408 = score(doc=580,freq=1.0), product of:
              0.21263687 = queryWeight, product of:
                1.8964888 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.020577086 = queryNorm
              0.34055278 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.0625 = fieldNorm(doc=580)
          0.0788444 = weight(abstract_txt:free in 580) [ClassicSimilarity], result of:
            0.0788444 = score(doc=580,freq=1.0), product of:
              0.22504558 = queryWeight, product of:
                1.9510403 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.020577086 = queryNorm
              0.3503486 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.0625 = fieldNorm(doc=580)
          0.044400867 = weight(abstract_txt:text in 580) [ClassicSimilarity], result of:
            0.044400867 = score(doc=580,freq=1.0), product of:
              0.17567688 = queryWeight, product of:
                2.111222 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020577086 = queryNorm
              0.25274166 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=580)
          0.08321299 = weight(abstract_txt:techniques in 580) [ClassicSimilarity], result of:
            0.08321299 = score(doc=580,freq=1.0), product of:
              0.29391876 = queryWeight, product of:
                3.153259 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.020577086 = queryNorm
              0.2831156 = fieldWeight in 580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.0625 = fieldNorm(doc=580)
        0.32 = coord(8/25)
    
  3. Z39.19-2005: Guidelines for the construction, format, and management of monolingual controlled vocabularies (2005) 0.14
    0.1399725 = sum of:
      0.1399725 = product of:
        0.58321875 = sum of:
          0.053081375 = weight(abstract_txt:recommendations in 708) [ClassicSimilarity], result of:
            0.053081375 = score(doc=708,freq=1.0), product of:
              0.118240975 = queryWeight, product of:
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.020577086 = queryNorm
              0.44892538 = fieldWeight in 708, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.078125 = fieldNorm(doc=708)
          0.08502306 = weight(abstract_txt:display in 708) [ClassicSimilarity], result of:
            0.08502306 = score(doc=708,freq=2.0), product of:
              0.12847571 = queryWeight, product of:
                1.042381 = boost
                5.989777 = idf(docFreq=300, maxDocs=44218)
                0.020577086 = queryNorm
              0.66178316 = fieldWeight in 708, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.989777 = idf(docFreq=300, maxDocs=44218)
                0.078125 = fieldNorm(doc=708)
          0.03522311 = weight(abstract_txt:retrieval in 708) [ClassicSimilarity], result of:
            0.03522311 = score(doc=708,freq=1.0), product of:
              0.12973748 = queryWeight, product of:
                1.814301 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020577086 = queryNorm
              0.27149525 = fieldWeight in 708, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=708)
          0.14909388 = weight(abstract_txt:vocabulary in 708) [ClassicSimilarity], result of:
            0.14909388 = score(doc=708,freq=3.0), product of:
              0.2056282 = queryWeight, product of:
                1.864972 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.020577086 = queryNorm
              0.72506535 = fieldWeight in 708, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.078125 = fieldNorm(doc=708)
          0.15678108 = weight(abstract_txt:controlled in 708) [ClassicSimilarity], result of:
            0.15678108 = score(doc=708,freq=3.0), product of:
              0.21263687 = queryWeight, product of:
                1.8964888 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.020577086 = queryNorm
              0.7373184 = fieldWeight in 708, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.078125 = fieldNorm(doc=708)
          0.10401623 = weight(abstract_txt:techniques in 708) [ClassicSimilarity], result of:
            0.10401623 = score(doc=708,freq=1.0), product of:
              0.29391876 = queryWeight, product of:
                3.153259 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.020577086 = queryNorm
              0.3538945 = fieldWeight in 708, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.078125 = fieldNorm(doc=708)
        0.24 = coord(6/25)
    
  4. Jones, I.; Cunliffe, D.; Tudhope, D.: Natural language processing and knowledge organization systems as an aid to retrieval (2004) 0.13
    0.12808925 = sum of:
      0.12808925 = product of:
        0.53370523 = sum of:
          0.03522311 = weight(abstract_txt:retrieval in 2677) [ClassicSimilarity], result of:
            0.03522311 = score(doc=2677,freq=1.0), product of:
              0.12973748 = queryWeight, product of:
                1.814301 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020577086 = queryNorm
              0.27149525 = fieldWeight in 2677, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=2677)
          0.08607939 = weight(abstract_txt:vocabulary in 2677) [ClassicSimilarity], result of:
            0.08607939 = score(doc=2677,freq=1.0), product of:
              0.2056282 = queryWeight, product of:
                1.864972 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.020577086 = queryNorm
              0.41861665 = fieldWeight in 2677, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.078125 = fieldNorm(doc=2677)
          0.090517595 = weight(abstract_txt:controlled in 2677) [ClassicSimilarity], result of:
            0.090517595 = score(doc=2677,freq=1.0), product of:
              0.21263687 = queryWeight, product of:
                1.8964888 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.020577086 = queryNorm
              0.42569098 = fieldWeight in 2677, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.078125 = fieldNorm(doc=2677)
          0.13937852 = weight(abstract_txt:free in 2677) [ClassicSimilarity], result of:
            0.13937852 = score(doc=2677,freq=2.0), product of:
              0.22504558 = queryWeight, product of:
                1.9510403 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.020577086 = queryNorm
              0.61933464 = fieldWeight in 2677, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.078125 = fieldNorm(doc=2677)
          0.07849039 = weight(abstract_txt:text in 2677) [ClassicSimilarity], result of:
            0.07849039 = score(doc=2677,freq=2.0), product of:
              0.17567688 = queryWeight, product of:
                2.111222 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020577086 = queryNorm
              0.44678837 = fieldWeight in 2677, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2677)
          0.10401623 = weight(abstract_txt:techniques in 2677) [ClassicSimilarity], result of:
            0.10401623 = score(doc=2677,freq=1.0), product of:
              0.29391876 = queryWeight, product of:
                3.153259 = boost
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.020577086 = queryNorm
              0.3538945 = fieldWeight in 2677, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5298495 = idf(docFreq=1295, maxDocs=44218)
                0.078125 = fieldNorm(doc=2677)
        0.24 = coord(6/25)
    
  5. Boyce, B.R.; McLain, J.P.: Entry point depth and online search using a controlled vocabulary (1989) 0.13
    0.12502554 = sum of:
      0.12502554 = product of:
        0.6251277 = sum of:
          0.07320987 = weight(abstract_txt:retrieval in 2287) [ClassicSimilarity], result of:
            0.07320987 = score(doc=2287,freq=3.0), product of:
              0.12973748 = queryWeight, product of:
                1.814301 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.020577086 = queryNorm
              0.5642923 = fieldWeight in 2287, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=2287)
          0.17891265 = weight(abstract_txt:vocabulary in 2287) [ClassicSimilarity], result of:
            0.17891265 = score(doc=2287,freq=3.0), product of:
              0.2056282 = queryWeight, product of:
                1.864972 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.020577086 = queryNorm
              0.8700784 = fieldWeight in 2287, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.09375 = fieldNorm(doc=2287)
          0.1881373 = weight(abstract_txt:controlled in 2287) [ClassicSimilarity], result of:
            0.1881373 = score(doc=2287,freq=3.0), product of:
              0.21263687 = queryWeight, product of:
                1.8964888 = boost
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.020577086 = queryNorm
              0.8847821 = fieldWeight in 2287, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4488444 = idf(docFreq=516, maxDocs=44218)
                0.09375 = fieldNorm(doc=2287)
          0.118266605 = weight(abstract_txt:free in 2287) [ClassicSimilarity], result of:
            0.118266605 = score(doc=2287,freq=1.0), product of:
              0.22504558 = queryWeight, product of:
                1.9510403 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.020577086 = queryNorm
              0.5255229 = fieldWeight in 2287, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.09375 = fieldNorm(doc=2287)
          0.0666013 = weight(abstract_txt:text in 2287) [ClassicSimilarity], result of:
            0.0666013 = score(doc=2287,freq=1.0), product of:
              0.17567688 = queryWeight, product of:
                2.111222 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.020577086 = queryNorm
              0.37911248 = fieldWeight in 2287, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=2287)
        0.2 = coord(5/25)