Document (#20370)

Author
Albus, W.
Smulders, H.
Title
Doeltreffend zoeken in volledige teksten : 2. full-text retrieval bij de HavenInformatieBank
Source
Informatie professional. 2(1998) no.3, S.28-33
Year
1998
Abstract
At Rotterdam Port Authority an information database has been created with approx. 100.000 full text documents online. Topic software has been used to identify word groups and refine search strategies to optimize precision and recall. The software guides users from selected terms to other relevant word combinations. Although the system would benefit from further refinement, users are generally satisfied. The database includes a number of foreign language documents but lacks a thesaurus of foreign terms
Footnote
Übers. d. Titels: Effective searching on full texts: 1. full-text-retrieval on the Harbour information database
Theme
Volltextretrieval
Object
Verity
Topic

Similar documents (content)

  1. Albus, W.; Smulders, H.: Doeltreffend zoeken in volledige teksten : 1. full-text retrieval bij de HavenInformatieBank (1998) 0.51
    0.5135316 = sum of:
      0.5135316 = product of:
        1.4264767 = sum of:
          0.06831582 = weight(abstract_txt:precision in 2683) [ClassicSimilarity], result of:
            0.06831582 = score(doc=2683,freq=1.0), product of:
              0.11322302 = queryWeight, product of:
                5.5165615 = idf(docFreq=466, maxDocs=42740)
                0.020524202 = queryNorm
              0.6033739 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5165615 = idf(docFreq=466, maxDocs=42740)
                0.109375 = fieldNorm(doc=2683)
          0.07668848 = weight(abstract_txt:recall in 2683) [ClassicSimilarity], result of:
            0.07668848 = score(doc=2683,freq=1.0), product of:
              0.12229461 = queryWeight, product of:
                1.0392889 = boost
                5.733301 = idf(docFreq=375, maxDocs=42740)
                0.020524202 = queryNorm
              0.62707984 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.733301 = idf(docFreq=375, maxDocs=42740)
                0.109375 = fieldNorm(doc=2683)
          0.1406853 = weight(abstract_txt:combinations in 2683) [ClassicSimilarity], result of:
            0.1406853 = score(doc=2683,freq=1.0), product of:
              0.18326789 = queryWeight, product of:
                1.2722598 = boost
                7.0185 = idf(docFreq=103, maxDocs=42740)
                0.020524202 = queryNorm
              0.7676484 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0185 = idf(docFreq=103, maxDocs=42740)
                0.109375 = fieldNorm(doc=2683)
          0.054066718 = weight(abstract_txt:text in 2683) [ClassicSimilarity], result of:
            0.054066718 = score(doc=2683,freq=1.0), product of:
              0.12205359 = queryWeight, product of:
                1.4683274 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.020524202 = queryNorm
              0.44297522 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.109375 = fieldNorm(doc=2683)
          0.063018605 = weight(abstract_txt:database in 2683) [ClassicSimilarity], result of:
            0.063018605 = score(doc=2683,freq=1.0), product of:
              0.13517916 = queryWeight, product of:
                1.5452633 = boost
                4.26227 = idf(docFreq=1636, maxDocs=42740)
                0.020524202 = queryNorm
              0.46618578 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.26227 = idf(docFreq=1636, maxDocs=42740)
                0.109375 = fieldNorm(doc=2683)
          0.0946304 = weight(abstract_txt:software in 2683) [ClassicSimilarity], result of:
            0.0946304 = score(doc=2683,freq=2.0), product of:
              0.1406936 = queryWeight, product of:
                1.5764667 = boost
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.020524202 = queryNorm
              0.67259914 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.109375 = fieldNorm(doc=2683)
          0.700434 = weight(title_txt:zoeken in 2683) [ClassicSimilarity], result of:
            0.700434 = score(doc=2683,freq=1.0), product of:
              0.30795276 = queryWeight, product of:
                1.6492051 = boost
                9.097941 = idf(docFreq=12, maxDocs=42740)
                0.020524202 = queryNorm
              2.2744853 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.097941 = idf(docFreq=12, maxDocs=42740)
                0.25 = fieldNorm(doc=2683)
          0.09657975 = weight(abstract_txt:full in 2683) [ClassicSimilarity], result of:
            0.09657975 = score(doc=2683,freq=1.0), product of:
              0.17968892 = queryWeight, product of:
                1.7815921 = boost
                4.914131 = idf(docFreq=852, maxDocs=42740)
                0.020524202 = queryNorm
              0.5374831 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.914131 = idf(docFreq=852, maxDocs=42740)
                0.109375 = fieldNorm(doc=2683)
          0.13205749 = weight(abstract_txt:word in 2683) [ClassicSimilarity], result of:
            0.13205749 = score(doc=2683,freq=1.0), product of:
              0.22136346 = queryWeight, product of:
                1.9774276 = boost
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.020524202 = queryNorm
              0.5965641 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4543004 = idf(docFreq=496, maxDocs=42740)
                0.109375 = fieldNorm(doc=2683)
        0.36 = coord(9/25)
    
  2. Sieverts, E.: Liever browsen dan zoeken (1998) 0.18
    0.18243761 = sum of:
      0.18243761 = product of:
        1.5203135 = sum of:
          0.052531585 = weight(abstract_txt:users in 5723) [ClassicSimilarity], result of:
            0.052531585 = score(doc=5723,freq=2.0), product of:
              0.0950315 = queryWeight, product of:
                1.2956313 = boost
                3.5737147 = idf(docFreq=3258, maxDocs=42740)
                0.020524202 = queryNorm
              0.55278075 = fieldWeight in 5723, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5737147 = idf(docFreq=3258, maxDocs=42740)
                0.109375 = fieldNorm(doc=5723)
          0.0669138 = weight(abstract_txt:software in 5723) [ClassicSimilarity], result of:
            0.0669138 = score(doc=5723,freq=1.0), product of:
              0.1406936 = queryWeight, product of:
                1.5764667 = boost
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.020524202 = queryNorm
              0.47559944 = fieldWeight in 5723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.109375 = fieldNorm(doc=5723)
          1.400868 = weight(title_txt:zoeken in 5723) [ClassicSimilarity], result of:
            1.400868 = score(doc=5723,freq=1.0), product of:
              0.30795276 = queryWeight, product of:
                1.6492051 = boost
                9.097941 = idf(docFreq=12, maxDocs=42740)
                0.020524202 = queryNorm
              4.5489707 = fieldWeight in 5723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.097941 = idf(docFreq=12, maxDocs=42740)
                0.5 = fieldNorm(doc=5723)
        0.12 = coord(3/25)
    
  3. Tseng, Y.-H.: Solving vocabulary problems with interactive query expansion (1998) 0.17
    0.17113228 = sum of:
      0.17113228 = product of:
        0.47536743 = sum of:
          0.07807522 = weight(abstract_txt:precision in 75) [ClassicSimilarity], result of:
            0.07807522 = score(doc=75,freq=4.0), product of:
              0.11322302 = queryWeight, product of:
                5.5165615 = idf(docFreq=466, maxDocs=42740)
                0.020524202 = queryNorm
              0.6895702 = fieldWeight in 75, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5165615 = idf(docFreq=466, maxDocs=42740)
                0.0625 = fieldNorm(doc=75)
          0.017469656 = weight(abstract_txt:from in 75) [ClassicSimilarity], result of:
            0.017469656 = score(doc=75,freq=3.0), product of:
              0.057867993 = queryWeight, product of:
                1.0110365 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.020524202 = queryNorm
              0.30188805 = fieldWeight in 75, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.0625 = fieldNorm(doc=75)
          0.09798895 = weight(abstract_txt:recall in 75) [ClassicSimilarity], result of:
            0.09798895 = score(doc=75,freq=5.0), product of:
              0.12229461 = queryWeight, product of:
                1.0392889 = boost
                5.733301 = idf(docFreq=375, maxDocs=42740)
                0.020524202 = queryNorm
              0.8012532 = fieldWeight in 75, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.733301 = idf(docFreq=375, maxDocs=42740)
                0.0625 = fieldNorm(doc=75)
          0.030018048 = weight(abstract_txt:users in 75) [ClassicSimilarity], result of:
            0.030018048 = score(doc=75,freq=2.0), product of:
              0.0950315 = queryWeight, product of:
                1.2956313 = boost
                3.5737147 = idf(docFreq=3258, maxDocs=42740)
                0.020524202 = queryNorm
              0.31587473 = fieldWeight in 75, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5737147 = idf(docFreq=3258, maxDocs=42740)
                0.0625 = fieldNorm(doc=75)
          0.043692507 = weight(abstract_txt:text in 75) [ClassicSimilarity], result of:
            0.043692507 = score(doc=75,freq=2.0), product of:
              0.12205359 = queryWeight, product of:
                1.4683274 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.020524202 = queryNorm
              0.35797805 = fieldWeight in 75, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.0625 = fieldNorm(doc=75)
          0.06959335 = weight(abstract_txt:terms in 75) [ClassicSimilarity], result of:
            0.06959335 = score(doc=75,freq=5.0), product of:
              0.12265287 = queryWeight, product of:
                1.4719278 = boost
                4.05999 = idf(docFreq=2003, maxDocs=42740)
                0.020524202 = queryNorm
              0.5674009 = fieldWeight in 75, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.05999 = idf(docFreq=2003, maxDocs=42740)
                0.0625 = fieldNorm(doc=75)
          0.03241458 = weight(abstract_txt:documents in 75) [ClassicSimilarity], result of:
            0.03241458 = score(doc=75,freq=1.0), product of:
              0.12602292 = queryWeight, product of:
                1.4920123 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.020524202 = queryNorm
              0.2572118 = fieldWeight in 75, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.0625 = fieldNorm(doc=75)
          0.050926723 = weight(abstract_txt:database in 75) [ClassicSimilarity], result of:
            0.050926723 = score(doc=75,freq=2.0), product of:
              0.13517916 = queryWeight, product of:
                1.5452633 = boost
                4.26227 = idf(docFreq=1636, maxDocs=42740)
                0.020524202 = queryNorm
              0.376735 = fieldWeight in 75, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.26227 = idf(docFreq=1636, maxDocs=42740)
                0.0625 = fieldNorm(doc=75)
          0.055188432 = weight(abstract_txt:full in 75) [ClassicSimilarity], result of:
            0.055188432 = score(doc=75,freq=1.0), product of:
              0.17968892 = queryWeight, product of:
                1.7815921 = boost
                4.914131 = idf(docFreq=852, maxDocs=42740)
                0.020524202 = queryNorm
              0.3071332 = fieldWeight in 75, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.914131 = idf(docFreq=852, maxDocs=42740)
                0.0625 = fieldNorm(doc=75)
        0.36 = coord(9/25)
    
  4. Sieverts, E.: Citatie-zoeken op het Web (1997) 0.17
    0.16811585 = sum of:
      0.16811585 = product of:
        1.4009655 = sum of:
          0.06482916 = weight(abstract_txt:documents in 1144) [ClassicSimilarity], result of:
            0.06482916 = score(doc=1144,freq=1.0), product of:
              0.12602292 = queryWeight, product of:
                1.4920123 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.020524202 = queryNorm
              0.5144236 = fieldWeight in 1144, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.125 = fieldNorm(doc=1144)
          1.2257595 = weight(title_txt:zoeken in 1144) [ClassicSimilarity], result of:
            1.2257595 = score(doc=1144,freq=1.0), product of:
              0.30795276 = queryWeight, product of:
                1.6492051 = boost
                9.097941 = idf(docFreq=12, maxDocs=42740)
                0.020524202 = queryNorm
              3.9803493 = fieldWeight in 1144, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.097941 = idf(docFreq=12, maxDocs=42740)
                0.4375 = fieldNorm(doc=1144)
          0.110376865 = weight(abstract_txt:full in 1144) [ClassicSimilarity], result of:
            0.110376865 = score(doc=1144,freq=1.0), product of:
              0.17968892 = queryWeight, product of:
                1.7815921 = boost
                4.914131 = idf(docFreq=852, maxDocs=42740)
                0.020524202 = queryNorm
              0.6142664 = fieldWeight in 1144, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.914131 = idf(docFreq=852, maxDocs=42740)
                0.125 = fieldNorm(doc=1144)
        0.12 = coord(3/25)
    
  5. Evans, R.: Beyond Boolean : relevance ranking, natural language and the new search paradigm (1994) 0.15
    0.15159172 = sum of:
      0.15159172 = product of:
        0.4210881 = sum of:
          0.017829893 = weight(abstract_txt:from in 578) [ClassicSimilarity], result of:
            0.017829893 = score(doc=578,freq=2.0), product of:
              0.057867993 = queryWeight, product of:
                1.0110365 = boost
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.020524202 = queryNorm
              0.30811322 = fieldWeight in 578, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7887225 = idf(docFreq=7144, maxDocs=42740)
                0.078125 = fieldNorm(doc=578)
          0.07746707 = weight(abstract_txt:recall in 578) [ClassicSimilarity], result of:
            0.07746707 = score(doc=578,freq=2.0), product of:
              0.12229461 = queryWeight, product of:
                1.0392889 = boost
                5.733301 = idf(docFreq=375, maxDocs=42740)
                0.020524202 = queryNorm
              0.6334463 = fieldWeight in 578, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.733301 = idf(docFreq=375, maxDocs=42740)
                0.078125 = fieldNorm(doc=578)
          0.04595557 = weight(abstract_txt:users in 578) [ClassicSimilarity], result of:
            0.04595557 = score(doc=578,freq=3.0), product of:
              0.0950315 = queryWeight, product of:
                1.2956313 = boost
                3.5737147 = idf(docFreq=3258, maxDocs=42740)
                0.020524202 = queryNorm
              0.48358247 = fieldWeight in 578, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5737147 = idf(docFreq=3258, maxDocs=42740)
                0.078125 = fieldNorm(doc=578)
          0.038619086 = weight(abstract_txt:text in 578) [ClassicSimilarity], result of:
            0.038619086 = score(doc=578,freq=1.0), product of:
              0.12205359 = queryWeight, product of:
                1.4683274 = boost
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.020524202 = queryNorm
              0.3164109 = fieldWeight in 578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0500593 = idf(docFreq=2023, maxDocs=42740)
                0.078125 = fieldNorm(doc=578)
          0.038903862 = weight(abstract_txt:terms in 578) [ClassicSimilarity], result of:
            0.038903862 = score(doc=578,freq=1.0), product of:
              0.12265287 = queryWeight, product of:
                1.4719278 = boost
                4.05999 = idf(docFreq=2003, maxDocs=42740)
                0.020524202 = queryNorm
              0.3171867 = fieldWeight in 578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05999 = idf(docFreq=2003, maxDocs=42740)
                0.078125 = fieldNorm(doc=578)
          0.040518228 = weight(abstract_txt:documents in 578) [ClassicSimilarity], result of:
            0.040518228 = score(doc=578,freq=1.0), product of:
              0.12602292 = queryWeight, product of:
                1.4920123 = boost
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.020524202 = queryNorm
              0.32151476 = fieldWeight in 578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.115389 = idf(docFreq=1895, maxDocs=42740)
                0.078125 = fieldNorm(doc=578)
          0.045013286 = weight(abstract_txt:database in 578) [ClassicSimilarity], result of:
            0.045013286 = score(doc=578,freq=1.0), product of:
              0.13517916 = queryWeight, product of:
                1.5452633 = boost
                4.26227 = idf(docFreq=1636, maxDocs=42740)
                0.020524202 = queryNorm
              0.33298984 = fieldWeight in 578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.26227 = idf(docFreq=1636, maxDocs=42740)
                0.078125 = fieldNorm(doc=578)
          0.047795568 = weight(abstract_txt:software in 578) [ClassicSimilarity], result of:
            0.047795568 = score(doc=578,freq=1.0), product of:
              0.1406936 = queryWeight, product of:
                1.5764667 = boost
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.020524202 = queryNorm
              0.33971387 = fieldWeight in 578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3483377 = idf(docFreq=1501, maxDocs=42740)
                0.078125 = fieldNorm(doc=578)
          0.06898554 = weight(abstract_txt:full in 578) [ClassicSimilarity], result of:
            0.06898554 = score(doc=578,freq=1.0), product of:
              0.17968892 = queryWeight, product of:
                1.7815921 = boost
                4.914131 = idf(docFreq=852, maxDocs=42740)
                0.020524202 = queryNorm
              0.3839165 = fieldWeight in 578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.914131 = idf(docFreq=852, maxDocs=42740)
                0.078125 = fieldNorm(doc=578)
        0.36 = coord(9/25)