Document (#34459)

Author
MacFarlane, A.
Tuson, A.
Title
Local search : a guide for the information retrieval practitioner
Source
Information processing and management. 45(2009) no.1, S.159-174
Year
2009
Abstract
There are a number of combinatorial optimisation problems in information retrieval in which the use of local search methods are worthwhile. The purpose of this paper is to show how local search can be used to solve some well known tasks in information retrieval (IR), how previous research in the field is piecemeal, bereft of a structure and methodologically flawed, and to suggest more rigorous ways of applying local search methods to solve IR problems. We provide a query based taxonomy for analysing the use of local search in IR tasks and an overview of issues such as fitness functions, statistical significance and test collections when conducting experiments on combinatorial optimisation problems. The paper gives a guide on the pitfalls and problems for IR practitioners who wish to use local search to solve their research issues, and gives practical advice on the use of such methods. The query based taxonomy is a novel structure which can be used by the IR practitioner in order to examine the use of local search in IR.

Similar documents (author)

  1. MacFarlane, A.: On open source IR (2003) 5.48
    5.480715 = sum of:
      5.480715 = weight(author_txt:macfarlane in 3011) [ClassicSimilarity], result of:
        5.480715 = fieldWeight in 3011, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.769144 = idf(docFreq=17, maxDocs=42596)
          0.625 = fieldNorm(doc=3011)
    
  2. MacFarlane, A.: Evaluation of web search for the information practitioner (2007) 5.48
    5.480715 = sum of:
      5.480715 = weight(author_txt:macfarlane in 1997) [ClassicSimilarity], result of:
        5.480715 = fieldWeight in 1997, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.769144 = idf(docFreq=17, maxDocs=42596)
          0.625 = fieldNorm(doc=1997)
    
  3. MacFarlane, A.: Knowledge organisation and its role in multimedia information retrieval (2016) 5.48
    5.480715 = sum of:
      5.480715 = weight(author_txt:macfarlane in 3912) [ClassicSimilarity], result of:
        5.480715 = fieldWeight in 3912, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.769144 = idf(docFreq=17, maxDocs=42596)
          0.625 = fieldNorm(doc=3912)
    
  4. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing for passage retrieval (2004) 3.29
    3.288429 = sum of:
      3.288429 = weight(author_txt:macfarlane in 5177) [ClassicSimilarity], result of:
        3.288429 = fieldWeight in 5177, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.769144 = idf(docFreq=17, maxDocs=42596)
          0.375 = fieldNorm(doc=5177)
    
  5. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 3.29
    3.288429 = sum of:
      3.288429 = weight(author_txt:macfarlane in 520) [ClassicSimilarity], result of:
        3.288429 = fieldWeight in 520, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.769144 = idf(docFreq=17, maxDocs=42596)
          0.375 = fieldNorm(doc=520)
    

Similar documents (content)

  1. Wollf, J.G.: ¬A scalable technique for best-match retrieval of sequential information using metrics-guided search (1994) 0.09
    0.08832331 = sum of:
      0.08832331 = product of:
        0.3680138 = sum of:
          0.04264377 = weight(abstract_txt:query in 6335) [ClassicSimilarity], result of:
            0.04264377 = score(doc=6335,freq=1.0), product of:
              0.09608672 = queryWeight, product of:
                1.2987316 = boost
                4.7339206 = idf(docFreq=1017, maxDocs=42596)
                0.015628705 = queryNorm
              0.44380504 = fieldWeight in 6335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7339206 = idf(docFreq=1017, maxDocs=42596)
                0.09375 = fieldNorm(doc=6335)
          0.025036141 = weight(abstract_txt:retrieval in 6335) [ClassicSimilarity], result of:
            0.025036141 = score(doc=6335,freq=1.0), product of:
              0.07712023 = queryWeight, product of:
                1.4250087 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.015628705 = queryNorm
              0.3246378 = fieldWeight in 6335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.09375 = fieldNorm(doc=6335)
          0.06055545 = weight(abstract_txt:gives in 6335) [ClassicSimilarity], result of:
            0.06055545 = score(doc=6335,freq=1.0), product of:
              0.121393405 = queryWeight, product of:
                1.4597728 = boost
                5.3209214 = idf(docFreq=565, maxDocs=42596)
                0.015628705 = queryNorm
              0.4988364 = fieldWeight in 6335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3209214 = idf(docFreq=565, maxDocs=42596)
                0.09375 = fieldNorm(doc=6335)
          0.07724387 = weight(abstract_txt:guide in 6335) [ClassicSimilarity], result of:
            0.07724387 = score(doc=6335,freq=1.0), product of:
              0.14278053 = queryWeight, product of:
                1.5831506 = boost
                5.7706375 = idf(docFreq=360, maxDocs=42596)
                0.015628705 = queryNorm
              0.54099727 = fieldWeight in 6335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7706375 = idf(docFreq=360, maxDocs=42596)
                0.09375 = fieldNorm(doc=6335)
          0.043830216 = weight(abstract_txt:methods in 6335) [ClassicSimilarity], result of:
            0.043830216 = score(doc=6335,freq=1.0), product of:
              0.11202264 = queryWeight, product of:
                1.7174585 = boost
                4.173463 = idf(docFreq=1782, maxDocs=42596)
                0.015628705 = queryNorm
              0.39126214 = fieldWeight in 6335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.173463 = idf(docFreq=1782, maxDocs=42596)
                0.09375 = fieldNorm(doc=6335)
          0.11870435 = weight(abstract_txt:search in 6335) [ClassicSimilarity], result of:
            0.11870435 = score(doc=6335,freq=3.0), product of:
              0.20016415 = queryWeight, product of:
                3.5068314 = boost
                3.6521485 = idf(docFreq=3002, maxDocs=42596)
                0.015628705 = queryNorm
              0.593035 = fieldWeight in 6335, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6521485 = idf(docFreq=3002, maxDocs=42596)
                0.09375 = fieldNorm(doc=6335)
        0.24 = coord(6/25)
    
  2. Bjorner, S.: DIALOG's RANK command for rank and file searchers (1993) 0.09
    0.0866421 = sum of:
      0.0866421 = product of:
        0.54151314 = sum of:
          0.05844029 = weight(abstract_txt:methods in 6267) [ClassicSimilarity], result of:
            0.05844029 = score(doc=6267,freq=1.0), product of:
              0.11202264 = queryWeight, product of:
                1.7174585 = boost
                4.173463 = idf(docFreq=1782, maxDocs=42596)
                0.015628705 = queryNorm
              0.52168286 = fieldWeight in 6267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.173463 = idf(docFreq=1782, maxDocs=42596)
                0.125 = fieldNorm(doc=6267)
          0.084776156 = weight(abstract_txt:problems in 6267) [ClassicSimilarity], result of:
            0.084776156 = score(doc=6267,freq=1.0), product of:
              0.15800092 = queryWeight, product of:
                2.3552256 = boost
                4.2924385 = idf(docFreq=1582, maxDocs=42596)
                0.015628705 = queryNorm
              0.5365548 = fieldWeight in 6267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2924385 = idf(docFreq=1582, maxDocs=42596)
                0.125 = fieldNorm(doc=6267)
          0.12922893 = weight(abstract_txt:search in 6267) [ClassicSimilarity], result of:
            0.12922893 = score(doc=6267,freq=2.0), product of:
              0.20016415 = queryWeight, product of:
                3.5068314 = boost
                3.6521485 = idf(docFreq=3002, maxDocs=42596)
                0.015628705 = queryNorm
              0.64561474 = fieldWeight in 6267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6521485 = idf(docFreq=3002, maxDocs=42596)
                0.125 = fieldNorm(doc=6267)
          0.26906776 = weight(abstract_txt:local in 6267) [ClassicSimilarity], result of:
            0.26906776 = score(doc=6267,freq=1.0), product of:
              0.41121057 = queryWeight, product of:
                5.0263634 = boost
                5.234647 = idf(docFreq=616, maxDocs=42596)
                0.015628705 = queryNorm
              0.65433085 = fieldWeight in 6267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.234647 = idf(docFreq=616, maxDocs=42596)
                0.125 = fieldNorm(doc=6267)
        0.16 = coord(4/25)
    
  3. Glassco, R.A.: Evaluating commercial text search-and-retrieval packages (1993) 0.09
    0.08644896 = sum of:
      0.08644896 = product of:
        0.43224478 = sum of:
          0.05685836 = weight(abstract_txt:query in 7414) [ClassicSimilarity], result of:
            0.05685836 = score(doc=7414,freq=1.0), product of:
              0.09608672 = queryWeight, product of:
                1.2987316 = boost
                4.7339206 = idf(docFreq=1017, maxDocs=42596)
                0.015628705 = queryNorm
              0.5917401 = fieldWeight in 7414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7339206 = idf(docFreq=1017, maxDocs=42596)
                0.125 = fieldNorm(doc=7414)
          0.03338152 = weight(abstract_txt:retrieval in 7414) [ClassicSimilarity], result of:
            0.03338152 = score(doc=7414,freq=1.0), product of:
              0.07712023 = queryWeight, product of:
                1.4250087 = boost
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.015628705 = queryNorm
              0.4328504 = fieldWeight in 7414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4628031 = idf(docFreq=3628, maxDocs=42596)
                0.125 = fieldNorm(doc=7414)
          0.08074059 = weight(abstract_txt:gives in 7414) [ClassicSimilarity], result of:
            0.08074059 = score(doc=7414,freq=1.0), product of:
              0.121393405 = queryWeight, product of:
                1.4597728 = boost
                5.3209214 = idf(docFreq=565, maxDocs=42596)
                0.015628705 = queryNorm
              0.6651152 = fieldWeight in 7414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3209214 = idf(docFreq=565, maxDocs=42596)
                0.125 = fieldNorm(doc=7414)
          0.102991834 = weight(abstract_txt:guide in 7414) [ClassicSimilarity], result of:
            0.102991834 = score(doc=7414,freq=1.0), product of:
              0.14278053 = queryWeight, product of:
                1.5831506 = boost
                5.7706375 = idf(docFreq=360, maxDocs=42596)
                0.015628705 = queryNorm
              0.7213297 = fieldWeight in 7414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7706375 = idf(docFreq=360, maxDocs=42596)
                0.125 = fieldNorm(doc=7414)
          0.15827246 = weight(abstract_txt:search in 7414) [ClassicSimilarity], result of:
            0.15827246 = score(doc=7414,freq=3.0), product of:
              0.20016415 = queryWeight, product of:
                3.5068314 = boost
                3.6521485 = idf(docFreq=3002, maxDocs=42596)
                0.015628705 = queryNorm
              0.7907133 = fieldWeight in 7414, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6521485 = idf(docFreq=3002, maxDocs=42596)
                0.125 = fieldNorm(doc=7414)
        0.2 = coord(5/25)
    
  4. Johnson, P.: Selecting electronic resources : developing a local decision-making matrix (1996) 0.09
    0.085942715 = sum of:
      0.085942715 = product of:
        0.537142 = sum of:
          0.037570912 = weight(abstract_txt:issues in 550) [ClassicSimilarity], result of:
            0.037570912 = score(doc=550,freq=1.0), product of:
              0.07968249 = queryWeight, product of:
                1.1826853 = boost
                4.310928 = idf(docFreq=1553, maxDocs=42596)
                0.015628705 = queryNorm
              0.47150773 = fieldWeight in 550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.310928 = idf(docFreq=1553, maxDocs=42596)
                0.109375 = fieldNorm(doc=550)
          0.039170828 = weight(abstract_txt:structure in 550) [ClassicSimilarity], result of:
            0.039170828 = score(doc=550,freq=1.0), product of:
              0.081928864 = queryWeight, product of:
                1.1992402 = boost
                4.371271 = idf(docFreq=1462, maxDocs=42596)
                0.015628705 = queryNorm
              0.47810778 = fieldWeight in 550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.371271 = idf(docFreq=1462, maxDocs=42596)
                0.109375 = fieldNorm(doc=550)
          0.12744589 = weight(abstract_txt:guide in 550) [ClassicSimilarity], result of:
            0.12744589 = score(doc=550,freq=2.0), product of:
              0.14278053 = queryWeight, product of:
                1.5831506 = boost
                5.7706375 = idf(docFreq=360, maxDocs=42596)
                0.015628705 = queryNorm
              0.89259994 = fieldWeight in 550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7706375 = idf(docFreq=360, maxDocs=42596)
                0.109375 = fieldNorm(doc=550)
          0.33295435 = weight(abstract_txt:local in 550) [ClassicSimilarity], result of:
            0.33295435 = score(doc=550,freq=2.0), product of:
              0.41121057 = queryWeight, product of:
                5.0263634 = boost
                5.234647 = idf(docFreq=616, maxDocs=42596)
                0.015628705 = queryNorm
              0.8096931 = fieldWeight in 550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.234647 = idf(docFreq=616, maxDocs=42596)
                0.109375 = fieldNorm(doc=550)
        0.16 = coord(4/25)
    
  5. Schwarz, K.: Domain model enhanced search : a comparison of taxonomy, thesaurus and ontology (2005) 0.08
    0.08452893 = sum of:
      0.08452893 = product of:
        0.35220388 = sum of:
          0.016101819 = weight(abstract_txt:issues in 570) [ClassicSimilarity], result of:
            0.016101819 = score(doc=570,freq=1.0), product of:
              0.07968249 = queryWeight, product of:
                1.1826853 = boost
                4.310928 = idf(docFreq=1553, maxDocs=42596)
                0.015628705 = queryNorm
              0.20207474 = fieldWeight in 570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.310928 = idf(docFreq=1553, maxDocs=42596)
                0.046875 = fieldNorm(doc=570)
          0.030277725 = weight(abstract_txt:gives in 570) [ClassicSimilarity], result of:
            0.030277725 = score(doc=570,freq=1.0), product of:
              0.121393405 = queryWeight, product of:
                1.4597728 = boost
                5.3209214 = idf(docFreq=565, maxDocs=42596)
                0.015628705 = queryNorm
              0.2494182 = fieldWeight in 570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3209214 = idf(docFreq=565, maxDocs=42596)
                0.046875 = fieldNorm(doc=570)
          0.054347858 = weight(abstract_txt:taxonomy in 570) [ClassicSimilarity], result of:
            0.054347858 = score(doc=570,freq=1.0), product of:
              0.17929488 = queryWeight, product of:
                1.7740737 = boost
                6.466559 = idf(docFreq=179, maxDocs=42596)
                0.015628705 = queryNorm
              0.30311996 = fieldWeight in 570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.466559 = idf(docFreq=179, maxDocs=42596)
                0.046875 = fieldNorm(doc=570)
          0.063582115 = weight(abstract_txt:problems in 570) [ClassicSimilarity], result of:
            0.063582115 = score(doc=570,freq=4.0), product of:
              0.15800092 = queryWeight, product of:
                2.3552256 = boost
                4.2924385 = idf(docFreq=1582, maxDocs=42596)
                0.015628705 = queryNorm
              0.4024161 = fieldWeight in 570, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2924385 = idf(docFreq=1582, maxDocs=42596)
                0.046875 = fieldNorm(doc=570)
          0.08509338 = weight(abstract_txt:solve in 570) [ClassicSimilarity], result of:
            0.08509338 = score(doc=570,freq=1.0), product of:
              0.27674124 = queryWeight, product of:
                2.699419 = boost
                6.559649 = idf(docFreq=163, maxDocs=42596)
                0.015628705 = queryNorm
              0.30748355 = fieldWeight in 570, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.559649 = idf(docFreq=163, maxDocs=42596)
                0.046875 = fieldNorm(doc=570)
          0.10280099 = weight(abstract_txt:search in 570) [ClassicSimilarity], result of:
            0.10280099 = score(doc=570,freq=9.0), product of:
              0.20016415 = queryWeight, product of:
                3.5068314 = boost
                3.6521485 = idf(docFreq=3002, maxDocs=42596)
                0.015628705 = queryNorm
              0.5135834 = fieldWeight in 570, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.6521485 = idf(docFreq=3002, maxDocs=42596)
                0.046875 = fieldNorm(doc=570)
        0.24 = coord(6/25)