Document (#41123)

Author
Herceg, P.M.
Allison, T.B.
Belvin, R.S.
Tzoukermann, E.
Title
Collaborative exploratory search for information filtering and large-scale information triage
Source
Journal of the Association for Information Science and Technology. 69(2018) no.3, S.395-409
Year
2018
Abstract
Modern information seekers face dynamic streams of large-scale heterogeneous data that are both intimidating and overwhelming. They need a strategy to filter this barrage of massive data sets, and to find all of the information responding to their information needs, despite the pressures imposed by schedules and budgets. In this applied research, we present an exploratory search strategy that allows professional information seekers to efficiently and effectively triage all of the data. We demonstrate that exploratory search is particularly useful for information filtering and large-scale information triage, regardless of the language of the data, and regardless of the particular industry, whether finance, medical, business, government, information technology, news, or legal. Our strategy reduces a dauntingly large volume of information into a manageable, high-precision data set, suitable for focused reading. This strategy is interdisciplinary, integrating concepts from information filtering, information triage, and exploratory search. Key aspects include advanced search software, interdisciplinary paired search, asynchronous collaborative search, attention to linguistic phenomena, and aggregated search results in the form of a search matrix or search grid. We present the positive results of a task-oriented evaluation in a real-world setting, discuss these results from a qualitative perspective, and share future research areas.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23961/full.

Similar documents (content)

  1. Lee, J.H.; Cho, H.; Kim, Y.-S.: Users' music information needs and behaviors : design implications for music information retrieval systems (2016) 0.18
    0.17580362 = sum of:
      0.17580362 = product of:
        0.5493863 = sum of:
          0.02899833 = weight(abstract_txt:present in 3006) [ClassicSimilarity], result of:
            0.02899833 = score(doc=3006,freq=1.0), product of:
              0.08534915 = queryWeight, product of:
                1.0928067 = boost
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.017958585 = queryNorm
              0.3397612 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.078125 = fieldNorm(doc=3006)
          0.022333523 = weight(abstract_txt:results in 3006) [ClassicSimilarity], result of:
            0.022333523 = score(doc=3006,freq=1.0), product of:
              0.082089156 = queryWeight, product of:
                1.3125997 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017958585 = queryNorm
              0.27206424 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=3006)
          0.071664505 = weight(abstract_txt:collaborative in 3006) [ClassicSimilarity], result of:
            0.071664505 = score(doc=3006,freq=1.0), product of:
              0.15601031 = queryWeight, product of:
                1.4774759 = boost
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.017958585 = queryNorm
              0.4593575 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.078125 = fieldNorm(doc=3006)
          0.08685084 = weight(abstract_txt:scale in 3006) [ClassicSimilarity], result of:
            0.08685084 = score(doc=3006,freq=1.0), product of:
              0.20300044 = queryWeight, product of:
                2.064134 = boost
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.017958585 = queryNorm
              0.4278357 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.078125 = fieldNorm(doc=3006)
          0.03273224 = weight(abstract_txt:data in 3006) [ClassicSimilarity], result of:
            0.03273224 = score(doc=3006,freq=1.0), product of:
              0.12557824 = queryWeight, product of:
                2.095902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.017958585 = queryNorm
              0.26065218 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=3006)
          0.0623058 = weight(abstract_txt:large in 3006) [ClassicSimilarity], result of:
            0.0623058 = score(doc=3006,freq=1.0), product of:
              0.17905214 = queryWeight, product of:
                2.2384558 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.017958585 = queryNorm
              0.34797573 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=3006)
          0.19251473 = weight(abstract_txt:exploratory in 3006) [ClassicSimilarity], result of:
            0.19251473 = score(doc=3006,freq=1.0), product of:
              0.3798422 = queryWeight, product of:
                3.260321 = boost
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.017958585 = queryNorm
              0.5068282 = fieldWeight in 3006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.078125 = fieldNorm(doc=3006)
          0.05198633 = weight(abstract_txt:information in 3006) [ClassicSimilarity], result of:
            0.05198633 = score(doc=3006,freq=3.0), product of:
              0.15869138 = queryWeight, product of:
                3.6500268 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.017958585 = queryNorm
              0.32759392 = fieldWeight in 3006, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=3006)
        0.32 = coord(8/25)
    
  2. Brin, S.; Page, L.: ¬The anatomy of a large-scale hypertextual Web search engine (1998) 0.16
    0.16171275 = sum of:
      0.16171275 = product of:
        0.5775455 = sum of:
          0.046397325 = weight(abstract_txt:present in 947) [ClassicSimilarity], result of:
            0.046397325 = score(doc=947,freq=4.0), product of:
              0.08534915 = queryWeight, product of:
                1.0928067 = boost
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.017958585 = queryNorm
              0.5436179 = fieldWeight in 947, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.0625 = fieldNorm(doc=947)
          0.025267497 = weight(abstract_txt:results in 947) [ClassicSimilarity], result of:
            0.025267497 = score(doc=947,freq=2.0), product of:
              0.082089156 = queryWeight, product of:
                1.3125997 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017958585 = queryNorm
              0.30780554 = fieldWeight in 947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=947)
          0.13896133 = weight(abstract_txt:scale in 947) [ClassicSimilarity], result of:
            0.13896133 = score(doc=947,freq=4.0), product of:
              0.20300044 = queryWeight, product of:
                2.064134 = boost
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.017958585 = queryNorm
              0.6845371 = fieldWeight in 947, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.0625 = fieldNorm(doc=947)
          0.026185792 = weight(abstract_txt:data in 947) [ClassicSimilarity], result of:
            0.026185792 = score(doc=947,freq=1.0), product of:
              0.12557824 = queryWeight, product of:
                2.095902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.017958585 = queryNorm
              0.20852174 = fieldWeight in 947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=947)
          0.099689275 = weight(abstract_txt:large in 947) [ClassicSimilarity], result of:
            0.099689275 = score(doc=947,freq=4.0), product of:
              0.17905214 = queryWeight, product of:
                2.2384558 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.017958585 = queryNorm
              0.55676115 = fieldWeight in 947, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=947)
          0.03395733 = weight(abstract_txt:information in 947) [ClassicSimilarity], result of:
            0.03395733 = score(doc=947,freq=2.0), product of:
              0.15869138 = queryWeight, product of:
                3.6500268 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.017958585 = queryNorm
              0.21398345 = fieldWeight in 947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=947)
          0.20708698 = weight(abstract_txt:search in 947) [ClassicSimilarity], result of:
            0.20708698 = score(doc=947,freq=9.0), product of:
              0.30192676 = queryWeight, product of:
                4.595995 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.017958585 = queryNorm
              0.68588483 = fieldWeight in 947, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=947)
        0.28 = coord(7/25)
    
  3. Mukhopadhyay, S.; Peng, S.; Raje, R.; Mostafa, J.; Palakal, M.: Distributed multi-agent information filtering : a comparative study (2005) 0.15
    0.15448807 = sum of:
      0.15448807 = product of:
        0.7724403 = sum of:
          0.17370167 = weight(abstract_txt:scale in 3559) [ClassicSimilarity], result of:
            0.17370167 = score(doc=3559,freq=4.0), product of:
              0.20300044 = queryWeight, product of:
                2.064134 = boost
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.017958585 = queryNorm
              0.8556714 = fieldWeight in 3559, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.476297 = idf(docFreq=502, maxDocs=44218)
                0.078125 = fieldNorm(doc=3559)
          0.03273224 = weight(abstract_txt:data in 3559) [ClassicSimilarity], result of:
            0.03273224 = score(doc=3559,freq=1.0), product of:
              0.12557824 = queryWeight, product of:
                2.095902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.017958585 = queryNorm
              0.26065218 = fieldWeight in 3559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=3559)
          0.1246116 = weight(abstract_txt:large in 3559) [ClassicSimilarity], result of:
            0.1246116 = score(doc=3559,freq=4.0), product of:
              0.17905214 = queryWeight, product of:
                2.2384558 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.017958585 = queryNorm
              0.69595146 = fieldWeight in 3559, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=3559)
          0.3619844 = weight(abstract_txt:filtering in 3559) [ClassicSimilarity], result of:
            0.3619844 = score(doc=3559,freq=6.0), product of:
              0.28932798 = queryWeight, product of:
                2.4642491 = boost
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.017958585 = queryNorm
              1.2511213 = fieldWeight in 3559, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.078125 = fieldNorm(doc=3559)
          0.07941043 = weight(abstract_txt:information in 3559) [ClassicSimilarity], result of:
            0.07941043 = score(doc=3559,freq=7.0), product of:
              0.15869138 = queryWeight, product of:
                3.6500268 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.017958585 = queryNorm
              0.50040793 = fieldWeight in 3559, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=3559)
        0.2 = coord(5/25)
    
  4. Mirizzi, R.: Exploratory browsing in the Web of Data (2011) 0.15
    0.15086885 = sum of:
      0.15086885 = product of:
        0.5388173 = sum of:
          0.015792185 = weight(abstract_txt:results in 4803) [ClassicSimilarity], result of:
            0.015792185 = score(doc=4803,freq=2.0), product of:
              0.082089156 = queryWeight, product of:
                1.3125997 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017958585 = queryNorm
              0.19237846 = fieldWeight in 4803, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4803)
          0.043300685 = weight(abstract_txt:data in 4803) [ClassicSimilarity], result of:
            0.043300685 = score(doc=4803,freq=7.0), product of:
              0.12557824 = queryWeight, product of:
                2.095902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.017958585 = queryNorm
              0.34481043 = fieldWeight in 4803, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4803)
          0.0311529 = weight(abstract_txt:large in 4803) [ClassicSimilarity], result of:
            0.0311529 = score(doc=4803,freq=1.0), product of:
              0.17905214 = queryWeight, product of:
                2.2384558 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.017958585 = queryNorm
              0.17398787 = fieldWeight in 4803, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4803)
          0.073889755 = weight(abstract_txt:filtering in 4803) [ClassicSimilarity], result of:
            0.073889755 = score(doc=4803,freq=1.0), product of:
              0.28932798 = queryWeight, product of:
                2.4642491 = boost
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.017958585 = queryNorm
              0.25538406 = fieldWeight in 4803, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4803)
          0.21523802 = weight(abstract_txt:exploratory in 4803) [ClassicSimilarity], result of:
            0.21523802 = score(doc=4803,freq=5.0), product of:
              0.3798422 = queryWeight, product of:
                3.260321 = boost
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.017958585 = queryNorm
              0.56665117 = fieldWeight in 4803, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4803)
          0.030014321 = weight(abstract_txt:information in 4803) [ClassicSimilarity], result of:
            0.030014321 = score(doc=4803,freq=4.0), product of:
              0.15869138 = queryWeight, product of:
                3.6500268 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.017958585 = queryNorm
              0.18913643 = fieldWeight in 4803, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4803)
          0.12942937 = weight(abstract_txt:search in 4803) [ClassicSimilarity], result of:
            0.12942937 = score(doc=4803,freq=9.0), product of:
              0.30192676 = queryWeight, product of:
                4.595995 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.017958585 = queryNorm
              0.42867804 = fieldWeight in 4803, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0390625 = fieldNorm(doc=4803)
        0.28 = coord(7/25)
    
  5. Orso, V.; Ruotsalo, T.; Leino, J.; Gamberini, L.; Jacucci, G.: Overlaying social information : the effects on users' search and information-selection behavior (2017) 0.15
    0.14702477 = sum of:
      0.14702477 = product of:
        0.5250884 = sum of:
          0.02029883 = weight(abstract_txt:present in 5097) [ClassicSimilarity], result of:
            0.02029883 = score(doc=5097,freq=1.0), product of:
              0.08534915 = queryWeight, product of:
                1.0928067 = boost
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.017958585 = queryNorm
              0.23783283 = fieldWeight in 5097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5097)
          0.015633466 = weight(abstract_txt:results in 5097) [ClassicSimilarity], result of:
            0.015633466 = score(doc=5097,freq=1.0), product of:
              0.082089156 = queryWeight, product of:
                1.3125997 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.017958585 = queryNorm
              0.19044496 = fieldWeight in 5097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5097)
          0.050165154 = weight(abstract_txt:collaborative in 5097) [ClassicSimilarity], result of:
            0.050165154 = score(doc=5097,freq=1.0), product of:
              0.15601031 = queryWeight, product of:
                1.4774759 = boost
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.017958585 = queryNorm
              0.32155025 = fieldWeight in 5097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5097)
          0.03968573 = weight(abstract_txt:data in 5097) [ClassicSimilarity], result of:
            0.03968573 = score(doc=5097,freq=3.0), product of:
              0.12557824 = queryWeight, product of:
                2.095902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.017958585 = queryNorm
              0.31602395 = fieldWeight in 5097, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5097)
          0.10344566 = weight(abstract_txt:filtering in 5097) [ClassicSimilarity], result of:
            0.10344566 = score(doc=5097,freq=1.0), product of:
              0.28932798 = queryWeight, product of:
                2.4642491 = boost
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.017958585 = queryNorm
              0.3575377 = fieldWeight in 5097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.537832 = idf(docFreq=173, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5097)
          0.086626545 = weight(abstract_txt:information in 5097) [ClassicSimilarity], result of:
            0.086626545 = score(doc=5097,freq=17.0), product of:
              0.15869138 = queryWeight, product of:
                3.6500268 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.017958585 = queryNorm
              0.5458806 = fieldWeight in 5097, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5097)
          0.20923302 = weight(abstract_txt:search in 5097) [ClassicSimilarity], result of:
            0.20923302 = score(doc=5097,freq=12.0), product of:
              0.30192676 = queryWeight, product of:
                4.595995 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.017958585 = queryNorm
              0.6929926 = fieldWeight in 5097, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5097)
        0.28 = coord(7/25)