Document (#30049)

Author
Crestani, F.
Du, H.
Title
Written versus spoken queries : a qualitative and quantitative comparative analysis
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.7, S.881-890
Year
2006
Abstract
The authors report on an experimental study on the differences between spoken and written queries. A set of written and spontaneous spoken queries are generated by users from written topics. These two sets of queries are compared in qualitative terms and in terms of their retrieval effectiveness. Written and spoken queries are compared in terms of length, duration, and part of speech. In addition, assuming perfect transcription of the spoken queries, written and spoken queries are compared in terms of their aptitude to describe relevant documents. The retrieval effectiveness of spoken and written queries is compared using three different information retrieval models. The results show that using speech to formulate one's information need provides a way to express it more naturally and encourages the formulation of longer queries. Despite that, longer spoken queries do not seem to significantly improve retrieval effectiveness compared with written queries.
Theme
Suchtaktik

Similar documents (author)

  1. Crestani, F.: Combination of similarity measures for effective spoken document retrieval (2003) 5.45
    5.4490323 = sum of:
      5.4490323 = weight(author_txt:crestani in 691) [ClassicSimilarity], result of:
        5.4490323 = fieldWeight in 691, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.7184515 = idf(docFreq=18, maxDocs=42740)
          0.625 = fieldNorm(doc=691)
    
  2. Crestani, F.; Lee, P.L.: Searching the web by constraining spreading activities (2000) 4.36
    4.3592257 = sum of:
      4.3592257 = weight(author_txt:crestani in 1395) [ClassicSimilarity], result of:
        4.3592257 = fieldWeight in 1395, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.7184515 = idf(docFreq=18, maxDocs=42740)
          0.5 = fieldNorm(doc=1395)
    
  3. Tombros, T.; Crestani, F.: Users' perception of relevance of spoken documents (2000) 4.36
    4.3592257 = sum of:
      4.3592257 = weight(author_txt:crestani in 5997) [ClassicSimilarity], result of:
        4.3592257 = fieldWeight in 5997, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.7184515 = idf(docFreq=18, maxDocs=42740)
          0.5 = fieldNorm(doc=5997)
    
  4. Crestani, F.; Wu, S.: Testing the cluster hypothesis in distributed information retrieval (2006) 4.36
    4.3592257 = sum of:
      4.3592257 = weight(author_txt:crestani in 2985) [ClassicSimilarity], result of:
        4.3592257 = fieldWeight in 2985, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.7184515 = idf(docFreq=18, maxDocs=42740)
          0.5 = fieldNorm(doc=2985)
    
  5. Crestani, F.; Rijsbergen, C.J. van: Information retrieval by logical imaging (1995) 3.81
    3.8143225 = sum of:
      3.8143225 = weight(author_txt:crestani in 1828) [ClassicSimilarity], result of:
        3.8143225 = fieldWeight in 1828, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.7184515 = idf(docFreq=18, maxDocs=42740)
          0.4375 = fieldNorm(doc=1828)
    

Similar documents (content)

  1. Sparck Jones, K.; Jones, G.J.F.; Foote, J.T.; Young, S.J.: Experiments in spoken document retrieval (1996) 0.19
    0.19008534 = sum of:
      0.19008534 = product of:
        1.1880333 = sum of:
          0.08997473 = weight(abstract_txt:transcription in 2952) [ClassicSimilarity], result of:
            0.08997473 = score(doc=2952,freq=1.0), product of:
              0.10869372 = queryWeight, product of:
                1.3658942 = boost
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.009012443 = queryNorm
              0.8277823 = fieldWeight in 2952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.829678 = idf(docFreq=16, maxDocs=42740)
                0.09375 = fieldNorm(doc=2952)
          0.122395314 = weight(abstract_txt:speech in 2952) [ClassicSimilarity], result of:
            0.122395314 = score(doc=2952,freq=2.0), product of:
              0.13344447 = queryWeight, product of:
                2.140327 = boost
                6.9179583 = idf(docFreq=114, maxDocs=42740)
                0.009012443 = queryNorm
              0.9172003 = fieldWeight in 2952, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9179583 = idf(docFreq=114, maxDocs=42740)
                0.09375 = fieldNorm(doc=2952)
          0.048625715 = weight(abstract_txt:retrieval in 2952) [ClassicSimilarity], result of:
            0.048625715 = score(doc=2952,freq=5.0), product of:
              0.06694704 = queryWeight, product of:
                2.1439297 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.009012443 = queryNorm
              0.72633106 = fieldWeight in 2952, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.09375 = fieldNorm(doc=2952)
          0.9270376 = weight(abstract_txt:spoken in 2952) [ClassicSimilarity], result of:
            0.9270376 = score(doc=2952,freq=3.0), product of:
              0.7136937 = queryWeight, product of:
                9.899557 = boost
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.009012443 = queryNorm
              1.2989292 = fieldWeight in 2952, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.09375 = fieldNorm(doc=2952)
        0.16 = coord(4/25)
    
  2. Bacchin, M.; Ferro, N.; Melucci, M.: ¬A probabilistic model for stemmer generation (2005) 0.17
    0.16836639 = sum of:
      0.16836639 = product of:
        0.8418319 = sum of:
          0.025628002 = weight(abstract_txt:retrieval in 3002) [ClassicSimilarity], result of:
            0.025628002 = score(doc=3002,freq=2.0), product of:
              0.06694704 = queryWeight, product of:
                2.1439297 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.009012443 = queryNorm
              0.3828101 = fieldWeight in 3002, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.078125 = fieldNorm(doc=3002)
          0.043793537 = weight(abstract_txt:effectiveness in 3002) [ClassicSimilarity], result of:
            0.043793537 = score(doc=3002,freq=1.0), product of:
              0.109536454 = queryWeight, product of:
                2.3749518 = boost
                5.117541 = idf(docFreq=695, maxDocs=42740)
                0.009012443 = queryNorm
              0.39980787 = fieldWeight in 3002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.117541 = idf(docFreq=695, maxDocs=42740)
                0.078125 = fieldNorm(doc=3002)
          0.16846497 = weight(abstract_txt:written in 3002) [ClassicSimilarity], result of:
            0.16846497 = score(doc=3002,freq=1.0), product of:
              0.3729191 = queryWeight, product of:
                7.1559477 = boost
                5.7823577 = idf(docFreq=357, maxDocs=42740)
                0.009012443 = queryNorm
              0.4517467 = fieldWeight in 3002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7823577 = idf(docFreq=357, maxDocs=42740)
                0.078125 = fieldNorm(doc=3002)
          0.1579242 = weight(abstract_txt:queries in 3002) [ClassicSimilarity], result of:
            0.1579242 = score(doc=3002,freq=1.0), product of:
              0.3971991 = queryWeight, product of:
                8.659948 = boost
                5.0892105 = idf(docFreq=715, maxDocs=42740)
                0.009012443 = queryNorm
              0.39759457 = fieldWeight in 3002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0892105 = idf(docFreq=715, maxDocs=42740)
                0.078125 = fieldNorm(doc=3002)
          0.44602117 = weight(abstract_txt:spoken in 3002) [ClassicSimilarity], result of:
            0.44602117 = score(doc=3002,freq=1.0), product of:
              0.7136937 = queryWeight, product of:
                9.899557 = boost
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.009012443 = queryNorm
              0.6249476 = fieldWeight in 3002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.078125 = fieldNorm(doc=3002)
        0.2 = coord(5/25)
    
  3. SARA (SGML Aware Retrieval Application) Workshop, 19th June 1994 (1994) 0.13
    0.1334462 = sum of:
      0.1334462 = product of:
        0.83403873 = sum of:
          0.009176618 = weight(abstract_txt:using in 825) [ClassicSimilarity], result of:
            0.009176618 = score(doc=825,freq=1.0), product of:
              0.033758 = queryWeight, product of:
                1.0765103 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.009012443 = queryNorm
              0.2718354 = fieldWeight in 825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.078125 = fieldNorm(doc=825)
          0.025628002 = weight(abstract_txt:retrieval in 825) [ClassicSimilarity], result of:
            0.025628002 = score(doc=825,freq=2.0), product of:
              0.06694704 = queryWeight, product of:
                2.1439297 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.009012443 = queryNorm
              0.3828101 = fieldWeight in 825, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.078125 = fieldNorm(doc=825)
          0.16846497 = weight(abstract_txt:written in 825) [ClassicSimilarity], result of:
            0.16846497 = score(doc=825,freq=1.0), product of:
              0.3729191 = queryWeight, product of:
                7.1559477 = boost
                5.7823577 = idf(docFreq=357, maxDocs=42740)
                0.009012443 = queryNorm
              0.4517467 = fieldWeight in 825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7823577 = idf(docFreq=357, maxDocs=42740)
                0.078125 = fieldNorm(doc=825)
          0.63076913 = weight(abstract_txt:spoken in 825) [ClassicSimilarity], result of:
            0.63076913 = score(doc=825,freq=2.0), product of:
              0.7136937 = queryWeight, product of:
                9.899557 = boost
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.009012443 = queryNorm
              0.8838093 = fieldWeight in 825, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.078125 = fieldNorm(doc=825)
        0.16 = coord(4/25)
    
  4. Pilch, H.: Empirical linguistics (1976) 0.13
    0.12534134 = sum of:
      0.12534134 = product of:
        0.7833834 = sum of:
          0.011011943 = weight(abstract_txt:using in 7860) [ClassicSimilarity], result of:
            0.011011943 = score(doc=7860,freq=1.0), product of:
              0.033758 = queryWeight, product of:
                1.0765103 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.009012443 = queryNorm
              0.32620248 = fieldWeight in 7860, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.09375 = fieldNorm(doc=7860)
          0.034988143 = weight(abstract_txt:terms in 7860) [ClassicSimilarity], result of:
            0.034988143 = score(doc=7860,freq=1.0), product of:
              0.0919231 = queryWeight, product of:
                2.512217 = boost
                4.05999 = idf(docFreq=2003, maxDocs=42740)
                0.009012443 = queryNorm
              0.38062406 = fieldWeight in 7860, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.05999 = idf(docFreq=2003, maxDocs=42740)
                0.09375 = fieldNorm(doc=7860)
          0.20215796 = weight(abstract_txt:written in 7860) [ClassicSimilarity], result of:
            0.20215796 = score(doc=7860,freq=1.0), product of:
              0.3729191 = queryWeight, product of:
                7.1559477 = boost
                5.7823577 = idf(docFreq=357, maxDocs=42740)
                0.009012443 = queryNorm
              0.542096 = fieldWeight in 7860, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7823577 = idf(docFreq=357, maxDocs=42740)
                0.09375 = fieldNorm(doc=7860)
          0.5352254 = weight(abstract_txt:spoken in 7860) [ClassicSimilarity], result of:
            0.5352254 = score(doc=7860,freq=1.0), product of:
              0.7136937 = queryWeight, product of:
                9.899557 = boost
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.009012443 = queryNorm
              0.7499371 = fieldWeight in 7860, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.09375 = fieldNorm(doc=7860)
        0.16 = coord(4/25)
    
  5. Srinivasan, P.: Query expansion and MEDLINE (1996) 0.12
    0.12007462 = sum of:
      0.12007462 = product of:
        0.6003731 = sum of:
          0.02225212 = weight(abstract_txt:using in 453) [ClassicSimilarity], result of:
            0.02225212 = score(doc=453,freq=3.0), product of:
              0.033758 = queryWeight, product of:
                1.0765103 = boost
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.009012443 = queryNorm
              0.65916586 = fieldWeight in 453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4794931 = idf(docFreq=3580, maxDocs=42740)
                0.109375 = fieldNorm(doc=453)
          0.050740857 = weight(abstract_txt:retrieval in 453) [ClassicSimilarity], result of:
            0.050740857 = score(doc=453,freq=4.0), product of:
              0.06694704 = queryWeight, product of:
                2.1439297 = boost
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.009012443 = queryNorm
              0.7579253 = fieldWeight in 453, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4648013 = idf(docFreq=3633, maxDocs=42740)
                0.109375 = fieldNorm(doc=453)
          0.08670678 = weight(abstract_txt:effectiveness in 453) [ClassicSimilarity], result of:
            0.08670678 = score(doc=453,freq=2.0), product of:
              0.109536454 = queryWeight, product of:
                2.3749518 = boost
                5.117541 = idf(docFreq=695, maxDocs=42740)
                0.009012443 = queryNorm
              0.7915792 = fieldWeight in 453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.117541 = idf(docFreq=695, maxDocs=42740)
                0.109375 = fieldNorm(doc=453)
          0.05772749 = weight(abstract_txt:terms in 453) [ClassicSimilarity], result of:
            0.05772749 = score(doc=453,freq=2.0), product of:
              0.0919231 = queryWeight, product of:
                2.512217 = boost
                4.05999 = idf(docFreq=2003, maxDocs=42740)
                0.009012443 = queryNorm
              0.62799764 = fieldWeight in 453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.05999 = idf(docFreq=2003, maxDocs=42740)
                0.109375 = fieldNorm(doc=453)
          0.38294584 = weight(abstract_txt:queries in 453) [ClassicSimilarity], result of:
            0.38294584 = score(doc=453,freq=3.0), product of:
              0.3971991 = queryWeight, product of:
                8.659948 = boost
                5.0892105 = idf(docFreq=715, maxDocs=42740)
                0.009012443 = queryNorm
              0.96411556 = fieldWeight in 453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.0892105 = idf(docFreq=715, maxDocs=42740)
                0.109375 = fieldNorm(doc=453)
        0.2 = coord(5/25)