Document (#33873)

Author
Fu, T.
Abbasi, A.
Chen, H.
Title
¬A hybrid approach to Web forum interactional coherence analysis
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.8, S.1195-1209
Year
2008
Abstract
Despite the rapid growth of text-based computer-mediated communication (CMC), its limitations have rendered the media highly incoherent. This poses problems for content analysis of online discourse archives. Interactional coherence analysis (ICA) attempts to accurately identify and construct CMC interaction networks. In this study, we propose the Hybrid Interactional Coherence (HIC) algorithm for identification of web forum interaction. HIC utilizes a bevy of system and linguistic features, including message header information, quotations, direct address, and lexical relations. Furthermore, several similarity-based methods including a Lexical Match Algorithm (LMA) and a sliding window method are utilized to account for interactional idiosyncrasies. Experiments results on two web forums revealed that the proposed HIC algorithm significantly outperformed comparison techniques in terms of precision, recall, and F-measure at both the forum and thread levels. Additionally, an example was used to illustrate how the improved ICA results can facilitate enhanced social network and role analysis capabilities.
Theme
Internet

Similar documents (author)

  1. Chen, Y.N.; Chen, S.J.: ¬A metadata practice of the OFLA FRBR model : a case study for the National Palace Museum in Taipai (2004) 4.35
    4.3499155 = sum of:
      4.3499155 = weight(author_txt:chen in 3384) [ClassicSimilarity], result of:
        4.3499155 = fieldWeight in 3384, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.5 = fieldNorm(doc=3384)
    
  2. Chen, C.C.; Chen, H.H.; Chen, K.H.: ¬The design of the XML/Metadata management system (2000) 4.00
    3.9956524 = sum of:
      3.9956524 = weight(author_txt:chen in 4633) [ClassicSimilarity], result of:
        3.9956524 = fieldWeight in 4633, product of:
          1.7320508 = tf(freq=3.0), with freq of:
            3.0 = termFreq=3.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.375 = fieldNorm(doc=4633)
    
  3. Chen, W.Y.: Observations on cataloguing and classification (1991) 3.84
    3.8448186 = sum of:
      3.8448186 = weight(author_txt:chen in 4184) [ClassicSimilarity], result of:
        3.8448186 = fieldWeight in 4184, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.625 = fieldNorm(doc=4184)
    
  4. Chen, H.: Knowledge-based document retrieval : framework and design (1992) 3.84
    3.8448186 = sum of:
      3.8448186 = weight(author_txt:chen in 5283) [ClassicSimilarity], result of:
        3.8448186 = fieldWeight in 5283, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.625 = fieldNorm(doc=5283)
    
  5. Chen, P.S.: On inference rules of logic-based information retrieval systems (1994) 3.84
    3.8448186 = sum of:
      3.8448186 = weight(author_txt:chen in 6731) [ClassicSimilarity], result of:
        3.8448186 = fieldWeight in 6731, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.625 = fieldNorm(doc=6731)
    

Similar documents (content)

  1. Fu, T.; Abbasi, A.; Chen, H.: ¬A focused crawler for Dark Web forums (2010) 0.08
    0.07827002 = sum of:
      0.07827002 = product of:
        0.3913501 = sum of:
          0.009965435 = weight(abstract_txt:results in 3471) [ClassicSimilarity], result of:
            0.009965435 = score(doc=3471,freq=1.0), product of:
              0.045786224 = queryWeight, product of:
                1.0092373 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0130274715 = queryNorm
              0.21765138 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=3471)
          0.12746026 = weight(abstract_txt:forums in 3471) [ClassicSimilarity], result of:
            0.12746026 = score(doc=3471,freq=5.0), product of:
              0.116230614 = queryWeight, product of:
                1.137028 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0130274715 = queryNorm
              1.0966152 = fieldWeight in 3471, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0625 = fieldNorm(doc=3471)
          0.06579666 = weight(abstract_txt:outperformed in 3471) [ClassicSimilarity], result of:
            0.06579666 = score(doc=3471,freq=1.0), product of:
              0.12789784 = queryWeight, product of:
                1.1927309 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0130274715 = queryNorm
              0.514447 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=3471)
          0.023015767 = weight(abstract_txt:analysis in 3471) [ClassicSimilarity], result of:
            0.023015767 = score(doc=3471,freq=1.0), product of:
              0.10079314 = queryWeight, product of:
                2.1176605 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0130274715 = queryNorm
              0.22834657 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=3471)
          0.16511196 = weight(abstract_txt:forum in 3471) [ClassicSimilarity], result of:
            0.16511196 = score(doc=3471,freq=2.0), product of:
              0.27036062 = queryWeight, product of:
                3.0036077 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0130274715 = queryNorm
              0.6107101 = fieldWeight in 3471, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0625 = fieldNorm(doc=3471)
        0.2 = coord(5/25)
    
  2. Cohan, A.; Young, S.; Yates, A.; Goharian, N.: Triaging content severity in online mental health forums (2017) 0.07
    0.06766468 = sum of:
      0.06766468 = product of:
        0.3383234 = sum of:
          0.098730296 = weight(abstract_txt:forums in 3930) [ClassicSimilarity], result of:
            0.098730296 = score(doc=3930,freq=3.0), product of:
              0.116230614 = queryWeight, product of:
                1.137028 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0130274715 = queryNorm
              0.8494345 = fieldWeight in 3930, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0625 = fieldNorm(doc=3930)
          0.03423011 = weight(abstract_txt:interaction in 3930) [ClassicSimilarity], result of:
            0.03423011 = score(doc=3930,freq=1.0), product of:
              0.104234025 = queryWeight, product of:
                1.522757 = boost
                5.254347 = idf(docFreq=627, maxDocs=44218)
                0.0130274715 = queryNorm
              0.32839668 = fieldWeight in 3930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.254347 = idf(docFreq=627, maxDocs=44218)
                0.0625 = fieldNorm(doc=3930)
          0.065595455 = weight(abstract_txt:lexical in 3930) [ClassicSimilarity], result of:
            0.065595455 = score(doc=3930,freq=1.0), product of:
              0.16081251 = queryWeight, product of:
                1.8914105 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0130274715 = queryNorm
              0.4079002 = fieldWeight in 3930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0625 = fieldNorm(doc=3930)
          0.023015767 = weight(abstract_txt:analysis in 3930) [ClassicSimilarity], result of:
            0.023015767 = score(doc=3930,freq=1.0), product of:
              0.10079314 = queryWeight, product of:
                2.1176605 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0130274715 = queryNorm
              0.22834657 = fieldWeight in 3930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=3930)
          0.11675178 = weight(abstract_txt:forum in 3930) [ClassicSimilarity], result of:
            0.11675178 = score(doc=3930,freq=1.0), product of:
              0.27036062 = queryWeight, product of:
                3.0036077 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0130274715 = queryNorm
              0.43183723 = fieldWeight in 3930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0625 = fieldNorm(doc=3930)
        0.2 = coord(5/25)
    
  3. Bhatia, S.; Biyani, P.; Mitra, P.: Identifying the role of individual user messages in an online discussion and its use in thread retrieval (2016) 0.07
    0.06698189 = sum of:
      0.06698189 = product of:
        0.41863683 = sum of:
          0.057001963 = weight(abstract_txt:forums in 2650) [ClassicSimilarity], result of:
            0.057001963 = score(doc=2650,freq=1.0), product of:
              0.116230614 = queryWeight, product of:
                1.137028 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0130274715 = queryNorm
              0.49042124 = fieldWeight in 2650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0625 = fieldNorm(doc=2650)
          0.2218673 = weight(abstract_txt:thread in 2650) [ClassicSimilarity], result of:
            0.2218673 = score(doc=2650,freq=7.0), product of:
              0.15034541 = queryWeight, product of:
                1.293171 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0130274715 = queryNorm
              1.4757171 = fieldWeight in 2650, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0625 = fieldNorm(doc=2650)
          0.023015767 = weight(abstract_txt:analysis in 2650) [ClassicSimilarity], result of:
            0.023015767 = score(doc=2650,freq=1.0), product of:
              0.10079314 = queryWeight, product of:
                2.1176605 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0130274715 = queryNorm
              0.22834657 = fieldWeight in 2650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=2650)
          0.11675178 = weight(abstract_txt:forum in 2650) [ClassicSimilarity], result of:
            0.11675178 = score(doc=2650,freq=1.0), product of:
              0.27036062 = queryWeight, product of:
                3.0036077 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0130274715 = queryNorm
              0.43183723 = fieldWeight in 2650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0625 = fieldNorm(doc=2650)
        0.16 = coord(4/25)
    
  4. Landauer, T.K.; Foltz, P.W.; Laham, D.: ¬An introduction to Latent Semantic Analysis (1998) 0.06
    0.062302716 = sum of:
      0.062302716 = product of:
        0.389392 = sum of:
          0.05351364 = weight(abstract_txt:accurately in 1162) [ClassicSimilarity], result of:
            0.05351364 = score(doc=1162,freq=1.0), product of:
              0.096035175 = queryWeight, product of:
                1.033537 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0130274715 = queryNorm
              0.5572296 = fieldWeight in 1162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=1162)
          0.081994325 = weight(abstract_txt:lexical in 1162) [ClassicSimilarity], result of:
            0.081994325 = score(doc=1162,freq=1.0), product of:
              0.16081251 = queryWeight, product of:
                1.8914105 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0130274715 = queryNorm
              0.5098753 = fieldWeight in 1162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.078125 = fieldNorm(doc=1162)
          0.02876971 = weight(abstract_txt:analysis in 1162) [ClassicSimilarity], result of:
            0.02876971 = score(doc=1162,freq=1.0), product of:
              0.10079314 = queryWeight, product of:
                2.1176605 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0130274715 = queryNorm
              0.2854332 = fieldWeight in 1162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=1162)
          0.22511432 = weight(abstract_txt:coherence in 1162) [ClassicSimilarity], result of:
            0.22511432 = score(doc=1162,freq=1.0), product of:
              0.36093566 = queryWeight, product of:
                3.470455 = boost
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.0130274715 = queryNorm
              0.6236965 = fieldWeight in 1162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.078125 = fieldNorm(doc=1162)
        0.16 = coord(4/25)
    
  5. Chuang, K.Y.; Yang, C.C.: Informational support exchanges using different computer-mediated communication formats in a social media alcoholism community (2014) 0.06
    0.058333475 = sum of:
      0.058333475 = product of:
        0.24305615 = sum of:
          0.03964085 = weight(abstract_txt:mediated in 1179) [ClassicSimilarity], result of:
            0.03964085 = score(doc=1179,freq=1.0), product of:
              0.09123384 = queryWeight, product of:
                1.0073696 = boost
                6.9519553 = idf(docFreq=114, maxDocs=44218)
                0.0130274715 = queryNorm
              0.4344972 = fieldWeight in 1179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9519553 = idf(docFreq=114, maxDocs=44218)
                0.0625 = fieldNorm(doc=1179)
          0.009965435 = weight(abstract_txt:results in 1179) [ClassicSimilarity], result of:
            0.009965435 = score(doc=1179,freq=1.0), product of:
              0.045786224 = queryWeight, product of:
                1.0092373 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0130274715 = queryNorm
              0.21765138 = fieldWeight in 1179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=1179)
          0.019452212 = weight(abstract_txt:including in 1179) [ClassicSimilarity], result of:
            0.019452212 = score(doc=1179,freq=1.0), product of:
              0.07151272 = queryWeight, product of:
                1.2612975 = boost
                4.352168 = idf(docFreq=1547, maxDocs=44218)
                0.0130274715 = queryNorm
              0.2720105 = fieldWeight in 1179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.352168 = idf(docFreq=1547, maxDocs=44218)
                0.0625 = fieldNorm(doc=1179)
          0.03423011 = weight(abstract_txt:interaction in 1179) [ClassicSimilarity], result of:
            0.03423011 = score(doc=1179,freq=1.0), product of:
              0.104234025 = queryWeight, product of:
                1.522757 = boost
                5.254347 = idf(docFreq=627, maxDocs=44218)
                0.0130274715 = queryNorm
              0.32839668 = fieldWeight in 1179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.254347 = idf(docFreq=627, maxDocs=44218)
                0.0625 = fieldNorm(doc=1179)
          0.023015767 = weight(abstract_txt:analysis in 1179) [ClassicSimilarity], result of:
            0.023015767 = score(doc=1179,freq=1.0), product of:
              0.10079314 = queryWeight, product of:
                2.1176605 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0130274715 = queryNorm
              0.22834657 = fieldWeight in 1179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=1179)
          0.11675178 = weight(abstract_txt:forum in 1179) [ClassicSimilarity], result of:
            0.11675178 = score(doc=1179,freq=1.0), product of:
              0.27036062 = queryWeight, product of:
                3.0036077 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0130274715 = queryNorm
              0.43183723 = fieldWeight in 1179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0625 = fieldNorm(doc=1179)
        0.24 = coord(6/25)