Document (#30981)

Author
Frobese, D.T.
Title
Klassifikationsaufgaben mit der SENTRAX : Konkreter Fall: Automatische Detektion von SPAM
Source
http://web1.bib.uni-hildesheim.de/edocs/2006/51992004X/doc/51992004X.pdf
Year
2006
Abstract
Die Suchfunktionen des SENTRAX-Verfahrens werden für die Klassifizierung von Mails und im Besonderen für die Detektion von SPAM eingesetzt. Die Eigenschaften einer kontextähnlichen Suche und die Fehlertoleranz sollen genutzt werden, um SPAM Nachrichten treffsicher aufzuspüren.
Footnote
Beitrag der Proceedings des Fünften Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2006), Hildesheim, xx.x.2006.
Theme
Computerlinguistik
Automatisches Klassifizieren
Object
SENTRAX

Similar documents (content)

  1. Goodman, J.; Heckerman, D.; Rounthwaite, R.: Schutzwälle gegen Spam (2005) 0.33
    0.330009 = sum of:
      0.330009 = product of:
        1.1880324 = sum of:
          0.010235171 = weight(abstract_txt:einer in 3696) [ClassicSimilarity], result of:
            0.010235171 = score(doc=3696,freq=1.0), product of:
              0.04819049 = queryWeight, product of:
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.012408397 = queryNorm
              0.21238984 = fieldWeight in 3696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3696)
          0.032665614 = weight(abstract_txt:sollen in 3696) [ClassicSimilarity], result of:
            0.032665614 = score(doc=3696,freq=1.0), product of:
              0.10446204 = queryWeight, product of:
                1.4723077 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.012408397 = queryNorm
              0.3127032 = fieldWeight in 3696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3696)
          0.03368246 = weight(abstract_txt:werden in 3696) [ClassicSimilarity], result of:
            0.03368246 = score(doc=3696,freq=5.0), product of:
              0.07855741 = queryWeight, product of:
                1.8056264 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.012408397 = queryNorm
              0.42876238 = fieldWeight in 3696, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3696)
          0.19488117 = weight(abstract_txt:mails in 3696) [ClassicSimilarity], result of:
            0.19488117 = score(doc=3696,freq=4.0), product of:
              0.21646675 = queryWeight, product of:
                2.1194098 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.012408397 = queryNorm
              0.9002822 = fieldWeight in 3696, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3696)
          0.916568 = weight(abstract_txt:spam in 3696) [ClassicSimilarity], result of:
            0.916568 = score(doc=3696,freq=8.0), product of:
              0.6955871 = queryWeight, product of:
                6.5804515 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.012408397 = queryNorm
              1.3176898 = fieldWeight in 3696, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3696)
        0.2777778 = coord(5/18)
    
  2. Krüger, A.: Angriffe aus dem Netz : die neue Szene des digitalen Verbrechens (2006) 0.16
    0.15551104 = sum of:
      0.15551104 = product of:
        0.9330662 = sum of:
          0.037271887 = weight(abstract_txt:werden in 141) [ClassicSimilarity], result of:
            0.037271887 = score(doc=141,freq=3.0), product of:
              0.07855741 = queryWeight, product of:
                1.8056264 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.012408397 = queryNorm
              0.47445413 = fieldWeight in 141, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=141)
          0.2411029 = weight(abstract_txt:mails in 141) [ClassicSimilarity], result of:
            0.2411029 = score(doc=141,freq=3.0), product of:
              0.21646675 = queryWeight, product of:
                2.1194098 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.012408397 = queryNorm
              1.1138103 = fieldWeight in 141, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.078125 = fieldNorm(doc=141)
          0.6546914 = weight(abstract_txt:spam in 141) [ClassicSimilarity], result of:
            0.6546914 = score(doc=141,freq=2.0), product of:
              0.6955871 = queryWeight, product of:
                6.5804515 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.012408397 = queryNorm
              0.94120693 = fieldWeight in 141, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.078125 = fieldNorm(doc=141)
        0.16666667 = coord(3/18)
    
  3. Krüger, K.: Suchmaschinen-Spamming : Vergleichend-kritische Analysen zur Wirkung kommerzieller Strategien der Website-Optimierung auf das Ranking in www-Suchmaschinen (2004) 0.15
    0.15029034 = sum of:
      0.15029034 = product of:
        0.90174204 = sum of:
          0.079593465 = weight(abstract_txt:eingesetzt in 3700) [ClassicSimilarity], result of:
            0.079593465 = score(doc=3700,freq=1.0), product of:
              0.1320568 = queryWeight, product of:
                1.6553876 = boost
                6.429029 = idf(docFreq=193, maxDocs=44218)
                0.012408397 = queryNorm
              0.60272145 = fieldWeight in 3700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.429029 = idf(docFreq=193, maxDocs=44218)
                0.09375 = fieldNorm(doc=3700)
          0.036518842 = weight(abstract_txt:werden in 3700) [ClassicSimilarity], result of:
            0.036518842 = score(doc=3700,freq=2.0), product of:
              0.07855741 = queryWeight, product of:
                1.8056264 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.012408397 = queryNorm
              0.46486822 = fieldWeight in 3700, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.09375 = fieldNorm(doc=3700)
          0.78562975 = weight(abstract_txt:spam in 3700) [ClassicSimilarity], result of:
            0.78562975 = score(doc=3700,freq=2.0), product of:
              0.6955871 = queryWeight, product of:
                6.5804515 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.012408397 = queryNorm
              1.1294484 = fieldWeight in 3700, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.09375 = fieldNorm(doc=3700)
        0.16666667 = coord(3/18)
    
  4. Terliesner, J.: Information Retrieval in Wikis : wie können klassische Methoden des Information Retrievals die Suchfunktion eines Wikis bereichern? (2010) 0.15
    0.14504638 = sum of:
      0.14504638 = product of:
        0.43513912 = sum of:
          0.014621671 = weight(abstract_txt:einer in 3504) [ClassicSimilarity], result of:
            0.014621671 = score(doc=3504,freq=1.0), product of:
              0.04819049 = queryWeight, product of:
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.012408397 = queryNorm
              0.30341405 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.046665166 = weight(abstract_txt:sollen in 3504) [ClassicSimilarity], result of:
            0.046665166 = score(doc=3504,freq=1.0), product of:
              0.10446204 = queryWeight, product of:
                1.4723077 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.012408397 = queryNorm
              0.44671887 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.06508503 = weight(abstract_txt:genutzt in 3504) [ClassicSimilarity], result of:
            0.06508503 = score(doc=3504,freq=1.0), product of:
              0.13040195 = queryWeight, product of:
                1.6449828 = boost
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.012408397 = queryNorm
              0.49911088 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.05271041 = weight(abstract_txt:werden in 3504) [ClassicSimilarity], result of:
            0.05271041 = score(doc=3504,freq=6.0), product of:
              0.07855741 = queryWeight, product of:
                1.8056264 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.012408397 = queryNorm
              0.6709795 = fieldWeight in 3504, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.094373785 = weight(abstract_txt:besonderen in 3504) [ClassicSimilarity], result of:
            0.094373785 = score(doc=3504,freq=1.0), product of:
              0.16705683 = queryWeight, product of:
                1.8618789 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.012408397 = queryNorm
              0.56492025 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.16168305 = weight(abstract_txt:suchfunktionen in 3504) [ClassicSimilarity], result of:
            0.16168305 = score(doc=3504,freq=1.0), product of:
              0.23918813 = queryWeight, product of:
                2.2278664 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.012408397 = queryNorm
              0.675966 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
        0.33333334 = coord(6/18)
    
  5. Sixtus, M.: Schlüssel gegen Spam : Yahoo macht seine Technik öffentlich, die gefälschte Mails erkennt - in Hoffnung dass sie zum Standard wird (2004) 0.13
    0.12555583 = sum of:
      0.12555583 = product of:
        0.75333494 = sum of:
          0.15119733 = weight(abstract_txt:nachrichten in 2217) [ClassicSimilarity], result of:
            0.15119733 = score(doc=2217,freq=2.0), product of:
              0.18154435 = queryWeight, product of:
                1.9409337 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.012408397 = queryNorm
              0.8328396 = fieldWeight in 2217, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.078125 = fieldNorm(doc=2217)
          0.13920084 = weight(abstract_txt:mails in 2217) [ClassicSimilarity], result of:
            0.13920084 = score(doc=2217,freq=1.0), product of:
              0.21646675 = queryWeight, product of:
                2.1194098 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.012408397 = queryNorm
              0.6430587 = fieldWeight in 2217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.078125 = fieldNorm(doc=2217)
          0.46293676 = weight(abstract_txt:spam in 2217) [ClassicSimilarity], result of:
            0.46293676 = score(doc=2217,freq=1.0), product of:
              0.6955871 = queryWeight, product of:
                6.5804515 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.012408397 = queryNorm
              0.66553384 = fieldWeight in 2217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.078125 = fieldNorm(doc=2217)
        0.16666667 = coord(3/18)