Document (#30982)

Author
Frobese, D.T.
Title
Klassifikationsaufgaben mit der SENTRAX : Konkreter Fall: Automatische Detektion von SPAM
Source
http://web1.bib.uni-hildesheim.de/edocs/2006/51992004X/doc/51992004X.pdf
Year
2006
Abstract
Die Suchfunktionen des SENTRAX-Verfahrens werden für die Klassifizierung von Mails und im Besonderen für die Detektion von SPAM eingesetzt. Die Eigenschaften einer kontextähnlichen Suche und die Fehlertoleranz sollen genutzt werden, um SPAM Nachrichten treffsicher aufzuspüren.
Footnote
Beitrag der Proceedings des Fünften Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2006), Hildesheim, xx.x.2006.
Theme
Computerlinguistik
Automatisches Klassifizieren
Object
SENTRAX

Similar documents (content)

  1. Goodman, J.; Heckerman, D.; Rounthwaite, R.: Schutzwälle gegen Spam (2005) 0.33
    0.3276713 = sum of:
      0.3276713 = product of:
        1.1796166 = sum of:
          0.010490829 = weight(abstract_txt:einer in 4697) [ClassicSimilarity], result of:
            0.010490829 = score(doc=4697,freq=1.0), product of:
              0.049063873 = queryWeight, product of:
                3.9098482 = idf(docFreq=2304, maxDocs=42306)
                0.012548792 = queryNorm
              0.21381983 = fieldWeight in 4697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9098482 = idf(docFreq=2304, maxDocs=42306)
                0.0546875 = fieldNorm(doc=4697)
          0.03322723 = weight(abstract_txt:sollen in 4697) [ClassicSimilarity], result of:
            0.03322723 = score(doc=4697,freq=1.0), product of:
              0.105816014 = queryWeight, product of:
                1.4685705 = boost
                5.7418876 = idf(docFreq=368, maxDocs=42306)
                0.012548792 = queryNorm
              0.3140095 = fieldWeight in 4697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7418876 = idf(docFreq=368, maxDocs=42306)
                0.0546875 = fieldNorm(doc=4697)
          0.034537014 = weight(abstract_txt:werden in 4697) [ClassicSimilarity], result of:
            0.034537014 = score(doc=4697,freq=5.0), product of:
              0.08000156 = queryWeight, product of:
                1.8058568 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.012548792 = queryNorm
              0.43170422 = fieldWeight in 4697, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.0546875 = fieldNorm(doc=4697)
          0.19488074 = weight(abstract_txt:mails in 4697) [ClassicSimilarity], result of:
            0.19488074 = score(doc=4697,freq=4.0), product of:
              0.21679433 = queryWeight, product of:
                2.10205 = boost
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.012548792 = queryNorm
              0.89891994 = fieldWeight in 4697, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.0546875 = fieldNorm(doc=4697)
          0.9064808 = weight(abstract_txt:spam in 4697) [ClassicSimilarity], result of:
            0.9064808 = score(doc=4697,freq=8.0), product of:
              0.6915201 = queryWeight, product of:
                6.5025263 = boost
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.012548792 = queryNorm
              1.3108524 = fieldWeight in 4697, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.0546875 = fieldNorm(doc=4697)
        0.2777778 = coord(5/18)
    
  2. Krüger, A.: Angriffe aus dem Netz : die neue Szene des digitalen Verbrechens (2006) 0.15
    0.1544677 = sum of:
      0.1544677 = product of:
        0.9268062 = sum of:
          0.038217504 = weight(abstract_txt:werden in 1267) [ClassicSimilarity], result of:
            0.038217504 = score(doc=1267,freq=3.0), product of:
              0.08000156 = queryWeight, product of:
                1.8058568 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.012548792 = queryNorm
              0.47770947 = fieldWeight in 1267, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.078125 = fieldNorm(doc=1267)
          0.2411024 = weight(abstract_txt:mails in 1267) [ClassicSimilarity], result of:
            0.2411024 = score(doc=1267,freq=3.0), product of:
              0.21679433 = queryWeight, product of:
                2.10205 = boost
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.012548792 = queryNorm
              1.112125 = fieldWeight in 1267, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.078125 = fieldNorm(doc=1267)
          0.64748627 = weight(abstract_txt:spam in 1267) [ClassicSimilarity], result of:
            0.64748627 = score(doc=1267,freq=2.0), product of:
              0.6915201 = queryWeight, product of:
                6.5025263 = boost
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.012548792 = queryNorm
              0.93632317 = fieldWeight in 1267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.078125 = fieldNorm(doc=1267)
        0.16666667 = coord(3/18)
    
  3. Krüger, K.: Suchmaschinen-Spamming : Vergleichend-kritische Analysen zur Wirkung kommerzieller Strategien der Website-Optimierung auf das Ranking in www-Suchmaschinen (2004) 0.15
    0.14898476 = sum of:
      0.14898476 = product of:
        0.8939085 = sum of:
          0.079479575 = weight(abstract_txt:eingesetzt in 4701) [ClassicSimilarity], result of:
            0.079479575 = score(doc=4701,freq=1.0), product of:
              0.13213064 = queryWeight, product of:
                1.6410464 = boost
                6.416242 = idf(docFreq=187, maxDocs=42306)
                0.012548792 = queryNorm
              0.6015227 = fieldWeight in 4701, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.416242 = idf(docFreq=187, maxDocs=42306)
                0.09375 = fieldNorm(doc=4701)
          0.037445355 = weight(abstract_txt:werden in 4701) [ClassicSimilarity], result of:
            0.037445355 = score(doc=4701,freq=2.0), product of:
              0.08000156 = queryWeight, product of:
                1.8058568 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.012548792 = queryNorm
              0.4680578 = fieldWeight in 4701, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.09375 = fieldNorm(doc=4701)
          0.77698356 = weight(abstract_txt:spam in 4701) [ClassicSimilarity], result of:
            0.77698356 = score(doc=4701,freq=2.0), product of:
              0.6915201 = queryWeight, product of:
                6.5025263 = boost
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.012548792 = queryNorm
              1.1235878 = fieldWeight in 4701, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.09375 = fieldNorm(doc=4701)
        0.16666667 = coord(3/18)
    
  4. Terliesner, J.: Information Retrieval in Wikis : wie können klassische Methoden des Information Retrievals die Suchfunktion eines Wikis bereichern? (2010) 0.15
    0.14698821 = sum of:
      0.14698821 = product of:
        0.4409646 = sum of:
          0.014986897 = weight(abstract_txt:einer in 505) [ClassicSimilarity], result of:
            0.014986897 = score(doc=505,freq=1.0), product of:
              0.049063873 = queryWeight, product of:
                3.9098482 = idf(docFreq=2304, maxDocs=42306)
                0.012548792 = queryNorm
              0.30545688 = fieldWeight in 505, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9098482 = idf(docFreq=2304, maxDocs=42306)
                0.078125 = fieldNorm(doc=505)
          0.047467474 = weight(abstract_txt:sollen in 505) [ClassicSimilarity], result of:
            0.047467474 = score(doc=505,freq=1.0), product of:
              0.105816014 = queryWeight, product of:
                1.4685705 = boost
                5.7418876 = idf(docFreq=368, maxDocs=42306)
                0.012548792 = queryNorm
              0.44858497 = fieldWeight in 505, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7418876 = idf(docFreq=368, maxDocs=42306)
                0.078125 = fieldNorm(doc=505)
          0.06590582 = weight(abstract_txt:genutzt in 505) [ClassicSimilarity], result of:
            0.06590582 = score(doc=505,freq=1.0), product of:
              0.13169517 = queryWeight, product of:
                1.6383399 = boost
                6.40566 = idf(docFreq=189, maxDocs=42306)
                0.012548792 = queryNorm
              0.5004422 = fieldWeight in 505, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.40566 = idf(docFreq=189, maxDocs=42306)
                0.078125 = fieldNorm(doc=505)
          0.054047715 = weight(abstract_txt:werden in 505) [ClassicSimilarity], result of:
            0.054047715 = score(doc=505,freq=6.0), product of:
              0.08000156 = queryWeight, product of:
                1.8058568 = boost
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.012548792 = queryNorm
              0.67558324 = fieldWeight in 505, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.530313 = idf(docFreq=3368, maxDocs=42306)
                0.078125 = fieldNorm(doc=505)
          0.09587936 = weight(abstract_txt:besonderen in 505) [ClassicSimilarity], result of:
            0.09587936 = score(doc=505,freq=1.0), product of:
              0.16908461 = queryWeight, product of:
                1.8563981 = boost
                7.258235 = idf(docFreq=80, maxDocs=42306)
                0.012548792 = queryNorm
              0.5670496 = fieldWeight in 505, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.258235 = idf(docFreq=80, maxDocs=42306)
                0.078125 = fieldNorm(doc=505)
          0.16267735 = weight(abstract_txt:suchfunktionen in 505) [ClassicSimilarity], result of:
            0.16267735 = score(doc=505,freq=1.0), product of:
              0.24053155 = queryWeight, product of:
                2.2141402 = boost
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.012548792 = queryNorm
              0.67632437 = fieldWeight in 505, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656952 = idf(docFreq=19, maxDocs=42306)
                0.078125 = fieldNorm(doc=505)
        0.33333334 = coord(6/18)
    
  5. Sixtus, M.: Schlüssel gegen Spam : Yahoo macht seine Technik öffentlich, die gefälschte Mails erkennt - in Hoffnung dass sie zum Standard wird (2004) 0.12
    0.124859706 = sum of:
      0.124859706 = product of:
        0.7491582 = sum of:
          0.15211566 = weight(abstract_txt:nachrichten in 3218) [ClassicSimilarity], result of:
            0.15211566 = score(doc=3218,freq=2.0), product of:
              0.18255481 = queryWeight, product of:
                1.9289267 = boost
                7.5418105 = idf(docFreq=60, maxDocs=42306)
                0.012548792 = queryNorm
              0.8332602 = fieldWeight in 3218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5418105 = idf(docFreq=60, maxDocs=42306)
                0.078125 = fieldNorm(doc=3218)
          0.13920054 = weight(abstract_txt:mails in 3218) [ClassicSimilarity], result of:
            0.13920054 = score(doc=3218,freq=1.0), product of:
              0.21679433 = queryWeight, product of:
                2.10205 = boost
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.012548792 = queryNorm
              0.6420857 = fieldWeight in 3218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.218697 = idf(docFreq=30, maxDocs=42306)
                0.078125 = fieldNorm(doc=3218)
          0.457842 = weight(abstract_txt:spam in 3218) [ClassicSimilarity], result of:
            0.457842 = score(doc=3218,freq=1.0), product of:
              0.6915201 = queryWeight, product of:
                6.5025263 = boost
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.012548792 = queryNorm
              0.6620805 = fieldWeight in 3218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.47463 = idf(docFreq=23, maxDocs=42306)
                0.078125 = fieldNorm(doc=3218)
        0.16666667 = coord(3/18)