Search (155 results, page 1 of 8)

Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.10

0.097582184 = sum of:
  0.05333692 = product of:
    0.21334767 = sum of:
      0.21334767 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
        0.21334767 = score(doc=562,freq=2.0), product of:
          0.37961 = queryWeight, product of:
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.044775832 = queryNorm
          0.56201804 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.25 = coord(1/4)
  0.044245265 = product of:
    0.066367894 = sum of:
      0.029968852 = weight(_text_:j in 562) [ClassicSimilarity], result of:
        0.029968852 = score(doc=562,freq=2.0), product of:
          0.14227505 = queryWeight, product of:
            3.1774964 = idf(docFreq=5010, maxDocs=44218)
            0.044775832 = queryNorm
          0.21064025 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.1774964 = idf(docFreq=5010, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
      0.03639904 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
        0.03639904 = score(doc=562,freq=2.0), product of:
          0.15679733 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.044775832 = queryNorm
          0.23214069 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.6666667 = coord(2/3)

Content: Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
Date: 8. 1.2013 10:22:32

Gonzalo, J.; Verdejo, F.; Peters, C.; Calzolari, N.: Applying EuroWordNet to cross-language text retrieval (1998) 0.07

0.06622753 = product of:
  0.13245507 = sum of:
    0.13245507 = product of:
      0.19868259 = sum of:
        0.06992732 = weight(_text_:j in 6445) [ClassicSimilarity], result of:
          0.06992732 = score(doc=6445,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.4914939 = fieldWeight in 6445, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.109375 = fieldNorm(doc=6445)
        0.12875527 = weight(_text_:n in 6445) [ClassicSimilarity], result of:
          0.12875527 = score(doc=6445,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.6669253 = fieldWeight in 6445, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.109375 = fieldNorm(doc=6445)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Vichot, F.; Wolinksi, F.; Tomeh, J.; Guennou, S.; Dillet, B.; Aydjian, S.: High precision hypertext navigation based on NLP automation extractions (1997) 0.06

0.056766458 = product of:
  0.113532916 = sum of:
    0.113532916 = product of:
      0.17029937 = sum of:
        0.059937704 = weight(_text_:j in 733) [ClassicSimilarity], result of:
          0.059937704 = score(doc=733,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.4212805 = fieldWeight in 733, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.09375 = fieldNorm(doc=733)
        0.110361665 = weight(_text_:n in 733) [ClassicSimilarity], result of:
          0.110361665 = score(doc=733,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.57165027 = fieldWeight in 733, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.09375 = fieldNorm(doc=733)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Hypertext - Information Retrieval - Multimedia '97: Theorien, Modelle und Implementierungen integrierter elektronischer Informationssysteme. Proceedings HIM '97. Hrsg.: N. Fuhr u.a

Vazov, N.: Identification des differentes structures temporelles dans des textes et leur rôles dans le raisonnement temporel (1999) 0.04

0.037844308 = product of:
  0.075688615 = sum of:
    0.075688615 = product of:
      0.113532916 = sum of:
        0.03995847 = weight(_text_:j in 6203) [ClassicSimilarity], result of:
          0.03995847 = score(doc=6203,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.28085366 = fieldWeight in 6203, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0625 = fieldNorm(doc=6203)
        0.073574446 = weight(_text_:n in 6203) [ClassicSimilarity], result of:
          0.073574446 = score(doc=6203,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.38110018 = fieldWeight in 6203, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.0625 = fieldNorm(doc=6203)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Organisation des connaissances en vue de leur intégration dans les systèmes de représentation et de recherche d'information. Ed.: J. Maniez, et al

Ferret, O.; Grau, B.; Masson, N.: Utilisation d'un réseau de cooccurences lexikales pour a méliorer une analyse thématique fondée sur la distribution des mots (1999) 0.04

0.037844308 = product of:
  0.075688615 = sum of:
    0.075688615 = product of:
      0.113532916 = sum of:
        0.03995847 = weight(_text_:j in 6295) [ClassicSimilarity], result of:
          0.03995847 = score(doc=6295,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.28085366 = fieldWeight in 6295, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0625 = fieldNorm(doc=6295)
        0.073574446 = weight(_text_:n in 6295) [ClassicSimilarity], result of:
          0.073574446 = score(doc=6295,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.38110018 = fieldWeight in 6295, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.0625 = fieldNorm(doc=6295)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Organisation des connaissances en vue de leur intégration dans les systèmes de représentation et de recherche d'information. Ed.: J. Maniez, et al

Hutchins, J.: From first conception to first demonstration : the nascent years of machine translation, 1947-1954. A chronology (1997) 0.04

0.036871053 = product of:
  0.07374211 = sum of:
    0.07374211 = product of:
      0.11061315 = sum of:
        0.049948085 = weight(_text_:j in 1463) [ClassicSimilarity], result of:
          0.049948085 = score(doc=1463,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.35106707 = fieldWeight in 1463, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.078125 = fieldNorm(doc=1463)
        0.06066507 = weight(_text_:22 in 1463) [ClassicSimilarity], result of:
          0.06066507 = score(doc=1463,freq=2.0), product of:
            0.15679733 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044775832 = queryNorm
            0.38690117 = fieldWeight in 1463, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1463)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 31. 7.1996 9:22:19

Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I.: Attention Is all you need (2017) 0.04

0.03600211 = product of:
  0.07200422 = sum of:
    0.07200422 = product of:
      0.10800633 = sum of:
        0.029968852 = weight(_text_:j in 970) [ClassicSimilarity], result of:
          0.029968852 = score(doc=970,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.21064025 = fieldWeight in 970, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.046875 = fieldNorm(doc=970)
        0.07803748 = weight(_text_:n in 970) [ClassicSimilarity], result of:
          0.07803748 = score(doc=970,freq=4.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.40421778 = fieldWeight in 970, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.046875 = fieldNorm(doc=970)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Schwarz, C.: THESYS: Thesaurus Syntax System : a fully automatic thesaurus building aid (1988) 0.03

0.03063721 = product of:
  0.06127442 = sum of:
    0.06127442 = product of:
      0.09191163 = sum of:
        0.049446084 = weight(_text_:j in 1361) [ClassicSimilarity], result of:
          0.049446084 = score(doc=1361,freq=4.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.34753868 = fieldWeight in 1361, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1361)
        0.04246555 = weight(_text_:22 in 1361) [ClassicSimilarity], result of:
          0.04246555 = score(doc=1361,freq=2.0), product of:
            0.15679733 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044775832 = queryNorm
            0.2708308 = fieldWeight in 1361, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1361)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 6. 1.1999 10:22:07
Source: Wissensorganisation im Wandel: Dezimalklassifikation - Thesaurusfragen - Warenklassifikation. Proc. 11. Jahrestagung der Gesellschaft für Klassifikation, Aachen, 29.6.-1.7.1987. Hrsg.: H.-J. Hermes u. J. Hölzl

Bager, J.: ¬Die Text-KI ChatGPT schreibt Fachtexte, Prosa, Gedichte und Programmcode (2023) 0.03

0.029496845 = product of:
  0.05899369 = sum of:
    0.05899369 = product of:
      0.08849053 = sum of:
        0.03995847 = weight(_text_:j in 835) [ClassicSimilarity], result of:
          0.03995847 = score(doc=835,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.28085366 = fieldWeight in 835, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0625 = fieldNorm(doc=835)
        0.048532058 = weight(_text_:22 in 835) [ClassicSimilarity], result of:
          0.048532058 = score(doc=835,freq=2.0), product of:
            0.15679733 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044775832 = queryNorm
            0.30952093 = fieldWeight in 835, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=835)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 29.12.2022 18:22:55

Perovsek, M.; Kranjca, J.; Erjaveca, T.; Cestnika, B.; Lavraca, N.: TextFlows : a visual programming platform for text mining and natural language processing (2016) 0.03

0.028383229 = product of:
  0.056766458 = sum of:
    0.056766458 = product of:
      0.08514968 = sum of:
        0.029968852 = weight(_text_:j in 2697) [ClassicSimilarity], result of:
          0.029968852 = score(doc=2697,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.21064025 = fieldWeight in 2697, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.046875 = fieldNorm(doc=2697)
        0.055180833 = weight(_text_:n in 2697) [ClassicSimilarity], result of:
          0.055180833 = score(doc=2697,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.28582513 = fieldWeight in 2697, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.046875 = fieldNorm(doc=2697)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Li, N.; Sun, J.: Improving Chinese term association from the linguistic perspective (2017) 0.03

0.028383229 = product of:
  0.056766458 = sum of:
    0.056766458 = product of:
      0.08514968 = sum of:
        0.029968852 = weight(_text_:j in 3381) [ClassicSimilarity], result of:
          0.029968852 = score(doc=3381,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.21064025 = fieldWeight in 3381, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.046875 = fieldNorm(doc=3381)
        0.055180833 = weight(_text_:n in 3381) [ClassicSimilarity], result of:
          0.055180833 = score(doc=3381,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.28582513 = fieldWeight in 3381, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.046875 = fieldNorm(doc=3381)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.03

0.02666846 = product of:
  0.05333692 = sum of:
    0.05333692 = product of:
      0.21334767 = sum of:
        0.21334767 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
          0.21334767 = score(doc=862,freq=2.0), product of:
            0.37961 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.044775832 = queryNorm
            0.56201804 = fieldWeight in 862, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=862)
      0.25 = coord(1/4)
  0.5 = coord(1/2)

Source: https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN

Godby, J.: WordSmith research project bridges gap between tokens and indexes (1998) 0.03

0.025809735 = product of:
  0.05161947 = sum of:
    0.05161947 = product of:
      0.077429205 = sum of:
        0.03496366 = weight(_text_:j in 4729) [ClassicSimilarity], result of:
          0.03496366 = score(doc=4729,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.24574696 = fieldWeight in 4729, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4729)
        0.04246555 = weight(_text_:22 in 4729) [ClassicSimilarity], result of:
          0.04246555 = score(doc=4729,freq=2.0), product of:
            0.15679733 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044775832 = queryNorm
            0.2708308 = fieldWeight in 4729, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4729)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: OCLC newsletter. 1998, no.234, Jul/Aug, S.22-24

Brown, T.B.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; Agarwal, S.; Herbert-Voss, A.; Krueger, G.; Henighan, T.; Child, R.; Ramesh, A.; Ziegler, D.M.; Wu, J.; Winter, C.; Hesse, C.; Chen, M.; Sigler, E.; Litwin, M.; Gray, S.; Chess, B.; Clark, J.; Berner, C.; McCandlish, S.; Radford, A.; Sutskever, I.; Amodei, D.: Language models are few-shot learners (2020) 0.02

0.023797423 = product of:
  0.047594845 = sum of:
    0.047594845 = product of:
      0.07139227 = sum of:
        0.03460505 = weight(_text_:j in 872) [ClassicSimilarity], result of:
          0.03460505 = score(doc=872,freq=6.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.24322641 = fieldWeight in 872, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.03125 = fieldNorm(doc=872)
        0.036787223 = weight(_text_:n in 872) [ClassicSimilarity], result of:
          0.036787223 = score(doc=872,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.19055009 = fieldWeight in 872, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.03125 = fieldNorm(doc=872)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Malo, P.; Sinha, A.; Korhonen, P.; Wallenius, J.; Takala, P.: Good debt or bad debt : detecting semantic orientations in economic texts (2014) 0.02
```
0.023652691 = product of:
  0.047305383 = sum of:
    0.047305383 = product of:
      0.07095807 = sum of:
        0.024974043 = weight(_text_:j in 1226) [ClassicSimilarity], result of:
          0.024974043 = score(doc=1226,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.17553353 = fieldWeight in 1226, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1226)
        0.045984026 = weight(_text_:n in 1226) [ClassicSimilarity], result of:
          0.045984026 = score(doc=1226,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.23818761 = fieldWeight in 1226, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1226)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

The use of robo-readers to analyze news texts is an emerging technology trend in computational finance. Recent research has developed sophisticated financial polarity lexicons for investigating how financial sentiments relate to future company performance. However, based on experience from fields that commonly analyze sentiment, it is well known that the overall semantic orientation of a sentence may differ from that of individual words. This article investigates how semantic orientations can be better detected in financial and economic news by accommodating the overall phrase-structure information and domain-specific use of language. Our three main contributions are the following: (a) a human-annotated finance phrase bank that can be used for training and evaluating alternative models; (b) a technique to enhance financial lexicons with attributes that help to identify expected direction of events that affect sentiment; and (c) a linearized phrase-structure model for detecting contextual semantic orientations in economic texts. The relevance of the newly added lexicon features and the benefit of using the proposed learning algorithm are demonstrated in a comparative study against general sentiment models as well as the popular word frequency models used in recent financial studies. The proposed framework is parsimonious and avoids the explosion in feature space caused by the use of conventional n-gram features.

Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.02

0.022122633 = product of:
  0.044245265 = sum of:
    0.044245265 = product of:
      0.066367894 = sum of:
        0.029968852 = weight(_text_:j in 1848) [ClassicSimilarity], result of:
          0.029968852 = score(doc=1848,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.21064025 = fieldWeight in 1848, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.046875 = fieldNorm(doc=1848)
        0.03639904 = weight(_text_:22 in 1848) [ClassicSimilarity], result of:
          0.03639904 = score(doc=1848,freq=2.0), product of:
            0.15679733 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044775832 = queryNorm
            0.23214069 = fieldWeight in 1848, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1848)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: The goal of entity linking is to associate references to an entity that is found in unstructured natural language content to an authoritative inventory of known entities. This article describes the construction of 6 test collections for cross-language person-entity linking that together span 22 languages. Fully automated components were used together with 2 crowdsourced validation stages to affordably generate ground-truth annotations with an accuracy comparable to that of a completely manual process. The resulting test collections each contain between 642 (Arabic) and 2,361 (Romanian) person references in non-English texts for which the correct resolution in English Wikipedia is known, plus a similar number of references for which no correct resolution into English Wikipedia is believed to exist. Fully automated cross-language person-name linking experiments with 20 non-English languages yielded a resolution accuracy of between 0.84 (Serbian) and 0.98 (Romanian), which compares favorably with previously reported cross-language entity linking results for Spanish.

Ahmed, F.; Nürnberger, A.: Evaluation of n-gram conflation approaches for Arabic text retrieval (2009) 0.02
```
0.020564683 = product of:
  0.041129366 = sum of:
    0.041129366 = product of:
      0.1233881 = sum of:
        0.1233881 = weight(_text_:n in 2941) [ClassicSimilarity], result of:
          0.1233881 = score(doc=2941,freq=10.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.63912445 = fieldWeight in 2941, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.046875 = fieldNorm(doc=2941)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)
```
Abstract

In this paper we present a language-independent approach for conflation that does not depend on predefined rules or prior knowledge of the target language. The proposed unsupervised method is based on an enhancement of the pure n-gram model that can group related words based on various string-similarity measures, while restricting the search to specific locations of the target word by taking into account the order of n-grams. We show that the method is effective to achieve high score similarities for all word-form variations and reduces the ambiguity, i.e., obtains a higher precision and recall, compared to pure n-gram-based approaches for English, Portuguese, and Arabic. The proposed method is especially suited for conflation approaches in Arabic, since Arabic is a highly inflectional language. Therefore, we present in addition an adaptive user interface for Arabic text retrieval called araSearch. araSearch serves as a metasearch interface to existing search engines. The system is able to extend a query using the proposed conflation approach such that additional results for relevant subwords can be found automatically.

Object

n-grams

Sienel, J.; Weiss, M.; Laube, M.: Sprachtechnologien für die Informationsgesellschaft des 21. Jahrhunderts (2000) 0.02

0.018435527 = product of:
  0.036871053 = sum of:
    0.036871053 = product of:
      0.055306576 = sum of:
        0.024974043 = weight(_text_:j in 5557) [ClassicSimilarity], result of:
          0.024974043 = score(doc=5557,freq=2.0), product of:
            0.14227505 = queryWeight, product of:
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.044775832 = queryNorm
            0.17553353 = fieldWeight in 5557, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.1774964 = idf(docFreq=5010, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5557)
        0.030332536 = weight(_text_:22 in 5557) [ClassicSimilarity], result of:
          0.030332536 = score(doc=5557,freq=2.0), product of:
            0.15679733 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044775832 = queryNorm
            0.19345059 = fieldWeight in 5557, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5557)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 26.12.2000 13:22:17

Alonge, A.; Calzolari, N.; Vossen, P.; Bloksma, L.; Castellon, I.; Marti, M.A.; Peters, W.: ¬The linguistic design of the EuroWordNet database (1998) 0.02

0.018393612 = product of:
  0.036787223 = sum of:
    0.036787223 = product of:
      0.110361665 = sum of:
        0.110361665 = weight(_text_:n in 6440) [ClassicSimilarity], result of:
          0.110361665 = score(doc=6440,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.57165027 = fieldWeight in 6440, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.09375 = fieldNorm(doc=6440)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)

Figuerola, C.G.; Gomez, R.; Lopez de San Roman, E.: Stemming and n-grams in Spanish : an evaluation of their impact in information retrieval (2000) 0.02

0.018393612 = product of:
  0.036787223 = sum of:
    0.036787223 = product of:
      0.110361665 = sum of:
        0.110361665 = weight(_text_:n in 6501) [ClassicSimilarity], result of:
          0.110361665 = score(doc=6501,freq=2.0), product of:
            0.19305801 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.044775832 = queryNorm
            0.57165027 = fieldWeight in 6501, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.09375 = fieldNorm(doc=6501)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)

Search (155 results, page 1 of 8)

Authors

Years

Languages

Types

Themes