Search (347 results, page 1 of 18)

  • × theme_ss:"Computerlinguistik"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.08
    0.08447748 = sum of:
      0.06298564 = product of:
        0.25194255 = sum of:
          0.25194255 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
            0.25194255 = score(doc=562,freq=2.0), product of:
              0.44828197 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.052875843 = queryNorm
              0.56201804 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.25 = coord(1/4)
      0.021491839 = product of:
        0.042983677 = sum of:
          0.042983677 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
            0.042983677 = score(doc=562,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.23214069 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.5 = coord(1/2)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.07
    0.06931168 = sum of:
      0.06298564 = product of:
        0.25194255 = sum of:
          0.25194255 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
            0.25194255 = score(doc=862,freq=2.0), product of:
              0.44828197 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.052875843 = queryNorm
              0.56201804 = fieldWeight in 862, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=862)
        0.25 = coord(1/4)
      0.0063260444 = product of:
        0.012652089 = sum of:
          0.012652089 = weight(_text_:d in 862) [ClassicSimilarity], result of:
            0.012652089 = score(doc=862,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.1259449 = fieldWeight in 862, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.046875 = fieldNorm(doc=862)
        0.5 = coord(1/2)
    
    Source
    https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN
  3. ¬Der Student aus dem Computer (2023) 0.06
    0.0649084 = product of:
      0.1298168 = sum of:
        0.1298168 = sum of:
          0.02952154 = weight(_text_:d in 1079) [ClassicSimilarity], result of:
            0.02952154 = score(doc=1079,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.29387143 = fieldWeight in 1079, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.109375 = fieldNorm(doc=1079)
          0.10029525 = weight(_text_:22 in 1079) [ClassicSimilarity], result of:
            0.10029525 = score(doc=1079,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.5416616 = fieldWeight in 1079, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.109375 = fieldNorm(doc=1079)
      0.5 = coord(1/2)
    
    Date
    27. 1.2023 16:22:55
    Language
    d
  4. Monnerjahn, P.: Vorsprung ohne Technik : Übersetzen: Computer und Qualität (2000) 0.06
    0.055635765 = product of:
      0.11127153 = sum of:
        0.11127153 = sum of:
          0.025304178 = weight(_text_:d in 5429) [ClassicSimilarity], result of:
            0.025304178 = score(doc=5429,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.2518898 = fieldWeight in 5429, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.09375 = fieldNorm(doc=5429)
          0.085967354 = weight(_text_:22 in 5429) [ClassicSimilarity], result of:
            0.085967354 = score(doc=5429,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.46428138 = fieldWeight in 5429, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.09375 = fieldNorm(doc=5429)
      0.5 = coord(1/2)
    
    Language
    d
    Source
    c't. 2000, H.22, S.230-231
  5. Lezius, W.; Rapp, R.; Wettler, M.: ¬A morphology-system and part-of-speech tagger for German (1996) 0.05
    0.050730363 = product of:
      0.101460725 = sum of:
        0.101460725 = sum of:
          0.029821262 = weight(_text_:d in 1693) [ClassicSimilarity], result of:
            0.029821262 = score(doc=1693,freq=4.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.296855 = fieldWeight in 1693, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.078125 = fieldNorm(doc=1693)
          0.07163946 = weight(_text_:22 in 1693) [ClassicSimilarity], result of:
            0.07163946 = score(doc=1693,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.38690117 = fieldWeight in 1693, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.078125 = fieldNorm(doc=1693)
      0.5 = coord(1/2)
    
    Date
    22. 3.2015 9:37:18
    Language
    d
    Source
    Natural language processing and speech technology: Results of the 3rd KONVENS Conference, Bielefeld, October 1996. Ed.: D. Gibbon
  6. Kuhlmann, U.; Monnerjahn, P.: Sprache auf Knopfdruck : Sieben automatische Übersetzungsprogramme im Test (2000) 0.05
    0.046363138 = product of:
      0.092726275 = sum of:
        0.092726275 = sum of:
          0.021086816 = weight(_text_:d in 5428) [ClassicSimilarity], result of:
            0.021086816 = score(doc=5428,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.20990817 = fieldWeight in 5428, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.078125 = fieldNorm(doc=5428)
          0.07163946 = weight(_text_:22 in 5428) [ClassicSimilarity], result of:
            0.07163946 = score(doc=5428,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.38690117 = fieldWeight in 5428, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.078125 = fieldNorm(doc=5428)
      0.5 = coord(1/2)
    
    Language
    d
    Source
    c't. 2000, H.22, S.220-229
  7. Deventer, J.P. van; Kruger, C.J.; Johnson, R.D.: Delineating knowledge management through lexical analysis : a retrospective (2015) 0.04
    0.040644467 = sum of:
      0.028107561 = product of:
        0.112430245 = sum of:
          0.112430245 = weight(_text_:authors in 3807) [ClassicSimilarity], result of:
            0.112430245 = score(doc=3807,freq=14.0), product of:
              0.24105114 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.052875843 = queryNorm
              0.46641657 = fieldWeight in 3807, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3807)
        0.25 = coord(1/4)
      0.012536907 = product of:
        0.025073813 = sum of:
          0.025073813 = weight(_text_:22 in 3807) [ClassicSimilarity], result of:
            0.025073813 = score(doc=3807,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.1354154 = fieldWeight in 3807, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.02734375 = fieldNorm(doc=3807)
        0.5 = coord(1/2)
    
    Abstract
    Purpose Academic authors tend to define terms that meet their own needs. Knowledge Management (KM) is a term that comes to mind and is examined in this study. Lexicographical research identified KM terms used by authors from 1996 to 2006 in academic outlets to define KM. Data were collected based on strict criteria which included that definitions should be unique instances. From 2006 onwards, these authors could not identify new unique instances of definitions with repetitive usage of such definition instances. Analysis revealed that KM is directly defined by People (Person and Organisation), Processes (Codify, Share, Leverage, and Process) and Contextualised Content (Information). The paper aims to discuss these issues. Design/methodology/approach The aim of this paper is to add to the body of knowledge in the KM discipline and supply KM practitioners and scholars with insight into what is commonly regarded to be KM so as to reignite the debate on what one could consider as KM. The lexicon used by KM scholars was evaluated though the application of lexicographical research methods as extended though Knowledge Discovery and Text Analysis methods. Findings By simplifying term relationships through the application of lexicographical research methods, as extended though Knowledge Discovery and Text Analysis methods, it was found that KM is directly defined by People (Person and Organisation), Processes (Codify, Share, Leverage, Process) and Contextualised Content (Information). One would therefore be able to indicate that KM, from an academic point of view, refers to people processing contextualised content.
    Research limitations/implications In total, 42 definitions were identified spanning a period of 11 years. This represented the first use of KM through the estimated apex of terms used. From 2006 onwards definitions were used in repetition, and all definitions that were considered to repeat were therefore subsequently excluded as not being unique instances. All definitions listed are by no means complete and exhaustive. The definitions are viewed outside the scope and context in which they were originally formulated and then used to review the key concepts in the definitions themselves. Social implications When the authors refer to the aforementioned discussion of KM content as well as the presentation of the method followed in this paper, the authors may have a few implications for future research in KM. First the research validates ideas presented by the OECD in 2005 pertaining to KM. It also validates that through the evolution of KM, the authors ended with a description of KM that may be seen as a standardised description. If the authors as academics and practitioners, for example, refer to KM as the same construct and/or idea, it has the potential to speculatively, distinguish between what KM may or may not be. Originality/value By simplifying the term used to define KM, by focusing on the most common definitions, the paper assist in refocusing KM by reconsidering the dimensions that is the most common in how it has been defined over time. This would hopefully assist in reigniting discussions about KM and how it may be used to the benefit of an organisation.
    Date
    20. 1.2015 18:30:22
  8. Ruge, G.: Sprache und Computer : Wortbedeutung und Termassoziation. Methoden zur automatischen semantischen Klassifikation (1995) 0.04
    0.04058429 = product of:
      0.08116858 = sum of:
        0.08116858 = sum of:
          0.023857009 = weight(_text_:d in 1534) [ClassicSimilarity], result of:
            0.023857009 = score(doc=1534,freq=4.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.237484 = fieldWeight in 1534, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.0625 = fieldNorm(doc=1534)
          0.057311572 = weight(_text_:22 in 1534) [ClassicSimilarity], result of:
            0.057311572 = score(doc=1534,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.30952093 = fieldWeight in 1534, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0625 = fieldNorm(doc=1534)
      0.5 = coord(1/2)
    
    Footnote
    Rez. in: Knowledge organization 22(1995) no.3/4, S.182-184 (M.T. Rolland)
    Language
    d
    Type
    d
  9. Lezius, W.: Morphy - Morphologie und Tagging für das Deutsche (2013) 0.04
    0.03709051 = product of:
      0.07418102 = sum of:
        0.07418102 = sum of:
          0.016869452 = weight(_text_:d in 1490) [ClassicSimilarity], result of:
            0.016869452 = score(doc=1490,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.16792654 = fieldWeight in 1490, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.0625 = fieldNorm(doc=1490)
          0.057311572 = weight(_text_:22 in 1490) [ClassicSimilarity], result of:
            0.057311572 = score(doc=1490,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.30952093 = fieldWeight in 1490, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0625 = fieldNorm(doc=1490)
      0.5 = coord(1/2)
    
    Date
    22. 3.2015 9:30:24
    Language
    d
  10. Bager, J.: ¬Die Text-KI ChatGPT schreibt Fachtexte, Prosa, Gedichte und Programmcode (2023) 0.04
    0.03709051 = product of:
      0.07418102 = sum of:
        0.07418102 = sum of:
          0.016869452 = weight(_text_:d in 835) [ClassicSimilarity], result of:
            0.016869452 = score(doc=835,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.16792654 = fieldWeight in 835, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.0625 = fieldNorm(doc=835)
          0.057311572 = weight(_text_:22 in 835) [ClassicSimilarity], result of:
            0.057311572 = score(doc=835,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.30952093 = fieldWeight in 835, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0625 = fieldNorm(doc=835)
      0.5 = coord(1/2)
    
    Date
    29.12.2022 18:22:55
    Language
    d
  11. Rieger, F.: Lügende Computer (2023) 0.04
    0.03709051 = product of:
      0.07418102 = sum of:
        0.07418102 = sum of:
          0.016869452 = weight(_text_:d in 912) [ClassicSimilarity], result of:
            0.016869452 = score(doc=912,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.16792654 = fieldWeight in 912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.0625 = fieldNorm(doc=912)
          0.057311572 = weight(_text_:22 in 912) [ClassicSimilarity], result of:
            0.057311572 = score(doc=912,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.30952093 = fieldWeight in 912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0625 = fieldNorm(doc=912)
      0.5 = coord(1/2)
    
    Date
    16. 3.2023 19:22:55
    Language
    d
  12. Schneider, R.: Web 3.0 ante portas? : Integration von Social Web und Semantic Web (2008) 0.03
    0.0324542 = product of:
      0.0649084 = sum of:
        0.0649084 = sum of:
          0.01476077 = weight(_text_:d in 4184) [ClassicSimilarity], result of:
            0.01476077 = score(doc=4184,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.14693572 = fieldWeight in 4184, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4184)
          0.050147627 = weight(_text_:22 in 4184) [ClassicSimilarity], result of:
            0.050147627 = score(doc=4184,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.2708308 = fieldWeight in 4184, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4184)
      0.5 = coord(1/2)
    
    Date
    22. 1.2011 10:38:28
    Language
    d
  13. Warner, A.J.: Natural language processing (1987) 0.03
    0.028655786 = product of:
      0.057311572 = sum of:
        0.057311572 = product of:
          0.114623144 = sum of:
            0.114623144 = weight(_text_:22 in 337) [ClassicSimilarity], result of:
              0.114623144 = score(doc=337,freq=2.0), product of:
                0.18516219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.052875843 = queryNorm
                0.61904186 = fieldWeight in 337, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.125 = fieldNorm(doc=337)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Annual review of information science and technology. 22(1987), S.79-108
  14. Lorenz, S.: Konzeption und prototypische Realisierung einer begriffsbasierten Texterschließung (2006) 0.03
    0.027817883 = product of:
      0.055635765 = sum of:
        0.055635765 = sum of:
          0.012652089 = weight(_text_:d in 1746) [ClassicSimilarity], result of:
            0.012652089 = score(doc=1746,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.1259449 = fieldWeight in 1746, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.046875 = fieldNorm(doc=1746)
          0.042983677 = weight(_text_:22 in 1746) [ClassicSimilarity], result of:
            0.042983677 = score(doc=1746,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.23214069 = fieldWeight in 1746, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=1746)
      0.5 = coord(1/2)
    
    Date
    22. 3.2015 9:17:30
    Language
    d
  15. Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.03
    0.027817883 = product of:
      0.055635765 = sum of:
        0.055635765 = sum of:
          0.012652089 = weight(_text_:d in 1848) [ClassicSimilarity], result of:
            0.012652089 = score(doc=1848,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.1259449 = fieldWeight in 1848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.046875 = fieldNorm(doc=1848)
          0.042983677 = weight(_text_:22 in 1848) [ClassicSimilarity], result of:
            0.042983677 = score(doc=1848,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.23214069 = fieldWeight in 1848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=1848)
      0.5 = coord(1/2)
    
    Abstract
    The goal of entity linking is to associate references to an entity that is found in unstructured natural language content to an authoritative inventory of known entities. This article describes the construction of 6 test collections for cross-language person-entity linking that together span 22 languages. Fully automated components were used together with 2 crowdsourced validation stages to affordably generate ground-truth annotations with an accuracy comparable to that of a completely manual process. The resulting test collections each contain between 642 (Arabic) and 2,361 (Romanian) person references in non-English texts for which the correct resolution in English Wikipedia is known, plus a similar number of references for which no correct resolution into English Wikipedia is believed to exist. Fully automated cross-language person-name linking experiments with 20 non-English languages yielded a resolution accuracy of between 0.84 (Serbian) and 0.98 (Romanian), which compares favorably with previously reported cross-language entity linking results for Spanish.
  16. Levin, M.; Krawczyk, S.; Bethard, S.; Jurafsky, D.: Citation-based bootstrapping for large-scale author disambiguation (2012) 0.03
    0.02673474 = sum of:
      0.021463035 = product of:
        0.08585214 = sum of:
          0.08585214 = weight(_text_:authors in 246) [ClassicSimilarity], result of:
            0.08585214 = score(doc=246,freq=4.0), product of:
              0.24105114 = queryWeight, product of:
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.052875843 = queryNorm
              0.35615736 = fieldWeight in 246, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.558814 = idf(docFreq=1258, maxDocs=44218)
                0.0390625 = fieldNorm(doc=246)
        0.25 = coord(1/4)
      0.005271704 = product of:
        0.010543408 = sum of:
          0.010543408 = weight(_text_:d in 246) [ClassicSimilarity], result of:
            0.010543408 = score(doc=246,freq=2.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.104954086 = fieldWeight in 246, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.0390625 = fieldNorm(doc=246)
        0.5 = coord(1/2)
    
    Abstract
    We present a new, two-stage, self-supervised algorithm for author disambiguation in large bibliographic databases. In the first "bootstrap" stage, a collection of high-precision features is used to bootstrap a training set with positive and negative examples of coreferring authors. A supervised feature-based classifier is then trained on the bootstrap clusters and used to cluster the authors in a larger unlabeled dataset. Our self-supervised approach shares the advantages of unsupervised approaches (no need for expensive hand labels) as well as supervised approaches (a rich set of features that can be discriminatively trained). The algorithm disambiguates 54,000,000 author instances in Thomson Reuters' Web of Knowledge with B3 F1 of.807. We analyze parameters and features, particularly those from citation networks, which have not been deeply investigated in author disambiguation. The most important citation feature is self-citation, which can be approximated without expensive extraction of the full network. For the supervised stage, the minor improvement due to other citation features (increasing F1 from.748 to.767) suggests they may not be worth the trouble of extracting from databases that don't already have them. A lean feature set without expensive abstract and title features performs 130 times faster with about equal F1.
  17. Sienel, J.; Weiss, M.; Laube, M.: Sprachtechnologien für die Informationsgesellschaft des 21. Jahrhunderts (2000) 0.03
    0.025365181 = product of:
      0.050730363 = sum of:
        0.050730363 = sum of:
          0.014910631 = weight(_text_:d in 5557) [ClassicSimilarity], result of:
            0.014910631 = score(doc=5557,freq=4.0), product of:
              0.10045733 = queryWeight, product of:
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.052875843 = queryNorm
              0.1484275 = fieldWeight in 5557, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                1.899872 = idf(docFreq=17979, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5557)
          0.03581973 = weight(_text_:22 in 5557) [ClassicSimilarity], result of:
            0.03581973 = score(doc=5557,freq=2.0), product of:
              0.18516219 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052875843 = queryNorm
              0.19345059 = fieldWeight in 5557, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=5557)
      0.5 = coord(1/2)
    
    Date
    26.12.2000 13:22:17
    Language
    d
    Source
    Sprachtechnologie für eine dynamische Wirtschaft im Medienzeitalter - Language technologies for dynamic business in the age of the media - L'ingénierie linguistique au service de la dynamisation économique à l'ère du multimédia: Tagungsakten der XXVI. Jahrestagung der Internationalen Vereinigung Sprache und Wirtschaft e.V., 23.-25.11.2000, Fachhochschule Köln. Hrsg.: K.-D. Schmitz
  18. McMahon, J.G.; Smith, F.J.: Improved statistical language model performance with automatic generated word hierarchies (1996) 0.03
    0.025073813 = product of:
      0.050147627 = sum of:
        0.050147627 = product of:
          0.10029525 = sum of:
            0.10029525 = weight(_text_:22 in 3164) [ClassicSimilarity], result of:
              0.10029525 = score(doc=3164,freq=2.0), product of:
                0.18516219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.052875843 = queryNorm
                0.5416616 = fieldWeight in 3164, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=3164)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Computational linguistics. 22(1996) no.2, S.217-248
  19. Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 0.03
    0.025073813 = product of:
      0.050147627 = sum of:
        0.050147627 = product of:
          0.10029525 = sum of:
            0.10029525 = weight(_text_:22 in 4506) [ClassicSimilarity], result of:
              0.10029525 = score(doc=4506,freq=2.0), product of:
                0.18516219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.052875843 = queryNorm
                0.5416616 = fieldWeight in 4506, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4506)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    8.10.2000 11:52:22
  20. Somers, H.: Example-based machine translation : Review article (1999) 0.03
    0.025073813 = product of:
      0.050147627 = sum of:
        0.050147627 = product of:
          0.10029525 = sum of:
            0.10029525 = weight(_text_:22 in 6672) [ClassicSimilarity], result of:
              0.10029525 = score(doc=6672,freq=2.0), product of:
                0.18516219 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.052875843 = queryNorm
                0.5416616 = fieldWeight in 6672, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6672)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    31. 7.1996 9:22:19

Languages

  • d 234
  • e 106
  • m 4
  • f 3
  • da 1
  • slv 1
  • More… Less…

Types

  • a 262
  • el 52
  • m 45
  • s 21
  • x 11
  • d 3
  • p 2
  • b 1
  • More… Less…

Subjects

Classifications