Search (343 results, page 1 of 18)

Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.07

0.065247305 = sum of:
  0.05315555 = product of:
    0.2126222 = sum of:
      0.2126222 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
        0.2126222 = score(doc=562,freq=2.0), product of:
          0.37831917 = queryWeight, product of:
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.044623576 = queryNorm
          0.56201804 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.25 = coord(1/4)
  0.012091756 = product of:
    0.036275268 = sum of:
      0.036275268 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
        0.036275268 = score(doc=562,freq=2.0), product of:
          0.15626416 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.044623576 = queryNorm
          0.23214069 = fieldWeight in 562, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=562)
    0.33333334 = coord(1/3)

Content: Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
Date: 8. 1.2013 10:22:32

Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.06

0.056714714 = sum of:
  0.05315555 = product of:
    0.2126222 = sum of:
      0.2126222 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
        0.2126222 = score(doc=862,freq=2.0), product of:
          0.37831917 = queryWeight, product of:
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.044623576 = queryNorm
          0.56201804 = fieldWeight in 862, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            8.478011 = idf(docFreq=24, maxDocs=44218)
            0.046875 = fieldNorm(doc=862)
    0.25 = coord(1/4)
  0.0035591645 = product of:
    0.010677493 = sum of:
      0.010677493 = weight(_text_:d in 862) [ClassicSimilarity], result of:
        0.010677493 = score(doc=862,freq=2.0), product of:
          0.084779084 = queryWeight, product of:
            1.899872 = idf(docFreq=17979, maxDocs=44218)
            0.044623576 = queryNorm
          0.1259449 = fieldWeight in 862, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.899872 = idf(docFreq=17979, maxDocs=44218)
            0.046875 = fieldNorm(doc=862)
    0.33333334 = coord(1/3)

Source: https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN

Sprachtechnologie : ein Überblick (2012) 0.05
```
0.049781885 = product of:
  0.09956377 = sum of:
    0.09956377 = product of:
      0.14934565 = sum of:
        0.008897911 = weight(_text_:d in 1750) [ClassicSimilarity], result of:
          0.008897911 = score(doc=1750,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.104954086 = fieldWeight in 1750, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1750)
        0.14044774 = weight(_text_:z in 1750) [ClassicSimilarity], result of:
          0.14044774 = score(doc=1750,freq=8.0), product of:
            0.23817 = queryWeight, product of:
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.044623576 = queryNorm
            0.58969533 = fieldWeight in 1750, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1750)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

Seit mehr als einem halben Jahrhundert existieren ernsthafte und ernst zu nehmende Versuche, menschliche Sprache maschinell zu verarbeiten. Maschinelle Übersetzung oder "natürliche" Dialoge mit Computern gehören zu den ersten Ideen, die den Bereich der späteren Computerlinguistik oder Sprachtechnologie abgesteckt und deren Vorhaben geleitet haben. Heute ist dieser auch maschinelle Sprachverarbeitung (natural language processing, NLP) genannte Bereich stark ausdiversifiziert: Durch die rapide Entwicklung der Informatik ist vieles vorher Unvorstellbare Realität (z. B. automatische Telefonauskunft), einiges früher Unmögliche immerhin möglich geworden (z. B. Handhelds mit Sprachein- und -ausgabe als digitale persönliche (Informations-)Assistenten). Es gibt verschiedene Anwendungen der Computerlinguistik, von denen einige den Sprung in die kommerzielle Nutzung geschafft haben (z. B. Diktiersysteme, Textklassifikation, maschinelle Übersetzung). Immer noch wird an natürlichsprachlichen Systemen (natural language systems, NLS) verschiedenster Funktionalität (z. B. zur Beantwortung beliebiger Fragen oder zur Generierung komplexer Texte) intensiv geforscht, auch wenn die hoch gesteckten Ziele von einst längst nicht erreicht sind (und deshalb entsprechend "heruntergefahren" wurden). Wo die maschinelle Sprachverarbeitung heute steht, ist allerdings angesichts der vielfältigen Aktivitäten in der Computerlinguistik und Sprachtechnologie weder offensichtlich noch leicht in Erfahrung zu bringen (für Studierende des Fachs und erst recht für Laien). Ein Ziel dieses Buches ist, es, die aktuelle Literaturlage in dieser Hinsicht zu verbessern, indem spezifisch systembezogene Aspekte der Computerlinguistik als Überblick über die Sprachtechnologie zusammengetragen werden.

Language

d
Doszkocs, T.E.; Zamora, A.: Dictionary services and spelling aids for Web searching (2004) 0.04
```
0.03765823 = product of:
  0.07531646 = sum of:
    0.07531646 = product of:
      0.11297469 = sum of:
        0.07022387 = weight(_text_:z in 2541) [ClassicSimilarity], result of:
          0.07022387 = score(doc=2541,freq=2.0), product of:
            0.23817 = queryWeight, product of:
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.044623576 = queryNorm
            0.29484767 = fieldWeight in 2541, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2541)
        0.042750817 = weight(_text_:22 in 2541) [ClassicSimilarity], result of:
          0.042750817 = score(doc=2541,freq=4.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.27358043 = fieldWeight in 2541, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2541)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

The Specialized Information Services Division (SIS) of the National Library of Medicine (NLM) provides Web access to more than a dozen scientific databases on toxicology and the environment on TOXNET . Search queries on TOXNET often include misspelled or variant English words, medical and scientific jargon and chemical names. Following the example of search engines like Google and ClinicalTrials.gov, we set out to develop a spelling "suggestion" system for increased recall and precision in TOXNET searching. This paper describes development of dictionary technology that can be used in a variety of applications such as orthographic verification, writing aid, natural language processing, and information storage and retrieval. The design of the technology allows building complex applications using the components developed in the earlier phases of the work in a modular fashion without extensive rewriting of computer code. Since many of the potential applications envisioned for this work have on-line or web-based interfaces, the dictionaries and other computer components must have fast response, and must be adaptable to open-ended database vocabularies, including chemical nomenclature. The dictionary vocabulary for this work was derived from SIS and other databases and specialized resources, such as NLM's Unified Medical Language Systems (UMLS) . The resulting technology, A-Z Dictionary (AZdict), has three major constituents: 1) the vocabulary list, 2) the word attributes that define part of speech and morphological relationships between words in the list, and 3) a set of programs that implements the retrieval of words and their attributes, and determines similarity between words (ChemSpell). These three components can be used in various applications such as spelling verification, spelling aid, part-of-speech tagging, paraphrasing, and many other natural language processing functions.

Date

14. 8.2004 17:22:56

Source

Online. 28(2004) no.3, S.22-29

Granitzer, M.: Statistische Verfahren der Textanalyse (2006) 0.04

0.036923498 = product of:
  0.073846996 = sum of:
    0.073846996 = product of:
      0.110770494 = sum of:
        0.0124570755 = weight(_text_:d in 5809) [ClassicSimilarity], result of:
          0.0124570755 = score(doc=5809,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.14693572 = fieldWeight in 5809, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5809)
        0.09831342 = weight(_text_:z in 5809) [ClassicSimilarity], result of:
          0.09831342 = score(doc=5809,freq=2.0), product of:
            0.23817 = queryWeight, product of:
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.044623576 = queryNorm
            0.41278675 = fieldWeight in 5809, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.0546875 = fieldNorm(doc=5809)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: Der vorliegende Artikel bietet einen Überblick über statistische Verfahren der Textanalyse im Kontext des Semantic Webs. Als Einleitung erfolgt die Diskussion von Methoden und gängigen Techniken zur Vorverarbeitung von Texten wie z. B. Stemming oder Part-of-Speech Tagging. Die so eingeführten Repräsentationsformen dienen als Basis für statistische Merkmalsanalysen sowie für weiterführende Techniken wie Information Extraction und maschinelle Lernverfahren. Die Darstellung dieser speziellen Techniken erfolgt im Überblick, wobei auf die wichtigsten Aspekte in Bezug auf das Semantic Web detailliert eingegangen wird. Die Anwendung der vorgestellten Techniken zur Erstellung und Wartung von Ontologien sowie der Verweis auf weiterführende Literatur bilden den Abschluss dieses Artikels.
Language: d

¬Der Student aus dem Computer (2023) 0.04

0.03651882 = product of:
  0.07303764 = sum of:
    0.07303764 = product of:
      0.10955645 = sum of:
        0.024914151 = weight(_text_:d in 1079) [ClassicSimilarity], result of:
          0.024914151 = score(doc=1079,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.29387143 = fieldWeight in 1079, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.109375 = fieldNorm(doc=1079)
        0.0846423 = weight(_text_:22 in 1079) [ClassicSimilarity], result of:
          0.0846423 = score(doc=1079,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.5416616 = fieldWeight in 1079, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=1079)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 27. 1.2023 16:22:55
Language: d

Endres-Niggemeyer, B.: Thinkie: Lautes Denken mit Spracherkennung (mobil) (2013) 0.03

0.031648714 = product of:
  0.06329743 = sum of:
    0.06329743 = product of:
      0.09494614 = sum of:
        0.010677493 = weight(_text_:d in 1145) [ClassicSimilarity], result of:
          0.010677493 = score(doc=1145,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.1259449 = fieldWeight in 1145, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.046875 = fieldNorm(doc=1145)
        0.084268644 = weight(_text_:z in 1145) [ClassicSimilarity], result of:
          0.084268644 = score(doc=1145,freq=2.0), product of:
            0.23817 = queryWeight, product of:
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.044623576 = queryNorm
            0.35381722 = fieldWeight in 1145, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.046875 = fieldNorm(doc=1145)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: Lautes Denken ist eine bewährte Methode zum Erforschen kognitiver Prozesse. Es wird in vielen Disziplinen benutzt, z. B. um aufzudecken, welche Erfahrungen Benutzer bei der Interaktion mit Computerschnittstellen machen. Nach einer kurzen Erklärung des Lauten Denkens wird die App Thinkie vorgestellt. Thinkie ist eine mobile Lösung für das Laute Denken auf iPhone und iPad. Die Testperson nimmt auf dem iPhone den Ton auf. Die Spracherkennungssoftware Siri (http://www.apple.com/de/ios/siri/) transkribiert ihn. Parallel wird auf dem iPad oder einem anderen Gerät gefilmt. Auf dem iPad kann man - mit Video im Blick - das Transkript aufarbeiten und interpretieren. Die Textdateien transportiert Thinkie über eine Cloud-Kollektion, die Filme werden mit iTunes übertragen. Thinkie ist noch nicht tauglich für den praktischen Gebrauch. Noch sind die Sequenzen zu kurz, die Siri verarbeiten kann. Das wird sich ändern.
Language: d

Monnerjahn, P.: Vorsprung ohne Technik : Übersetzen: Computer und Qualität (2000) 0.03

0.03130184 = product of:
  0.06260368 = sum of:
    0.06260368 = product of:
      0.09390552 = sum of:
        0.021354986 = weight(_text_:d in 5429) [ClassicSimilarity], result of:
          0.021354986 = score(doc=5429,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.2518898 = fieldWeight in 5429, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.09375 = fieldNorm(doc=5429)
        0.072550535 = weight(_text_:22 in 5429) [ClassicSimilarity], result of:
          0.072550535 = score(doc=5429,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.46428138 = fieldWeight in 5429, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=5429)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Language: d
Source: c't. 2000, H.22, S.230-231

Lezius, W.; Rapp, R.; Wettler, M.: ¬A morphology-system and part-of-speech tagger for German (1996) 0.03

0.02854196 = product of:
  0.05708392 = sum of:
    0.05708392 = product of:
      0.08562588 = sum of:
        0.025167095 = weight(_text_:d in 1693) [ClassicSimilarity], result of:
          0.025167095 = score(doc=1693,freq=4.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.296855 = fieldWeight in 1693, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.078125 = fieldNorm(doc=1693)
        0.060458783 = weight(_text_:22 in 1693) [ClassicSimilarity], result of:
          0.060458783 = score(doc=1693,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.38690117 = fieldWeight in 1693, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=1693)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 22. 3.2015 9:37:18
Language: d
Source: Natural language processing and speech technology: Results of the 3rd KONVENS Conference, Bielefeld, October 1996. Ed.: D. Gibbon

Kuhlmann, U.; Monnerjahn, P.: Sprache auf Knopfdruck : Sieben automatische Übersetzungsprogramme im Test (2000) 0.03

0.026084868 = product of:
  0.052169736 = sum of:
    0.052169736 = product of:
      0.0782546 = sum of:
        0.017795822 = weight(_text_:d in 5428) [ClassicSimilarity], result of:
          0.017795822 = score(doc=5428,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.20990817 = fieldWeight in 5428, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.078125 = fieldNorm(doc=5428)
        0.060458783 = weight(_text_:22 in 5428) [ClassicSimilarity], result of:
          0.060458783 = score(doc=5428,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.38690117 = fieldWeight in 5428, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=5428)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Language: d
Source: c't. 2000, H.22, S.220-229

Ruge, G.: Sprache und Computer : Wortbedeutung und Termassoziation. Methoden zur automatischen semantischen Klassifikation (1995) 0.02

0.022833569 = product of:
  0.045667138 = sum of:
    0.045667138 = product of:
      0.068500705 = sum of:
        0.020133676 = weight(_text_:d in 1534) [ClassicSimilarity], result of:
          0.020133676 = score(doc=1534,freq=4.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.237484 = fieldWeight in 1534, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0625 = fieldNorm(doc=1534)
        0.048367027 = weight(_text_:22 in 1534) [ClassicSimilarity], result of:
          0.048367027 = score(doc=1534,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.30952093 = fieldWeight in 1534, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1534)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Footnote: Rez. in: Knowledge organization 22(1995) no.3/4, S.182-184 (M.T. Rolland)
Language: d
Type: d

Lezius, W.: Morphy - Morphologie und Tagging für das Deutsche (2013) 0.02

0.020867895 = product of:
  0.04173579 = sum of:
    0.04173579 = product of:
      0.06260368 = sum of:
        0.014236658 = weight(_text_:d in 1490) [ClassicSimilarity], result of:
          0.014236658 = score(doc=1490,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.16792654 = fieldWeight in 1490, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0625 = fieldNorm(doc=1490)
        0.048367027 = weight(_text_:22 in 1490) [ClassicSimilarity], result of:
          0.048367027 = score(doc=1490,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.30952093 = fieldWeight in 1490, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=1490)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 22. 3.2015 9:30:24
Language: d

Bager, J.: ¬Die Text-KI ChatGPT schreibt Fachtexte, Prosa, Gedichte und Programmcode (2023) 0.02

0.020867895 = product of:
  0.04173579 = sum of:
    0.04173579 = product of:
      0.06260368 = sum of:
        0.014236658 = weight(_text_:d in 835) [ClassicSimilarity], result of:
          0.014236658 = score(doc=835,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.16792654 = fieldWeight in 835, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0625 = fieldNorm(doc=835)
        0.048367027 = weight(_text_:22 in 835) [ClassicSimilarity], result of:
          0.048367027 = score(doc=835,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.30952093 = fieldWeight in 835, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=835)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 29.12.2022 18:22:55
Language: d

Rieger, F.: Lügende Computer (2023) 0.02

0.020867895 = product of:
  0.04173579 = sum of:
    0.04173579 = product of:
      0.06260368 = sum of:
        0.014236658 = weight(_text_:d in 912) [ClassicSimilarity], result of:
          0.014236658 = score(doc=912,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.16792654 = fieldWeight in 912, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0625 = fieldNorm(doc=912)
        0.048367027 = weight(_text_:22 in 912) [ClassicSimilarity], result of:
          0.048367027 = score(doc=912,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.30952093 = fieldWeight in 912, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0625 = fieldNorm(doc=912)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 16. 3.2023 19:22:55
Language: d

Xianghao, G.; Yixin, Z.; Li, Y.: ¬A new method of news test understanding and abstracting based on speech acts theory (1998) 0.02

0.018726366 = product of:
  0.03745273 = sum of:
    0.03745273 = product of:
      0.11235819 = sum of:
        0.11235819 = weight(_text_:z in 3532) [ClassicSimilarity], result of:
          0.11235819 = score(doc=3532,freq=2.0), product of:
            0.23817 = queryWeight, product of:
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.044623576 = queryNorm
            0.47175628 = fieldWeight in 3532, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.0625 = fieldNorm(doc=3532)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)

Schneider, R.: Web 3.0 ante portas? : Integration von Social Web und Semantic Web (2008) 0.02

0.01825941 = product of:
  0.03651882 = sum of:
    0.03651882 = product of:
      0.054778226 = sum of:
        0.0124570755 = weight(_text_:d in 4184) [ClassicSimilarity], result of:
          0.0124570755 = score(doc=4184,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.14693572 = fieldWeight in 4184, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4184)
        0.04232115 = weight(_text_:22 in 4184) [ClassicSimilarity], result of:
          0.04232115 = score(doc=4184,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.2708308 = fieldWeight in 4184, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4184)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 22. 1.2011 10:38:28
Language: d

Savoy, J.: Text representation strategies : an example with the State of the union addresses (2016) 0.02
```
0.016551927 = product of:
  0.033103853 = sum of:
    0.033103853 = product of:
      0.09931155 = sum of:
        0.09931155 = weight(_text_:z in 3042) [ClassicSimilarity], result of:
          0.09931155 = score(doc=3042,freq=4.0), product of:
            0.23817 = queryWeight, product of:
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.044623576 = queryNorm
            0.41697758 = fieldWeight in 3042, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              5.337313 = idf(docFreq=577, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3042)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)
```
Abstract

Based on State of the Union addresses from 1790 to 2014 (225 speeches delivered by 42 presidents), this paper describes and evaluates different text representation strategies. To determine the most important words of a given text, the term frequencies (tf) or the tf?idf weighting scheme can be applied. Recently, latent Dirichlet allocation (LDA) has been proposed to define the topics included in a corpus. As another strategy, this study proposes to apply a vocabulary specificity measure (Z?score) to determine the most significantly overused word-types or short sequences of them. Our experiments show that the simple term frequency measure is not able to discriminate between specific terms associated with a document or a set of texts. Using the tf idf or LDA approach, the selection requires some arbitrary decisions. Based on the term-specific measure (Z?score), the term selection has a clear theoretical basis. Moreover, the most significant sentences for each presidency can be determined. As another facet, we can visualize the dynamic evolution of usage of some terms associated with their specificity measures. Finally, this technique can be employed to define the most important lexical leaders introducing terms overused by the k following presidencies.

Warner, A.J.: Natural language processing (1987) 0.02

0.016122343 = product of:
  0.032244686 = sum of:
    0.032244686 = product of:
      0.096734054 = sum of:
        0.096734054 = weight(_text_:22 in 337) [ClassicSimilarity], result of:
          0.096734054 = score(doc=337,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.61904186 = fieldWeight in 337, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=337)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)

Source: Annual review of information science and technology. 22(1987), S.79-108

Lorenz, S.: Konzeption und prototypische Realisierung einer begriffsbasierten Texterschließung (2006) 0.02

0.01565092 = product of:
  0.03130184 = sum of:
    0.03130184 = product of:
      0.04695276 = sum of:
        0.010677493 = weight(_text_:d in 1746) [ClassicSimilarity], result of:
          0.010677493 = score(doc=1746,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.1259449 = fieldWeight in 1746, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.046875 = fieldNorm(doc=1746)
        0.036275268 = weight(_text_:22 in 1746) [ClassicSimilarity], result of:
          0.036275268 = score(doc=1746,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.23214069 = fieldWeight in 1746, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1746)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 22. 3.2015 9:17:30
Language: d

Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.02

0.01565092 = product of:
  0.03130184 = sum of:
    0.03130184 = product of:
      0.04695276 = sum of:
        0.010677493 = weight(_text_:d in 1848) [ClassicSimilarity], result of:
          0.010677493 = score(doc=1848,freq=2.0), product of:
            0.084779084 = queryWeight, product of:
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.044623576 = queryNorm
            0.1259449 = fieldWeight in 1848, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.899872 = idf(docFreq=17979, maxDocs=44218)
              0.046875 = fieldNorm(doc=1848)
        0.036275268 = weight(_text_:22 in 1848) [ClassicSimilarity], result of:
          0.036275268 = score(doc=1848,freq=2.0), product of:
            0.15626416 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.044623576 = queryNorm
            0.23214069 = fieldWeight in 1848, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1848)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: The goal of entity linking is to associate references to an entity that is found in unstructured natural language content to an authoritative inventory of known entities. This article describes the construction of 6 test collections for cross-language person-entity linking that together span 22 languages. Fully automated components were used together with 2 crowdsourced validation stages to affordably generate ground-truth annotations with an accuracy comparable to that of a completely manual process. The resulting test collections each contain between 642 (Arabic) and 2,361 (Romanian) person references in non-English texts for which the correct resolution in English Wikipedia is known, plus a similar number of references for which no correct resolution into English Wikipedia is believed to exist. Fully automated cross-language person-name linking experiments with 20 non-English languages yielded a resolution accuracy of between 0.84 (Serbian) and 0.98 (Romanian), which compares favorably with previously reported cross-language entity linking results for Spanish.

Search (343 results, page 1 of 18)

Authors

Years

Languages

Types

Themes

Subjects

Classifications