Search (106 results, page 1 of 6)

Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.18

0.17643562 = product of:
  0.29405937 = sum of:
    0.0690943 = product of:
      0.2072829 = sum of:
        0.2072829 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
          0.2072829 = score(doc=562,freq=2.0), product of:
            0.36881894 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.043503 = queryNorm
            0.56201804 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
      0.33333334 = coord(1/3)
    0.2072829 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
      0.2072829 = score(doc=562,freq=2.0), product of:
        0.36881894 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.043503 = queryNorm
        0.56201804 = fieldWeight in 562, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=562)
    0.017682169 = product of:
      0.035364337 = sum of:
        0.035364337 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
          0.035364337 = score(doc=562,freq=2.0), product of:
            0.1523401 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043503 = queryNorm
            0.23214069 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
      0.5 = coord(1/2)
  0.6 = coord(3/5)

Content: Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
Date: 8. 1.2013 10:22:32

Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.11

0.11055088 = product of:
  0.2763772 = sum of:
    0.0690943 = product of:
      0.2072829 = sum of:
        0.2072829 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
          0.2072829 = score(doc=862,freq=2.0), product of:
            0.36881894 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.043503 = queryNorm
            0.56201804 = fieldWeight in 862, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=862)
      0.33333334 = coord(1/3)
    0.2072829 = weight(_text_:2f in 862) [ClassicSimilarity], result of:
      0.2072829 = score(doc=862,freq=2.0), product of:
        0.36881894 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.043503 = queryNorm
        0.56201804 = fieldWeight in 862, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=862)
  0.4 = coord(2/5)

Source: https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN

Weingarten, R.: ¬Die Verkabelung der Sprache : Grenzen der Technisierung von Kommunikation (1989) 0.10

0.10495796 = product of:
  0.2623949 = sum of:
    0.05972695 = weight(_text_:b in 7156) [ClassicSimilarity], result of:
      0.05972695 = score(doc=7156,freq=4.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.3875115 = fieldWeight in 7156, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7156)
    0.20266797 = weight(_text_:191 in 7156) [ClassicSimilarity], result of:
      0.20266797 = score(doc=7156,freq=4.0), product of:
        0.28391814 = queryWeight, product of:
          6.5264034 = idf(docFreq=175, maxDocs=44218)
          0.043503 = queryNorm
        0.71382535 = fieldWeight in 7156, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          6.5264034 = idf(docFreq=175, maxDocs=44218)
          0.0546875 = fieldNorm(doc=7156)
  0.4 = coord(2/5)

Classification: Spr B 191 / Kommunikation
SBB: Spr B 191 / Kommunikation

Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.09

0.08998603 = product of:
  0.22496507 = sum of:
    0.2072829 = weight(_text_:2f in 563) [ClassicSimilarity], result of:
      0.2072829 = score(doc=563,freq=2.0), product of:
        0.36881894 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.043503 = queryNorm
        0.56201804 = fieldWeight in 563, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=563)
    0.017682169 = product of:
      0.035364337 = sum of:
        0.035364337 = weight(_text_:22 in 563) [ClassicSimilarity], result of:
          0.035364337 = score(doc=563,freq=2.0), product of:
            0.1523401 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043503 = queryNorm
            0.23214069 = fieldWeight in 563, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=563)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Content: A Thesis presented to The University of Guelph In partial fulfilment of requirements for the degree of Master of Science in Computer Science. Vgl. Unter: http://www.inf.ufrgs.br%2F~ceramisch%2Fdownload_files%2Fpublications%2F2009%2Fp01.pdf.
Date: 10. 1.2013 19:22:47

Sprachtechnologie : ein Überblick (2012) 0.05
```
0.045090124 = product of:
  0.11272531 = sum of:
    0.060333326 = weight(_text_:b in 1750) [ClassicSimilarity], result of:
      0.060333326 = score(doc=1750,freq=8.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.3914457 = fieldWeight in 1750, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1750)
    0.052391984 = product of:
      0.10478397 = sum of:
        0.10478397 = weight(_text_:erfahrung in 1750) [ClassicSimilarity], result of:
          0.10478397 = score(doc=1750,freq=2.0), product of:
            0.28725627 = queryWeight, product of:
              6.603137 = idf(docFreq=162, maxDocs=44218)
              0.043503 = queryNorm
            0.3647752 = fieldWeight in 1750, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.603137 = idf(docFreq=162, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1750)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

Seit mehr als einem halben Jahrhundert existieren ernsthafte und ernst zu nehmende Versuche, menschliche Sprache maschinell zu verarbeiten. Maschinelle Übersetzung oder "natürliche" Dialoge mit Computern gehören zu den ersten Ideen, die den Bereich der späteren Computerlinguistik oder Sprachtechnologie abgesteckt und deren Vorhaben geleitet haben. Heute ist dieser auch maschinelle Sprachverarbeitung (natural language processing, NLP) genannte Bereich stark ausdiversifiziert: Durch die rapide Entwicklung der Informatik ist vieles vorher Unvorstellbare Realität (z. B. automatische Telefonauskunft), einiges früher Unmögliche immerhin möglich geworden (z. B. Handhelds mit Sprachein- und -ausgabe als digitale persönliche (Informations-)Assistenten). Es gibt verschiedene Anwendungen der Computerlinguistik, von denen einige den Sprung in die kommerzielle Nutzung geschafft haben (z. B. Diktiersysteme, Textklassifikation, maschinelle Übersetzung). Immer noch wird an natürlichsprachlichen Systemen (natural language systems, NLS) verschiedenster Funktionalität (z. B. zur Beantwortung beliebiger Fragen oder zur Generierung komplexer Texte) intensiv geforscht, auch wenn die hoch gesteckten Ziele von einst längst nicht erreicht sind (und deshalb entsprechend "heruntergefahren" wurden). Wo die maschinelle Sprachverarbeitung heute steht, ist allerdings angesichts der vielfältigen Aktivitäten in der Computerlinguistik und Sprachtechnologie weder offensichtlich noch leicht in Erfahrung zu bringen (für Studierende des Fachs und erst recht für Laien). Ein Ziel dieses Buches ist, es, die aktuelle Literaturlage in dieser Hinsicht zu verbessern, indem spezifisch systembezogene Aspekte der Computerlinguistik als Überblick über die Sprachtechnologie zusammengetragen werden.
Winterschladen, S.; Gurevych, I.: ¬Die perfekte Suchmaschine : Forschungsgruppe entwickelt ein System, das artverwandte Begriffe finden soll (2006) 0.04
```
0.043331336 = product of:
  0.10832834 = sum of:
    0.07165395 = weight(_text_:191 in 5912) [ClassicSimilarity], result of:
      0.07165395 = score(doc=5912,freq=2.0), product of:
        0.28391814 = queryWeight, product of:
          6.5264034 = idf(docFreq=175, maxDocs=44218)
          0.043503 = queryNorm
        0.25237536 = fieldWeight in 5912, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.5264034 = idf(docFreq=175, maxDocs=44218)
          0.02734375 = fieldNorm(doc=5912)
    0.03667439 = product of:
      0.07334878 = sum of:
        0.07334878 = weight(_text_:erfahrung in 5912) [ClassicSimilarity], result of:
          0.07334878 = score(doc=5912,freq=2.0), product of:
            0.28725627 = queryWeight, product of:
              6.603137 = idf(docFreq=162, maxDocs=44218)
              0.043503 = queryNorm
            0.25534266 = fieldWeight in 5912, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.603137 = idf(docFreq=162, maxDocs=44218)
              0.02734375 = fieldNorm(doc=5912)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Content

"KÖLNER STADT-ANZEIGER: Frau Gurevych, Sie entwickeln eine Suchmaschine der nächsten Generation? Wie kann man sich diese vorstellen? IRYNA GUREVYCH Jeder kennt die herkömmlichen Suchmaschinen wie Google, Yahoo oder Altavista. Diese sind aber nicht perfekt, weil sie nur nach dem Prinzip der Zeichenerkennung funktionieren. Das steigende Informationsbedürfnis können herkömmliche Suchmaschinen nicht befriedigen. KStA: Wieso nicht? GUREVYCH Nehmen wir mal ein konkretes Beispiel: Sie suchen bei Google nach einem Rezept für einen Kuchen, der aber kein Obst enthalten soll. Keine Suchmaschine der Welt kann bisher sinnvoll solche oder ähnliche Anfragen ausführen. Meistens kommen Tausende von Ergebnissen, in denen der Nutzer die relevanten Informationen wie eine Nadel im Heuhaufen suchen muss. KStA: Und Sie können dieses Problem lösen? GUREVYCH Wir entwickeln eine Suchmaschine, die sich nicht nur auf das System der Zeichenerkennung verlässt, sondern auch linguistische Merkmale nutzt. Unsere Suchmaschine soll also auch artverwandte Begriffe zeigen. KStA: Wie weit sind Sie mit Ihrer Forschung? GUREVYCH Das Projekt ist auf zwei Jahre angelegt. Wir haben vor einem halben Jahr begonnen, haben also noch einen großen Teil vor uns. Trotzdem sind die ersten Zwischenergebnisse schon sehr beachtlich. KStA: Und wann geht die Suchmaschine ins Internet? GUREVYCH Da es sich um ein Projekt der Deutschen Forschungsgemeinschaft handelt, wird die Suchmaschine vorerst nicht veröffentlicht. Wir sehen es als unsere Aufgabe an, Verbesserungsmöglichkeiten durch schlaue Such-Algorithmen mit unseren Forschungsarbeiten nachzuweisen und Fehler der bekannten Suchmaschinen zu beseitigen. Und da sind wir auf einem guten Weg. KStA: Arbeiten Sie auch an einem ganz speziellen Projekt? GUREVYCH Ja, ihre erste Bewährungsprobe muss die neue Technologie auf einem auf den ersten Blick ungewöhnlichen Feld bestehen: Unsere Forschungsgruppe an der Technischen Universität Darmstadt entwickelt derzeit ein neuartiges System zur Unterstützung Jugendlicher bei der Berufsauswahl. Dazu stellt uns die Bundesagentur für Arbeit die Beschreibungen von 5800 Berufen in Deutschland zur Verfügung. KStA: Und was sollen Sie dann mit diesen konkreten Informationen machen? GUREVYCH Jugendliche sollen unsere Suchmaschine mit einem Aufsatz über ihre beruflichen Vorlieben flittern. Das System soll dann eine Suchabfrage starten und mögliche Berufe anhand des Interesses des Jugendlichen heraussuchen. Die persönliche Beratung durch die Bundesagentur für Arbeit kann dadurch auf alternative Angebote ausgeweitet werden. Ein erster Prototyp soll Ende des Jahres bereitstehen. KStA: Es geht also zunächst einmal nicht darum, einen Jobfür den Jugendlichen zu finden, sondern den perfekten Beruf für ihn zu ermitteln? GUREVYCH Ja, anhand der Beschreibung des Jugendlichen startet die Suchmaschine eine semantische Abfrage und sucht den passenden Beruf heraus. KStA: Gab es schon weitere Anfragen seitens der Industrie? GUREVYCH Nein, wir haben bisher noch keine Werbung betrieben. Meine Erfahrung zeigt, dass angesehene Kongresse die beste Plattform sind, um die Ergebnisse zu präsentieren und auf sich aufmerksam zu machen. Einige erste Veröffentlichungen sind bereits unterwegs und werden 2006 noch erscheinen. KStA: Wie sieht denn Ihrer Meinung nach die Suchmaschine der Zukunft aus? GUREVYCH Suchmaschinen werden immer spezieller. Das heißt, dass es etwa in der Medizin, bei den Krankenkassen oder im Sport eigene Suchmaschinen geben wird. Außerdem wird die Tendenz verstärkt zu linguistischen Suchmaschinen gehen, die nach artverwandten Begriffen fahnden. Die perfekte Suchmaschine wird wohl eine Kombination aus statistischem und linguistisch-semantischem Suchverhalten sein. Algorithmen, die wir am Fachgebiet Telekooperation an der TU Darmstadt entwickeln, werden für den nächsten qualitativen Sprung bei der Entwicklung der Suchmaschinen von größter Bedeutung sein."

Source

Kölner Stadtanzeiger. Nr.191 vom 18.8.2006, Magazin S.7

Semantik, Lexikographie und Computeranwendungen : Workshop ... (Bonn) : 1995.01.27-28 (1996) 0.02

0.022958897 = product of:
  0.057397243 = sum of:
    0.042662103 = weight(_text_:b in 190) [ClassicSimilarity], result of:
      0.042662103 = score(doc=190,freq=4.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.2767939 = fieldWeight in 190, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.0390625 = fieldNorm(doc=190)
    0.014735141 = product of:
      0.029470282 = sum of:
        0.029470282 = weight(_text_:22 in 190) [ClassicSimilarity], result of:
          0.029470282 = score(doc=190,freq=2.0), product of:
            0.1523401 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043503 = queryNorm
            0.19345059 = fieldWeight in 190, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=190)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Classification: Spr B 68 / Computerlinguistik
Date: 14. 4.2007 10:04:22
SBB: Spr B 68 / Computerlinguistik

Endres-Niggemeyer, B.: Sprachverarbeitung im Informationsbereich (1989) 0.02

0.019306665 = product of:
  0.09653332 = sum of:
    0.09653332 = weight(_text_:b in 4860) [ClassicSimilarity], result of:
      0.09653332 = score(doc=4860,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.6263131 = fieldWeight in 4860, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.125 = fieldNorm(doc=4860)
  0.2 = coord(1/5)

Natürlichsprachlicher Entwurf von Informationssystemen (1996) 0.02

0.019306665 = product of:
  0.09653332 = sum of:
    0.09653332 = weight(_text_:b in 722) [ClassicSimilarity], result of:
      0.09653332 = score(doc=722,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.6263131 = fieldWeight in 722, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.125 = fieldNorm(doc=722)
  0.2 = coord(1/5)

Editor: Ortner, E., B. Schienmann u. H. Thoma

Luo, L.; Ju, J.; Li, Y.-F.; Haffari, G.; Xiong, B.; Pan, S.: ChatRule: mining logical rules with large language models for knowledge graph reasoning (2023) 0.02

0.017960722 = product of:
  0.044901803 = sum of:
    0.030166663 = weight(_text_:b in 1171) [ClassicSimilarity], result of:
      0.030166663 = score(doc=1171,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.19572285 = fieldWeight in 1171, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1171)
    0.014735141 = product of:
      0.029470282 = sum of:
        0.029470282 = weight(_text_:22 in 1171) [ClassicSimilarity], result of:
          0.029470282 = score(doc=1171,freq=2.0), product of:
            0.1523401 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043503 = queryNorm
            0.19345059 = fieldWeight in 1171, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1171)
      0.5 = coord(1/2)
  0.4 = coord(2/5)

Date: 23.11.2023 19:07:22

Caseiro, D.: Automatic language identification bibliography : Last Update: 20 September 1999 (1999) 0.02

0.016893331 = product of:
  0.08446665 = sum of:
    0.08446665 = weight(_text_:b in 1842) [ClassicSimilarity], result of:
      0.08446665 = score(doc=1842,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.54802394 = fieldWeight in 1842, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.109375 = fieldNorm(doc=1842)
  0.2 = coord(1/5)

Type: b

Campe, P.: Case, semantic roles, and grammatical relations : a comprehensive bibliography (1994) 0.01

0.014479998 = product of:
  0.07239999 = sum of:
    0.07239999 = weight(_text_:b in 8663) [ClassicSimilarity], result of:
      0.07239999 = score(doc=8663,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.46973482 = fieldWeight in 8663, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.09375 = fieldNorm(doc=8663)
  0.2 = coord(1/5)

Type: b

Vichot, F.; Wolinksi, F.; Tomeh, J.; Guennou, S.; Dillet, B.; Aydjian, S.: High precision hypertext navigation based on NLP automation extractions (1997) 0.01

0.014479998 = product of:
  0.07239999 = sum of:
    0.07239999 = weight(_text_:b in 733) [ClassicSimilarity], result of:
      0.07239999 = score(doc=733,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.46973482 = fieldWeight in 733, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.09375 = fieldNorm(doc=733)
  0.2 = coord(1/5)

Jones, D.: Analogical natural language processing (1996) 0.01

0.013651873 = product of:
  0.068259366 = sum of:
    0.068259366 = weight(_text_:b in 4698) [ClassicSimilarity], result of:
      0.068259366 = score(doc=4698,freq=4.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.44287026 = fieldWeight in 4698, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.0625 = fieldNorm(doc=4698)
  0.2 = coord(1/5)

Classification: Spr B 68 / Computerlinguistik
SBB: Spr B 68 / Computerlinguistik

Yang, C.C.; Luk, J.: Automatic generation of English/Chinese thesaurus based on a parallel corpus in laws (2003) 0.01
```
0.012572505 = product of:
  0.03143126 = sum of:
    0.021116663 = weight(_text_:b in 1616) [ClassicSimilarity], result of:
      0.021116663 = score(doc=1616,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.13700598 = fieldWeight in 1616, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.02734375 = fieldNorm(doc=1616)
    0.010314598 = product of:
      0.020629195 = sum of:
        0.020629195 = weight(_text_:22 in 1616) [ClassicSimilarity], result of:
          0.020629195 = score(doc=1616,freq=2.0), product of:
            0.1523401 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.043503 = queryNorm
            0.1354154 = fieldWeight in 1616, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=1616)
      0.5 = coord(1/2)
  0.4 = coord(2/5)
```
Abstract

The information available in languages other than English in the World Wide Web is increasing significantly. According to a report from Computer Economics in 1999, 54% of Internet users are English speakers ("English Will Dominate Web for Only Three More Years," Computer Economics, July 9, 1999, http://www.computereconomics. com/new4/pr/pr990610.html). However, it is predicted that there will be only 60% increase in Internet users among English speakers verses a 150% growth among nonEnglish speakers for the next five years. By 2005, 57% of Internet users will be non-English speakers. A report by CNN.com in 2000 showed that the number of Internet users in China had been increased from 8.9 million to 16.9 million from January to June in 2000 ("Report: China Internet users double to 17 million," CNN.com, July, 2000, http://cnn.org/2000/TECH/computing/07/27/ china.internet.reut/index.html). According to Nielsen/ NetRatings, there was a dramatic leap from 22.5 millions to 56.6 millions Internet users from 2001 to 2002. China had become the second largest global at-home Internet population in 2002 (US's Internet population was 166 millions) (Robyn Greenspan, "China Pulls Ahead of Japan," Internet.com, April 22, 2002, http://cyberatias.internet.com/big-picture/geographics/article/0,,5911_1013841,00. html). All of the evidences reveal the importance of crosslingual research to satisfy the needs in the near future. Digital library research has been focusing in structural and semantic interoperability in the past. Searching and retrieving objects across variations in protocols, formats and disciplines are widely explored (Schatz, B., & Chen, H. (1999). Digital libraries: technological advances and social impacts. IEEE Computer, Special Issue an Digital Libraries, February, 32(2), 45-50.; Chen, H., Yen, J., & Yang, C.C. (1999). International activities: development of Asian digital libraries. IEEE Computer, Special Issue an Digital Libraries, 32(2), 48-49.). However, research in crossing language boundaries, especially across European languages and Oriental languages, is still in the initial stage. In this proposal, we put our focus an cross-lingual semantic interoperability by developing automatic generation of a cross-lingual thesaurus based an English/Chinese parallel corpus. When the searchers encounter retrieval problems, Professional librarians usually consult the thesaurus to identify other relevant vocabularies. In the problem of searching across language boundaries, a cross-lingual thesaurus, which is generated by co-occurrence analysis and Hopfield network, can be used to generate additional semantically relevant terms that cannot be obtained from dictionary. In particular, the automatically generated cross-lingual thesaurus is able to capture the unknown words that do not exist in a dictionary, such as names of persons, organizations, and events. Due to Hong Kong's unique history background, both English and Chinese are used as official languages in all legal documents. Therefore, English/Chinese cross-lingual information retrieval is critical for applications in courts and the government. In this paper, we develop an automatic thesaurus by the Hopfield network based an a parallel corpus collected from the Web site of the Department of Justice of the Hong Kong Special Administrative Region (HKSAR) Government. Experiments are conducted to measure the precision and recall of the automatic generated English/Chinese thesaurus. The result Shows that such thesaurus is a promising tool to retrieve relevant terms, especially in the language that is not the same as the input term. The direct translation of the input term can also be retrieved in most of the cases.

Sabourin, C.F. (Bearb.): Computational linguistics in information science : bibliography (1994) 0.01

0.012066665 = product of:
  0.060333326 = sum of:
    0.060333326 = weight(_text_:b in 8280) [ClassicSimilarity], result of:
      0.060333326 = score(doc=8280,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.3914457 = fieldWeight in 8280, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.078125 = fieldNorm(doc=8280)
  0.2 = coord(1/5)

Type: b

Dreehsen, B.: ¬Der PC als Dolmetscher (1998) 0.01

0.012066665 = product of:
  0.060333326 = sum of:
    0.060333326 = weight(_text_:b in 1474) [ClassicSimilarity], result of:
      0.060333326 = score(doc=1474,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.3914457 = fieldWeight in 1474, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.078125 = fieldNorm(doc=1474)
  0.2 = coord(1/5)

Rettinger, A.; Schumilin, A.; Thoma, S.; Ell, B.: Learning a cross-lingual semantic representation of relations expressed in text (2015) 0.01

0.012066665 = product of:
  0.060333326 = sum of:
    0.060333326 = weight(_text_:b in 2027) [ClassicSimilarity], result of:
      0.060333326 = score(doc=2027,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.3914457 = fieldWeight in 2027, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.078125 = fieldNorm(doc=2027)
  0.2 = coord(1/5)

Hofstadter, D.: Artificial neural networks today are not conscious (2022) 0.01

0.012066665 = product of:
  0.060333326 = sum of:
    0.060333326 = weight(_text_:b in 860) [ClassicSimilarity], result of:
      0.060333326 = score(doc=860,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.3914457 = fieldWeight in 860, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.078125 = fieldNorm(doc=860)
  0.2 = coord(1/5)

Content: Vgl. auch: Agüera y Arcas, B.: Artificial neural networks are making strides towards consciousness..

Agüera y Arcas, B.: Artificial neural networks are making strides towards consciousness (2022) 0.01

0.012066665 = product of:
  0.060333326 = sum of:
    0.060333326 = weight(_text_:b in 861) [ClassicSimilarity], result of:
      0.060333326 = score(doc=861,freq=2.0), product of:
        0.15412949 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.043503 = queryNorm
        0.3914457 = fieldWeight in 861, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.078125 = fieldNorm(doc=861)
  0.2 = coord(1/5)

Search (106 results, page 1 of 6)

Authors

Years

Languages

Types

Themes

Subjects

Classifications