Search (698 results, page 1 of 35)

Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.11

0.11318469 = product of:
  0.35572332 = sum of:
    0.03266342 = product of:
      0.09799025 = sum of:
        0.09799025 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
          0.09799025 = score(doc=562,freq=2.0), product of:
            0.17435429 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.02056547 = queryNorm
            0.56201804 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
      0.33333334 = coord(1/3)
    0.09799025 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
      0.09799025 = score(doc=562,freq=2.0), product of:
        0.17435429 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.02056547 = queryNorm
        0.56201804 = fieldWeight in 562, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=562)
    0.09799025 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
      0.09799025 = score(doc=562,freq=2.0), product of:
        0.17435429 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.02056547 = queryNorm
        0.56201804 = fieldWeight in 562, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=562)
    0.09799025 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
      0.09799025 = score(doc=562,freq=2.0), product of:
        0.17435429 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.02056547 = queryNorm
        0.56201804 = fieldWeight in 562, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=562)
    0.0025225237 = weight(_text_:in in 562) [ClassicSimilarity], result of:
      0.0025225237 = score(doc=562,freq=2.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.09017298 = fieldWeight in 562, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=562)
    0.018207615 = weight(_text_:computer in 562) [ClassicSimilarity], result of:
      0.018207615 = score(doc=562,freq=2.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.24226204 = fieldWeight in 562, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.046875 = fieldNorm(doc=562)
    0.008359012 = product of:
      0.016718024 = sum of:
        0.016718024 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
          0.016718024 = score(doc=562,freq=2.0), product of:
            0.072016776 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02056547 = queryNorm
            0.23214069 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
      0.5 = coord(1/2)
  0.3181818 = coord(7/22)

Abstract: Document representations for text classification are typically based on the classical Bag-Of-Words paradigm. This approach comes with deficiencies that motivate the integration of features on a higher semantic level than single words. In this paper we propose an enhancement of the classical document representation through concepts extracted from background knowledge. Boosting is used for actual classification. Experimental evaluations on two well known text corpora support our approach through consistent improvement of the results.
Content: Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
Date: 8. 1.2013 10:22:32
Imprint: Washington, DC : IEEE Computer Society

Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.09

0.094885804 = product of:
  0.3479146 = sum of:
    0.09799025 = weight(_text_:2f in 563) [ClassicSimilarity], result of:
      0.09799025 = score(doc=563,freq=2.0), product of:
        0.17435429 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.02056547 = queryNorm
        0.56201804 = fieldWeight in 563, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=563)
    0.09799025 = weight(_text_:2f in 563) [ClassicSimilarity], result of:
      0.09799025 = score(doc=563,freq=2.0), product of:
        0.17435429 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.02056547 = queryNorm
        0.56201804 = fieldWeight in 563, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=563)
    0.09799025 = weight(_text_:2f in 563) [ClassicSimilarity], result of:
      0.09799025 = score(doc=563,freq=2.0), product of:
        0.17435429 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.02056547 = queryNorm
        0.56201804 = fieldWeight in 563, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=563)
    0.005640535 = weight(_text_:in in 563) [ClassicSimilarity], result of:
      0.005640535 = score(doc=563,freq=10.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.20163295 = fieldWeight in 563, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=563)
    0.018207615 = weight(_text_:computer in 563) [ClassicSimilarity], result of:
      0.018207615 = score(doc=563,freq=2.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.24226204 = fieldWeight in 563, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.046875 = fieldNorm(doc=563)
    0.030095704 = sum of:
      0.01337768 = weight(_text_:science in 563) [ClassicSimilarity], result of:
        0.01337768 = score(doc=563,freq=4.0), product of:
          0.0541719 = queryWeight, product of:
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.02056547 = queryNorm
          0.24694869 = fieldWeight in 563, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            2.6341193 = idf(docFreq=8627, maxDocs=44218)
            0.046875 = fieldNorm(doc=563)
      0.016718024 = weight(_text_:22 in 563) [ClassicSimilarity], result of:
        0.016718024 = score(doc=563,freq=2.0), product of:
          0.072016776 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.02056547 = queryNorm
          0.23214069 = fieldWeight in 563, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=563)
  0.27272728 = coord(6/22)

Abstract: In this thesis we propose three new word association measures for multi-word term extraction. We combine these association measures with LocalMaxs algorithm in our extraction model and compare the results of different multi-word term extraction methods. Our approach is language and domain independent and requires no training data. It can be applied to such tasks as text summarization, information retrieval, and document classification. We further explore the potential of using multi-word terms as an effective representation for general web-page summarization. We extract multi-word terms from human written summaries in a large collection of web-pages, and generate the summaries by aligning document words with these multi-word terms. Our system applies machine translation technology to learn the aligning process from a training set and focuses on selecting high quality multi-word terms from human written summaries to generate suitable results for web-page summarization.
Content: A Thesis presented to The University of Guelph In partial fulfilment of requirements for the degree of Master of Science in Computer Science. Vgl. Unter: http://www.inf.ufrgs.br%2F~ceramisch%2Fdownload_files%2Fpublications%2F2009%2Fp01.pdf.
Date: 10. 1.2013 19:22:47

Noever, D.; Ciolino, M.: ¬The Turing deception (2022) 0.08

0.07522803 = product of:
  0.3310033 = sum of:
    0.03266342 = product of:
      0.09799025 = sum of:
        0.09799025 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
          0.09799025 = score(doc=862,freq=2.0), product of:
            0.17435429 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.02056547 = queryNorm
            0.56201804 = fieldWeight in 862, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=862)
      0.33333334 = coord(1/3)
    0.09799025 = weight(_text_:2f in 862) [ClassicSimilarity], result of:
      0.09799025 = score(doc=862,freq=2.0), product of:
        0.17435429 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.02056547 = queryNorm
        0.56201804 = fieldWeight in 862, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=862)
    0.09799025 = weight(_text_:2f in 862) [ClassicSimilarity], result of:
      0.09799025 = score(doc=862,freq=2.0), product of:
        0.17435429 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.02056547 = queryNorm
        0.56201804 = fieldWeight in 862, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=862)
    0.09799025 = weight(_text_:2f in 862) [ClassicSimilarity], result of:
      0.09799025 = score(doc=862,freq=2.0), product of:
        0.17435429 = queryWeight, product of:
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.02056547 = queryNorm
        0.56201804 = fieldWeight in 862, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.478011 = idf(docFreq=24, maxDocs=44218)
          0.046875 = fieldNorm(doc=862)
    0.0043691397 = weight(_text_:in in 862) [ClassicSimilarity], result of:
      0.0043691397 = score(doc=862,freq=6.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.1561842 = fieldWeight in 862, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=862)
  0.22727273 = coord(5/22)

Abstract: This research revisits the classic Turing test and compares recent large language models such as ChatGPT for their abilities to reproduce human-level comprehension and compelling text generation. Two task challenges- summary and question answering- prompt ChatGPT to produce original content (98-99%) from a single text entry and sequential questions initially posed by Turing in 1950. We score the original and generated content against the OpenAI GPT-2 Output Detector from 2019, and establish multiple cases where the generated content proves original and undetectable (98%). The question of a machine fooling a human judge recedes in this work relative to the question of "how would one prove it?" The original contribution of the work presents a metric and simple grammatical set for understanding the writing mechanics of chatbots in evaluating their readability and statistical clarity, engagement, delivery, overall quality, and plagiarism risks. While Turing's original prose scores at least 14% below the machine-generated output, whether an algorithm displays hints of Turing's true initial thoughts (the "Lovelace 2.0" test) remains unanswerable.
Source: https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN

Computational linguistics and intelligent text processing : second international conference; Proceedings. CICLing 2001, Mexico City, Mexiko, 18.-24.2.2001 (2001) 0.06

0.055209864 = product of:
  0.2429234 = sum of:
    0.11417909 = weight(_text_:lecture in 3177) [ClassicSimilarity], result of:
      0.11417909 = score(doc=3177,freq=2.0), product of:
        0.13308205 = queryWeight, product of:
          6.4711404 = idf(docFreq=185, maxDocs=44218)
          0.02056547 = queryNorm
        0.8579601 = fieldWeight in 3177, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.4711404 = idf(docFreq=185, maxDocs=44218)
          0.09375 = fieldNorm(doc=3177)
    0.077824585 = weight(_text_:notes in 3177) [ClassicSimilarity], result of:
      0.077824585 = score(doc=3177,freq=2.0), product of:
        0.10987139 = queryWeight, product of:
          5.3425174 = idf(docFreq=574, maxDocs=44218)
          0.02056547 = queryNorm
        0.70832443 = fieldWeight in 3177, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.3425174 = idf(docFreq=574, maxDocs=44218)
          0.09375 = fieldNorm(doc=3177)
    0.0050450475 = weight(_text_:in in 3177) [ClassicSimilarity], result of:
      0.0050450475 = score(doc=3177,freq=2.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.18034597 = fieldWeight in 3177, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=3177)
    0.03641523 = weight(_text_:computer in 3177) [ClassicSimilarity], result of:
      0.03641523 = score(doc=3177,freq=2.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.48452407 = fieldWeight in 3177, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.09375 = fieldNorm(doc=3177)
    0.009459447 = product of:
      0.018918894 = sum of:
        0.018918894 = weight(_text_:science in 3177) [ClassicSimilarity], result of:
          0.018918894 = score(doc=3177,freq=2.0), product of:
            0.0541719 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.02056547 = queryNorm
            0.34923816 = fieldWeight in 3177, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.09375 = fieldNorm(doc=3177)
      0.5 = coord(1/2)
  0.22727273 = coord(5/22)

Series: Lecture notes in computer science; vol.2004

Schöneberg, U.; Sperber, W.: POS tagging and its applications for mathematics (2014) 0.05

0.049325477 = product of:
  0.18086007 = sum of:
    0.008434889 = product of:
      0.016869778 = sum of:
        0.016869778 = weight(_text_:29 in 1748) [ClassicSimilarity], result of:
          0.016869778 = score(doc=1748,freq=2.0), product of:
            0.072342895 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.02056547 = queryNorm
            0.23319192 = fieldWeight in 1748, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.046875 = fieldNorm(doc=1748)
      0.5 = coord(1/2)
    0.08073681 = weight(_text_:lecture in 1748) [ClassicSimilarity], result of:
      0.08073681 = score(doc=1748,freq=4.0), product of:
        0.13308205 = queryWeight, product of:
          6.4711404 = idf(docFreq=185, maxDocs=44218)
          0.02056547 = queryNorm
        0.6066694 = fieldWeight in 1748, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          6.4711404 = idf(docFreq=185, maxDocs=44218)
          0.046875 = fieldNorm(doc=1748)
    0.05503029 = weight(_text_:notes in 1748) [ClassicSimilarity], result of:
      0.05503029 = score(doc=1748,freq=4.0), product of:
        0.10987139 = queryWeight, product of:
          5.3425174 = idf(docFreq=574, maxDocs=44218)
          0.02056547 = queryNorm
        0.500861 = fieldWeight in 1748, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.3425174 = idf(docFreq=574, maxDocs=44218)
          0.046875 = fieldNorm(doc=1748)
    0.0061788964 = weight(_text_:in in 1748) [ClassicSimilarity], result of:
      0.0061788964 = score(doc=1748,freq=12.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.22087781 = fieldWeight in 1748, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=1748)
    0.025749456 = weight(_text_:computer in 1748) [ClassicSimilarity], result of:
      0.025749456 = score(doc=1748,freq=4.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.34261024 = fieldWeight in 1748, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.046875 = fieldNorm(doc=1748)
    0.0047297236 = product of:
      0.009459447 = sum of:
        0.009459447 = weight(_text_:science in 1748) [ClassicSimilarity], result of:
          0.009459447 = score(doc=1748,freq=2.0), product of:
            0.0541719 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.02056547 = queryNorm
            0.17461908 = fieldWeight in 1748, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.046875 = fieldNorm(doc=1748)
      0.5 = coord(1/2)
  0.27272728 = coord(6/22)

Abstract: Content analysis of scientific publications is a nontrivial task, but a useful and important one for scientific information services. In the Gutenberg era it was a domain of human experts; in the digital age many machine-based methods, e.g., graph analysis tools and machine-learning techniques, have been developed for it. Natural Language Processing (NLP) is a powerful machine-learning approach to semiautomatic speech and language processing, which is also applicable to mathematics. The well established methods of NLP have to be adjusted for the special needs of mathematics, in particular for handling mathematical formulae. We demonstrate a mathematics-aware part of speech tagger and give a short overview about our adaptation of NLP methods for mathematical publications. We show the use of the tools developed for key phrase extraction and classification in the database zbMATH.
Date: 29. 3.2015 19:34:37
Series: Lecture notes in computer science; 8543)(Lecture notes in artificial intelligence
Source: Intelligent Computer Mathematics - International Conference, CICM 2014, Coimbra, Portugal, July 7-11, 2014. Proceedings

Semantik, Lexikographie und Computeranwendungen : Workshop ... (Bonn) : 1995.01.27-28 (1996) 0.03

0.031604268 = product of:
  0.13905877 = sum of:
    0.04183023 = weight(_text_:informatik in 190) [ClassicSimilarity], result of:
      0.04183023 = score(doc=190,freq=4.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.39863077 = fieldWeight in 190, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.0390625 = fieldNorm(doc=190)
    0.034402408 = weight(_text_:und in 190) [ClassicSimilarity], result of:
      0.034402408 = score(doc=190,freq=76.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.7547594 = fieldWeight in 190, product of:
          8.717798 = tf(freq=76.0), with freq of:
            76.0 = termFreq=76.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0390625 = fieldNorm(doc=190)
    0.034402408 = weight(_text_:und in 190) [ClassicSimilarity], result of:
      0.034402408 = score(doc=190,freq=76.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.7547594 = fieldWeight in 190, product of:
          8.717798 = tf(freq=76.0), with freq of:
            76.0 = termFreq=76.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0390625 = fieldNorm(doc=190)
    0.02145788 = weight(_text_:computer in 190) [ClassicSimilarity], result of:
      0.02145788 = score(doc=190,freq=4.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.28550854 = fieldWeight in 190, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.0390625 = fieldNorm(doc=190)
    0.0069658435 = product of:
      0.013931687 = sum of:
        0.013931687 = weight(_text_:22 in 190) [ClassicSimilarity], result of:
          0.013931687 = score(doc=190,freq=2.0), product of:
            0.072016776 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02056547 = queryNorm
            0.19345059 = fieldWeight in 190, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=190)
      0.5 = coord(1/2)
  0.22727273 = coord(5/22)

BK: 54.75 Sprachverarbeitung <Informatik>
18.00 Einzelne Sprachen und Literaturen allgemein
Classification: ES 940 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Maschinelle Sprachanalyse
ET 400 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Einzelgebiete der Sprachwissenschaft, Sprachbeschreibung / Semantik und Lexikologie / Allgemeines
ES 945 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Spracherkennung
ET 580 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Einzelgebiete der Sprachwissenschaft, Sprachbeschreibung / Semantik und Lexikologie / Lexikologie (diachrone und synchrone) / Lexikographie
54.75 Sprachverarbeitung <Informatik>
18.00 Einzelne Sprachen und Literaturen allgemein
Date: 14. 4.2007 10:04:22
RSWK: Computer / Anwendung / Computerunterstützte Lexikographie / Aufsatzsammlung
RVK: ES 940 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Maschinelle Sprachanalyse
ET 400 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Einzelgebiete der Sprachwissenschaft, Sprachbeschreibung / Semantik und Lexikologie / Allgemeines
ES 945 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Spracherkennung
ET 580 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Einzelgebiete der Sprachwissenschaft, Sprachbeschreibung / Semantik und Lexikologie / Lexikologie (diachrone und synchrone) / Lexikographie
Series: Sprache und Information ; 33
Subject: Computer / Anwendung / Computerunterstützte Lexikographie / Aufsatzsammlung

Computerlinguistik und Sprachtechnologie : Eine Einführung (2010) 0.03

0.029412752 = product of:
  0.10784675 = sum of:
    0.033464182 = weight(_text_:informatik in 1735) [ClassicSimilarity], result of:
      0.033464182 = score(doc=1735,freq=4.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.3189046 = fieldWeight in 1735, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.03125 = fieldNorm(doc=1735)
    0.0172915 = weight(_text_:und in 1735) [ClassicSimilarity], result of:
      0.0172915 = score(doc=1735,freq=30.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.3793607 = fieldWeight in 1735, product of:
          5.477226 = tf(freq=30.0), with freq of:
            30.0 = termFreq=30.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.03125 = fieldNorm(doc=1735)
    0.0172915 = weight(_text_:und in 1735) [ClassicSimilarity], result of:
      0.0172915 = score(doc=1735,freq=30.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.3793607 = fieldWeight in 1735, product of:
          5.477226 = tf(freq=30.0), with freq of:
            30.0 = termFreq=30.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.03125 = fieldNorm(doc=1735)
    0.0037603567 = weight(_text_:in in 1735) [ClassicSimilarity], result of:
      0.0037603567 = score(doc=1735,freq=10.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.13442196 = fieldWeight in 1735, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.03125 = fieldNorm(doc=1735)
    0.02973291 = weight(_text_:computer in 1735) [ClassicSimilarity], result of:
      0.02973291 = score(doc=1735,freq=12.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.39561224 = fieldWeight in 1735, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.03125 = fieldNorm(doc=1735)
    0.0063062985 = product of:
      0.012612597 = sum of:
        0.012612597 = weight(_text_:science in 1735) [ClassicSimilarity], result of:
          0.012612597 = score(doc=1735,freq=8.0), product of:
            0.0541719 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.02056547 = queryNorm
            0.23282544 = fieldWeight in 1735, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.03125 = fieldNorm(doc=1735)
      0.5 = coord(1/2)
  0.27272728 = coord(6/22)

Abstract: Dieses Lehrbuch bietet eine umfassende Einführung in Grundlagen und Methoden der Computerlinguistik und stellt die wichtigsten Anwendungsgebiete in der Sprachtechnologie vor. Es richtet sich gleichermaßen an Studierende der Computerlinguistik und verwandter Fächer mit Bezug zur Verarbeitung natürlicher Sprache wie an Entwickler sprachverarbeitender Systeme. Nach einem Überblick über Aufgaben und Ziele der Computerlinguistik werden die erforderlichen theoretischen Grundlagen zur Logik, den Formalen Sprachen, der Graphentheorie, den statistischen Verfahren sowie der Texttechnologie beispielbezogen erläutert. Es schließt sich eine Darstellung der verschiedenen Methoden für die Verarbeitung auf den linguistischen Beschreibungsebenen an. Dabei werden zunächst die grundlegenden Begriffe und Konzepte der Phonetik und Phonologie, Morphologie, Syntax, Semantik sowie der Pragmatik vermittelt und darauf aufbauend die Prinzipien der sprachtechnologischen Umsetzung behandelt. Die drei letzten Teile des Buchs geben Überblicke über die verwendeten sprachtechnologischen Informationsquellen, die vielfältigen Anwendungen in der Praxis sowie über Evaluationsverfahren für sprachverarbeitende Systeme. Anhand konkreter Fragestellungen - von der Entwicklung von Korrekturprogrammen über das Informationsmanagement bis zur Maschinellen Übersetzung - wird das Zusammenwirken der einzelnen Methoden aufgezeigt. Für die dritte Auflage wurden sämtliche Kapitel überarbeitet und aktualisiert sowie zum Teil zu eigenständigen, neuen Kapiteln zusammengeführt. Insbesondere trägt die dritte Auflage der rasanten Entwicklung in der Computerlinguistik und Sprachtechnologie durch eine stärkere Fokussierung auf statistische Grundlagen und Methoden Rechnung. "Durch dieses Handbuch wird für den deutschsprachigen Raum eine empfindliche Lücke geschlossen. Es umfasst die relevanten Aspekte computerlinguistischer Grundlagenwissenschaft und sprachtechnologischer Anwendung in eindrucksvoller Breite und auf aktuellem Stand." [Manfred Pinkal]
BK: 18.00 Einzelne Sprachen und Literaturen allgemein
54.75 Sprachverarbeitung Informatik
Classification: 18.00 Einzelne Sprachen und Literaturen allgemein
54.75 Sprachverarbeitung Informatik
LCSH: Computer science
Translators (Computer programs)
Computer science
Subject: Computer science
Translators (Computer programs)
Computer science

Zimmermann, H.H.: Linguistisch-technische Aspekte der maschinellen Übersetzung (1990) 0.03

0.026245564 = product of:
  0.1443506 = sum of:
    0.013393938 = weight(_text_:und in 614) [ClassicSimilarity], result of:
      0.013393938 = score(doc=614,freq=2.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.29385152 = fieldWeight in 614, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.09375 = fieldNorm(doc=614)
    0.112517685 = weight(_text_:schriftenreihe in 614) [ClassicSimilarity], result of:
      0.112517685 = score(doc=614,freq=2.0), product of:
        0.13211027 = queryWeight, product of:
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.02056547 = queryNorm
        0.85169524 = fieldWeight in 614, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.4238877 = idf(docFreq=194, maxDocs=44218)
          0.09375 = fieldNorm(doc=614)
    0.013393938 = weight(_text_:und in 614) [ClassicSimilarity], result of:
      0.013393938 = score(doc=614,freq=2.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.29385152 = fieldWeight in 614, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.09375 = fieldNorm(doc=614)
    0.0050450475 = weight(_text_:in in 614) [ClassicSimilarity], result of:
      0.0050450475 = score(doc=614,freq=2.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.18034597 = fieldWeight in 614, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=614)
  0.18181819 = coord(4/22)

Series: DGD-Schriftenreihe; Bd.9
Source: Grundlagen der praktischen Information und Dokumentation: ein Handbuch zur Einführung in die fachliche Informationsarbeit. 3. Aufl. Hrsg.: M. Buder u.a. Bd.1

Lobin, H: Computerlinguistik und Texttechnologie (2010) 0.02

0.02417351 = product of:
  0.10636344 = sum of:
    0.05019627 = weight(_text_:informatik in 564) [ClassicSimilarity], result of:
      0.05019627 = score(doc=564,freq=4.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.4783569 = fieldWeight in 564, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.046875 = fieldNorm(doc=564)
    0.017718514 = weight(_text_:und in 564) [ClassicSimilarity], result of:
      0.017718514 = score(doc=564,freq=14.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.38872904 = fieldWeight in 564, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.046875 = fieldNorm(doc=564)
    0.017718514 = weight(_text_:und in 564) [ClassicSimilarity], result of:
      0.017718514 = score(doc=564,freq=14.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.38872904 = fieldWeight in 564, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.046875 = fieldNorm(doc=564)
    0.0025225237 = weight(_text_:in in 564) [ClassicSimilarity], result of:
      0.0025225237 = score(doc=564,freq=2.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.09017298 = fieldWeight in 564, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=564)
    0.018207615 = weight(_text_:computer in 564) [ClassicSimilarity], result of:
      0.018207615 = score(doc=564,freq=2.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.24226204 = fieldWeight in 564, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.046875 = fieldNorm(doc=564)
  0.22727273 = coord(5/22)

Abstract: Computerlinguistik (die Verarbeitung von Sprache mit dem Computer) und Texttechnologie (die automatisierte Handhabung elektronischer Texte) haben im letzten Jahrzehnt unterschiedliche Richtungen eingeschlagen. Beide Disziplinen speisen sich jedoch aus der gleichen Quelle: der formalen Grammatik. Deshalb ist eine gemeinsame Darstellung sinnvoll. Der Bezug auf die gemeinsamen Grundlagen und die kontrastierende Gegenüberstellung einzelner Teilbereiche fördern das Verständnis der jeweils anderen Disziplin und eröffnen interessante Querbezüge. Erstmals wird die Verknüpfung von Computerlinguistik und Texttechnologie mit dieser Einführung in knapper Form systematisch vollzogen, was sie insbesondere für Module im Bachelor-Studium geeignet macht.
BK: 18.00 Einzelne Sprachen und Literaturen allgemein
54.75 Sprachverarbeitung Informatik
Classification: 18.00 Einzelne Sprachen und Literaturen allgemein
54.75 Sprachverarbeitung Informatik

Weber, N.: ¬Die Semantik von Bedeutungsexplikationen (1999) 0.02

0.02359684 = product of:
  0.12978262 = sum of:
    0.05019627 = weight(_text_:informatik in 1399) [ClassicSimilarity], result of:
      0.05019627 = score(doc=1399,freq=4.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.4783569 = fieldWeight in 1399, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.046875 = fieldNorm(doc=1399)
    0.030689366 = weight(_text_:und in 1399) [ClassicSimilarity], result of:
      0.030689366 = score(doc=1399,freq=42.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.6732984 = fieldWeight in 1399, product of:
          6.4807405 = tf(freq=42.0), with freq of:
            42.0 = termFreq=42.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.046875 = fieldNorm(doc=1399)
    0.030689366 = weight(_text_:und in 1399) [ClassicSimilarity], result of:
      0.030689366 = score(doc=1399,freq=42.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.6732984 = fieldWeight in 1399, product of:
          6.4807405 = tf(freq=42.0), with freq of:
            42.0 = termFreq=42.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.046875 = fieldNorm(doc=1399)
    0.018207615 = weight(_text_:computer in 1399) [ClassicSimilarity], result of:
      0.018207615 = score(doc=1399,freq=2.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.24226204 = fieldWeight in 1399, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.046875 = fieldNorm(doc=1399)
  0.18181819 = coord(4/22)

BK: 54.75 Sprachverarbeitung <Informatik>
18.00 Einzelne Sprachen und Literaturen allgemein
Classification: ES 950 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Maschinelle Redeanlyse
ES 965 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Datenverarbeitung und Einzelgebiete der Grammatik
54.75 Sprachverarbeitung <Informatik>
18.00 Einzelne Sprachen und Literaturen allgemein
RVK: ES 950 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Maschinelle Redeanlyse
ES 965 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Datenverarbeitung und Einzelgebiete der Grammatik
Series: Sprache, Sprechen und Computer ; 3

Manning, C.D.: Part-of-Speech Tagging from 97% to 100% : is it time for some linguistics? (2011) 0.02

0.023353849 = product of:
  0.10275693 = sum of:
    0.047574617 = weight(_text_:lecture in 1121) [ClassicSimilarity], result of:
      0.047574617 = score(doc=1121,freq=2.0), product of:
        0.13308205 = queryWeight, product of:
          6.4711404 = idf(docFreq=185, maxDocs=44218)
          0.02056547 = queryNorm
        0.35748336 = fieldWeight in 1121, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.4711404 = idf(docFreq=185, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1121)
    0.032426912 = weight(_text_:notes in 1121) [ClassicSimilarity], result of:
      0.032426912 = score(doc=1121,freq=2.0), product of:
        0.10987139 = queryWeight, product of:
          5.3425174 = idf(docFreq=574, maxDocs=44218)
          0.02056547 = queryNorm
        0.29513517 = fieldWeight in 1121, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.3425174 = idf(docFreq=574, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1121)
    0.00364095 = weight(_text_:in in 1121) [ClassicSimilarity], result of:
      0.00364095 = score(doc=1121,freq=6.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.1301535 = fieldWeight in 1121, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1121)
    0.015173013 = weight(_text_:computer in 1121) [ClassicSimilarity], result of:
      0.015173013 = score(doc=1121,freq=2.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.20188503 = fieldWeight in 1121, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1121)
    0.0039414368 = product of:
      0.0078828735 = sum of:
        0.0078828735 = weight(_text_:science in 1121) [ClassicSimilarity], result of:
          0.0078828735 = score(doc=1121,freq=2.0), product of:
            0.0541719 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.02056547 = queryNorm
            0.1455159 = fieldWeight in 1121, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1121)
      0.5 = coord(1/2)
  0.22727273 = coord(5/22)

Abstract: I examine what would be necessary to move part-of-speech tagging performance from its current level of about 97.3% token accuracy (56% sentence accuracy) to close to 100% accuracy. I suggest that it must still be possible to greatly increase tagging performance and examine some useful improvements that have recently been made to the Stanford Part-of-Speech Tagger. However, an error analysis of some of the remaining errors suggests that there is limited further mileage to be had either from better machine learning or better features in a discriminative sequence classifier. The prospects for further gains from semisupervised learning also seem quite limited. Rather, I suggest and begin to demonstrate that the largest opportunity for further progress comes from improving the taxonomic basis of the linguistic resources from which taggers are trained. That is, from improved descriptive linguistics. However, I conclude by suggesting that there are also limits to this process. The status of some words may not be able to be adequately captured by assigning them to one of a small number of categories. While conventions can be used in such cases to improve tagging consistency, they lack a strong linguistic basis.
Series: Lecture notes in computer science; 6608

Schulz, A.: Motivierte Maschinen : Sprachcomputer (2002) 0.02

0.02330325 = product of:
  0.12816787 = sum of:
    0.059156876 = weight(_text_:informatik in 1309) [ClassicSimilarity], result of:
      0.059156876 = score(doc=1309,freq=2.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.563749 = fieldWeight in 1309, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.078125 = fieldNorm(doc=1309)
    0.019332483 = weight(_text_:und in 1309) [ClassicSimilarity], result of:
      0.019332483 = score(doc=1309,freq=6.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.42413816 = fieldWeight in 1309, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.078125 = fieldNorm(doc=1309)
    0.019332483 = weight(_text_:und in 1309) [ClassicSimilarity], result of:
      0.019332483 = score(doc=1309,freq=6.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.42413816 = fieldWeight in 1309, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.078125 = fieldNorm(doc=1309)
    0.030346027 = weight(_text_:computer in 1309) [ClassicSimilarity], result of:
      0.030346027 = score(doc=1309,freq=2.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.40377006 = fieldWeight in 1309, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.078125 = fieldNorm(doc=1309)
  0.18181819 = coord(4/22)

Abstract: Experten für Künstliche Intelligenz simulieren den menschlichen Spracherwerb per Computer. Dazu imitieren sie Funktionsprinzipien des Gehirns. Ergebnis: ein Programm, das ab und zu hungrig wird
Field: Informatik
Source: Gehirn und Geist: Das Magazin für Hirnforschung und Psychologie. 2002, H.3, S.54-56

Rötzer, F.: Computer ergooglen die Bedeutung von Worten (2005) 0.02
```
0.023241052 = product of:
  0.08521719 = sum of:
    0.025098136 = weight(_text_:informatik in 3385) [ClassicSimilarity], result of:
      0.025098136 = score(doc=3385,freq=4.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.23917845 = fieldWeight in 3385, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.0234375 = fieldNorm(doc=3385)
    0.016058767 = weight(_text_:und in 3385) [ClassicSimilarity], result of:
      0.016058767 = score(doc=3385,freq=46.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.3523156 = fieldWeight in 3385, product of:
          6.78233 = tf(freq=46.0), with freq of:
            46.0 = termFreq=46.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0234375 = fieldNorm(doc=3385)
    0.016058767 = weight(_text_:und in 3385) [ClassicSimilarity], result of:
      0.016058767 = score(doc=3385,freq=46.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.3523156 = fieldWeight in 3385, product of:
          6.78233 = tf(freq=46.0), with freq of:
            46.0 = termFreq=46.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0234375 = fieldNorm(doc=3385)
    0.0033369856 = weight(_text_:in in 3385) [ClassicSimilarity], result of:
      0.0033369856 = score(doc=3385,freq=14.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.119287655 = fieldWeight in 3385, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0234375 = fieldNorm(doc=3385)
    0.022299685 = weight(_text_:computer in 3385) [ClassicSimilarity], result of:
      0.022299685 = score(doc=3385,freq=12.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.29670918 = fieldWeight in 3385, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.0234375 = fieldNorm(doc=3385)
    0.0023648618 = product of:
      0.0047297236 = sum of:
        0.0047297236 = weight(_text_:science in 3385) [ClassicSimilarity], result of:
          0.0047297236 = score(doc=3385,freq=2.0), product of:
            0.0541719 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.02056547 = queryNorm
            0.08730954 = fieldWeight in 3385, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0234375 = fieldNorm(doc=3385)
      0.5 = coord(1/2)
  0.27272728 = coord(6/22)
```
Content

"Wie könnten Computer Sprache lernen und dabei auch die Bedeutung von Worten sowie die Beziehungen zwischen ihnen verstehen? Dieses Problem der Semantik stellt eine gewaltige, bislang nur ansatzweise bewältigte Aufgabe dar, da Worte und Wortverbindungen oft mehrere oder auch viele Bedeutungen haben, die zudem vom außersprachlichen Kontext abhängen. Die beiden holländischen (Ein künstliches Bewusstsein aus einfachen Aussagen (1)). Paul Vitanyi (2) und Rudi Cilibrasi vom Nationalen Institut für Mathematik und Informatik (3) in Amsterdam schlagen eine elegante Lösung vor: zum Nachschlagen im Internet, der größten Datenbank, die es gibt, wird einfach Google benutzt. Objekte wie eine Maus können mit ihren Namen "Maus" benannt werden, die Bedeutung allgemeiner Begriffe muss aus ihrem Kontext gelernt werden. Ein semantisches Web zur Repräsentation von Wissen besteht aus den möglichen Verbindungen, die Objekte und ihre Namen eingehen können. Natürlich können in der Wirklichkeit neue Namen, aber auch neue Bedeutungen und damit neue Verknüpfungen geschaffen werden. Sprache ist lebendig und flexibel. Um einer Künstlichen Intelligenz alle Wortbedeutungen beizubringen, müsste mit der Hilfe von menschlichen Experten oder auch vielen Mitarbeitern eine riesige Datenbank mit den möglichen semantischen Netzen aufgebaut und dazu noch ständig aktualisiert werden. Das aber müsste gar nicht notwendig sein, denn mit dem Web gibt es nicht nur die größte und weitgehend kostenlos benutzbare semantische Datenbank, sie wird auch ständig von zahllosen Internetnutzern aktualisiert. Zudem gibt es Suchmaschinen wie Google, die Verbindungen zwischen Worten und damit deren Bedeutungskontext in der Praxis in ihrer Wahrscheinlichkeit quantitativ mit der Angabe der Webseiten, auf denen sie gefunden wurden, messen.
Mit einem bereits zuvor von Paul Vitanyi und anderen entwickeltem Verfahren, das den Zusammenhang von Objekten misst (normalized information distance - NID ), kann die Nähe zwischen bestimmten Objekten (Bilder, Worte, Muster, Intervalle, Genome, Programme etc.) anhand aller Eigenschaften analysiert und aufgrund der dominanten gemeinsamen Eigenschaft bestimmt werden. Ähnlich können auch die allgemein verwendeten, nicht unbedingt "wahren" Bedeutungen von Namen mit der Google-Suche erschlossen werden. 'At this moment one database stands out as the pinnacle of computer-accessible human knowledge and the most inclusive summary of statistical information: the Google search engine. There can be no doubt that Google has already enabled science to accelerate tremendously and revolutionized the research process. It has dominated the attention of internet users for years, and has recently attracted substantial attention of many Wall Street investors, even reshaping their ideas of company financing.' (Paul Vitanyi und Rudi Cilibrasi) Gibt man ein Wort ein wie beispielsweise "Pferd", erhält man bei Google 4.310.000 indexierte Seiten. Für "Reiter" sind es 3.400.000 Seiten. Kombiniert man beide Begriffe, werden noch 315.000 Seiten erfasst. Für das gemeinsame Auftreten beispielsweise von "Pferd" und "Bart" werden zwar noch immer erstaunliche 67.100 Seiten aufgeführt, aber man sieht schon, dass "Pferd" und "Reiter" enger zusammen hängen. Daraus ergibt sich eine bestimmte Wahrscheinlichkeit für das gemeinsame Auftreten von Begriffen. Aus dieser Häufigkeit, die sich im Vergleich mit der maximalen Menge (5.000.000.000) an indexierten Seiten ergibt, haben die beiden Wissenschaftler eine statistische Größe entwickelt, die sie "normalised Google distance" (NGD) nennen und die normalerweise zwischen 0 und 1 liegt. Je geringer NGD ist, desto enger hängen zwei Begriffe zusammen. "Das ist eine automatische Bedeutungsgenerierung", sagt Vitanyi gegenüber dern New Scientist (4). "Das könnte gut eine Möglichkeit darstellen, einen Computer Dinge verstehen und halbintelligent handeln zu lassen." Werden solche Suchen immer wieder durchgeführt, lässt sich eine Karte für die Verbindungen von Worten erstellen. Und aus dieser Karte wiederum kann ein Computer, so die Hoffnung, auch die Bedeutung der einzelnen Worte in unterschiedlichen natürlichen Sprachen und Kontexten erfassen. So habe man über einige Suchen realisiert, dass ein Computer zwischen Farben und Zahlen unterscheiden, holländische Maler aus dem 17. Jahrhundert und Notfälle sowie Fast-Notfälle auseinander halten oder elektrische oder religiöse Begriffe verstehen könne. Überdies habe eine einfache automatische Übersetzung Englisch-Spanisch bewerkstelligt werden können. Auf diese Weise ließe sich auch, so hoffen die Wissenschaftler, die Bedeutung von Worten erlernen, könne man Spracherkennung verbessern oder ein semantisches Web erstellen und natürlich endlich eine bessere automatische Übersetzung von einer Sprache in die andere realisieren.

Field

Informatik

Footnote

Artikel in der ListeTelepolis vom 27.01.2005

Monnerjahn, P.: Vorsprung ohne Technik : Übersetzen: Computer und Qualität (2000) 0.02

0.022738606 = product of:
  0.10004986 = sum of:
    0.013393938 = weight(_text_:und in 5429) [ClassicSimilarity], result of:
      0.013393938 = score(doc=5429,freq=2.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.29385152 = fieldWeight in 5429, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.09375 = fieldNorm(doc=5429)
    0.013393938 = weight(_text_:und in 5429) [ClassicSimilarity], result of:
      0.013393938 = score(doc=5429,freq=2.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.29385152 = fieldWeight in 5429, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.09375 = fieldNorm(doc=5429)
    0.0050450475 = weight(_text_:in in 5429) [ClassicSimilarity], result of:
      0.0050450475 = score(doc=5429,freq=2.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.18034597 = fieldWeight in 5429, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.09375 = fieldNorm(doc=5429)
    0.051498912 = weight(_text_:computer in 5429) [ClassicSimilarity], result of:
      0.051498912 = score(doc=5429,freq=4.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.6852205 = fieldWeight in 5429, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.09375 = fieldNorm(doc=5429)
    0.016718024 = product of:
      0.03343605 = sum of:
        0.03343605 = weight(_text_:22 in 5429) [ClassicSimilarity], result of:
          0.03343605 = score(doc=5429,freq=2.0), product of:
            0.072016776 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02056547 = queryNorm
            0.46428138 = fieldWeight in 5429, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=5429)
      0.5 = coord(1/2)
  0.22727273 = coord(5/22)

Abstract: Noch immer ist der menschliche Übersetzer dem Computer in sprachlicher Hinsicht überlegen. Zwar ist die Übersetzungssoftware besser geworden, aber die systembedingten Probleme bleiben
Source: c't. 2000, H.22, S.230-231

Hahn, U.: Automatische Sprachverarbeitung (2023) 0.02

0.022287842 = product of:
  0.0980665 = sum of:
    0.041409813 = weight(_text_:informatik in 790) [ClassicSimilarity], result of:
      0.041409813 = score(doc=790,freq=2.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.3946243 = fieldWeight in 790, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.0546875 = fieldNorm(doc=790)
    0.015626261 = weight(_text_:und in 790) [ClassicSimilarity], result of:
      0.015626261 = score(doc=790,freq=8.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.34282678 = fieldWeight in 790, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0546875 = fieldNorm(doc=790)
    0.015626261 = weight(_text_:und in 790) [ClassicSimilarity], result of:
      0.015626261 = score(doc=790,freq=8.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.34282678 = fieldWeight in 790, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0546875 = fieldNorm(doc=790)
    0.004161952 = weight(_text_:in in 790) [ClassicSimilarity], result of:
      0.004161952 = score(doc=790,freq=4.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.14877784 = fieldWeight in 790, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0546875 = fieldNorm(doc=790)
    0.021242218 = weight(_text_:computer in 790) [ClassicSimilarity], result of:
      0.021242218 = score(doc=790,freq=2.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.28263903 = fieldWeight in 790, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.0546875 = fieldNorm(doc=790)
  0.22727273 = coord(5/22)

Abstract: Dieses Kapitel gibt eine Übersicht über die maschinelle Verarbeitung natürlicher Sprachen (wie das Deutsche oder Englische; natural language - NL) durch Computer. Grundlegende Konzepte der automatischen Sprachverarbeitung (natural language processing - NLP) stammen aus der Sprachwissenschaft (s. Abschnitt 2) und sind in zunehmend selbstständiger Weise mit formalen Methoden und technischen Grundlagen der Informatik in einer eigenständigen Disziplin, der Computerlinguistik (CL; s. Abschnitte 3 und 4), verknüpft worden. Natürlichsprachliche Systeme (NatS) mit anwendungsbezogenen Funktionalitätsvorgaben bilden den Kern der informationswissenschaftlich geprägten NLP, die häufig als Sprachtechnologie oder im Deutschen auch (mittlerweile veraltet) als Informationslinguistik bezeichnet wird (s. Abschnitt 5).
Source: Grundlagen der Informationswissenschaft. Hrsg.: Rainer Kuhlen, Dirk Lewandowski, Wolfgang Semar und Christa Womser-Hacker. 7., völlig neu gefasste Ausg

Schürmann, H.: Software scannt Radio- und Fernsehsendungen : Recherche in Nachrichtenarchiven erleichtert (2001) 0.02
```
0.021862883 = product of:
  0.0801639 = sum of:
    0.020704906 = weight(_text_:informatik in 5759) [ClassicSimilarity], result of:
      0.020704906 = score(doc=5759,freq=2.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.19731215 = fieldWeight in 5759, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02734375 = fieldNorm(doc=5759)
    0.017028322 = weight(_text_:und in 5759) [ClassicSimilarity], result of:
      0.017028322 = score(doc=5759,freq=38.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.3735868 = fieldWeight in 5759, product of:
          6.164414 = tf(freq=38.0), with freq of:
            38.0 = termFreq=38.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02734375 = fieldNorm(doc=5759)
    0.017028322 = weight(_text_:und in 5759) [ClassicSimilarity], result of:
      0.017028322 = score(doc=5759,freq=38.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.3735868 = fieldWeight in 5759, product of:
          6.164414 = tf(freq=38.0), with freq of:
            38.0 = termFreq=38.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02734375 = fieldNorm(doc=5759)
    0.005505745 = weight(_text_:in in 5759) [ClassicSimilarity], result of:
      0.005505745 = score(doc=5759,freq=28.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.19681457 = fieldWeight in 5759, product of:
          5.2915025 = tf(freq=28.0), with freq of:
            28.0 = termFreq=28.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02734375 = fieldNorm(doc=5759)
    0.015020517 = weight(_text_:computer in 5759) [ClassicSimilarity], result of:
      0.015020517 = score(doc=5759,freq=4.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.19985598 = fieldWeight in 5759, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02734375 = fieldNorm(doc=5759)
    0.00487609 = product of:
      0.00975218 = sum of:
        0.00975218 = weight(_text_:22 in 5759) [ClassicSimilarity], result of:
          0.00975218 = score(doc=5759,freq=2.0), product of:
            0.072016776 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02056547 = queryNorm
            0.1354154 = fieldWeight in 5759, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02734375 = fieldNorm(doc=5759)
      0.5 = coord(1/2)
  0.27272728 = coord(6/22)
```
Abstract

Computer müssen lernen, die Sprache des Menschen zu verstehen. Forscher an der Uni Duisburg haben eine Methode entwickelt, mit der ein Rechner Informationen aus Radiobeiträgen herausfiltern kann.

Content

Um Firmen und Agenturen die Beobachtungen von Medien zu erleichtern, entwickeln Forscher an der Duisburger Hochschule zurzeit ein System zur automatischen Themenerkennung in Rundfunk und Fernsehen. Das so genannte Alert-System soll dem Nutzer helfen, die für ihn relevanten Sprachinformationen aus Nachrichtensendungen herauszufiltem und weiterzuverarbeiten. Durch die automatische Analyse durch den Computer können mehrere Programme rund um die Uhr beobachtet werden. Noch erfolgt die Informationsgewinnung aus TV- und Radiosendungen auf klassischem Wege: Ein Mensch sieht, hört, liest und wertet aus. Das ist enorm zeitaufwendig und für eine Firma, die beispielsweise die Konkurrenz beobachten oder ihre Medienpräsenz dokumentieren lassen möchte, auch sehr teuer. Diese Arbeit ließe sich mit einem Spracherkenner automatisieren, sagten sich die Duisburger Forscher. Sie arbeiten nun zusammen mit Partnern aus Deutschland, Frankreich und Portugal in einem europaweiten Projekt an der Entwicklung einer entsprechenden Technologie (http://alert.uni-duisburg.de). An dem Projekt sind auch zwei Medienbeobachtungsuntemehmen beteiligt, die Oberserver Argus Media GmbH aus Baden-Baden und das französische Unternehmen Secodip. Unsere Arbeit würde schon dadurch erleichtert, wenn Informationen, die über unsere Kunden in den Medien erscheinen, vorselektiert würden", beschreibt Simone Holderbach, Leiterin der Produktentwicklung bei Oberserver, ihr Interesse an der Technik. Und wie funktioniert Alert? Das Spracherkennungssystem wird darauf getrimmt, Nachrichtensendungen in Radio und Fernsehen zu überwachen: Alles, was gesagt wird - sei es vom Nachrichtensprecher, Reporter oder Interviewten -, wird durch die automatische Spracherkennung in Text umgewandelt. Dabei werden Themen und Schlüsselwörter erkannt und gespeichert. Diese werden mit den Suchbegriffen des Nutzers verglichen. Gefundene Übereinstimmungen werden angezeigt und dem Benutzer automatisch mitgeteilt. Konventionelle Spracherkennungstechnik sei für die Medienbeobachtung nicht einsetzbar, da diese für einen anderen Zweck entwickelt worden sei, betont Prof. Gerhard Rigoll, Leiter des Fachgebiets Technische Informatik an der Duisburger Hochschule. Für die Umwandlung von Sprache in Text wurde die Alert-Software gründlich trainiert. Aus Zeitungstexten, Audio- und Video-Material wurden bislang rund 3 50 Millionen Wörter verarbeitet. Das System arbeitet in drei Sprachen. Doch so ganz fehlerfrei sei der automatisch gewonnene Text nicht, räumt Rigoll ein. Zurzeit liegt die Erkennungsrate bei 40 bis 70 Prozent. Und das wird sich in absehbarer Zeit auch nicht ändern." Musiküberlagerungen oder starke Hintergrundgeräusche bei Reportagen führen zu Ungenauigkeiten bei der Textumwandlung. Deshalb haben die, Duisburger Wissenschaftler Methoden entwickelt, die über die herkömmliche Suche nach Schlüsselwörtern hinausgehen und eine inhaltsorientierte Zuordnung ermöglichen. Dadurch erhält der Nutzer dann auch solche Nachrichten, die zwar zum Thema passen, in denen das Stichwort aber gar nicht auftaucht", bringt Rigoll den Vorteil der Technik auf den Punkt. Wird beispielsweise "Ölpreis" als Suchbegriff eingegeben, werden auch solche Nachrichten angezeigt, in denen Olkonzerne und Energieagenturen eine Rolle spielen. Rigoll: Das Alert-System liest sozusagen zwischen den Zeilen!' Das Forschungsprojekt wurde vor einem Jahr gestartet und läuft noch bis Mitte 2002. Wer sich über den Stand der Technik informieren möchte, kann dies in dieser Woche auf der Industriemesse in Hannover. Das Alert-System wird auf dem Gemeinschaftsstand "Forschungsland NRW" in Halle 18, Stand M12, präsentiert

Source

Handelsblatt. Nr.79 vom 24.4.2001, S.22

Sagawe, H.: Einfluß 'intelligenter' Maschinen auf menschliches Verhalten (1994) 0.02

0.021296024 = product of:
  0.117128134 = sum of:
    0.0661394 = weight(_text_:informatik in 1714) [ClassicSimilarity], result of:
      0.0661394 = score(doc=1714,freq=10.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.63029057 = fieldWeight in 1714, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1714)
    0.014765427 = weight(_text_:und in 1714) [ClassicSimilarity], result of:
      0.014765427 = score(doc=1714,freq=14.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.32394084 = fieldWeight in 1714, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1714)
    0.014765427 = weight(_text_:und in 1714) [ClassicSimilarity], result of:
      0.014765427 = score(doc=1714,freq=14.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.32394084 = fieldWeight in 1714, product of:
          3.7416575 = tf(freq=14.0), with freq of:
            14.0 = termFreq=14.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1714)
    0.02145788 = weight(_text_:computer in 1714) [ClassicSimilarity], result of:
      0.02145788 = score(doc=1714,freq=4.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.28550854 = fieldWeight in 1714, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1714)
  0.18181819 = coord(4/22)

BK: 54.80 Angewandte Informatik
Classification: MS 4850 Soziologie / Spezielle Soziologien / Wirtschafts- und Industriesoziologie / Industrie (allgemeines) und Technik (Automatisierung)
ST 278 Informatik / Monographien / Software und -entwicklung / Mensch-Maschine-Kommunikation Software-Ergonomie
54.80 Angewandte Informatik
Content: Inhalt: 1. Die Mensch-Maschine-Beziehung: Sprache als Element der Mensch-Maschine-Beziehung. Der Sprechakt. Der Befehl als Sprachelement bei der Mensch-Maschine Kommunikation. Weitere Sprachelemente der Mensch-Maschine-Kommunikation. Sprechen Computer anders? Werden wir wie die Computer sprechen? 2. Mikro- und makrosoziologische Einflüsse: Anthropomorphisierung des Computers. Digitalisierte Gesellschaft. Subgruppenspezifischer Einfluß des Computers 3. Schlußbetrachtung 4. Resumee 5. Anhang
Field: Informatik
RVK: MS 4850 Soziologie / Spezielle Soziologien / Wirtschafts- und Industriesoziologie / Industrie (allgemeines) und Technik (Automatisierung)
ST 278 Informatik / Monographien / Software und -entwicklung / Mensch-Maschine-Kommunikation Software-Ergonomie

Hodgson, J.P.E.: Knowledge representation and language in AI (1991) 0.02

0.021278007 = product of:
  0.09362323 = sum of:
    0.051231362 = weight(_text_:informatik in 1529) [ClassicSimilarity], result of:
      0.051231362 = score(doc=1529,freq=6.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.48822102 = fieldWeight in 1529, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1529)
    0.007892453 = weight(_text_:und in 1529) [ClassicSimilarity], result of:
      0.007892453 = score(doc=1529,freq=4.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.17315367 = fieldWeight in 1529, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1529)
    0.007892453 = weight(_text_:und in 1529) [ClassicSimilarity], result of:
      0.007892453 = score(doc=1529,freq=4.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.17315367 = fieldWeight in 1529, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1529)
    0.00514908 = weight(_text_:in in 1529) [ClassicSimilarity], result of:
      0.00514908 = score(doc=1529,freq=12.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.18406484 = fieldWeight in 1529, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1529)
    0.02145788 = weight(_text_:computer in 1529) [ClassicSimilarity], result of:
      0.02145788 = score(doc=1529,freq=4.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.28550854 = fieldWeight in 1529, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1529)
  0.22727273 = coord(5/22)

Abstract: The aim of this book is to highlight the relationship between knowledge representation and language in artificial intelligence, and in particular on the way in which the choice of representation influences the language used to discuss a problem - and vice versa. Opening with a discussion of knowledge representation methods, and following this with a look at reasoning methods, the author begins to make his case for the intimate relationship between language and representation. He shows how each representation method fits particularly well with some reasoning methods and less so with others, using specific languages as examples. The question of representation change, an important and complex issue about which very little is known, is addressed. Dr Hodgson gathers together recent work on problem solving, showing how, in some cases, it has been possible to use representation changes to recast problems into a language that makes them easier to solve. The author maintains throughout that the relationships that this book explores lie at the heart of the construction of large systems, examining a number of the current large AI systems from the viewpoint of representation and language to prove his point.
Classification: ST 285 Informatik / Monographien / Software und -entwicklung / Computer supported cooperative work (CSCW), Groupware
Field: Informatik
RVK: ST 285 Informatik / Monographien / Software und -entwicklung / Computer supported cooperative work (CSCW), Groupware
Series: Ellis Horwood series in artificial intelligence

Conceptual structures : logical, linguistic, and computational issues. 8th International Conference on Conceptual Structures, ICCS 2000, Darmstadt, Germany, August 14-18, 2000 (2000) 0.02
```
0.020455712 = product of:
  0.09000513 = sum of:
    0.040368404 = weight(_text_:lecture in 691) [ClassicSimilarity], result of:
      0.040368404 = score(doc=691,freq=4.0), product of:
        0.13308205 = queryWeight, product of:
          6.4711404 = idf(docFreq=185, maxDocs=44218)
          0.02056547 = queryNorm
        0.3033347 = fieldWeight in 691, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          6.4711404 = idf(docFreq=185, maxDocs=44218)
          0.0234375 = fieldNorm(doc=691)
    0.027515145 = weight(_text_:notes in 691) [ClassicSimilarity], result of:
      0.027515145 = score(doc=691,freq=4.0), product of:
        0.10987139 = queryWeight, product of:
          5.3425174 = idf(docFreq=574, maxDocs=44218)
          0.02056547 = queryNorm
        0.2504305 = fieldWeight in 691, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.3425174 = idf(docFreq=574, maxDocs=44218)
          0.0234375 = fieldNorm(doc=691)
    0.00398846 = weight(_text_:in in 691) [ClassicSimilarity], result of:
      0.00398846 = score(doc=691,freq=20.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.14257601 = fieldWeight in 691, product of:
          4.472136 = tf(freq=20.0), with freq of:
            20.0 = termFreq=20.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.0234375 = fieldNorm(doc=691)
    0.015768258 = weight(_text_:computer in 691) [ClassicSimilarity], result of:
      0.015768258 = score(doc=691,freq=6.0), product of:
        0.0751567 = queryWeight, product of:
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.02056547 = queryNorm
        0.20980507 = fieldWeight in 691, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.6545093 = idf(docFreq=3109, maxDocs=44218)
          0.0234375 = fieldNorm(doc=691)
    0.0023648618 = product of:
      0.0047297236 = sum of:
        0.0047297236 = weight(_text_:science in 691) [ClassicSimilarity], result of:
          0.0047297236 = score(doc=691,freq=2.0), product of:
            0.0541719 = queryWeight, product of:
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.02056547 = queryNorm
            0.08730954 = fieldWeight in 691, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.6341193 = idf(docFreq=8627, maxDocs=44218)
              0.0234375 = fieldNorm(doc=691)
      0.5 = coord(1/2)
  0.22727273 = coord(5/22)
```
Abstract

Computer scientists create models of a perceived reality. Through AI techniques, these models aim at providing the basic support for emulating cognitive behavior such as reasoning and learning, which is one of the main goals of the Al research effort. Such computer models are formed through the interaction of various acquisition and inference mechanisms: perception, concept learning, conceptual clustering, hypothesis testing, probabilistic inference, etc., and are represented using different paradigms tightly linked to the processes that use them. Among these paradigms let us cite: biological models (neural nets, genetic programming), logic-based models (first-order logic, modal logic, rule-based systems), virtual reality models (object systems, agent systems), probabilistic models (Bayesian nets, fuzzy logic), linguistic models (conceptual dependency graphs, language-based rep resentations), etc. One of the strengths of the Conceptual Graph (CG) theory is its versatility in terms of the representation paradigms under which it falls. It can be viewed and therefore used, under different representation paradigms, which makes it a popular choice for a wealth of applications. Its full coupling with different cognitive processes lead to the opening of the field toward related research communities such as the Description Logic, Formal Concept Analysis, and Computational Linguistic communities. We now see more and more research results from one community enrich the other, laying the foundations of common philosophical grounds from which a successful synergy can emerge. ICCS 2000 embodies this spirit of research collaboration. It presents a set of papers that we believe, by their exposure, will benefit the whole community. For instance, the technical program proposes tracks on Conceptual Ontologies, Language, Formal Concept Analysis, Computational Aspects of Conceptual Structures, and Formal Semantics, with some papers on pragmatism and human related aspects of computing. Never before was the program of ICCS formed by so heterogeneously rooted theories of knowledge representation and use. We hope that this swirl of ideas will benefit you as much as it already has benefited us while putting together this program

Content

Concepts and Language: The Role of Conceptual Structure in Human Evolution (Keith Devlin) - Concepts in Linguistics - Concepts in Natural Language (Gisela Harras) - Patterns, Schemata, and Types: Author Support through Formalized Experience (Felix H. Gatzemeier) - Conventions and Notations for Knowledge Representation and Retrieval (Philippe Martin) - Conceptual Ontology: Ontology, Metadata, and Semiotics (John F. Sowa) - Pragmatically Yours (Mary Keeler) - Conceptual Modeling for Distributed Ontology Environments (Deborah L. McGuinness) - Discovery of Class Relations in Exception Structured Knowledge Bases (Hendra Suryanto, Paul Compton) - Conceptual Graphs: Perspectives: CGs Applications: Where Are We 7 Years after the First ICCS ? (Michel Chein, David Genest) - The Engineering of a CC-Based System: Fundamental Issues (Guy W. Mineau) - Conceptual Graphs, Metamodeling, and Notation of Concepts (Olivier Gerbé, Guy W. Mineau, Rudolf K. Keller) - Knowledge Representation and Reasonings: Based on Graph Homomorphism (Marie-Laure Mugnier) - User Modeling Using Conceptual Graphs for Intelligent Agents (James F. Baldwin, Trevor P. Martin, Aimilia Tzanavari) - Towards a Unified Querying System of Both Structured and Semi-structured Imprecise Data Using Fuzzy View (Patrice Buche, Ollivier Haemmerlé) - Formal Semantics of Conceptual Structures: The Extensional Semantics of the Conceptual Graph Formalism (Guy W. Mineau) - Semantics of Attribute Relations in Conceptual Graphs (Pavel Kocura) - Nested Concept Graphs and Triadic Power Context Families (Susanne Prediger) - Negations in Simple Concept Graphs (Frithjof Dau) - Extending the CG Model by Simulations (Jean-François Baget) - Contextual Logic and Formal Concept Analysis: Building and Structuring Description Logic Knowledge Bases: Using Least Common Subsumers and Concept Analysis (Franz Baader, Ralf Molitor) - On the Contextual Logic of Ordinal Data (Silke Pollandt, Rudolf Wille) - Boolean Concept Logic (Rudolf Wille) - Lattices of Triadic Concept Graphs (Bernd Groh, Rudolf Wille) - Formalizing Hypotheses with Concepts (Bernhard Ganter, Sergei 0. Kuznetsov) - Generalized Formal Concept Analysis (Laurent Chaudron, Nicolas Maille) - A Logical Generalization of Formal Concept Analysis (Sébastien Ferré, Olivier Ridoux) - On the Treatment of Incomplete Knowledge in Formal Concept Analysis (Peter Burmeister, Richard Holzer) - Conceptual Structures in Practice: Logic-Based Networks: Concept Graphs and Conceptual Structures (Peter W. Eklund) - Conceptual Knowledge Discovery and Data Analysis (Joachim Hereth, Gerd Stumme, Rudolf Wille, Uta Wille) - CEM - A Conceptual Email Manager (Richard Cole, Gerd Stumme) - A Contextual-Logic Extension of TOSCANA (Peter Eklund, Bernd Groh, Gerd Stumme, Rudolf Wille) - A Conceptual Graph Model for W3C Resource Description Framework (Olivier Corby, Rose Dieng, Cédric Hébert) - Computational Aspects of Conceptual Structures: Computing with Conceptual Structures (Bernhard Ganter) - Symmetry and the Computation of Conceptual Structures (Robert Levinson) An Introduction to SNePS 3 (Stuart C. Shapiro) - Composition Norm Dynamics Calculation with Conceptual Graphs (Aldo de Moor) - From PROLOG++ to PROLOG+CG: A CG Object-Oriented Logic Programming Language (Adil Kabbaj, Martin Janta-Polczynski) - A Cost-Bounded Algorithm to Control Events Generalization (Gaël de Chalendar, Brigitte Grau, Olivier Ferret)

Series

Lecture notes in computer science; vol.1867: Lecture notes on artificial intelligence

Seelbach, D.: Computerlinguistik und Dokumentation : keyphrases in Dokumentationsprozessen (1975) 0.02

0.020200292 = product of:
  0.111101605 = sum of:
    0.05019627 = weight(_text_:informatik in 299) [ClassicSimilarity], result of:
      0.05019627 = score(doc=299,freq=4.0), product of:
        0.104934774 = queryWeight, product of:
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.02056547 = queryNorm
        0.4783569 = fieldWeight in 299, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.1024737 = idf(docFreq=730, maxDocs=44218)
          0.046875 = fieldNorm(doc=299)
    0.029191408 = weight(_text_:und in 299) [ClassicSimilarity], result of:
      0.029191408 = score(doc=299,freq=38.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.6404345 = fieldWeight in 299, product of:
          6.164414 = tf(freq=38.0), with freq of:
            38.0 = termFreq=38.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.046875 = fieldNorm(doc=299)
    0.029191408 = weight(_text_:und in 299) [ClassicSimilarity], result of:
      0.029191408 = score(doc=299,freq=38.0), product of:
        0.04558063 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02056547 = queryNorm
        0.6404345 = fieldWeight in 299, product of:
          6.164414 = tf(freq=38.0), with freq of:
            38.0 = termFreq=38.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.046875 = fieldNorm(doc=299)
    0.0025225237 = weight(_text_:in in 299) [ClassicSimilarity], result of:
      0.0025225237 = score(doc=299,freq=2.0), product of:
        0.027974274 = queryWeight, product of:
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.02056547 = queryNorm
        0.09017298 = fieldWeight in 299, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          1.3602545 = idf(docFreq=30841, maxDocs=44218)
          0.046875 = fieldNorm(doc=299)
  0.18181819 = coord(4/22)

BK: 54.80 Angewandte Informatik
Classification: ES 950 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Maschinelle Redeanlyse
ES 955 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Maschinelle Referatherstellung, linguistische Dokumentation und Information
54.80 Angewandte Informatik
RVK: ES 950 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Maschinelle Redeanlyse
ES 955 Allgemeine und vergleichende Sprach- und Literaturwissenschaft. Indogermanistik. Außereuropäische Sprachen und Literaturen / Spezialbereiche der allgemeinen Sprachwissenschaft / Datenverarbeitung und Sprachwissenschaft. Computerlinguistik / Maschinelle Referatherstellung, linguistische Dokumentation und Information

Search (698 results, page 1 of 35)

Authors

Years

Languages

Types

Themes

Subjects

Classifications