Document (#23564)

Author
Schirmer, K.
Haller, J.
Title
Zugang zu mehrsprachigen Nachrichten im Internet
Source
Sprachtechnologie für eine dynamische Wirtschaft im Medienzeitalter - Language technologies for dynamic business in the age of the media - L'ingénierie linguistique au service de la dynamisation économique à l'ère du multimédia: Tagungsakten der XXVI. Jahrestagung der Internationalen Vereinigung Sprache und Wirtschaft e.V., 23.-25.11.2000, Fachhochschule Köln. Hrsg.: K.-D. Schmitz
Imprint
Wien : Termnet
Year
2000
Pages
S.23-24
Abstract
In einer Kooperation zwischen smart information und dem IAI werden täglich ca. 20.000 aktuelle Nachrichten des Tages (in deutscher Sprache) linguistisch indexiert. Die Nachrichten werden täglich von der Nachrichtensuchmaschine newscan http://www.newscan.de von smart information aus den verschiedensten InternetQuellen gesammelt. Der Benutzer kann mit frei gewählten Begriffen suchen. Das Ergebnis einer solchen Schlüsselwortsuche wird in Tabellenform ausgegeben, nach Häufigkeit geordnet. Bei einer größeren Ergebnismenge (mehr als zehn Dokumente) werden die Nachrichten automatisch gruppiert (Clusteranalyse) und mit einem Label (Thema) versehen. Diese Themen werden in einer Baumstruktur dargestellt. Der Nutzer kann gezielt auf einen Themenbereich zugreifen. Die Clusteranalyse beruht auf der automatischen Gruppierung der Dokumente und ihrer Stichwörter (Deskriptoren), wie sie von dem automatischen Deskribierungsmodul AUDESC des IAI erzeugt werden. Die in einer großen Datei zusammengestellten Nachrichten werden in jeder Nacht an das IAI geschickt. Mit einer speziell an diese Nachrichten angepaßte Version des Indexierungsmoduls AUTINDEX werden jeder einzelnen Nachricht Schlagwörter zugeordnet
Theme
Multilinguale Probleme
Internet

Similar documents (author)

  1. Haller, K.: ¬Das Katalogsystem der Bayerischen Staatsbibliothek (1991) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:haller in 526) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 526, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=526)
    
  2. Haller, K.: Regelwerke und Normdateien in Verbundbibliotheken (1988) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:haller in 702) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 702, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=702)
    
  3. Haller, K.: Kommunikation, Normung und Kataloge (1990) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:haller in 1147) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 1147, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=1147)
    
  4. Haller, K.: ¬Der Image-Katalog 1953-1981 der Bayerischen Staatsbibliothek (1997) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:haller in 1459) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 1459, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=1459)
    
  5. Haller, K.: Katalogkunde : Formalkataloge und formale Ordnungsmethodem (1983) 5.56
    5.56391 = sum of:
      5.56391 = weight(author_txt:haller in 2705) [ClassicSimilarity], result of:
        5.56391 = fieldWeight in 2705, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.902256 = idf(docFreq=15, maxDocs=43254)
          0.625 = fieldNorm(doc=2705)
    

Similar documents (content)

  1. Deuble, M.; Niemann, J.: Elektronische Archivierung für die Lokalberichterstattung : Szenarien der Archivreorganisation am Beispiel der Cuxhavener Nachrichten (1997) 0.16
    0.15890566 = sum of:
      0.15890566 = product of:
        0.99316037 = sum of:
          0.15227574 = weight(abstract_txt:20.000 in 3484) [ClassicSimilarity], result of:
            0.15227574 = score(doc=3484,freq=1.0), product of:
              0.14036065 = queryWeight, product of:
                1.0263889 = boost
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.01575644 = queryNorm
              1.084889 = fieldWeight in 3484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.125 = fieldNorm(doc=3484)
          0.11678308 = weight(abstract_txt:einer in 3484) [ClassicSimilarity], result of:
            0.11678308 = score(doc=3484,freq=2.0), product of:
              0.16961019 = queryWeight, product of:
                2.7636998 = boost
                3.89496 = idf(docFreq=2391, maxDocs=43254)
                0.01575644 = queryNorm
              0.68853813 = fieldWeight in 3484, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.89496 = idf(docFreq=2391, maxDocs=43254)
                0.125 = fieldNorm(doc=3484)
          0.123202346 = weight(abstract_txt:werden in 3484) [ClassicSimilarity], result of:
            0.123202346 = score(doc=3484,freq=3.0), product of:
              0.16164532 = queryWeight, product of:
                2.9142034 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.01575644 = queryNorm
              0.762177 = fieldWeight in 3484, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.125 = fieldNorm(doc=3484)
          0.6008992 = weight(abstract_txt:nachrichten in 3484) [ClassicSimilarity], result of:
            0.6008992 = score(doc=3484,freq=1.0), product of:
              0.6369076 = queryWeight, product of:
                5.355538 = boost
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.01575644 = queryNorm
              0.9434638 = fieldWeight in 3484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.125 = fieldNorm(doc=3484)
        0.16 = coord(4/25)
    
  2. Zoller, P.: ¬Die wohldurchdachte Aufbereitung des Informationsangebotes : Teil einer computergestützten Sachbearbeitung (1992) 0.16
    0.15887108 = sum of:
      0.15887108 = product of:
        0.7943554 = sum of:
          0.0337129 = weight(abstract_txt:diese in 2402) [ClassicSimilarity], result of:
            0.0337129 = score(doc=2402,freq=1.0), product of:
              0.07074465 = queryWeight, product of:
                1.0305073 = boost
                4.356969 = idf(docFreq=1506, maxDocs=43254)
                0.01575644 = queryNorm
              0.4765435 = fieldWeight in 2402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.356969 = idf(docFreq=1506, maxDocs=43254)
                0.109375 = fieldNorm(doc=2402)
          0.10036026 = weight(abstract_txt:dokumente in 2402) [ClassicSimilarity], result of:
            0.10036026 = score(doc=2402,freq=1.0), product of:
              0.14639875 = queryWeight, product of:
                1.4824258 = boost
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.01575644 = queryNorm
              0.6855267 = fieldWeight in 2402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.109375 = fieldNorm(doc=2402)
          0.07225584 = weight(abstract_txt:einer in 2402) [ClassicSimilarity], result of:
            0.07225584 = score(doc=2402,freq=1.0), product of:
              0.16961019 = queryWeight, product of:
                2.7636998 = boost
                3.89496 = idf(docFreq=2391, maxDocs=43254)
                0.01575644 = queryNorm
              0.42601123 = fieldWeight in 2402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.89496 = idf(docFreq=2391, maxDocs=43254)
                0.109375 = fieldNorm(doc=2402)
          0.062239546 = weight(abstract_txt:werden in 2402) [ClassicSimilarity], result of:
            0.062239546 = score(doc=2402,freq=1.0), product of:
              0.16164532 = queryWeight, product of:
                2.9142034 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.01575644 = queryNorm
              0.38503772 = fieldWeight in 2402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.109375 = fieldNorm(doc=2402)
          0.5257868 = weight(abstract_txt:nachrichten in 2402) [ClassicSimilarity], result of:
            0.5257868 = score(doc=2402,freq=1.0), product of:
              0.6369076 = queryWeight, product of:
                5.355538 = boost
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.01575644 = queryNorm
              0.8255308 = fieldWeight in 2402, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.109375 = fieldNorm(doc=2402)
        0.2 = coord(5/25)
    
  3. Start von Wikinews (2005) 0.15
    0.15229064 = sum of:
      0.15229064 = product of:
        1.2690887 = sum of:
          0.054061107 = weight(abstract_txt:kann in 5301) [ClassicSimilarity], result of:
            0.054061107 = score(doc=5301,freq=1.0), product of:
              0.076410234 = queryWeight, product of:
                1.0709767 = boost
                4.528073 = idf(docFreq=1269, maxDocs=43254)
                0.01575644 = queryNorm
              0.70751137 = fieldWeight in 5301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.528073 = idf(docFreq=1269, maxDocs=43254)
                0.15625 = fieldNorm(doc=5301)
          0.15277782 = weight(abstract_txt:jeder in 5301) [ClassicSimilarity], result of:
            0.15277782 = score(doc=5301,freq=1.0), product of:
              0.15273377 = queryWeight, product of:
                1.5141602 = boost
                6.4018455 = idf(docFreq=194, maxDocs=43254)
                0.01575644 = queryNorm
              1.0002884 = fieldWeight in 5301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4018455 = idf(docFreq=194, maxDocs=43254)
                0.15625 = fieldNorm(doc=5301)
          1.0622498 = weight(abstract_txt:nachrichten in 5301) [ClassicSimilarity], result of:
            1.0622498 = score(doc=5301,freq=2.0), product of:
              0.6369076 = queryWeight, product of:
                5.355538 = boost
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.01575644 = queryNorm
              1.667824 = fieldWeight in 5301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.15625 = fieldNorm(doc=5301)
        0.12 = coord(3/25)
    
  4. Albrecht, C.: ¬Die Entdeckung der Weitschweifigkeit : Über das Glück, mit Markow-Ketten zu rasseln: Die Schriften Claude E. Shannons (2001) 0.15
    0.15056658 = sum of:
      0.15056658 = product of:
        0.5377378 = sum of:
          0.07613787 = weight(abstract_txt:nachricht in 644) [ClassicSimilarity], result of:
            0.07613787 = score(doc=644,freq=4.0), product of:
              0.14036065 = queryWeight, product of:
                1.0263889 = boost
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.01575644 = queryNorm
              0.5424445 = fieldWeight in 644, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.679112 = idf(docFreq=19, maxDocs=43254)
                0.03125 = fieldNorm(doc=644)
          0.009632257 = weight(abstract_txt:diese in 644) [ClassicSimilarity], result of:
            0.009632257 = score(doc=644,freq=1.0), product of:
              0.07074465 = queryWeight, product of:
                1.0305073 = boost
                4.356969 = idf(docFreq=1506, maxDocs=43254)
                0.01575644 = queryNorm
              0.13615528 = fieldWeight in 644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.356969 = idf(docFreq=1506, maxDocs=43254)
                0.03125 = fieldNorm(doc=644)
          0.01529079 = weight(abstract_txt:kann in 644) [ClassicSimilarity], result of:
            0.01529079 = score(doc=644,freq=2.0), product of:
              0.076410234 = queryWeight, product of:
                1.0709767 = boost
                4.528073 = idf(docFreq=1269, maxDocs=43254)
                0.01575644 = queryNorm
              0.20011443 = fieldWeight in 644, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.528073 = idf(docFreq=1269, maxDocs=43254)
                0.03125 = fieldNorm(doc=644)
          0.02867436 = weight(abstract_txt:dokumente in 644) [ClassicSimilarity], result of:
            0.02867436 = score(doc=644,freq=1.0), product of:
              0.14639875 = queryWeight, product of:
                1.4824258 = boost
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.01575644 = queryNorm
              0.19586478 = fieldWeight in 644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.267673 = idf(docFreq=222, maxDocs=43254)
                0.03125 = fieldNorm(doc=644)
          0.041289054 = weight(abstract_txt:einer in 644) [ClassicSimilarity], result of:
            0.041289054 = score(doc=644,freq=4.0), product of:
              0.16961019 = queryWeight, product of:
                2.7636998 = boost
                3.89496 = idf(docFreq=2391, maxDocs=43254)
                0.01575644 = queryNorm
              0.243435 = fieldWeight in 644, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.89496 = idf(docFreq=2391, maxDocs=43254)
                0.03125 = fieldNorm(doc=644)
          0.030800587 = weight(abstract_txt:werden in 644) [ClassicSimilarity], result of:
            0.030800587 = score(doc=644,freq=3.0), product of:
              0.16164532 = queryWeight, product of:
                2.9142034 = boost
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.01575644 = queryNorm
              0.19054425 = fieldWeight in 644, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5203447 = idf(docFreq=3478, maxDocs=43254)
                0.03125 = fieldNorm(doc=644)
          0.33591288 = weight(abstract_txt:nachrichten in 644) [ClassicSimilarity], result of:
            0.33591288 = score(doc=644,freq=5.0), product of:
              0.6369076 = queryWeight, product of:
                5.355538 = boost
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.01575644 = queryNorm
              0.5274123 = fieldWeight in 644, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.03125 = fieldNorm(doc=644)
        0.28 = coord(7/25)
    
  5. Stock, M.: Neuigkeiten auf der Spur : Searches, Tracks und News Pages bei Factiva (2002) 0.14
    0.1357082 = sum of:
      0.1357082 = product of:
        0.8481763 = sum of:
          0.12322615 = weight(abstract_txt:geordnet in 2698) [ClassicSimilarity], result of:
            0.12322615 = score(doc=2698,freq=1.0), product of:
              0.13323596 = queryWeight, product of:
                8.455969 = idf(docFreq=24, maxDocs=43254)
                0.01575644 = queryNorm
              0.92487156 = fieldWeight in 2698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.455969 = idf(docFreq=24, maxDocs=43254)
                0.109375 = fieldNorm(doc=2698)
          0.1269075 = weight(abstract_txt:tages in 2698) [ClassicSimilarity], result of:
            0.1269075 = score(doc=2698,freq=1.0), product of:
              0.1358765 = queryWeight, product of:
                1.0098606 = boost
                8.5393505 = idf(docFreq=22, maxDocs=43254)
                0.01575644 = queryNorm
              0.93399143 = fieldWeight in 2698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.5393505 = idf(docFreq=22, maxDocs=43254)
                0.109375 = fieldNorm(doc=2698)
          0.07225584 = weight(abstract_txt:einer in 2698) [ClassicSimilarity], result of:
            0.07225584 = score(doc=2698,freq=1.0), product of:
              0.16961019 = queryWeight, product of:
                2.7636998 = boost
                3.89496 = idf(docFreq=2391, maxDocs=43254)
                0.01575644 = queryNorm
              0.42601123 = fieldWeight in 2698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.89496 = idf(docFreq=2391, maxDocs=43254)
                0.109375 = fieldNorm(doc=2698)
          0.5257868 = weight(abstract_txt:nachrichten in 2698) [ClassicSimilarity], result of:
            0.5257868 = score(doc=2698,freq=1.0), product of:
              0.6369076 = queryWeight, product of:
                5.355538 = boost
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.01575644 = queryNorm
              0.8255308 = fieldWeight in 2698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5477104 = idf(docFreq=61, maxDocs=43254)
                0.109375 = fieldNorm(doc=2698)
        0.16 = coord(4/25)