Document (#23563)

Author
Schirmer, K.
Haller, J.
Title
Zugang zu mehrsprachigen Nachrichten im Internet
Source
Sprachtechnologie für eine dynamische Wirtschaft im Medienzeitalter - Language technologies for dynamic business in the age of the media - L'ingénierie linguistique au service de la dynamisation économique à l'ère du multimédia: Tagungsakten der XXVI. Jahrestagung der Internationalen Vereinigung Sprache und Wirtschaft e.V., 23.-25.11.2000, Fachhochschule Köln. Hrsg.: K.-D. Schmitz
Imprint
Wien : Termnet
Year
2000
Pages
S.23-24
Abstract
In einer Kooperation zwischen smart information und dem IAI werden täglich ca. 20.000 aktuelle Nachrichten des Tages (in deutscher Sprache) linguistisch indexiert. Die Nachrichten werden täglich von der Nachrichtensuchmaschine newscan http://www.newscan.de von smart information aus den verschiedensten InternetQuellen gesammelt. Der Benutzer kann mit frei gewählten Begriffen suchen. Das Ergebnis einer solchen Schlüsselwortsuche wird in Tabellenform ausgegeben, nach Häufigkeit geordnet. Bei einer größeren Ergebnismenge (mehr als zehn Dokumente) werden die Nachrichten automatisch gruppiert (Clusteranalyse) und mit einem Label (Thema) versehen. Diese Themen werden in einer Baumstruktur dargestellt. Der Nutzer kann gezielt auf einen Themenbereich zugreifen. Die Clusteranalyse beruht auf der automatischen Gruppierung der Dokumente und ihrer Stichwörter (Deskriptoren), wie sie von dem automatischen Deskribierungsmodul AUDESC des IAI erzeugt werden. Die in einer großen Datei zusammengestellten Nachrichten werden in jeder Nacht an das IAI geschickt. Mit einer speziell an diese Nachrichten angepaßte Version des Indexierungsmoduls AUTINDEX werden jeder einzelnen Nachricht Schlagwörter zugeordnet
Theme
Multilinguale Probleme
Internet

Similar documents (author)

  1. Haller, K.: ¬Das Katalogsystem der Bayerischen Staatsbibliothek (1991) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:haller in 526) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 526, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=526)
    
  2. Haller, K.: Regelwerke und Normdateien in Verbundbibliotheken (1988) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:haller in 702) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 702, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=702)
    
  3. Haller, K.: Kommunikation, Normung und Kataloge (1990) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:haller in 1147) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 1147, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=1147)
    
  4. Haller, K.: ¬Der Image-Katalog 1953-1981 der Bayerischen Staatsbibliothek (1997) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:haller in 458) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 458, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=458)
    
  5. Haller, K.: Katalogkunde : Formalkataloge und formale Ordnungsmethodem (1983) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:haller in 704) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 704, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=704)
    

Similar documents (content)

  1. Deuble, M.; Niemann, J.: Elektronische Archivierung für die Lokalberichterstattung : Szenarien der Archivreorganisation am Beispiel der Cuxhavener Nachrichten (1997) 0.16
    0.15848093 = sum of:
      0.15848093 = product of:
        0.9905058 = sum of:
          0.15359095 = weight(abstract_txt:20.000 in 1483) [ClassicSimilarity], result of:
            0.15359095 = score(doc=1483,freq=1.0), product of:
              0.14121431 = queryWeight, product of:
                1.0263202 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.01581317 = queryNorm
              1.0876443 = fieldWeight in 1483, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.125 = fieldNorm(doc=1483)
          0.11588794 = weight(abstract_txt:einer in 1483) [ClassicSimilarity], result of:
            0.11588794 = score(doc=1483,freq=2.0), product of:
              0.1687981 = queryWeight, product of:
                2.7485456 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.01581317 = queryNorm
              0.68654764 = fieldWeight in 1483, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.125 = fieldNorm(doc=1483)
          0.12184966 = weight(abstract_txt:werden in 1483) [ClassicSimilarity], result of:
            0.12184966 = score(doc=1483,freq=3.0), product of:
              0.16051297 = queryWeight, product of:
                2.8949938 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.01581317 = queryNorm
              0.7591266 = fieldWeight in 1483, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.125 = fieldNorm(doc=1483)
          0.59917724 = weight(abstract_txt:nachrichten in 1483) [ClassicSimilarity], result of:
            0.59917724 = score(doc=1483,freq=1.0), product of:
              0.63590014 = queryWeight, product of:
                5.3347445 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.01581317 = queryNorm
              0.9422505 = fieldWeight in 1483, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.125 = fieldNorm(doc=1483)
        0.16 = coord(4/25)
    
  2. Zoller, P.: ¬Die wohldurchdachte Aufbereitung des Informationsangebotes : Teil einer computergestützten Sachbearbeitung (1992) 0.16
    0.15812293 = sum of:
      0.15812293 = product of:
        0.7906146 = sum of:
          0.033250187 = weight(abstract_txt:diese in 1333) [ClassicSimilarity], result of:
            0.033250187 = score(doc=1333,freq=1.0), product of:
              0.07011899 = queryWeight, product of:
                1.0227662 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.01581317 = queryNorm
              0.47419658 = fieldWeight in 1333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.109375 = fieldNorm(doc=1333)
          0.09982618 = weight(abstract_txt:dokumente in 1333) [ClassicSimilarity], result of:
            0.09982618 = score(doc=1333,freq=1.0), product of:
              0.14592709 = queryWeight, product of:
                1.4754567 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.01581317 = queryNorm
              0.68408257 = fieldWeight in 1333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.109375 = fieldNorm(doc=1333)
          0.071702 = weight(abstract_txt:einer in 1333) [ClassicSimilarity], result of:
            0.071702 = score(doc=1333,freq=1.0), product of:
              0.1687981 = queryWeight, product of:
                2.7485456 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.01581317 = queryNorm
              0.42477968 = fieldWeight in 1333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.109375 = fieldNorm(doc=1333)
          0.061556194 = weight(abstract_txt:werden in 1333) [ClassicSimilarity], result of:
            0.061556194 = score(doc=1333,freq=1.0), product of:
              0.16051297 = queryWeight, product of:
                2.8949938 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.01581317 = queryNorm
              0.3834967 = fieldWeight in 1333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.109375 = fieldNorm(doc=1333)
          0.5242801 = weight(abstract_txt:nachrichten in 1333) [ClassicSimilarity], result of:
            0.5242801 = score(doc=1333,freq=1.0), product of:
              0.63590014 = queryWeight, product of:
                5.3347445 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.01581317 = queryNorm
              0.8244692 = fieldWeight in 1333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.109375 = fieldNorm(doc=1333)
        0.2 = coord(5/25)
    
  3. Start von Wikinews (2005) 0.15
    0.15191467 = sum of:
      0.15191467 = product of:
        1.2659556 = sum of:
          0.053334672 = weight(abstract_txt:kann in 3300) [ClassicSimilarity], result of:
            0.053334672 = score(doc=3300,freq=1.0), product of:
              0.0757492 = queryWeight, product of:
                1.063035 = boost
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.01581317 = queryNorm
              0.7040955 = fieldWeight in 3300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.15625 = fieldNorm(doc=3300)
          0.15341529 = weight(abstract_txt:jeder in 3300) [ClassicSimilarity], result of:
            0.15341529 = score(doc=3300,freq=1.0), product of:
              0.15320893 = queryWeight, product of:
                1.5118216 = boost
                6.4086204 = idf(docFreq=197, maxDocs=44218)
                0.01581317 = queryNorm
              1.001347 = fieldWeight in 3300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4086204 = idf(docFreq=197, maxDocs=44218)
                0.15625 = fieldNorm(doc=3300)
          1.0592057 = weight(abstract_txt:nachrichten in 3300) [ClassicSimilarity], result of:
            1.0592057 = score(doc=3300,freq=2.0), product of:
              0.63590014 = queryWeight, product of:
                5.3347445 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.01581317 = queryNorm
              1.6656792 = fieldWeight in 3300, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.15625 = fieldNorm(doc=3300)
        0.12 = coord(3/25)
    
  4. Albrecht, C.: ¬Die Entdeckung der Weitschweifigkeit : Über das Glück, mit Markow-Ketten zu rasseln: Die Schriften Claude E. Shannons (2001) 0.15
    0.15016061 = sum of:
      0.15016061 = product of:
        0.5362879 = sum of:
          0.009500054 = weight(abstract_txt:diese in 5643) [ClassicSimilarity], result of:
            0.009500054 = score(doc=5643,freq=1.0), product of:
              0.07011899 = queryWeight, product of:
                1.0227662 = boost
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.01581317 = queryNorm
              0.13548474 = fieldWeight in 5643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3355117 = idf(docFreq=1573, maxDocs=44218)
                0.03125 = fieldNorm(doc=5643)
          0.07679547 = weight(abstract_txt:nachricht in 5643) [ClassicSimilarity], result of:
            0.07679547 = score(doc=5643,freq=4.0), product of:
              0.14121431 = queryWeight, product of:
                1.0263202 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.01581317 = queryNorm
              0.54382217 = fieldWeight in 5643, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.03125 = fieldNorm(doc=5643)
          0.015085324 = weight(abstract_txt:kann in 5643) [ClassicSimilarity], result of:
            0.015085324 = score(doc=5643,freq=2.0), product of:
              0.0757492 = queryWeight, product of:
                1.063035 = boost
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.01581317 = queryNorm
              0.19914828 = fieldWeight in 5643, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5062113 = idf(docFreq=1326, maxDocs=44218)
                0.03125 = fieldNorm(doc=5643)
          0.028521765 = weight(abstract_txt:dokumente in 5643) [ClassicSimilarity], result of:
            0.028521765 = score(doc=5643,freq=1.0), product of:
              0.14592709 = queryWeight, product of:
                1.4754567 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.01581317 = queryNorm
              0.19545217 = fieldWeight in 5643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.03125 = fieldNorm(doc=5643)
          0.04097257 = weight(abstract_txt:einer in 5643) [ClassicSimilarity], result of:
            0.04097257 = score(doc=5643,freq=4.0), product of:
              0.1687981 = queryWeight, product of:
                2.7485456 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.01581317 = queryNorm
              0.24273124 = fieldWeight in 5643, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.03125 = fieldNorm(doc=5643)
          0.030462416 = weight(abstract_txt:werden in 5643) [ClassicSimilarity], result of:
            0.030462416 = score(doc=5643,freq=3.0), product of:
              0.16051297 = queryWeight, product of:
                2.8949938 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.01581317 = queryNorm
              0.18978165 = fieldWeight in 5643, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.03125 = fieldNorm(doc=5643)
          0.33495027 = weight(abstract_txt:nachrichten in 5643) [ClassicSimilarity], result of:
            0.33495027 = score(doc=5643,freq=5.0), product of:
              0.63590014 = queryWeight, product of:
                5.3347445 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.01581317 = queryNorm
              0.52673405 = fieldWeight in 5643, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.03125 = fieldNorm(doc=5643)
        0.28 = coord(7/25)
    
  5. Stock, M.: Neuigkeiten auf der Spur : Searches, Tracks und News Pages bei Factiva (2002) 0.14
    0.13573073 = sum of:
      0.13573073 = product of:
        0.848317 = sum of:
          0.1243154 = weight(abstract_txt:geordnet in 697) [ClassicSimilarity], result of:
            0.1243154 = score(doc=697,freq=1.0), product of:
              0.13406423 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.01581317 = queryNorm
              0.92728245 = fieldWeight in 697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.109375 = fieldNorm(doc=697)
          0.12801954 = weight(abstract_txt:tages in 697) [ClassicSimilarity], result of:
            0.12801954 = score(doc=697,freq=1.0), product of:
              0.13671425 = queryWeight, product of:
                1.009835 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.01581317 = queryNorm
              0.9364023 = fieldWeight in 697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.109375 = fieldNorm(doc=697)
          0.071702 = weight(abstract_txt:einer in 697) [ClassicSimilarity], result of:
            0.071702 = score(doc=697,freq=1.0), product of:
              0.1687981 = queryWeight, product of:
                2.7485456 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.01581317 = queryNorm
              0.42477968 = fieldWeight in 697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.109375 = fieldNorm(doc=697)
          0.5242801 = weight(abstract_txt:nachrichten in 697) [ClassicSimilarity], result of:
            0.5242801 = score(doc=697,freq=1.0), product of:
              0.63590014 = queryWeight, product of:
                5.3347445 = boost
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.01581317 = queryNorm
              0.8244692 = fieldWeight in 697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.538004 = idf(docFreq=63, maxDocs=44218)
                0.109375 = fieldNorm(doc=697)
        0.16 = coord(4/25)