Search (116 results, page 1 of 6)

Gillitzer, B.: Yewno (2017) 0.04

0.038712174 = product of:
  0.11060621 = sum of:
    0.014006989 = weight(_text_:software in 3447) [ClassicSimilarity], result of:
      0.014006989 = score(doc=3447,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.17532499 = fieldWeight in 3447, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.03125 = fieldNorm(doc=3447)
    0.01576312 = weight(_text_:und in 3447) [ClassicSimilarity], result of:
      0.01576312 = score(doc=3447,freq=26.0), product of:
        0.044633795 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02013827 = queryNorm
        0.3531656 = fieldWeight in 3447, product of:
          5.0990195 = tf(freq=26.0), with freq of:
            26.0 = termFreq=26.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.03125 = fieldNorm(doc=3447)
    0.014006989 = weight(_text_:software in 3447) [ClassicSimilarity], result of:
      0.014006989 = score(doc=3447,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.17532499 = fieldWeight in 3447, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.03125 = fieldNorm(doc=3447)
    0.033800744 = weight(_text_:methoden in 3447) [ClassicSimilarity], result of:
      0.033800744 = score(doc=3447,freq=4.0), product of:
        0.10436003 = queryWeight, product of:
          5.1821747 = idf(docFreq=674, maxDocs=44218)
          0.02013827 = queryNorm
        0.32388592 = fieldWeight in 3447, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          5.1821747 = idf(docFreq=674, maxDocs=44218)
          0.03125 = fieldNorm(doc=3447)
    0.015383439 = weight(_text_:der in 3447) [ClassicSimilarity], result of:
      0.015383439 = score(doc=3447,freq=24.0), product of:
        0.044984195 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02013827 = queryNorm
        0.34197432 = fieldWeight in 3447, product of:
          4.8989797 = tf(freq=24.0), with freq of:
            24.0 = termFreq=24.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.03125 = fieldNorm(doc=3447)
    0.014006989 = weight(_text_:software in 3447) [ClassicSimilarity], result of:
      0.014006989 = score(doc=3447,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.17532499 = fieldWeight in 3447, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.03125 = fieldNorm(doc=3447)
    0.0036379434 = product of:
      0.01091383 = sum of:
        0.01091383 = weight(_text_:22 in 3447) [ClassicSimilarity], result of:
          0.01091383 = score(doc=3447,freq=2.0), product of:
            0.07052079 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02013827 = queryNorm
            0.15476047 = fieldWeight in 3447, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=3447)
      0.33333334 = coord(1/3)
  0.35 = coord(7/20)

Abstract: Yewno findet Themen und Konzepte (Suchbegriffe und ihre Abstraktionen) in englischsprachigen digitalen Texten mit Methoden des maschinellen Lernens und der künstlichen Intelligenz. Als Ergebnis Ihrer Suchanfrage werden die Konzepte, die Ihre Anfrage betreffen, in vielfältigen sachlichen Beziehungen als graphisches Netzwerk präsentiert, über das Sie einfach navigieren können. Auch versteckte thematische Beziehungen werden hier sichtbar gemacht, die vom Bekannten zu neuen Entdeckungen führen. Im Rahmen einer Pilotphase können Sie über einen interdisziplinären Ausschnitt aus aktuellen englischsprachigen Fachzeitschriften verschiedenster Fachgebiete recherchieren. Die zu den Themen gehörigen Artikel werden in Ausschnitten unmittelbar angezeigt und können in den meisten Fällen direkt als Volltext aufgerufen werden.
"Die Bayerische Staatsbibliothek testet den semantischen "Discovery Service" Yewno als zusätzliche thematische Suchmaschine für digitale Volltexte. Der Service ist unter folgendem Link erreichbar: https://www.bsb-muenchen.de/recherche-und-service/suchen-und-finden/yewno/. Das Identifizieren von Themen, um die es in einem Text geht, basiert bei Yewno alleine auf Methoden der künstlichen Intelligenz und des maschinellen Lernens. Dabei werden sie nicht - wie bei klassischen Katalogsystemen - einem Text als Ganzem zugeordnet, sondern der jeweiligen Textstelle. Die Eingabe eines Suchwortes bzw. Themas, bei Yewno "Konzept" genannt, führt umgehend zu einer grafischen Darstellung eines semantischen Netzwerks relevanter Konzepte und ihrer inhaltlichen Zusammenhänge. So ist ein Navigieren über thematische Beziehungen bis hin zu den Fundstellen im Text möglich, die dann in sogenannten Snippets angezeigt werden. In der Test-Anwendung der Bayerischen Staatsbibliothek durchsucht Yewno aktuell 40 Millionen englischsprachige Dokumente aus Publikationen namhafter Wissenschaftsverlage wie Cambridge University Press, Oxford University Press, Wiley, Sage und Springer, sowie Dokumente, die im Open Access verfügbar sind. Nach der dreimonatigen Testphase werden zunächst die Rückmeldungen der Nutzer ausgewertet. Ob und wann dann der Schritt von der klassischen Suchmaschine zum semantischen "Discovery Service" kommt und welche Bedeutung Anwendungen wie Yewno in diesem Zusammenhang einnehmen werden, ist heute noch nicht abzusehen. Die Software Yewno wurde vom gleichnamigen Startup in Zusammenarbeit mit der Stanford University entwickelt, mit der auch die Bayerische Staatsbibliothek eng kooperiert. [Inetbib-Posting vom 22.02.2017].
Date: 22. 2.2017 10:16:49
Source: https://www.bsb-muenchen.de/recherche-und-service/suchen-und-finden/yewno/

Knorz, G.; Rein, B.: Semantische Suche in einer Hochschulontologie (2005) 0.03

0.034255974 = product of:
  0.11418658 = sum of:
    0.02451223 = weight(_text_:software in 1852) [ClassicSimilarity], result of:
      0.02451223 = score(doc=1852,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30681872 = fieldWeight in 1852, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1852)
    0.01874063 = weight(_text_:und in 1852) [ClassicSimilarity], result of:
      0.01874063 = score(doc=1852,freq=12.0), product of:
        0.044633795 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02013827 = queryNorm
        0.41987535 = fieldWeight in 1852, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1852)
    0.02451223 = weight(_text_:software in 1852) [ClassicSimilarity], result of:
      0.02451223 = score(doc=1852,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30681872 = fieldWeight in 1852, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1852)
    0.015542857 = weight(_text_:der in 1852) [ClassicSimilarity], result of:
      0.015542857 = score(doc=1852,freq=8.0), product of:
        0.044984195 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02013827 = queryNorm
        0.34551817 = fieldWeight in 1852, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1852)
    0.02451223 = weight(_text_:software in 1852) [ClassicSimilarity], result of:
      0.02451223 = score(doc=1852,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30681872 = fieldWeight in 1852, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1852)
    0.006366401 = product of:
      0.019099202 = sum of:
        0.019099202 = weight(_text_:22 in 1852) [ClassicSimilarity], result of:
          0.019099202 = score(doc=1852,freq=2.0), product of:
            0.07052079 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02013827 = queryNorm
            0.2708308 = fieldWeight in 1852, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1852)
      0.33333334 = coord(1/3)
  0.3 = coord(6/20)

Abstract: Ontologien werden eingesetzt, um durch semantische Fundierung insbesondere für das Dokumentenretrieval eine grundlegend bessere Basis zu haben, als dies gegenwärtiger Stand der Technik ist. Vorgestellt wird eine an der FH Darmstadt entwickelte und eingesetzte Ontologie, die den Gegenstandsbereich Hochschule sowohl breit abdecken und gleichzeitig differenziert semantisch beschreiben soll. Das Problem der semantischen Suche besteht nun darin, dass sie für Informationssuchende so einfach wie bei gängigen Suchmaschinen zu nutzen sein soll, und gleichzeitig auf der Grundlage des aufwendigen Informationsmodells hochwertige Ergebnisse liefern muss. Es wird beschrieben, welche Möglichkeiten die verwendete Software K-Infinity bereitstellt und mit welchem Konzept diese Möglichkeiten für eine semantische Suche nach Dokumenten und anderen Informationseinheiten (Personen, Veranstaltungen, Projekte etc.) eingesetzt werden.
Date: 11. 2.2011 18:22:58
Source: Information - Wissenschaft und Praxis. 56(2005) H.5/6, S.281-290

Knorz, G.; Rein, B.: Semantische Suche in einer Hochschulontologie : Ontologie-basiertes Information-Filtering und -Retrieval mit relationalen Datenbanken (2005) 0.03

0.034255974 = product of:
  0.11418658 = sum of:
    0.02451223 = weight(_text_:software in 4324) [ClassicSimilarity], result of:
      0.02451223 = score(doc=4324,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30681872 = fieldWeight in 4324, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4324)
    0.01874063 = weight(_text_:und in 4324) [ClassicSimilarity], result of:
      0.01874063 = score(doc=4324,freq=12.0), product of:
        0.044633795 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02013827 = queryNorm
        0.41987535 = fieldWeight in 4324, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4324)
    0.02451223 = weight(_text_:software in 4324) [ClassicSimilarity], result of:
      0.02451223 = score(doc=4324,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30681872 = fieldWeight in 4324, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4324)
    0.015542857 = weight(_text_:der in 4324) [ClassicSimilarity], result of:
      0.015542857 = score(doc=4324,freq=8.0), product of:
        0.044984195 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02013827 = queryNorm
        0.34551817 = fieldWeight in 4324, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4324)
    0.02451223 = weight(_text_:software in 4324) [ClassicSimilarity], result of:
      0.02451223 = score(doc=4324,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30681872 = fieldWeight in 4324, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0546875 = fieldNorm(doc=4324)
    0.006366401 = product of:
      0.019099202 = sum of:
        0.019099202 = weight(_text_:22 in 4324) [ClassicSimilarity], result of:
          0.019099202 = score(doc=4324,freq=2.0), product of:
            0.07052079 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02013827 = queryNorm
            0.2708308 = fieldWeight in 4324, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4324)
      0.33333334 = coord(1/3)
  0.3 = coord(6/20)

Abstract: Ontologien werden eingesetzt, um durch semantische Fundierung insbesondere für das Dokumentenretrieval eine grundlegend bessere Basis zu haben, als dies gegenwärtiger Stand der Technik ist. Vorgestellt wird eine an der FH Darmstadt entwickelte und eingesetzte Ontologie, die den Gegenstandsbereich Hochschule sowohl breit abdecken und gleichzeitig differenziert semantisch beschreiben soll. Das Problem der semantischen Suche besteht nun darin, dass sie für Informationssuchende so einfach wie bei gängigen Suchmaschinen zu nutzen sein soll, und gleichzeitig auf der Grundlage des aufwendigen Informationsmodells hochwertige Ergebnisse liefern muss. Es wird beschrieben, welche Möglichkeiten die verwendete Software K-Infinity bereitstellt und mit welchem Konzept diese Möglichkeiten für eine semantische Suche nach Dokumenten und anderen Informationseinheiten (Personen, Veranstaltungen, Projekte etc.) eingesetzt werden.
Date: 11. 2.2011 18:22:25

Surfing versus Drilling for knowledge in science : When should you use your computer? When should you use your brain? (2018) 0.03

0.0321239 = product of:
  0.10707967 = sum of:
    0.0114324065 = weight(_text_:23 in 4564) [ClassicSimilarity], result of:
      0.0114324065 = score(doc=4564,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.15839456 = fieldWeight in 4564, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.03125 = fieldNorm(doc=4564)
    0.0114324065 = weight(_text_:23 in 4564) [ClassicSimilarity], result of:
      0.0114324065 = score(doc=4564,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.15839456 = fieldWeight in 4564, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.03125 = fieldNorm(doc=4564)
    0.024260817 = weight(_text_:software in 4564) [ClassicSimilarity], result of:
      0.024260817 = score(doc=4564,freq=6.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.3036718 = fieldWeight in 4564, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.03125 = fieldNorm(doc=4564)
    0.0114324065 = weight(_text_:23 in 4564) [ClassicSimilarity], result of:
      0.0114324065 = score(doc=4564,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.15839456 = fieldWeight in 4564, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.03125 = fieldNorm(doc=4564)
    0.024260817 = weight(_text_:software in 4564) [ClassicSimilarity], result of:
      0.024260817 = score(doc=4564,freq=6.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.3036718 = fieldWeight in 4564, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.03125 = fieldNorm(doc=4564)
    0.024260817 = weight(_text_:software in 4564) [ClassicSimilarity], result of:
      0.024260817 = score(doc=4564,freq=6.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.3036718 = fieldWeight in 4564, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.03125 = fieldNorm(doc=4564)
  0.3 = coord(6/20)

Abstract: For this second Special Issue of Infozine, we have invited students, teachers, researchers, and software developers to share their opinions about one or the other aspect of this broad topic: how to balance drilling (for depth) vs. surfing (for breadth) in scientific learning, teaching, research, and software design - and how the modern digital-liberal system affects our ability to strike this balance. This special issue is meant to provide a wide and unbiased spectrum of possible viewpoints on the topic, helping readers to define lucidly their own position and information use behavior.
Content: Editorial: Surfing versus Drilling for Knowledge in Science: When should you use your computer? When should you use your brain? Blaise Pascal: Les deux infinis - The two infinities / Philippe Hünenberger and Oliver Renn - "Surfing" vs. "drilling" in the modern scientific world / Antonio Loprieno - Of millimeter paper and machine learning / Philippe Hünenberger - From one to many, from breadth to depth - industrializing research / Janne Soetbeer - "Deep drilling" requires "surfing" / Gerd Folkers and Laura Folkers - Surfing vs. drilling in science: A delicate balance / Alzbeta Kubincová - Digital trends in academia - for the sake of critical thinking or comfort? / Leif-Thore Deck - I diagnose, therefore I am a Doctor? Will drilling computer software replace human doctors in the future? / Yi Zheng - Surfing versus drilling in fundamental research / Wilfred van Gunsteren - Using brain vs. brute force in computational studies of biological systems / Arieh Warshel - Laboratory literature boards in the digital age / Jeffrey Bode - Research strategies in computational chemistry / Sereina Riniker - Surfing on the hype waves or drilling deep for knowledge? A perspective from industry / Nadine Schneider and Nikolaus Stiefl - The use and purpose of articles and scientists / Philip Mark Lund - Can you look at papers like artwork? / Oliver Renn - Dynamite fishing in the data swamp / Frank Perabo 34 Streetlights, augmented intelligence, and information discovery / Jeffrey Saffer and Vicki Burnett - "Yes Dave. Happy to do that for you." Why AI, machine learning, and blockchain will lead to deeper "drilling" / Michiel Kolman and Sjors de Heuvel - Trends in scientific document search ( Stefan Geißler - Power tools for text mining / Jane Reed 42 Publishing and patenting: Navigating the differences to ensure search success / Paul Peters
Date: 26.11.2018 12:48:23

Mandalka, M.: Open semantic search zum unabhängigen und datenschutzfreundlichen Erschliessen von Dokumenten (2015) 0.03
```
0.029589036 = product of:
  0.098630115 = sum of:
    0.018195612 = weight(_text_:software in 2133) [ClassicSimilarity], result of:
      0.018195612 = score(doc=2133,freq=6.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.22775385 = fieldWeight in 2133, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0234375 = fieldNorm(doc=2133)
    0.017959423 = weight(_text_:und in 2133) [ClassicSimilarity], result of:
      0.017959423 = score(doc=2133,freq=60.0), product of:
        0.044633795 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02013827 = queryNorm
        0.40237278 = fieldWeight in 2133, product of:
          7.745967 = tf(freq=60.0), with freq of:
            60.0 = termFreq=60.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0234375 = fieldNorm(doc=2133)
    0.018195612 = weight(_text_:software in 2133) [ClassicSimilarity], result of:
      0.018195612 = score(doc=2133,freq=6.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.22775385 = fieldWeight in 2133, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0234375 = fieldNorm(doc=2133)
    0.017925551 = weight(_text_:methoden in 2133) [ClassicSimilarity], result of:
      0.017925551 = score(doc=2133,freq=2.0), product of:
        0.10436003 = queryWeight, product of:
          5.1821747 = idf(docFreq=674, maxDocs=44218)
          0.02013827 = queryNorm
        0.17176645 = fieldWeight in 2133, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          5.1821747 = idf(docFreq=674, maxDocs=44218)
          0.0234375 = fieldNorm(doc=2133)
    0.0081583 = weight(_text_:der in 2133) [ClassicSimilarity], result of:
      0.0081583 = score(doc=2133,freq=12.0), product of:
        0.044984195 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02013827 = queryNorm
        0.18135926 = fieldWeight in 2133, product of:
          3.4641016 = tf(freq=12.0), with freq of:
            12.0 = termFreq=12.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.0234375 = fieldNorm(doc=2133)
    0.018195612 = weight(_text_:software in 2133) [ClassicSimilarity], result of:
      0.018195612 = score(doc=2133,freq=6.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.22775385 = fieldWeight in 2133, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0234375 = fieldNorm(doc=2133)
  0.3 = coord(6/20)
```
Abstract

Ob grösserer Leak oder Zusammenwürfeln oder (wieder) Erschliessen umfangreicherer (kollaborativer) Recherche(n) oder Archive: Immer öfter müssen im Journalismus größere Datenberge und Dokumentenberge erschlossen werden. In eine Suchmaschine integrierte Analyse-Tools helfen (halb)automatisch.

Content

"Open Semantic Desktop Search Zur Tagung des Netzwerk Recherche ist die Desktop Suchmaschine Open Semantic Desktop Search zum unabhängigen und datenschutzfreundlichen Erschliessen und Analysieren von Dokumentenbergen nun erstmals auch als deutschsprachige Version verfügbar. Dank mächtiger Open Source Basis kann die auf Debian GNU/Linux und Apache Solr basierende freie Software als unter Linux, Windows oder Mac lauffähige virtuelle Maschine kostenlos heruntergeladen, genutzt, weitergegeben und weiterentwickelt werden. Dokumentenberge erschliessen Ob grösserer Leak oder Zusammenwürfeln oder (wieder) Erschliessen umfangreicherer (kollaborativer) Recherche(n) oder Archive: Hin und wieder müssen größere Datenberge bzw. Dokumentenberge erschlossen werden, die so viele Dokumente enthalten, dass Mensch diese Masse an Dokumenten nicht mehr alle nacheinander durchschauen und einordnen kann. Auch bei kontinuierlicher Recherche zu Fachthemen sammeln sich mit der Zeit größere Mengen digitalisierter oder digitaler Dokumente zu grösseren Datenbergen an, die immer weiter wachsen und deren Informationen mit einer Suchmaschine für das Archiv leichter auffindbar bleiben. Moderne Tools zur Datenanalyse in Verbindung mit Enterprise Search Suchlösungen und darauf aufbauender Recherche-Tools helfen (halb)automatisch.
Unabhängiges Durchsuchen und Analysieren grosser Datenmengen Damit können investigativ arbeitende Journalisten selbstständig und auf eigener Hardware datenschutzfreundlich hunderte, tausende, hunderttausende oder gar Millionen von Dokumenten oder hunderte Megabyte, Gigabytes oder gar einige Terabytes an Daten mit Volltextsuche durchsuchbar machen. Automatische Datenanreicherung und Erschliessung mittels Hintergrundwissen Zudem wird anhand von konfigurierbaren Hintergrundwissen automatisch eine interaktive Navigation zu in Dokumenten enthaltenen Namen von Bundestagsabgeordneten oder Orten in Deutschland generiert oder anhand Textmustern strukturierte Informationen wie Geldbeträge extrahiert. Mittels Named Entities Manager für Personen, Organisationen, Begriffe und Orte können eigene Rechercheschwerpunkte konfiguriert werden, aus denen dann automatisch eine interaktive Navigation (Facettensuche) und aggregierte Übersichten generiert werden. Automatische Datenvisualisierung Diese lassen sich auch visualisieren: So z.B. die zeitliche Verteilung von Suchergebnissen als Trand Diagramm oder durch gleichzeitige Nennung in Dokumenten abgeleitete Verbindungen als Netzwerk bzw. Graph.
Automatische Texterkennung (OCR) Dokumente, die nicht im Textformat, sondern als Grafiken vorliegen, wie z.B. Scans werden automatisch durch automatische Texterkennung (OCR) angereichert und damit auch der extrahierte Text durchsuchbar. Auch für eingebettete Bilddateien bzw. Scans innerhalb von PDF-Dateien. Unscharfe Suche mit Listen Ansonsten ist auch das Recherche-Tool bzw. die Such-Applikation "Suche mit Listen" integriert, mit denen sich schnell und komfortabel abgleichen lässt, ob es zu den einzelnen Einträgen in Listen jeweils Treffer in der durchsuchbaren Dokumentensammlung gibt. Mittels unscharfer Suche findet das Tool auch Ergebnisse, die in fehlerhaften oder unterschiedlichen Schreibweisen vorliegen. Semantische Suche und Textmining Im Recherche, Textanalyse und Document Mining Tutorial zu den enthaltenen Recherche-Tools und verschiedenen kombinierten Methoden zur Datenanalyse, Anreicherung und Suche wird ausführlicher beschrieben, wie auch eine große heterogene und unstrukturierte Dokumentensammlung bzw. eine grosse Anzahl von Dokumenten in verschiedenen Formaten leicht durchsucht und analysiert werden kann.
Virtuelle Maschine für mehr Plattformunabhängigkeit Die nun auch deutschsprachig verfügbare und mit deutschen Daten wie Ortsnamen oder Bundestagsabgeordneten vorkonfigurierte virtuelle Maschine Open Semantic Desktop Search ermöglicht nun auch auf einzelnen Desktop Computern oder Notebooks mit Windows oder iOS (Mac) die Suche und Analyse von Dokumenten mit der Suchmaschine Open Semantic Search. Als virtuelle Maschine (VM) lässt sich die Suchmaschine Open Semantic Search nicht nur für besonders sensible Dokumente mit dem verschlüsselten Live-System InvestigateIX als abgeschottetes System auf verschlüsselten externen Datenträgern installieren, sondern als virtuelle Maschine für den Desktop auch einfach unter Windows oder auf einem Mac in eine bzgl. weiterer Software und Daten bereits existierende Systemumgebung integrieren, ohne hierzu auf einen (für gemeinsame Recherchen im Team oder für die Redaktion auch möglichen) Suchmaschinen Server angewiesen zu sein. Datenschutz & Unabhängigkeit: Grössere Unabhängigkeit von zentralen IT-Infrastrukturen für unabhängigen investigativen Datenjournalismus Damit ist investigative Recherche weitmöglichst unabhängig möglich: ohne teure, zentrale und von Administratoren abhängige Server, ohne von der Dokumentenanzahl abhängige teure Software-Lizenzen, ohne Internet und ohne spionierende Cloud-Dienste. Datenanalyse und Suche finden auf dem eigenen Computer statt, nicht wie bei vielen anderen Lösungen in der sogenannten Cloud."

Source

http://www.linux-community.de/Internal/Nachrichten/Open-Semantic-Search-zum-unabhaengigen-und-datenschutzfreundlichen-Erschliessen-von-Dokumenten

BOND: Assoziativ-OPAC SpiderSearch (2003) 0.02

0.021155374 = product of:
  0.0846215 = sum of:
    0.017508736 = weight(_text_:software in 1795) [ClassicSimilarity], result of:
      0.017508736 = score(doc=1795,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.21915624 = fieldWeight in 1795, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1795)
    0.016394636 = weight(_text_:und in 1795) [ClassicSimilarity], result of:
      0.016394636 = score(doc=1795,freq=18.0), product of:
        0.044633795 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02013827 = queryNorm
        0.3673144 = fieldWeight in 1795, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1795)
    0.017508736 = weight(_text_:software in 1795) [ClassicSimilarity], result of:
      0.017508736 = score(doc=1795,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.21915624 = fieldWeight in 1795, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1795)
    0.015700657 = weight(_text_:der in 1795) [ClassicSimilarity], result of:
      0.015700657 = score(doc=1795,freq=16.0), product of:
        0.044984195 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02013827 = queryNorm
        0.34902605 = fieldWeight in 1795, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1795)
    0.017508736 = weight(_text_:software in 1795) [ClassicSimilarity], result of:
      0.017508736 = score(doc=1795,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.21915624 = fieldWeight in 1795, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=1795)
  0.25 = coord(5/20)

Abstract: Der Hersteller von Bibliothekssoftware BOND erweitert sein Produktangebot um eine innovative Neuheit, den Assoziativ-OPAC SpiderSearch. Dieser graphische Web-OPAC sucht zu einem Stichwort Assoziative, d.h. sinn- oder sprachwissenschaftlich verwandte Begriffe. Diese baut er spinnennetzartig um den zentralen Suchbegriff herum auf. Anhand der ihm angebotenen Assoziative kann sich der Leser sehr einfach und anschaulich durch den Medienbestand der Bibliothek klicken. So findet er schnell und komfortabel relevante Medien, die mit herkömmlichen Suchverfahren nur schwer recherchierbar wären. Mühsame Überlegungen über verwandte Suchbegriffe und angrenzende Themengebiete bleiben dem Benutzer erspart. Dies übernimmt SpiderSearch und navigiert den Benutzer ähnlich wie beim Surfen durch Webseiten durch sämtliche Themen, die mit dem Suchbegriff in Zusammenhang stehen. Auch aufwändiges Durchblättern einer riesigen Suchergebnisliste ist nicht nötig. Durch die im semantischen Netz vorgeschlagenen Begriffe, kann der Benutzer sein Thema genau eingrenzen und erhält in seiner Trefferliste nur passende Medien. Diese ordnet SpiderSearch nach ihrer Relevanz, so dass der Leser die benötigte Literatur einfach und komfortabel findet. Wie auch im normalen Web-OPAC enthält die Trefferliste Angaben zu Titel, Standort und Verfügbarkeit des Mediums. Zur einfachen Zuordnung der Medienart ist jedem Medium ein entsprechendes Symbol zugewiesen. Per Mausklick erhält der Benutzer Detailangaben zum Medium und optional eine Ansicht des Buchcovers. SpiderSearch ist ein Zusatzmodul zur Software BIBLIOTHECA2000 von BOND und setzt auf den Web-OPAL auf. Vor allem bei Öffentlichen Bibliotheken stößt SpiderSearch auf großes Interesse. Erste Anwender bieten Ihren Lesern bereits dieses neue Sucherlebnis.

Rahmstorf, G.: Integriertes Management inhaltlicher Datenarten (2001) 0.02

0.02111046 = product of:
  0.08444184 = sum of:
    0.017148608 = weight(_text_:23 in 5856) [ClassicSimilarity], result of:
      0.017148608 = score(doc=5856,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.23759183 = fieldWeight in 5856, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.046875 = fieldNorm(doc=5856)
    0.017148608 = weight(_text_:23 in 5856) [ClassicSimilarity], result of:
      0.017148608 = score(doc=5856,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.23759183 = fieldWeight in 5856, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.046875 = fieldNorm(doc=5856)
    0.019673564 = weight(_text_:und in 5856) [ClassicSimilarity], result of:
      0.019673564 = score(doc=5856,freq=18.0), product of:
        0.044633795 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02013827 = queryNorm
        0.4407773 = fieldWeight in 5856, product of:
          4.2426405 = tf(freq=18.0), with freq of:
            18.0 = termFreq=18.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.046875 = fieldNorm(doc=5856)
    0.017148608 = weight(_text_:23 in 5856) [ClassicSimilarity], result of:
      0.017148608 = score(doc=5856,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.23759183 = fieldWeight in 5856, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.046875 = fieldNorm(doc=5856)
    0.013322448 = weight(_text_:der in 5856) [ClassicSimilarity], result of:
      0.013322448 = score(doc=5856,freq=8.0), product of:
        0.044984195 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02013827 = queryNorm
        0.29615843 = fieldWeight in 5856, product of:
          2.828427 = tf(freq=8.0), with freq of:
            8.0 = termFreq=8.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.046875 = fieldNorm(doc=5856)
  0.25 = coord(5/20)

Abstract: Inhaltliche Daten sind im Unterschied zu Messdaten, Zahlen, Analogsignalen und anderen Informationen solche Daten, die sich auch sprachlich interpretieren lassen. Sie transportieren Inhalte, die sich benennen lassen. Zu inhaltlichen Daten gehören z. B. Auftragsdaten, Werbetexte, Produktbezeichnungen und Patentklassifikationen. Die meisten Daten, die im Internet kommuniziert werden, sind inhaltliche Daten. Man kann inhaltliche Daten in vier Klassen einordnen: * Wissensdaten - formatierte Daten (Fakten u. a. Daten in strukturierter Form), - nichtformatierte Daten (vorwiegend Texte); * Zugriffsdaten - Benennungsdaten (Wortschatz, Terminologie, Themen u. a.), - Begriffsdaten (Ordnungs- und Bedeutungsstrukturen). In der Wissensorganisation geht es hauptsächlich darum, die unüberschaubare Fülle des Wissens zu ordnen und wiederauffindbar zu machen. Daher befasst sich das Fach nicht nur mit dem Wissen selbst, selbst sondern auch mit den Mitteln, die dazu verwendet werden, das Wissen zu ordnen und auffindbar zu machen
Series: Tagungen der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis; 4
Source: Information Research & Content Management: Orientierung, Ordnung und Organisation im Wissensmarkt; 23. DGI-Online-Tagung der DGI und 53. Jahrestagung der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V. DGI, Frankfurt am Main, 8.-10.5.2001. Proceedings. Hrsg.: R. Schmidt

AssoziativOPAC : SpiderSearch von BOND (2003) 0.02

0.020920968 = product of:
  0.08368387 = sum of:
    0.017508736 = weight(_text_:software in 2029) [ClassicSimilarity], result of:
      0.017508736 = score(doc=2029,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.21915624 = fieldWeight in 2029, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2029)
    0.015457011 = weight(_text_:und in 2029) [ClassicSimilarity], result of:
      0.015457011 = score(doc=2029,freq=16.0), product of:
        0.044633795 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02013827 = queryNorm
        0.34630734 = fieldWeight in 2029, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2029)
    0.017508736 = weight(_text_:software in 2029) [ClassicSimilarity], result of:
      0.017508736 = score(doc=2029,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.21915624 = fieldWeight in 2029, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2029)
    0.015700657 = weight(_text_:der in 2029) [ClassicSimilarity], result of:
      0.015700657 = score(doc=2029,freq=16.0), product of:
        0.044984195 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02013827 = queryNorm
        0.34902605 = fieldWeight in 2029, product of:
          4.0 = tf(freq=16.0), with freq of:
            16.0 = termFreq=16.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2029)
    0.017508736 = weight(_text_:software in 2029) [ClassicSimilarity], result of:
      0.017508736 = score(doc=2029,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.21915624 = fieldWeight in 2029, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2029)
  0.25 = coord(5/20)

Content: "Der Hersteller von Bibliothekssoftware BOND erweitert sein Produktangebot um den Assoziativ-OPAC SpiderSearch. Dieser graphische Web-OPAC sucht zu einem Stichwort Assoziative, d.h. sinnoder sprachwissenschaftlich verwandte Begriffe. Diese baut er spinnennetzartig um den zentralen Suchbegriff herum auf. Anhand der ihm angebotenen Assoziative kann sich der Leser sehr einfach und anschaulich durch den Medienbestand der Bibliothek klicken. So findet er schnell und komfortabel relevante Medien, die mit herkömmlichen Suchverfahren nur schwer recherchierbar wären. Überlegungen über verwandte Suchbegriffe und angrenzende Themengebiete bleiben dem Benutzer erspart: SpiderSearch navigiert den Benutzer ähnlich wie beim Surfen durch Webseiten durch sämtliche Themen, die mit dem Suchbegriff in Zusammenhang stehen. Auch aufwändiges Durchblättern einer riesigen Suchergebnisliste ist nicht nötig. Durch die im semantischen Netz vorgeschlagenen Begriffe kann der Benutzer sein Thema genau eingrenzen und erhält in seiner Trefferliste nur passende Medien. Diese ordnet SpiderSearch nach ihrer Relevanz, so dass der Leser die benötigte Literatur einfach und komfortabel findet. Wie auch im normalen Web-OPAC enthält die Trefferliste Angaben zu Titel, Standort und Verfügbarkeit des Mediums. Zur einfachen Zuordnung der Medienart ist jedem Medium ein entsprechendes Symbol zugewiesen. Per Mausklick erhält der Benutzer Detailangaben zum Medium und optional eine Ansicht des Buchcovers. SpiderSearch ist ein Zusatzmodul zur Software BIBLIOTHECA2000 von BOND und setzt auf den Web-OPAC auf."

Schek, M.: Automatische Klassifizierung und Visualisierung im Archiv der Süddeutschen Zeitung (2005) 0.02
```
0.018538194 = product of:
  0.074152775 = sum of:
    0.010003355 = weight(_text_:23 in 4884) [ClassicSimilarity], result of:
      0.010003355 = score(doc=4884,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.13859524 = fieldWeight in 4884, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02734375 = fieldNorm(doc=4884)
    0.010003355 = weight(_text_:23 in 4884) [ClassicSimilarity], result of:
      0.010003355 = score(doc=4884,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.13859524 = fieldWeight in 4884, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02734375 = fieldNorm(doc=4884)
    0.023581443 = weight(_text_:und in 4884) [ClassicSimilarity], result of:
      0.023581443 = score(doc=4884,freq=76.0), product of:
        0.044633795 = queryWeight, product of:
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02013827 = queryNorm
        0.5283316 = fieldWeight in 4884, product of:
          8.717798 = tf(freq=76.0), with freq of:
            76.0 = termFreq=76.0
          2.216367 = idf(docFreq=13101, maxDocs=44218)
          0.02734375 = fieldNorm(doc=4884)
    0.010003355 = weight(_text_:23 in 4884) [ClassicSimilarity], result of:
      0.010003355 = score(doc=4884,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.13859524 = fieldWeight in 4884, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02734375 = fieldNorm(doc=4884)
    0.020561269 = weight(_text_:der in 4884) [ClassicSimilarity], result of:
      0.020561269 = score(doc=4884,freq=56.0), product of:
        0.044984195 = queryWeight, product of:
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02013827 = queryNorm
        0.4570776 = fieldWeight in 4884, product of:
          7.483315 = tf(freq=56.0), with freq of:
            56.0 = termFreq=56.0
          2.2337668 = idf(docFreq=12875, maxDocs=44218)
          0.02734375 = fieldNorm(doc=4884)
  0.25 = coord(5/20)
```
Abstract

Die Süddeutsche Zeitung (SZ) verfügt seit ihrer Gründung 1945 über ein Pressearchiv, das die Texte der eigenen Redakteure und zahlreicher nationaler und internationaler Publikationen dokumentiert und auf Anfrage für Recherchezwecke bereitstellt. Die Einführung der EDV begann Anfang der 90er Jahre mit der digitalen Speicherung zunächst der SZ-Daten. Die technische Weiterentwicklung ab Mitte der 90er Jahre diente zwei Zielen: (1) dem vollständigen Wechsel von der Papierablage zur digitalen Speicherung und (2) dem Wandel von einer verlagsinternen Dokumentations- und Auskunftsstelle zu einem auch auf dem Markt vertretenen Informationsdienstleister. Um die dabei entstehenden Aufwände zu verteilen und gleichzeitig Synergieeffekte zwischen inhaltlich verwandten Archiven zu erschließen, gründeten der Süddeutsche Verlag und der Bayerische Rundfunk im Jahr 1998 die Dokumentations- und Informationszentrum (DIZ) München GmbH, in der die Pressearchive der beiden Gesellschafter und das Bildarchiv des Süddeutschen Verlags zusammengeführt wurden. Die gemeinsam entwickelte Pressedatenbank ermöglichte das standortübergreifende Lektorat, die browserbasierte Recherche für Redakteure und externe Kunden im Intraund Internet und die kundenspezifischen Content Feeds für Verlage, Rundfunkanstalten und Portale. Die DIZPressedatenbank enthält zur Zeit 6,9 Millionen Artikel, die jeweils als HTML oder PDF abrufbar sind. Täglich kommen ca. 3.500 Artikel hinzu, von denen ca. 1.000 lektoriert werden. Das Lektorat erfolgt im DIZ nicht durch die Vergabe von Schlagwörtern am Dokument, sondern durch die Verlinkung der Artikel mit "virtuellen Mappen", den Dossiers. Diese stellen die elektronische Repräsentation einer Papiermappe dar und sind das zentrale Erschließungsobjekt. Im Gegensatz zu statischen Klassifikationssystemen ist die Dossierstruktur dynamisch und aufkommensabhängig, d.h. neue Dossiers werden hauptsächlich anhand der aktuellen Berichterstattung erstellt. Insgesamt enthält die DIZ-Pressedatenbank ca. 90.000 Dossiers, davon sind 68.000 Sachthemen (Topics), Personen und Institutionen. Die Dossiers sind untereinander zum "DIZ-Wissensnetz" verlinkt.
DIZ definiert das Wissensnetz als Alleinstellungsmerkmal und wendet beträchtliche personelle Ressourcen für die Aktualisierung und Oualitätssicherung der Dossiers auf. Nach der Umstellung auf den komplett digitalisierten Workflow im April 2001 identifizierte DIZ vier Ansatzpunkte, wie die Aufwände auf der Inputseite (Lektorat) zu optimieren sind und gleichzeitig auf der Outputseite (Recherche) das Wissensnetz besser zu vermarkten ist: 1. (Teil-)Automatische Klassifizierung von Pressetexten (Vorschlagwesen) 2. Visualisierung des Wissensnetzes (Topic Mapping) 3. (Voll-)Automatische Klassifizierung und Optimierung des Wissensnetzes 4. Neue Retrievalmöglichkeiten (Clustering, Konzeptsuche) Die Projekte 1 und 2 "Automatische Klassifizierung und Visualisierung" starteten zuerst und wurden beschleunigt durch zwei Entwicklungen: - Der Bayerische Rundfunk (BR), ursprünglich Mitbegründer und 50%-Gesellschafter der DIZ München GmbH, entschloss sich aus strategischen Gründen, zum Ende 2003 aus der Kooperation auszusteigen. - Die Medienkrise, hervorgerufen durch den massiven Rückgang der Anzeigenerlöse, erforderte auch im Süddeutschen Verlag massive Einsparungen und die Suche nach neuen Erlösquellen. Beides führte dazu, dass die Kapazitäten im Bereich Pressedokumentation von ursprünglich rund 20 (nur SZ, ohne BR-Anteil) auf rund 13 zum 1. Januar 2004 sanken und gleichzeitig die Aufwände für die Pflege des Wissensnetzes unter verstärkten Rechtfertigungsdruck gerieten. Für die Projekte 1 und 2 ergaben sich daraus drei quantitative und qualitative Ziele: - Produktivitätssteigerung im Lektorat - Konsistenzverbesserung im Lektorat - Bessere Vermarktung und intensivere Nutzung der Dossiers in der Recherche Alle drei genannten Ziele konnten erreicht werden, wobei insbesondere die Produktivität im Lektorat gestiegen ist. Die Projekte 1 und 2 "Automatische Klassifizierung und Visualisierung" sind seit Anfang 2004 erfolgreich abgeschlossen. Die Folgeprojekte 3 und 4 laufen seit Mitte 2004 und sollen bis Mitte 2005 abgeschlossen sein. Im folgenden wird in Abschnitt 2 die Produktauswahl und Arbeitsweise der Automatischen Klassifizierung beschrieben. Abschnitt 3 schildert den Einsatz der Wissensnetz-Visualisierung in Lektorat und Recherche. Abschnitt 4 fasst die Ergebnisse der Projekte 1 und 2 zusammen und gibt einen Ausblick auf die Ziele der Projekte 3 und 4.

Date

27. 1.2006 13:23:26

Faaborg, A.; Lagoze, C.: Semantic browsing (2003) 0.02

0.015980618 = product of:
  0.07990309 = sum of:
    0.02451223 = weight(_text_:software in 1026) [ClassicSimilarity], result of:
      0.02451223 = score(doc=1026,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30681872 = fieldWeight in 1026, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1026)
    0.02451223 = weight(_text_:software in 1026) [ClassicSimilarity], result of:
      0.02451223 = score(doc=1026,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30681872 = fieldWeight in 1026, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1026)
    0.02451223 = weight(_text_:software in 1026) [ClassicSimilarity], result of:
      0.02451223 = score(doc=1026,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30681872 = fieldWeight in 1026, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0546875 = fieldNorm(doc=1026)
    0.006366401 = product of:
      0.019099202 = sum of:
        0.019099202 = weight(_text_:22 in 1026) [ClassicSimilarity], result of:
          0.019099202 = score(doc=1026,freq=2.0), product of:
            0.07052079 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02013827 = queryNorm
            0.2708308 = fieldWeight in 1026, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=1026)
      0.33333334 = coord(1/3)
  0.2 = coord(4/20)

Abstract: We have created software applications that allow users to both author and use Semantic Web metadata. To create and use a layer of semantic content on top of the existing Web, we have (1) implemented a user interface that expedites the task of attributing metadata to resources on the Web, and (2) augmented a Web browser to leverage this semantic metadata to provide relevant information and tasks to the user. This project provides a framework for annotating and reorganizing existing files, pages, and sites on the Web that is similar to Vannevar Bushrsquos original concepts of trail blazing and associative indexing.
Source: Research and advanced technology for digital libraries : 7th European Conference, proceedings / ECDL 2003, Trondheim, Norway, August 17-22, 2003

Fieldhouse, M.; Hancock-Beaulieu, M.: ¬The design of a graphical user interface for a highly interactive information retrieval system (1996) 0.01

0.013277307 = product of:
  0.066386536 = sum of:
    0.02000671 = weight(_text_:23 in 6958) [ClassicSimilarity], result of:
      0.02000671 = score(doc=6958,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.27719048 = fieldWeight in 6958, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6958)
    0.02000671 = weight(_text_:23 in 6958) [ClassicSimilarity], result of:
      0.02000671 = score(doc=6958,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.27719048 = fieldWeight in 6958, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6958)
    0.02000671 = weight(_text_:23 in 6958) [ClassicSimilarity], result of:
      0.02000671 = score(doc=6958,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.27719048 = fieldWeight in 6958, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0546875 = fieldNorm(doc=6958)
    0.006366401 = product of:
      0.019099202 = sum of:
        0.019099202 = weight(_text_:22 in 6958) [ClassicSimilarity], result of:
          0.019099202 = score(doc=6958,freq=2.0), product of:
            0.07052079 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02013827 = queryNorm
            0.2708308 = fieldWeight in 6958, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0546875 = fieldNorm(doc=6958)
      0.33333334 = coord(1/3)
  0.2 = coord(4/20)

Source: Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon

Schatz, B.R.; Johnson, E.H.; Cochrane, P.A.; Chen, H.: Interactive term suggestion for users of digital libraries : using thesauri and co-occurrence lists for information retrieval (1996) 0.01

0.012861458 = product of:
  0.08574305 = sum of:
    0.028581016 = weight(_text_:23 in 6417) [ClassicSimilarity], result of:
      0.028581016 = score(doc=6417,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.3959864 = fieldWeight in 6417, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.078125 = fieldNorm(doc=6417)
    0.028581016 = weight(_text_:23 in 6417) [ClassicSimilarity], result of:
      0.028581016 = score(doc=6417,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.3959864 = fieldWeight in 6417, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.078125 = fieldNorm(doc=6417)
    0.028581016 = weight(_text_:23 in 6417) [ClassicSimilarity], result of:
      0.028581016 = score(doc=6417,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.3959864 = fieldWeight in 6417, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.078125 = fieldNorm(doc=6417)
  0.15 = coord(3/20)

Date: 10. 8.2001 21:23:46

Shiri, A.A.; Revie, C.: Query expansion behavior within a thesaurus-enhanced search environment : a user-centered evaluation (2006) 0.01

0.011414727 = product of:
  0.057073634 = sum of:
    0.017508736 = weight(_text_:software in 56) [ClassicSimilarity], result of:
      0.017508736 = score(doc=56,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.21915624 = fieldWeight in 56, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=56)
    0.017508736 = weight(_text_:software in 56) [ClassicSimilarity], result of:
      0.017508736 = score(doc=56,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.21915624 = fieldWeight in 56, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=56)
    0.017508736 = weight(_text_:software in 56) [ClassicSimilarity], result of:
      0.017508736 = score(doc=56,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.21915624 = fieldWeight in 56, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=56)
    0.0045474293 = product of:
      0.013642288 = sum of:
        0.013642288 = weight(_text_:22 in 56) [ClassicSimilarity], result of:
          0.013642288 = score(doc=56,freq=2.0), product of:
            0.07052079 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02013827 = queryNorm
            0.19345059 = fieldWeight in 56, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=56)
      0.33333334 = coord(1/3)
  0.2 = coord(4/20)

Abstract: The study reported here investigated the query expansion behavior of end-users interacting with a thesaurus-enhanced search system on the Web. Two groups, namely academic staff and postgraduate students, were recruited into this study. Data were collected from 90 searches performed by 30 users using the OVID interface to the CAB abstracts database. Data-gathering techniques included questionnaires, screen capturing software, and interviews. The results presented here relate to issues of search-topic and search-term characteristics, number and types of expanded queries, usefulness of thesaurus terms, and behavioral differences between academic staff and postgraduate students in their interaction. The key conclusions drawn were that (a) academic staff chose more narrow and synonymous terms than did postgraduate students, who generally selected broader and related terms; (b) topic complexity affected users' interaction with the thesaurus in that complex topics required more query expansion and search term selection; (c) users' prior topic-search experience appeared to have a significant effect on their selection and evaluation of thesaurus terms; (d) in 50% of the searches where additional terms were suggested from the thesaurus, users stated that they had not been aware of the terms at the beginning of the search; this observation was particularly noticeable in the case of postgraduate students.
Date: 22. 7.2006 16:32:43

Bayer, O.; Höhfeld, S.; Josbächer, F.; Kimm, N.; Kradepohl, I.; Kwiatkowski, M.; Puschmann, C.; Sabbagh, M.; Werner, N.; Vollmer, U.: Evaluation of an ontology-based knowledge-management-system : a case study of Convera RetrievalWare 8.0 (2005) 0.01

0.011142492 = product of:
  0.07428328 = sum of:
    0.024761094 = weight(_text_:software in 624) [ClassicSimilarity], result of:
      0.024761094 = score(doc=624,freq=4.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30993375 = fieldWeight in 624, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=624)
    0.024761094 = weight(_text_:software in 624) [ClassicSimilarity], result of:
      0.024761094 = score(doc=624,freq=4.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30993375 = fieldWeight in 624, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=624)
    0.024761094 = weight(_text_:software in 624) [ClassicSimilarity], result of:
      0.024761094 = score(doc=624,freq=4.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30993375 = fieldWeight in 624, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=624)
  0.15 = coord(3/20)

Abstract: With RetrievalWare 8.0(TM) the American company Convera offers an elaborated software in the range of Information Retrieval, Information Indexing and Knowledge Management. Convera promises the possibility of handling different file formats in many different languages. Regarding comparable products one innovation is to be stressed particularly: the possibility of the preparation as well as integration of an ontology. One tool of the software package is useful in order to produce ontologies manually, to process existing ontologies and to import the very. The processing of search results is also to be mentioned. By means of categorization strategies search results can be classified dynamically and presented in personalized representations. This study presents an evaluation of the functions and components of the system. Technological aspects and modes of operation under the surface of Convera RetrievalWare will be analysed, with a focus on the creation of libraries and thesauri, and the problems posed by the integration of an existing thesaurus. Broader aspects such as usability and system ergonomics are integrated in the examination as well.

Nagao, M.: Knowledge and inference (1990) 0.01

0.011142492 = product of:
  0.07428328 = sum of:
    0.024761094 = weight(_text_:software in 3304) [ClassicSimilarity], result of:
      0.024761094 = score(doc=3304,freq=4.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30993375 = fieldWeight in 3304, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3304)
    0.024761094 = weight(_text_:software in 3304) [ClassicSimilarity], result of:
      0.024761094 = score(doc=3304,freq=4.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30993375 = fieldWeight in 3304, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3304)
    0.024761094 = weight(_text_:software in 3304) [ClassicSimilarity], result of:
      0.024761094 = score(doc=3304,freq=4.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.30993375 = fieldWeight in 3304, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.0390625 = fieldNorm(doc=3304)
  0.15 = coord(3/20)

Abstract: Knowledge and Inference discusses an important problem for software systems: How do we treat knowledge and ideas on a computer and how do we use inference to solve problems on a computer? The book talks about the problems of knowledge and inference for the purpose of merging artificial intelligence and library science. The book begins by clarifying the concept of ""knowledge"" from many points of view, followed by a chapter on the current state of library science and the place of artificial intelligence in library science. Subsequent chapters cover central topics in the artificial intelligence: search and problem solving, methods of making proofs, and the use of knowledge in looking for a proof. There is also a discussion of how to use the knowledge system. The final chapter describes a popular expert system. It describes tools for building expert systems using an example based on Expert Systems-A Practical Introduction by P. Sell (Macmillian, 1985). This type of software is called an ""expert system shell."" This book was written as a textbook for undergraduate students covering only the basics but explaining as much detail as possible.

Hoang, H.H.; Tjoa, A.M: ¬The state of the art of ontology-based query systems : a comparison of existing approaches (2006) 0.01

0.010289166 = product of:
  0.06859444 = sum of:
    0.022864813 = weight(_text_:23 in 792) [ClassicSimilarity], result of:
      0.022864813 = score(doc=792,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.31678912 = fieldWeight in 792, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0625 = fieldNorm(doc=792)
    0.022864813 = weight(_text_:23 in 792) [ClassicSimilarity], result of:
      0.022864813 = score(doc=792,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.31678912 = fieldWeight in 792, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0625 = fieldNorm(doc=792)
    0.022864813 = weight(_text_:23 in 792) [ClassicSimilarity], result of:
      0.022864813 = score(doc=792,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.31678912 = fieldWeight in 792, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0625 = fieldNorm(doc=792)
  0.15 = coord(3/20)

Date: 28.11.2016 18:23:55

Walker, S.: Subject access in online catalogues (1991) 0.01

0.010289166 = product of:
  0.06859444 = sum of:
    0.022864813 = weight(_text_:23 in 5690) [ClassicSimilarity], result of:
      0.022864813 = score(doc=5690,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.31678912 = fieldWeight in 5690, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0625 = fieldNorm(doc=5690)
    0.022864813 = weight(_text_:23 in 5690) [ClassicSimilarity], result of:
      0.022864813 = score(doc=5690,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.31678912 = fieldWeight in 5690, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0625 = fieldNorm(doc=5690)
    0.022864813 = weight(_text_:23 in 5690) [ClassicSimilarity], result of:
      0.022864813 = score(doc=5690,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.31678912 = fieldWeight in 5690, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0625 = fieldNorm(doc=5690)
  0.15 = coord(3/20)

Pages: S.23-33

Bernier-Colborne, G.: Identifying semantic relations in a specialized corpus through distributional analysis of a cooccurrence tensor (2014) 0.01

0.010289166 = product of:
  0.06859444 = sum of:
    0.022864813 = weight(_text_:23 in 2153) [ClassicSimilarity], result of:
      0.022864813 = score(doc=2153,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.31678912 = fieldWeight in 2153, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0625 = fieldNorm(doc=2153)
    0.022864813 = weight(_text_:23 in 2153) [ClassicSimilarity], result of:
      0.022864813 = score(doc=2153,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.31678912 = fieldWeight in 2153, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0625 = fieldNorm(doc=2153)
    0.022864813 = weight(_text_:23 in 2153) [ClassicSimilarity], result of:
      0.022864813 = score(doc=2153,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.31678912 = fieldWeight in 2153, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0625 = fieldNorm(doc=2153)
  0.15 = coord(3/20)

Source: Proceedings of the Third Joint Conference on Lexical and Computational Semantics (*SEM 2014), Dublin, Ireland, August 23-24 2014

Jiang, Y.; Bai, W.; Zhang, X.; Hu, J.: Wikipedia-based information content and semantic similarity computation (2017) 0.01

0.009950917 = product of:
  0.049754586 = sum of:
    0.014290508 = weight(_text_:23 in 2877) [ClassicSimilarity], result of:
      0.014290508 = score(doc=2877,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.1979932 = fieldWeight in 2877, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2877)
    0.014290508 = weight(_text_:23 in 2877) [ClassicSimilarity], result of:
      0.014290508 = score(doc=2877,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.1979932 = fieldWeight in 2877, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2877)
    0.014290508 = weight(_text_:23 in 2877) [ClassicSimilarity], result of:
      0.014290508 = score(doc=2877,freq=2.0), product of:
        0.07217676 = queryWeight, product of:
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.02013827 = queryNorm
        0.1979932 = fieldWeight in 2877, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5840597 = idf(docFreq=3336, maxDocs=44218)
          0.0390625 = fieldNorm(doc=2877)
    0.0068830615 = product of:
      0.013766123 = sum of:
        0.013766123 = weight(_text_:29 in 2877) [ClassicSimilarity], result of:
          0.013766123 = score(doc=2877,freq=2.0), product of:
            0.070840135 = queryWeight, product of:
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.02013827 = queryNorm
            0.19432661 = fieldWeight in 2877, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5176873 = idf(docFreq=3565, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2877)
      0.5 = coord(1/2)
  0.2 = coord(4/20)

Date: 23. 1.2017 14:06:29

Bradford, R.B.: Relationship discovery in large text collections using Latent Semantic Indexing (2006) 0.01

0.009131783 = product of:
  0.045658913 = sum of:
    0.014006989 = weight(_text_:software in 1163) [ClassicSimilarity], result of:
      0.014006989 = score(doc=1163,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.17532499 = fieldWeight in 1163, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.03125 = fieldNorm(doc=1163)
    0.014006989 = weight(_text_:software in 1163) [ClassicSimilarity], result of:
      0.014006989 = score(doc=1163,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.17532499 = fieldWeight in 1163, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.03125 = fieldNorm(doc=1163)
    0.014006989 = weight(_text_:software in 1163) [ClassicSimilarity], result of:
      0.014006989 = score(doc=1163,freq=2.0), product of:
        0.07989157 = queryWeight, product of:
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.02013827 = queryNorm
        0.17532499 = fieldWeight in 1163, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.9671519 = idf(docFreq=2274, maxDocs=44218)
          0.03125 = fieldNorm(doc=1163)
    0.0036379434 = product of:
      0.01091383 = sum of:
        0.01091383 = weight(_text_:22 in 1163) [ClassicSimilarity], result of:
          0.01091383 = score(doc=1163,freq=2.0), product of:
            0.07052079 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.02013827 = queryNorm
            0.15476047 = fieldWeight in 1163, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=1163)
      0.33333334 = coord(1/3)
  0.2 = coord(4/20)

Abstract: This paper addresses the problem of information discovery in large collections of text. For users, one of the key problems in working with such collections is determining where to focus their attention. In selecting documents for examination, users must be able to formulate reasonably precise queries. Queries that are too broad will greatly reduce the efficiency of information discovery efforts by overwhelming the users with peripheral information. In order to formulate efficient queries, a mechanism is needed to automatically alert users regarding potentially interesting information contained within the collection. This paper presents the results of an experiment designed to test one approach to generation of such alerts. The technique of latent semantic indexing (LSI) is used to identify relationships among entities of interest. Entity extraction software is used to pre-process the text of the collection so that the LSI space contains representation vectors for named entities in addition to those for individual terms. In the LSI space, the cosine of the angle between the representation vectors for two entities captures important information regarding the degree of association of those two entities. For appropriate choices of entities, determining the entity pairs with the highest mutual cosine values yields valuable information regarding the contents of the text collection. The test database used for the experiment consists of 150,000 news articles. The proposed approach for alert generation is tested using a counterterrorism analysis example. The approach is shown to have significant potential for aiding users in rapidly focusing on information of potential importance in large text collections. The approach also has value in identifying possible use of aliases.
Source: Proceedings of the Fourth Workshop on Link Analysis, Counterterrorism, and Security, SIAM Data Mining Conference, Bethesda, MD, 20-22 April, 2006. [http://www.siam.org/meetings/sdm06/workproceed/Link%20Analysis/15.pdf]

Search (116 results, page 1 of 6)

Authors

Years

Languages

Types

Themes

Subjects

Classifications