Document (#39134)

Author
Mandalka, M.
Title
Open semantic search zum unabhängigen und datenschutzfreundlichen Erschliessen von Dokumenten
Issue
[07.07.2015].
Source
http://www.linux-community.de/Internal/Nachrichten/Open-Semantic-Search-zum-unabhaengigen-und-datenschutzfreundlichen-Erschliessen-von-Dokumenten
Year
2015
Abstract
Ob grösserer Leak oder Zusammenwürfeln oder (wieder) Erschliessen umfangreicherer (kollaborativer) Recherche(n) oder Archive: Immer öfter müssen im Journalismus größere Datenberge und Dokumentenberge erschlossen werden. In eine Suchmaschine integrierte Analyse-Tools helfen (halb)automatisch.
Content
"Open Semantic Desktop Search Zur Tagung des Netzwerk Recherche ist die Desktop Suchmaschine Open Semantic Desktop Search zum unabhängigen und datenschutzfreundlichen Erschliessen und Analysieren von Dokumentenbergen nun erstmals auch als deutschsprachige Version verfügbar. Dank mächtiger Open Source Basis kann die auf Debian GNU/Linux und Apache Solr basierende freie Software als unter Linux, Windows oder Mac lauffähige virtuelle Maschine kostenlos heruntergeladen, genutzt, weitergegeben und weiterentwickelt werden. Dokumentenberge erschliessen Ob grösserer Leak oder Zusammenwürfeln oder (wieder) Erschliessen umfangreicherer (kollaborativer) Recherche(n) oder Archive: Hin und wieder müssen größere Datenberge bzw. Dokumentenberge erschlossen werden, die so viele Dokumente enthalten, dass Mensch diese Masse an Dokumenten nicht mehr alle nacheinander durchschauen und einordnen kann. Auch bei kontinuierlicher Recherche zu Fachthemen sammeln sich mit der Zeit größere Mengen digitalisierter oder digitaler Dokumente zu grösseren Datenbergen an, die immer weiter wachsen und deren Informationen mit einer Suchmaschine für das Archiv leichter auffindbar bleiben. Moderne Tools zur Datenanalyse in Verbindung mit Enterprise Search Suchlösungen und darauf aufbauender Recherche-Tools helfen (halb)automatisch.
Unabhängiges Durchsuchen und Analysieren grosser Datenmengen Damit können investigativ arbeitende Journalisten selbstständig und auf eigener Hardware datenschutzfreundlich hunderte, tausende, hunderttausende oder gar Millionen von Dokumenten oder hunderte Megabyte, Gigabytes oder gar einige Terabytes an Daten mit Volltextsuche durchsuchbar machen. Automatische Datenanreicherung und Erschliessung mittels Hintergrundwissen Zudem wird anhand von konfigurierbaren Hintergrundwissen automatisch eine interaktive Navigation zu in Dokumenten enthaltenen Namen von Bundestagsabgeordneten oder Orten in Deutschland generiert oder anhand Textmustern strukturierte Informationen wie Geldbeträge extrahiert. Mittels Named Entities Manager für Personen, Organisationen, Begriffe und Orte können eigene Rechercheschwerpunkte konfiguriert werden, aus denen dann automatisch eine interaktive Navigation (Facettensuche) und aggregierte Übersichten generiert werden. Automatische Datenvisualisierung Diese lassen sich auch visualisieren: So z.B. die zeitliche Verteilung von Suchergebnissen als Trand Diagramm oder durch gleichzeitige Nennung in Dokumenten abgeleitete Verbindungen als Netzwerk bzw. Graph.
Automatische Texterkennung (OCR) Dokumente, die nicht im Textformat, sondern als Grafiken vorliegen, wie z.B. Scans werden automatisch durch automatische Texterkennung (OCR) angereichert und damit auch der extrahierte Text durchsuchbar. Auch für eingebettete Bilddateien bzw. Scans innerhalb von PDF-Dateien. Unscharfe Suche mit Listen Ansonsten ist auch das Recherche-Tool bzw. die Such-Applikation "Suche mit Listen" integriert, mit denen sich schnell und komfortabel abgleichen lässt, ob es zu den einzelnen Einträgen in Listen jeweils Treffer in der durchsuchbaren Dokumentensammlung gibt. Mittels unscharfer Suche findet das Tool auch Ergebnisse, die in fehlerhaften oder unterschiedlichen Schreibweisen vorliegen. Semantische Suche und Textmining Im Recherche, Textanalyse und Document Mining Tutorial zu den enthaltenen Recherche-Tools und verschiedenen kombinierten Methoden zur Datenanalyse, Anreicherung und Suche wird ausführlicher beschrieben, wie auch eine große heterogene und unstrukturierte Dokumentensammlung bzw. eine grosse Anzahl von Dokumenten in verschiedenen Formaten leicht durchsucht und analysiert werden kann.
Virtuelle Maschine für mehr Plattformunabhängigkeit Die nun auch deutschsprachig verfügbare und mit deutschen Daten wie Ortsnamen oder Bundestagsabgeordneten vorkonfigurierte virtuelle Maschine Open Semantic Desktop Search ermöglicht nun auch auf einzelnen Desktop Computern oder Notebooks mit Windows oder iOS (Mac) die Suche und Analyse von Dokumenten mit der Suchmaschine Open Semantic Search. Als virtuelle Maschine (VM) lässt sich die Suchmaschine Open Semantic Search nicht nur für besonders sensible Dokumente mit dem verschlüsselten Live-System InvestigateIX als abgeschottetes System auf verschlüsselten externen Datenträgern installieren, sondern als virtuelle Maschine für den Desktop auch einfach unter Windows oder auf einem Mac in eine bzgl. weiterer Software und Daten bereits existierende Systemumgebung integrieren, ohne hierzu auf einen (für gemeinsame Recherchen im Team oder für die Redaktion auch möglichen) Suchmaschinen Server angewiesen zu sein. Datenschutz & Unabhängigkeit: Grössere Unabhängigkeit von zentralen IT-Infrastrukturen für unabhängigen investigativen Datenjournalismus Damit ist investigative Recherche weitmöglichst unabhängig möglich: ohne teure, zentrale und von Administratoren abhängige Server, ohne von der Dokumentenanzahl abhängige teure Software-Lizenzen, ohne Internet und ohne spionierende Cloud-Dienste. Datenanalyse und Suche finden auf dem eigenen Computer statt, nicht wie bei vielen anderen Lösungen in der sogenannten Cloud."
Footnote
Vgl. auch: http://www.opensemanticsearch.org/de/.
Theme
Suchmaschinen
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (content)

  1. Krischker, U.: Formale Analyse von Dokumenten (1997) 0.22
    0.21565601 = sum of:
      0.21565601 = product of:
        0.59904444 = sum of:
          0.017639214 = weight(abstract_txt:eine in 3925) [ClassicSimilarity], result of:
            0.017639214 = score(doc=3925,freq=2.0), product of:
              0.045755863 = queryWeight, product of:
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.013113521 = queryNorm
              0.3855072 = fieldWeight in 3925, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.078125 = fieldNorm(doc=3925)
          0.021921653 = weight(abstract_txt:werden in 3925) [ClassicSimilarity], result of:
            0.021921653 = score(doc=3925,freq=3.0), product of:
              0.046203945 = queryWeight, product of:
                1.0048845 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013113521 = queryNorm
              0.47445413 = fieldWeight in 3925, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=3925)
          0.036945533 = weight(abstract_txt:immer in 3925) [ClassicSimilarity], result of:
            0.036945533 = score(doc=3925,freq=1.0), product of:
              0.09437245 = queryWeight, product of:
                1.4361482 = boost
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.013113521 = queryNorm
              0.39148644 = fieldWeight in 3925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.078125 = fieldNorm(doc=3925)
          0.05197018 = weight(abstract_txt:müssen in 3925) [ClassicSimilarity], result of:
            0.05197018 = score(doc=3925,freq=1.0), product of:
              0.11847865 = queryWeight, product of:
                1.6091505 = boost
                5.6146684 = idf(docFreq=437, maxDocs=44218)
                0.013113521 = queryNorm
              0.43864596 = fieldWeight in 3925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6146684 = idf(docFreq=437, maxDocs=44218)
                0.078125 = fieldNorm(doc=3925)
          0.11343571 = weight(abstract_txt:analyse in 3925) [ClassicSimilarity], result of:
            0.11343571 = score(doc=3925,freq=4.0), product of:
              0.12558867 = queryWeight, product of:
                1.6567304 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.013113521 = queryNorm
              0.90323204 = fieldWeight in 3925, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.078125 = fieldNorm(doc=3925)
          0.05968487 = weight(abstract_txt:wieder in 3925) [ClassicSimilarity], result of:
            0.05968487 = score(doc=3925,freq=1.0), product of:
              0.1299312 = queryWeight, product of:
                1.6851296 = boost
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.013113521 = queryNorm
              0.4593575 = fieldWeight in 3925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.078125 = fieldNorm(doc=3925)
          0.06982937 = weight(abstract_txt:archive in 3925) [ClassicSimilarity], result of:
            0.06982937 = score(doc=3925,freq=1.0), product of:
              0.14426556 = queryWeight, product of:
                1.7756524 = boost
                6.195629 = idf(docFreq=244, maxDocs=44218)
                0.013113521 = queryNorm
              0.48403352 = fieldWeight in 3925, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.195629 = idf(docFreq=244, maxDocs=44218)
                0.078125 = fieldNorm(doc=3925)
          0.1111444 = weight(abstract_txt:dokumenten in 3925) [ClassicSimilarity], result of:
            0.1111444 = score(doc=3925,freq=2.0), product of:
              0.1560938 = queryWeight, product of:
                1.8470109 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.013113521 = queryNorm
              0.71203595 = fieldWeight in 3925, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.078125 = fieldNorm(doc=3925)
          0.116473526 = weight(abstract_txt:oder in 3925) [ClassicSimilarity], result of:
            0.116473526 = score(doc=3925,freq=3.0), product of:
              0.20290314 = queryWeight, product of:
                3.6473851 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.013113521 = queryNorm
              0.5740351 = fieldWeight in 3925, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.078125 = fieldNorm(doc=3925)
        0.36 = coord(9/25)
    
  2. Jörn, F.: Wie Google für uns nach der ominösen Gluonenkraft stöbert : Software-Krabbler machen sich vor der Anfrage auf die Suche - Das Netz ist etwa fünfhundertmal größer als alles Durchforschte (2001) 0.12
    0.11580812 = sum of:
      0.11580812 = product of:
        0.2895203 = sum of:
          0.00986062 = weight(abstract_txt:eine in 3684) [ClassicSimilarity], result of:
            0.00986062 = score(doc=3684,freq=10.0), product of:
              0.045755863 = queryWeight, product of:
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.013113521 = queryNorm
              0.21550506 = fieldWeight in 3684, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.010494193 = weight(abstract_txt:werden in 3684) [ClassicSimilarity], result of:
            0.010494193 = score(doc=3684,freq=11.0), product of:
              0.046203945 = queryWeight, product of:
                1.0048845 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013113521 = queryNorm
              0.22712764 = fieldWeight in 3684, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.0035931198 = weight(abstract_txt:search in 3684) [ClassicSimilarity], result of:
            0.0035931198 = score(doc=3684,freq=1.0), product of:
              0.05029117 = queryWeight, product of:
                1.0483891 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.013113521 = queryNorm
              0.07144634 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.018472767 = weight(abstract_txt:immer in 3684) [ClassicSimilarity], result of:
            0.018472767 = score(doc=3684,freq=4.0), product of:
              0.09437245 = queryWeight, product of:
                1.4361482 = boost
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.013113521 = queryNorm
              0.19574322 = fieldWeight in 3684, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.047550328 = weight(abstract_txt:suchmaschine in 3684) [ClassicSimilarity], result of:
            0.047550328 = score(doc=3684,freq=6.0), product of:
              0.15484452 = queryWeight, product of:
                1.8396049 = boost
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.013113521 = queryNorm
              0.30708435 = fieldWeight in 3684, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.034030885 = weight(abstract_txt:dokumenten in 3684) [ClassicSimilarity], result of:
            0.034030885 = score(doc=3684,freq=3.0), product of:
              0.1560938 = queryWeight, product of:
                1.8470109 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.013113521 = queryNorm
              0.2180156 = fieldWeight in 3684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.025138948 = weight(abstract_txt:automatisch in 3684) [ClassicSimilarity], result of:
            0.025138948 = score(doc=3684,freq=1.0), product of:
              0.18396787 = queryWeight, product of:
                2.0051534 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.013113521 = queryNorm
              0.13664858 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.03626573 = weight(abstract_txt:helfen in 3684) [ClassicSimilarity], result of:
            0.03626573 = score(doc=3684,freq=2.0), product of:
              0.18642247 = queryWeight, product of:
                2.018486 = boost
                7.042927 = idf(docFreq=104, maxDocs=44218)
                0.013113521 = queryNorm
              0.1945352 = fieldWeight in 3684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.042927 = idf(docFreq=104, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.048356246 = weight(abstract_txt:öfter in 3684) [ClassicSimilarity], result of:
            0.048356246 = score(doc=3684,freq=1.0), product of:
              0.28454152 = queryWeight, product of:
                2.49373 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.013113521 = queryNorm
              0.16994444 = fieldWeight in 3684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
          0.055757456 = weight(abstract_txt:oder in 3684) [ClassicSimilarity], result of:
            0.055757456 = score(doc=3684,freq=11.0), product of:
              0.20290314 = queryWeight, product of:
                3.6473851 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.013113521 = queryNorm
              0.2747984 = fieldWeight in 3684, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.01953125 = fieldNorm(doc=3684)
        0.4 = coord(10/25)
    
  3. Schetsche, M.; Lehmann, K.; Krug, T.: ¬Die Google-Gesellschaft : Zehn Prinzipien der neuen Wissensordnung (2005) 0.12
    0.115657724 = sum of:
      0.115657724 = product of:
        0.4130633 = sum of:
          0.014319164 = weight(abstract_txt:werden in 3488) [ClassicSimilarity], result of:
            0.014319164 = score(doc=3488,freq=2.0), product of:
              0.046203945 = queryWeight, product of:
                1.0048845 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013113521 = queryNorm
              0.30991215 = fieldWeight in 3488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=3488)
          0.041799102 = weight(abstract_txt:immer in 3488) [ClassicSimilarity], result of:
            0.041799102 = score(doc=3488,freq=2.0), product of:
              0.09437245 = queryWeight, product of:
                1.4361482 = boost
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.013113521 = queryNorm
              0.44291633 = fieldWeight in 3488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.0625 = fieldNorm(doc=3488)
          0.047747895 = weight(abstract_txt:wieder in 3488) [ClassicSimilarity], result of:
            0.047747895 = score(doc=3488,freq=1.0), product of:
              0.1299312 = queryWeight, product of:
                1.6851296 = boost
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.013113521 = queryNorm
              0.367486 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.0625 = fieldNorm(doc=3488)
          0.054476622 = weight(abstract_txt:recherche in 3488) [ClassicSimilarity], result of:
            0.054476622 = score(doc=3488,freq=1.0), product of:
              0.14186788 = queryWeight, product of:
                1.7608349 = boost
                6.1439276 = idf(docFreq=257, maxDocs=44218)
                0.013113521 = queryNorm
              0.38399547 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1439276 = idf(docFreq=257, maxDocs=44218)
                0.0625 = fieldNorm(doc=3488)
          0.08923692 = weight(abstract_txt:erschlossen in 3488) [ClassicSimilarity], result of:
            0.08923692 = score(doc=3488,freq=1.0), product of:
              0.1971395 = queryWeight, product of:
                2.0756946 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.013113521 = queryNorm
              0.45265874 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=3488)
          0.11168678 = weight(abstract_txt:größere in 3488) [ClassicSimilarity], result of:
            0.11168678 = score(doc=3488,freq=1.0), product of:
              0.22895236 = queryWeight, product of:
                2.2369134 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.013113521 = queryNorm
              0.4878167 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=3488)
          0.053796817 = weight(abstract_txt:oder in 3488) [ClassicSimilarity], result of:
            0.053796817 = score(doc=3488,freq=1.0), product of:
              0.20290314 = queryWeight, product of:
                3.6473851 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.013113521 = queryNorm
              0.26513547 = fieldWeight in 3488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.0625 = fieldNorm(doc=3488)
        0.28 = coord(7/25)
    
  4. Leyh, M.: ¬Das Google File System (2005) 0.11
    0.113015905 = sum of:
      0.113015905 = product of:
        0.35317472 = sum of:
          0.01833121 = weight(abstract_txt:eine in 863) [ClassicSimilarity], result of:
            0.01833121 = score(doc=863,freq=6.0), product of:
              0.045755863 = queryWeight, product of:
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.013113521 = queryNorm
              0.40063083 = fieldWeight in 863, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.046875 = fieldNorm(doc=863)
          0.0151877655 = weight(abstract_txt:werden in 863) [ClassicSimilarity], result of:
            0.0151877655 = score(doc=863,freq=4.0), product of:
              0.046203945 = queryWeight, product of:
                1.0048845 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013113521 = queryNorm
              0.32871145 = fieldWeight in 863, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.046875 = fieldNorm(doc=863)
          0.022167321 = weight(abstract_txt:immer in 863) [ClassicSimilarity], result of:
            0.022167321 = score(doc=863,freq=1.0), product of:
              0.09437245 = queryWeight, product of:
                1.4361482 = boost
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.013113521 = queryNorm
              0.23489186 = fieldWeight in 863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.046875 = fieldNorm(doc=863)
          0.06588767 = weight(abstract_txt:suchmaschine in 863) [ClassicSimilarity], result of:
            0.06588767 = score(doc=863,freq=2.0), product of:
              0.15484452 = queryWeight, product of:
                1.8396049 = boost
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.013113521 = queryNorm
              0.42550856 = fieldWeight in 863, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4187727 = idf(docFreq=195, maxDocs=44218)
                0.046875 = fieldNorm(doc=863)
          0.047154576 = weight(abstract_txt:dokumenten in 863) [ClassicSimilarity], result of:
            0.047154576 = score(doc=863,freq=1.0), product of:
              0.1560938 = queryWeight, product of:
                1.8470109 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.013113521 = queryNorm
              0.30209127 = fieldWeight in 863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.046875 = fieldNorm(doc=863)
          0.060333475 = weight(abstract_txt:automatisch in 863) [ClassicSimilarity], result of:
            0.060333475 = score(doc=863,freq=1.0), product of:
              0.18396787 = queryWeight, product of:
                2.0051534 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.013113521 = queryNorm
              0.3279566 = fieldWeight in 863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.046875 = fieldNorm(doc=863)
          0.08376509 = weight(abstract_txt:größere in 863) [ClassicSimilarity], result of:
            0.08376509 = score(doc=863,freq=1.0), product of:
              0.22895236 = queryWeight, product of:
                2.2369134 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.013113521 = queryNorm
              0.36586252 = fieldWeight in 863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.046875 = fieldNorm(doc=863)
          0.040347613 = weight(abstract_txt:oder in 863) [ClassicSimilarity], result of:
            0.040347613 = score(doc=863,freq=1.0), product of:
              0.20290314 = queryWeight, product of:
                3.6473851 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.013113521 = queryNorm
              0.1988516 = fieldWeight in 863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.046875 = fieldNorm(doc=863)
        0.32 = coord(8/25)
    
  5. Hänger, C.; Krätzsch, C.; Niemann, C.: Was vom Tagging übrig blieb : Erkenntnisse und Einsichten aus zwei Jahren Projektarbeit (2011) 0.11
    0.111957446 = sum of:
      0.111957446 = product of:
        0.34986702 = sum of:
          0.01234745 = weight(abstract_txt:eine in 4519) [ClassicSimilarity], result of:
            0.01234745 = score(doc=4519,freq=2.0), product of:
              0.045755863 = queryWeight, product of:
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.013113521 = queryNorm
              0.26985502 = fieldWeight in 4519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4892128 = idf(docFreq=3668, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.012529268 = weight(abstract_txt:werden in 4519) [ClassicSimilarity], result of:
            0.012529268 = score(doc=4519,freq=2.0), product of:
              0.046203945 = queryWeight, product of:
                1.0048845 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.013113521 = queryNorm
              0.27117312 = fieldWeight in 4519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.03657421 = weight(abstract_txt:immer in 4519) [ClassicSimilarity], result of:
            0.03657421 = score(doc=4519,freq=2.0), product of:
              0.09437245 = queryWeight, product of:
                1.4361482 = boost
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.013113521 = queryNorm
              0.38755178 = fieldWeight in 4519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0110264 = idf(docFreq=800, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.039702497 = weight(abstract_txt:analyse in 4519) [ClassicSimilarity], result of:
            0.039702497 = score(doc=4519,freq=1.0), product of:
              0.12558867 = queryWeight, product of:
                1.6567304 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.013113521 = queryNorm
              0.3161312 = fieldWeight in 4519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.04177941 = weight(abstract_txt:wieder in 4519) [ClassicSimilarity], result of:
            0.04177941 = score(doc=4519,freq=1.0), product of:
              0.1299312 = queryWeight, product of:
                1.6851296 = boost
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.013113521 = queryNorm
              0.32155025 = fieldWeight in 4519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.879776 = idf(docFreq=335, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.05501367 = weight(abstract_txt:dokumenten in 4519) [ClassicSimilarity], result of:
            0.05501367 = score(doc=4519,freq=1.0), product of:
              0.1560938 = queryWeight, product of:
                1.8470109 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.013113521 = queryNorm
              0.35243982 = fieldWeight in 4519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.070389055 = weight(abstract_txt:automatisch in 4519) [ClassicSimilarity], result of:
            0.070389055 = score(doc=4519,freq=1.0), product of:
              0.18396787 = queryWeight, product of:
                2.0051534 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.013113521 = queryNorm
              0.382616 = fieldWeight in 4519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
          0.08153147 = weight(abstract_txt:oder in 4519) [ClassicSimilarity], result of:
            0.08153147 = score(doc=4519,freq=3.0), product of:
              0.20290314 = queryWeight, product of:
                3.6473851 = boost
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.013113521 = queryNorm
              0.4018246 = fieldWeight in 4519, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2421675 = idf(docFreq=1727, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4519)
        0.32 = coord(8/25)