Document (#34819)

Author
Puzicha, J.
Title
Informationen finden! : Intelligente Suchmaschinentechnologie & automatische Kategorisierung
Imprint
Rheinbach : recommind
Year
2007
Pages
15 S
Abstract
Wie in diesem Text erläutert wurde, ist die Effektivität von Such- und Klassifizierungssystemen durch folgendes bestimmt: 1) den Arbeitsauftrag, 2) die Genauigkeit des Systems, 3) den zu erreichenden Automatisierungsgrad, 4) die Einfachheit der Integration in bereits vorhandene Systeme. Diese Kriterien gehen davon aus, dass jedes System, unabhängig von der Technologie, in der Lage ist, Grundvoraussetzungen des Produkts in Bezug auf Funktionalität, Skalierbarkeit und Input-Methode zu erfüllen. Diese Produkteigenschaften sind in der Recommind Produktliteratur genauer erläutert. Von diesen Fähigkeiten ausgehend sollte die vorhergehende Diskussion jedoch einige klare Trends aufgezeigt haben. Es ist nicht überraschend, dass jüngere Entwicklungen im Maschine Learning und anderen Bereichen der Informatik einen theoretischen Ausgangspunkt für die Entwicklung von Suchmaschinen- und Klassifizierungstechnologie haben. Besonders jüngste Fortschritte bei den statistischen Methoden (PLSA) und anderen mathematischen Werkzeugen (SVMs) haben eine Ergebnisqualität auf Durchbruchsniveau erreicht. Dazu kommt noch die Flexibilität in der Anwendung durch Selbsttraining und Kategorienerkennen von PLSA-Systemen, wie auch eine neue Generation von vorher unerreichten Produktivitätsverbesserungen.
Content
Technical Whitepaper - Grundlagen der Informationsgewinnung
Footnote
Vgl. auch: http://www.recommind.de/?id=mindserver_categorization.
Theme
Automatisches Klassifizieren
Object
Latent Semantic Indexing

Similar documents (content)

  1. Burblies, C.; Wolff, J.E.: Vascoda - Effiziente Vermittlung wissenschaftlicher information (2009) 0.09
    0.08827885 = sum of:
      0.08827885 = product of:
        0.44139424 = sum of:
          0.033171732 = weight(abstract_txt:durch in 4784) [ClassicSimilarity], result of:
            0.033171732 = score(doc=4784,freq=2.0), product of:
              0.10048477 = queryWeight, product of:
                1.0896761 = boost
                4.2683973 = idf(docFreq=1626, maxDocs=42740)
                0.02160419 = queryNorm
              0.330117 = fieldWeight in 4784, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2683973 = idf(docFreq=1626, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4784)
          0.13432816 = weight(abstract_txt:suchmaschinentechnologie in 4784) [ClassicSimilarity], result of:
            0.13432816 = score(doc=4784,freq=2.0), product of:
              0.20262337 = queryWeight, product of:
                1.0941505 = boost
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.02160419 = queryNorm
              0.66294503 = fieldWeight in 4784, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4784)
          0.16488667 = weight(abstract_txt:einfachheit in 4784) [ClassicSimilarity], result of:
            0.16488667 = score(doc=4784,freq=2.0), product of:
              0.2322925 = queryWeight, product of:
                1.1715206 = boost
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.02160419 = queryNorm
              0.7098235 = fieldWeight in 4784, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4784)
          0.043186907 = weight(abstract_txt:anderen in 4784) [ClassicSimilarity], result of:
            0.043186907 = score(doc=4784,freq=1.0), product of:
              0.15094993 = queryWeight, product of:
                1.3355612 = boost
                5.2315593 = idf(docFreq=620, maxDocs=42740)
                0.02160419 = queryNorm
              0.2861009 = fieldWeight in 4784, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2315593 = idf(docFreq=620, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4784)
          0.06582079 = weight(abstract_txt:haben in 4784) [ClassicSimilarity], result of:
            0.06582079 = score(doc=4784,freq=2.0), product of:
              0.18163268 = queryWeight, product of:
                1.7942789 = boost
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.02160419 = queryNorm
              0.36238408 = fieldWeight in 4784, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.0546875 = fieldNorm(doc=4784)
        0.2 = coord(5/25)
    
  2. Raicher, E.: Möglichkeiten und Grenzen von Primo bei der Einführung in deutschsprachigen Bibliotheken und Bibliotheksverbünden (2010) 0.08
    0.08484192 = sum of:
      0.08484192 = product of:
        0.30300686 = sum of:
          0.03791055 = weight(abstract_txt:durch in 1312) [ClassicSimilarity], result of:
            0.03791055 = score(doc=1312,freq=8.0), product of:
              0.10048477 = queryWeight, product of:
                1.0896761 = boost
                4.2683973 = idf(docFreq=1626, maxDocs=42740)
                0.02160419 = queryNorm
              0.37727657 = fieldWeight in 1312, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.2683973 = idf(docFreq=1626, maxDocs=42740)
                0.03125 = fieldNorm(doc=1312)
          0.05427677 = weight(abstract_txt:suchmaschinentechnologie in 1312) [ClassicSimilarity], result of:
            0.05427677 = score(doc=1312,freq=1.0), product of:
              0.20262337 = queryWeight, product of:
                1.0941505 = boost
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.02160419 = queryNorm
              0.26787025 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.571848 = idf(docFreq=21, maxDocs=42740)
                0.03125 = fieldNorm(doc=1312)
          0.05710954 = weight(abstract_txt:überraschend in 1312) [ClassicSimilarity], result of:
            0.05710954 = score(doc=1312,freq=1.0), product of:
              0.20961352 = queryWeight, product of:
                1.1128637 = boost
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.02160419 = queryNorm
              0.2724516 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.7184515 = idf(docFreq=18, maxDocs=42740)
                0.03125 = fieldNorm(doc=1312)
          0.020282755 = weight(abstract_txt:diese in 1312) [ClassicSimilarity], result of:
            0.020282755 = score(doc=1312,freq=2.0), product of:
              0.105123095 = queryWeight, product of:
                1.1145419 = boost
                4.3657994 = idf(docFreq=1475, maxDocs=42740)
                0.02160419 = queryNorm
              0.1929429 = fieldWeight in 1312, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3657994 = idf(docFreq=1475, maxDocs=42740)
                0.03125 = fieldNorm(doc=1312)
          0.06332334 = weight(abstract_txt:produkts in 1312) [ClassicSimilarity], result of:
            0.06332334 = score(doc=1312,freq=1.0), product of:
              0.224555 = queryWeight, product of:
                1.151844 = boost
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.02160419 = queryNorm
              0.2819948 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.023833 = idf(docFreq=13, maxDocs=42740)
                0.03125 = fieldNorm(doc=1312)
          0.04350826 = weight(abstract_txt:dass in 1312) [ClassicSimilarity], result of:
            0.04350826 = score(doc=1312,freq=7.0), product of:
              0.11516098 = queryWeight, product of:
                1.166541 = boost
                4.569486 = idf(docFreq=1203, maxDocs=42740)
                0.02160419 = queryNorm
              0.37780386 = fieldWeight in 1312, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.569486 = idf(docFreq=1203, maxDocs=42740)
                0.03125 = fieldNorm(doc=1312)
          0.026595619 = weight(abstract_txt:haben in 1312) [ClassicSimilarity], result of:
            0.026595619 = score(doc=1312,freq=1.0), product of:
              0.18163268 = queryWeight, product of:
                1.7942789 = boost
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.02160419 = queryNorm
              0.14642529 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.03125 = fieldNorm(doc=1312)
        0.28 = coord(7/25)
    
  3. Weishaupt, K.: Alephino : ein neues Bibliothekssystem für kleine und mittlere Bibliotheken (2004) 0.08
    0.0844057 = sum of:
      0.0844057 = product of:
        0.3516904 = sum of:
          0.073755175 = weight(abstract_txt:funktionalität in 3287) [ClassicSimilarity], result of:
            0.073755175 = score(doc=3287,freq=1.0), product of:
              0.17117874 = queryWeight, product of:
                1.005674 = boost
                7.8787007 = idf(docFreq=43, maxDocs=42740)
                0.02160419 = queryNorm
              0.43086645 = fieldWeight in 3287, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8787007 = idf(docFreq=43, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3287)
          0.088337764 = weight(abstract_txt:vorher in 3287) [ClassicSimilarity], result of:
            0.088337764 = score(doc=3287,freq=1.0), product of:
              0.19305709 = queryWeight, product of:
                1.0680097 = boost
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.02160419 = queryNorm
              0.45757326 = fieldWeight in 3287, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.367054 = idf(docFreq=26, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3287)
          0.02509863 = weight(abstract_txt:diese in 3287) [ClassicSimilarity], result of:
            0.02509863 = score(doc=3287,freq=1.0), product of:
              0.105123095 = queryWeight, product of:
                1.1145419 = boost
                4.3657994 = idf(docFreq=1475, maxDocs=42740)
                0.02160419 = queryNorm
              0.23875466 = fieldWeight in 3287, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3657994 = idf(docFreq=1475, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3287)
          0.040698253 = weight(abstract_txt:dass in 3287) [ClassicSimilarity], result of:
            0.040698253 = score(doc=3287,freq=2.0), product of:
              0.11516098 = queryWeight, product of:
                1.166541 = boost
                4.569486 = idf(docFreq=1203, maxDocs=42740)
                0.02160419 = queryNorm
              0.35340315 = fieldWeight in 3287, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.569486 = idf(docFreq=1203, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3287)
          0.043186907 = weight(abstract_txt:anderen in 3287) [ClassicSimilarity], result of:
            0.043186907 = score(doc=3287,freq=1.0), product of:
              0.15094993 = queryWeight, product of:
                1.3355612 = boost
                5.2315593 = idf(docFreq=620, maxDocs=42740)
                0.02160419 = queryNorm
              0.2861009 = fieldWeight in 3287, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2315593 = idf(docFreq=620, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3287)
          0.08061368 = weight(abstract_txt:haben in 3287) [ClassicSimilarity], result of:
            0.08061368 = score(doc=3287,freq=3.0), product of:
              0.18163268 = queryWeight, product of:
                1.7942789 = boost
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.02160419 = queryNorm
              0.44382805 = fieldWeight in 3287, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3287)
        0.24 = coord(6/25)
    
  4. Lochmann, D.: Vom Wesen der Information : eine allgemeinverständliche Betrachtung über Information in der Gesellschaft, in der Natur und in der Informationstheorie (2004) 0.08
    0.075289264 = sum of:
      0.075289264 = product of:
        0.26889023 = sum of:
          0.04143647 = weight(abstract_txt:fortschritte in 3582) [ClassicSimilarity], result of:
            0.04143647 = score(doc=3582,freq=1.0), product of:
              0.1692526 = queryWeight, product of:
                7.834249 = idf(docFreq=45, maxDocs=42740)
                0.02160419 = queryNorm
              0.24482028 = fieldWeight in 3582, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.834249 = idf(docFreq=45, maxDocs=42740)
                0.03125 = fieldNorm(doc=3582)
          0.04411145 = weight(abstract_txt:statistischen in 3582) [ClassicSimilarity], result of:
            0.04411145 = score(doc=3582,freq=1.0), product of:
              0.1764606 = queryWeight, product of:
                1.0210716 = boost
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.02160419 = queryNorm
              0.24997903 = fieldWeight in 3582, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.999329 = idf(docFreq=38, maxDocs=42740)
                0.03125 = fieldNorm(doc=3582)
          0.018955275 = weight(abstract_txt:durch in 3582) [ClassicSimilarity], result of:
            0.018955275 = score(doc=3582,freq=2.0), product of:
              0.10048477 = queryWeight, product of:
                1.0896761 = boost
                4.2683973 = idf(docFreq=1626, maxDocs=42740)
                0.02160419 = queryNorm
              0.18863828 = fieldWeight in 3582, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2683973 = idf(docFreq=1626, maxDocs=42740)
                0.03125 = fieldNorm(doc=3582)
          0.03206985 = weight(abstract_txt:diese in 3582) [ClassicSimilarity], result of:
            0.03206985 = score(doc=3582,freq=5.0), product of:
              0.105123095 = queryWeight, product of:
                1.1145419 = boost
                4.3657994 = idf(docFreq=1475, maxDocs=42740)
                0.02160419 = queryNorm
              0.3050695 = fieldWeight in 3582, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3657994 = idf(docFreq=1475, maxDocs=42740)
                0.03125 = fieldNorm(doc=3582)
          0.04350826 = weight(abstract_txt:dass in 3582) [ClassicSimilarity], result of:
            0.04350826 = score(doc=3582,freq=7.0), product of:
              0.11516098 = queryWeight, product of:
                1.166541 = boost
                4.569486 = idf(docFreq=1203, maxDocs=42740)
                0.02160419 = queryNorm
              0.37780386 = fieldWeight in 3582, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.569486 = idf(docFreq=1203, maxDocs=42740)
                0.03125 = fieldNorm(doc=3582)
          0.04274395 = weight(abstract_txt:anderen in 3582) [ClassicSimilarity], result of:
            0.04274395 = score(doc=3582,freq=3.0), product of:
              0.15094993 = queryWeight, product of:
                1.3355612 = boost
                5.2315593 = idf(docFreq=620, maxDocs=42740)
                0.02160419 = queryNorm
              0.28316644 = fieldWeight in 3582, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2315593 = idf(docFreq=620, maxDocs=42740)
                0.03125 = fieldNorm(doc=3582)
          0.04606496 = weight(abstract_txt:haben in 3582) [ClassicSimilarity], result of:
            0.04606496 = score(doc=3582,freq=3.0), product of:
              0.18163268 = queryWeight, product of:
                1.7942789 = boost
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.02160419 = queryNorm
              0.25361603 = fieldWeight in 3582, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6856093 = idf(docFreq=1071, maxDocs=42740)
                0.03125 = fieldNorm(doc=3582)
        0.28 = coord(7/25)
    
  5. Bisig, U.: Kriterien der Benutzerfreundlichkeit von Sachkatalogen : ein Überblick auf dem Hintergrund von RSWK und OPACs (1995) 0.07
    0.0719635 = sum of:
      0.0719635 = product of:
        0.89954376 = sum of:
          0.43317387 = weight(abstract_txt:genauigkeit in 2352) [ClassicSimilarity], result of:
            0.43317387 = score(doc=2352,freq=1.0), product of:
              0.22113441 = queryWeight, product of:
                1.1430376 = boost
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.02160419 = queryNorm
              1.9588714 = fieldWeight in 2352, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.954841 = idf(docFreq=14, maxDocs=42740)
                0.21875 = fieldNorm(doc=2352)
          0.46636993 = weight(abstract_txt:einfachheit in 2352) [ClassicSimilarity], result of:
            0.46636993 = score(doc=2352,freq=1.0), product of:
              0.2322925 = queryWeight, product of:
                1.1715206 = boost
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.02160419 = queryNorm
              2.007684 = fieldWeight in 2352, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.177984 = idf(docFreq=11, maxDocs=42740)
                0.21875 = fieldNorm(doc=2352)
        0.08 = coord(2/25)