Document (#30978)

Author
Pfister, J.
Title
Clustering von Patent-Dokumenten am Beispiel der Datenbanken des Fachinformationszentrums Karlsruhe
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.129-146
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
In diesem Artikel, der im Anwendungsbereich der Patentrecherche und Patentinformation angesiedelt ist, wird das automatische Gruppieren von Patentdokumenten - das so genannte Clustering - als ein Werkzeug zur Aufbereitung der Ergebnismenge einer Datenbankanfrage untersucht. Der Schwerpunkt liegt dabei auf der Evaluierung von drei Clustering-Verfahren mittels Nutzerbewertungen.
Theme
Automatisches Klassifizieren
Field
Patentinformation

Similar documents (author)

  1. Pfister, D. Schmidt- => Schmidt-Pfister, D.: 5.02
    5.018137 = sum of:
      5.018137 = weight(author_txt:pfister in 6983) [ClassicSimilarity], result of:
        5.018137 = score(doc=6983,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          5.0181375 = fieldWeight in 6983, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.375 = fieldNorm(doc=6983)
    
  2. Pfister, R.-D.: Ware oder öffentliches Gut? : Über den Charakter von Information; am Beispiel Internet (1994) 4.73
    4.731145 = sum of:
      4.731145 = weight(author_txt:pfister in 128) [ClassicSimilarity], result of:
        4.731145 = score(doc=128,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          4.7311454 = fieldWeight in 128, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.5 = fieldNorm(doc=128)
    
  3. Pfister, R.-D.: Neue Produkte auf der Basis von Multimedia (1995) 4.73
    4.731145 = sum of:
      4.731145 = weight(author_txt:pfister in 1460) [ClassicSimilarity], result of:
        4.731145 = score(doc=1460,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          4.7311454 = fieldWeight in 1460, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.5 = fieldNorm(doc=1460)
    
  4. Pfister, H.-R.: Eröffnung des CSCL-Kompetenzzentrums am GMD-IPSI in Darmstadt : Kooperatives computerunterstütztes Lernen (CSCL) - Was ist das und wozu nützt es? (2000) 4.73
    4.731145 = sum of:
      4.731145 = weight(author_txt:pfister in 5834) [ClassicSimilarity], result of:
        4.731145 = score(doc=5834,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          4.7311454 = fieldWeight in 5834, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.5 = fieldNorm(doc=5834)
    
  5. Hangel, N.; Schmidt-Pfister, D.: Why do you publish? : on the tensions between generating scientific knowledge and publication pressure (2017) 4.14
    4.139752 = sum of:
      4.139752 = weight(author_txt:pfister in 55) [ClassicSimilarity], result of:
        4.139752 = score(doc=55,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.10568265 = queryNorm
          4.1397524 = fieldWeight in 55, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            9.462291 = idf(docFreq=8, maxDocs=42596)
            0.4375 = fieldNorm(doc=55)
    

Similar documents (content)

  1. Schramm, R.: Patentinformation (2004) 0.23
    0.2299488 = sum of:
      0.2299488 = product of:
        1.9162401 = sum of:
          0.043680083 = weight(abstract_txt:verfahren in 3956) [ClassicSimilarity], result of:
            0.043680083 = score(doc=3956,freq=2.0), product of:
              0.11401862 = queryWeight, product of:
                1.2235383 = boost
                5.7789826 = idf(docFreq=357, maxDocs=42596)
                0.016125264 = queryNorm
              0.38309604 = fieldWeight in 3956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7789826 = idf(docFreq=357, maxDocs=42596)
                0.046875 = fieldNorm(doc=3956)
          0.07426335 = weight(abstract_txt:patent in 3956) [ClassicSimilarity], result of:
            0.07426335 = score(doc=3956,freq=2.0), product of:
              0.16241887 = queryWeight, product of:
                1.4603196 = boost
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.016125264 = queryNorm
              0.4572335 = fieldWeight in 3956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.046875 = fieldNorm(doc=3956)
          1.7982967 = weight(title_txt:patentinformation in 3956) [ClassicSimilarity], result of:
            1.7982967 = score(doc=3956,freq=1.0), product of:
              0.2226718 = queryWeight, product of:
                1.7098669 = boost
                8.075996 = idf(docFreq=35, maxDocs=42596)
                0.016125264 = queryNorm
              8.075996 = fieldWeight in 3956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.075996 = idf(docFreq=35, maxDocs=42596)
                1.0 = fieldNorm(doc=3956)
        0.12 = coord(3/25)
    
  2. STN baut Patentinformation aus (2004) 0.17
    0.16833898 = sum of:
      0.16833898 = product of:
        1.0521187 = sum of:
          0.03593012 = weight(abstract_txt:datenbanken in 3305) [ClassicSimilarity], result of:
            0.03593012 = score(doc=3305,freq=1.0), product of:
              0.1137989 = queryWeight, product of:
                1.2223588 = boost
                5.7734118 = idf(docFreq=359, maxDocs=42596)
                0.016125264 = queryNorm
              0.31573346 = fieldWeight in 3305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7734118 = idf(docFreq=359, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3305)
          0.036730587 = weight(abstract_txt:beispiel in 3305) [ClassicSimilarity], result of:
            0.036730587 = score(doc=3305,freq=1.0), product of:
              0.11548286 = queryWeight, product of:
                1.2313696 = boost
                5.8159714 = idf(docFreq=344, maxDocs=42596)
                0.016125264 = queryNorm
              0.31806093 = fieldWeight in 3305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8159714 = idf(docFreq=344, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3305)
          0.080309615 = weight(abstract_txt:karlsruhe in 3305) [ClassicSimilarity], result of:
            0.080309615 = score(doc=3305,freq=1.0), product of:
              0.19454078 = queryWeight, product of:
                1.5982143 = boost
                7.5486417 = idf(docFreq=60, maxDocs=42596)
                0.016125264 = queryNorm
              0.41281635 = fieldWeight in 3305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5486417 = idf(docFreq=60, maxDocs=42596)
                0.0546875 = fieldNorm(doc=3305)
          0.89914834 = weight(title_txt:patentinformation in 3305) [ClassicSimilarity], result of:
            0.89914834 = score(doc=3305,freq=1.0), product of:
              0.2226718 = queryWeight, product of:
                1.7098669 = boost
                8.075996 = idf(docFreq=35, maxDocs=42596)
                0.016125264 = queryNorm
              4.037998 = fieldWeight in 3305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.075996 = idf(docFreq=35, maxDocs=42596)
                0.5 = fieldNorm(doc=3305)
        0.16 = coord(4/25)
    
  3. Gerick, T.: Content-based Information Retrieval auf Basis semantischer Abfragenetze : Kooperative Technologien am Beispsiel der Dokumentenrecherche in GENIOS Wirtschaftsdatenbanken (1999) 0.13
    0.13072565 = sum of:
      0.13072565 = product of:
        0.46687734 = sum of:
          0.028103765 = weight(abstract_txt:dabei in 4875) [ClassicSimilarity], result of:
            0.028103765 = score(doc=4875,freq=1.0), product of:
              0.076162405 = queryWeight, product of:
                4.7231727 = idf(docFreq=1028, maxDocs=42596)
                0.016125264 = queryNorm
              0.36899787 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7231727 = idf(docFreq=1028, maxDocs=42596)
                0.078125 = fieldNorm(doc=4875)
          0.05132874 = weight(abstract_txt:datenbanken in 4875) [ClassicSimilarity], result of:
            0.05132874 = score(doc=4875,freq=1.0), product of:
              0.1137989 = queryWeight, product of:
                1.2223588 = boost
                5.7734118 = idf(docFreq=359, maxDocs=42596)
                0.016125264 = queryNorm
              0.45104778 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7734118 = idf(docFreq=359, maxDocs=42596)
                0.078125 = fieldNorm(doc=4875)
          0.05147747 = weight(abstract_txt:verfahren in 4875) [ClassicSimilarity], result of:
            0.05147747 = score(doc=4875,freq=1.0), product of:
              0.11401862 = queryWeight, product of:
                1.2235383 = boost
                5.7789826 = idf(docFreq=357, maxDocs=42596)
                0.016125264 = queryNorm
              0.451483 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7789826 = idf(docFreq=357, maxDocs=42596)
                0.078125 = fieldNorm(doc=4875)
          0.052472267 = weight(abstract_txt:beispiel in 4875) [ClassicSimilarity], result of:
            0.052472267 = score(doc=4875,freq=1.0), product of:
              0.11548286 = queryWeight, product of:
                1.2313696 = boost
                5.8159714 = idf(docFreq=344, maxDocs=42596)
                0.016125264 = queryNorm
              0.45437276 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8159714 = idf(docFreq=344, maxDocs=42596)
                0.078125 = fieldNorm(doc=4875)
          0.07193927 = weight(abstract_txt:dokumenten in 4875) [ClassicSimilarity], result of:
            0.07193927 = score(doc=4875,freq=1.0), product of:
              0.14251973 = queryWeight, product of:
                1.3679404 = boost
                6.4610186 = idf(docFreq=180, maxDocs=42596)
                0.016125264 = queryNorm
              0.50476706 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4610186 = idf(docFreq=180, maxDocs=42596)
                0.078125 = fieldNorm(doc=4875)
          0.0875202 = weight(abstract_txt:patent in 4875) [ClassicSimilarity], result of:
            0.0875202 = score(doc=4875,freq=1.0), product of:
              0.16241887 = queryWeight, product of:
                1.4603196 = boost
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.016125264 = queryNorm
              0.53885484 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.078125 = fieldNorm(doc=4875)
          0.12403567 = weight(abstract_txt:werkzeug in 4875) [ClassicSimilarity], result of:
            0.12403567 = score(doc=4875,freq=1.0), product of:
              0.20492521 = queryWeight, product of:
                1.6403154 = boost
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.016125264 = queryNorm
              0.6052729 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.747493 = idf(docFreq=49, maxDocs=42596)
                0.078125 = fieldNorm(doc=4875)
        0.28 = coord(7/25)
    
  4. Panyr, J.: Vektorraum-Modell und Clusteranalyse in Information-Retrieval-Systemen (1987) 0.09
    0.09237484 = sum of:
      0.09237484 = product of:
        0.57734275 = sum of:
          0.040839422 = weight(abstract_txt:diesem in 2322) [ClassicSimilarity], result of:
            0.040839422 = score(doc=2322,freq=1.0), product of:
              0.0780786 = queryWeight, product of:
                1.0125015 = boost
                4.7822194 = idf(docFreq=969, maxDocs=42596)
                0.016125264 = queryNorm
              0.52305526 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7822194 = idf(docFreq=969, maxDocs=42596)
                0.109375 = fieldNorm(doc=2322)
          0.14243247 = weight(abstract_txt:dokumenten in 2322) [ClassicSimilarity], result of:
            0.14243247 = score(doc=2322,freq=2.0), product of:
              0.14251973 = queryWeight, product of:
                1.3679404 = boost
                6.4610186 = idf(docFreq=180, maxDocs=42596)
                0.016125264 = queryNorm
              0.99938774 = fieldWeight in 2322, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4610186 = idf(docFreq=180, maxDocs=42596)
                0.109375 = fieldNorm(doc=2322)
          0.12487063 = weight(abstract_txt:automatische in 2322) [ClassicSimilarity], result of:
            0.12487063 = score(doc=2322,freq=1.0), product of:
              0.1644823 = queryWeight, product of:
                1.4695666 = boost
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.016125264 = queryNorm
              0.7591737 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9410167 = idf(docFreq=111, maxDocs=42596)
                0.109375 = fieldNorm(doc=2322)
          0.26920024 = weight(abstract_txt:clustering in 2322) [ClassicSimilarity], result of:
            0.26920024 = score(doc=2322,freq=1.0), product of:
              0.39588556 = queryWeight, product of:
                3.9488907 = boost
                6.2170978 = idf(docFreq=230, maxDocs=42596)
                0.016125264 = queryNorm
              0.67999506 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2170978 = idf(docFreq=230, maxDocs=42596)
                0.109375 = fieldNorm(doc=2322)
        0.16 = coord(4/25)
    
  5. CAS und CSA vereinbaren die Bereitstellung weiterer Datenbanken auf STN International (2004) 0.09
    0.08609906 = sum of:
      0.08609906 = product of:
        0.35874608 = sum of:
          0.06286462 = weight(abstract_txt:datenbanken in 3357) [ClassicSimilarity], result of:
            0.06286462 = score(doc=3357,freq=6.0), product of:
              0.1137989 = queryWeight, product of:
                1.2223588 = boost
                5.7734118 = idf(docFreq=359, maxDocs=42596)
                0.016125264 = queryNorm
              0.5524185 = fieldWeight in 3357, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.7734118 = idf(docFreq=359, maxDocs=42596)
                0.0390625 = fieldNorm(doc=3357)
          0.026236134 = weight(abstract_txt:beispiel in 3357) [ClassicSimilarity], result of:
            0.026236134 = score(doc=3357,freq=1.0), product of:
              0.11548286 = queryWeight, product of:
                1.2313696 = boost
                5.8159714 = idf(docFreq=344, maxDocs=42596)
                0.016125264 = queryNorm
              0.22718638 = fieldWeight in 3357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8159714 = idf(docFreq=344, maxDocs=42596)
                0.0390625 = fieldNorm(doc=3357)
          0.0266771 = weight(abstract_txt:drei in 3357) [ClassicSimilarity], result of:
            0.0266771 = score(doc=3357,freq=1.0), product of:
              0.116773255 = queryWeight, product of:
                1.2382301 = boost
                5.848375 = idf(docFreq=333, maxDocs=42596)
                0.016125264 = queryNorm
              0.22845215 = fieldWeight in 3357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.848375 = idf(docFreq=333, maxDocs=42596)
                0.0390625 = fieldNorm(doc=3357)
          0.058695585 = weight(abstract_txt:schwerpunkt in 3357) [ClassicSimilarity], result of:
            0.058695585 = score(doc=3357,freq=2.0), product of:
              0.15678744 = queryWeight, product of:
                1.43478 = boost
                6.776714 = idf(docFreq=131, maxDocs=42596)
                0.016125264 = queryNorm
              0.37436408 = fieldWeight in 3357, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.776714 = idf(docFreq=131, maxDocs=42596)
                0.0390625 = fieldNorm(doc=3357)
          0.0437601 = weight(abstract_txt:patent in 3357) [ClassicSimilarity], result of:
            0.0437601 = score(doc=3357,freq=1.0), product of:
              0.16241887 = queryWeight, product of:
                1.4603196 = boost
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.016125264 = queryNorm
              0.26942742 = fieldWeight in 3357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8973417 = idf(docFreq=116, maxDocs=42596)
                0.0390625 = fieldNorm(doc=3357)
          0.14051256 = weight(abstract_txt:karlsruhe in 3357) [ClassicSimilarity], result of:
            0.14051256 = score(doc=3357,freq=6.0), product of:
              0.19454078 = queryWeight, product of:
                1.5982143 = boost
                7.5486417 = idf(docFreq=60, maxDocs=42596)
                0.016125264 = queryNorm
              0.7222781 = fieldWeight in 3357, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5486417 = idf(docFreq=60, maxDocs=42596)
                0.0390625 = fieldNorm(doc=3357)
        0.24 = coord(6/25)