Document (#30978)

Author
Pfister, J.
Title
Clustering von Patent-Dokumenten am Beispiel der Datenbanken des Fachinformationszentrums Karlsruhe
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.129-146
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
In diesem Artikel, der im Anwendungsbereich der Patentrecherche und Patentinformation angesiedelt ist, wird das automatische Gruppieren von Patentdokumenten - das so genannte Clustering - als ein Werkzeug zur Aufbereitung der Ergebnismenge einer Datenbankanfrage untersucht. Der Schwerpunkt liegt dabei auf der Evaluierung von drei Clustering-Verfahren mittels Nutzerbewertungen.
Theme
Automatisches Klassifizieren
Field
Patentinformation

Similar documents (author)

  1. Pfister, D. Schmidt- => Schmidt-Pfister, D.: 4.96
    4.9640517 = sum of:
      4.9640517 = weight(author_txt:pfister in 898) [ClassicSimilarity], result of:
        4.9640517 = fieldWeight in 898, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.375 = fieldNorm(doc=898)
    
  2. Pfister, R.-D.: Ware oder öffentliches Gut? : Über den Charakter von Information; am Beispiel Internet (1994) 4.68
    4.680153 = sum of:
      4.680153 = weight(author_txt:pfister in 128) [ClassicSimilarity], result of:
        4.680153 = fieldWeight in 128, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.5 = fieldNorm(doc=128)
    
  3. Pfister, R.-D.: Neue Produkte auf der Basis von Multimedia (1995) 4.68
    4.680153 = sum of:
      4.680153 = weight(author_txt:pfister in 1460) [ClassicSimilarity], result of:
        4.680153 = fieldWeight in 1460, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.5 = fieldNorm(doc=1460)
    
  4. Pfister, H.-R.: Eröffnung des CSCL-Kompetenzzentrums am GMD-IPSI in Darmstadt : Kooperatives computerunterstütztes Lernen (CSCL) - Was ist das und wozu nützt es? (2000) 4.68
    4.680153 = sum of:
      4.680153 = weight(author_txt:pfister in 5834) [ClassicSimilarity], result of:
        4.680153 = fieldWeight in 5834, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.5 = fieldNorm(doc=5834)
    
  5. Hangel, N.; Schmidt-Pfister, D.: Why do you publish? : on the tensions between generating scientific knowledge and publication pressure (2017) 4.10
    4.095134 = sum of:
      4.095134 = weight(author_txt:pfister in 55) [ClassicSimilarity], result of:
        4.095134 = fieldWeight in 55, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.360306 = idf(docFreq=9, maxDocs=42740)
          0.4375 = fieldNorm(doc=55)
    

Similar documents (content)

  1. Schramm, R.: Patentinformation (2004) 0.23
    0.23011862 = sum of:
      0.23011862 = product of:
        1.9176552 = sum of:
          0.043548472 = weight(abstract_txt:verfahren in 3956) [ClassicSimilarity], result of:
            0.043548472 = score(doc=3956,freq=2.0), product of:
              0.113772936 = queryWeight, product of:
                1.2226167 = boost
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.016116507 = queryNorm
              0.38276654 = fieldWeight in 3956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.046875 = fieldNorm(doc=3956)
          0.07433997 = weight(abstract_txt:patent in 3956) [ClassicSimilarity], result of:
            0.07433997 = score(doc=3956,freq=2.0), product of:
              0.16250692 = queryWeight, product of:
                1.4611902 = boost
                6.900717 = idf(docFreq=116, maxDocs=42740)
                0.016116507 = queryNorm
              0.45745724 = fieldWeight in 3956, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.900717 = idf(docFreq=116, maxDocs=42740)
                0.046875 = fieldNorm(doc=3956)
          1.7997668 = weight(title_txt:patentinformation in 3956) [ClassicSimilarity], result of:
            1.7997668 = score(doc=3956,freq=1.0), product of:
              0.22276074 = queryWeight, product of:
                1.7107642 = boost
                8.079371 = idf(docFreq=35, maxDocs=42740)
                0.016116507 = queryNorm
              8.079371 = fieldWeight in 3956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.079371 = idf(docFreq=35, maxDocs=42740)
                1.0 = fieldNorm(doc=3956)
        0.12 = coord(3/25)
    
  2. STN baut Patentinformation aus (2004) 0.17
    0.16844171 = sum of:
      0.16844171 = product of:
        1.0527607 = sum of:
          0.035771403 = weight(abstract_txt:datenbanken in 3305) [ClassicSimilarity], result of:
            0.035771403 = score(doc=3305,freq=1.0), product of:
              0.11344702 = queryWeight, product of:
                1.2208642 = boost
                5.7657366 = idf(docFreq=363, maxDocs=42740)
                0.016116507 = queryNorm
              0.31531373 = fieldWeight in 3305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7657366 = idf(docFreq=363, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3305)
          0.036723655 = weight(abstract_txt:beispiel in 3305) [ClassicSimilarity], result of:
            0.036723655 = score(doc=3305,freq=1.0), product of:
              0.11545154 = queryWeight, product of:
                1.2316028 = boost
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.016116507 = queryNorm
              0.3180872 = fieldWeight in 3305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3305)
          0.080382295 = weight(abstract_txt:karlsruhe in 3305) [ClassicSimilarity], result of:
            0.080382295 = score(doc=3305,freq=1.0), product of:
              0.19462983 = queryWeight, product of:
                1.5990996 = boost
                7.5520167 = idf(docFreq=60, maxDocs=42740)
                0.016116507 = queryNorm
              0.4130009 = fieldWeight in 3305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5520167 = idf(docFreq=60, maxDocs=42740)
                0.0546875 = fieldNorm(doc=3305)
          0.8998834 = weight(title_txt:patentinformation in 3305) [ClassicSimilarity], result of:
            0.8998834 = score(doc=3305,freq=1.0), product of:
              0.22276074 = queryWeight, product of:
                1.7107642 = boost
                8.079371 = idf(docFreq=35, maxDocs=42740)
                0.016116507 = queryNorm
              4.0396857 = fieldWeight in 3305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.079371 = idf(docFreq=35, maxDocs=42740)
                0.5 = fieldNorm(doc=3305)
        0.16 = coord(4/25)
    
  3. Gerick, T.: Content-based Information Retrieval auf Basis semantischer Abfragenetze : Kooperative Technologien am Beispsiel der Dokumentenrecherche in GENIOS Wirtschaftsdatenbanken (1999) 0.13
    0.1306368 = sum of:
      0.1306368 = product of:
        0.46655998 = sum of:
          0.028082505 = weight(abstract_txt:dabei in 4875) [ClassicSimilarity], result of:
            0.028082505 = score(doc=4875,freq=1.0), product of:
              0.07611292 = queryWeight, product of:
                4.722668 = idf(docFreq=1032, maxDocs=42740)
                0.016116507 = queryNorm
              0.36895844 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.722668 = idf(docFreq=1032, maxDocs=42740)
                0.078125 = fieldNorm(doc=4875)
          0.051102 = weight(abstract_txt:datenbanken in 4875) [ClassicSimilarity], result of:
            0.051102 = score(doc=4875,freq=1.0), product of:
              0.11344702 = queryWeight, product of:
                1.2208642 = boost
                5.7657366 = idf(docFreq=363, maxDocs=42740)
                0.016116507 = queryNorm
              0.45044816 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7657366 = idf(docFreq=363, maxDocs=42740)
                0.078125 = fieldNorm(doc=4875)
          0.051322374 = weight(abstract_txt:verfahren in 4875) [ClassicSimilarity], result of:
            0.051322374 = score(doc=4875,freq=1.0), product of:
              0.113772936 = queryWeight, product of:
                1.2226167 = boost
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.016116507 = queryNorm
              0.45109475 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7740126 = idf(docFreq=360, maxDocs=42740)
                0.078125 = fieldNorm(doc=4875)
          0.052462365 = weight(abstract_txt:beispiel in 4875) [ClassicSimilarity], result of:
            0.052462365 = score(doc=4875,freq=1.0), product of:
              0.11545154 = queryWeight, product of:
                1.2316028 = boost
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.016116507 = queryNorm
              0.45441028 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.078125 = fieldNorm(doc=4875)
          0.07183663 = weight(abstract_txt:dokumenten in 4875) [ClassicSimilarity], result of:
            0.07183663 = score(doc=4875,freq=1.0), product of:
              0.14236343 = queryWeight, product of:
                1.3676344 = boost
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.016116507 = queryNorm
              0.5046003 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.078125 = fieldNorm(doc=4875)
          0.08761049 = weight(abstract_txt:patent in 4875) [ClassicSimilarity], result of:
            0.08761049 = score(doc=4875,freq=1.0), product of:
              0.16250692 = queryWeight, product of:
                1.4611902 = boost
                6.900717 = idf(docFreq=116, maxDocs=42740)
                0.016116507 = queryNorm
              0.5391185 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.900717 = idf(docFreq=116, maxDocs=42740)
                0.078125 = fieldNorm(doc=4875)
          0.12414364 = weight(abstract_txt:werkzeug in 4875) [ClassicSimilarity], result of:
            0.12414364 = score(doc=4875,freq=1.0), product of:
              0.20501429 = queryWeight, product of:
                1.6412052 = boost
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.016116507 = queryNorm
              0.6055365 = fieldWeight in 4875, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7508674 = idf(docFreq=49, maxDocs=42740)
                0.078125 = fieldNorm(doc=4875)
        0.28 = coord(7/25)
    
  4. Panyr, J.: Vektorraum-Modell und Clusteranalyse in Information-Retrieval-Systemen (1987) 0.09
    0.09231058 = sum of:
      0.09231058 = product of:
        0.57694113 = sum of:
          0.040671714 = weight(abstract_txt:diesem in 2322) [ClassicSimilarity], result of:
            0.040671714 = score(doc=2322,freq=1.0), product of:
              0.077853374 = queryWeight, product of:
                1.0113688 = boost
                4.776359 = idf(docFreq=978, maxDocs=42740)
                0.016116507 = queryNorm
              0.52241427 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.776359 = idf(docFreq=978, maxDocs=42740)
                0.109375 = fieldNorm(doc=2322)
          0.14222927 = weight(abstract_txt:dokumenten in 2322) [ClassicSimilarity], result of:
            0.14222927 = score(doc=2322,freq=2.0), product of:
              0.14236343 = queryWeight, product of:
                1.3676344 = boost
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.016116507 = queryNorm
              0.99905765 = fieldWeight in 2322, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.458884 = idf(docFreq=181, maxDocs=42740)
                0.109375 = fieldNorm(doc=2322)
          0.12451892 = weight(abstract_txt:automatische in 2322) [ClassicSimilarity], result of:
            0.12451892 = score(doc=2322,freq=1.0), product of:
              0.16414942 = queryWeight, product of:
                1.4685559 = boost
                6.9355025 = idf(docFreq=112, maxDocs=42740)
                0.016116507 = queryNorm
              0.7585706 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9355025 = idf(docFreq=112, maxDocs=42740)
                0.109375 = fieldNorm(doc=2322)
          0.26952124 = weight(abstract_txt:clustering in 2322) [ClassicSimilarity], result of:
            0.26952124 = score(doc=2322,freq=1.0), product of:
              0.3961426 = queryWeight, product of:
                3.9514568 = boost
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.016116507 = queryNorm
              0.6803642 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.220473 = idf(docFreq=230, maxDocs=42740)
                0.109375 = fieldNorm(doc=2322)
        0.16 = coord(4/25)
    
  5. CAS und CSA vereinbaren die Bereitstellung weiterer Datenbanken auf STN International (2004) 0.09
    0.08597281 = sum of:
      0.08597281 = product of:
        0.35822004 = sum of:
          0.06258691 = weight(abstract_txt:datenbanken in 3357) [ClassicSimilarity], result of:
            0.06258691 = score(doc=3357,freq=6.0), product of:
              0.11344702 = queryWeight, product of:
                1.2208642 = boost
                5.7657366 = idf(docFreq=363, maxDocs=42740)
                0.016116507 = queryNorm
              0.5516841 = fieldWeight in 3357, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.7657366 = idf(docFreq=363, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3357)
          0.026231183 = weight(abstract_txt:beispiel in 3357) [ClassicSimilarity], result of:
            0.026231183 = score(doc=3357,freq=1.0), product of:
              0.11545154 = queryWeight, product of:
                1.2316028 = boost
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.016116507 = queryNorm
              0.22720514 = fieldWeight in 3357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8164515 = idf(docFreq=345, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3357)
          0.026589388 = weight(abstract_txt:drei in 3357) [ClassicSimilarity], result of:
            0.026589388 = score(doc=3357,freq=1.0), product of:
              0.11650021 = queryWeight, product of:
                1.2371837 = boost
                5.842808 = idf(docFreq=336, maxDocs=42740)
                0.016116507 = queryNorm
              0.22823468 = fieldWeight in 3357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.842808 = idf(docFreq=336, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3357)
          0.058367588 = weight(abstract_txt:schwerpunkt in 3357) [ClassicSimilarity], result of:
            0.058367588 = score(doc=3357,freq=2.0), product of:
              0.15618008 = queryWeight, product of:
                1.4324638 = boost
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.016116507 = queryNorm
              0.37371978 = fieldWeight in 3357, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.765051 = idf(docFreq=133, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3357)
          0.043805245 = weight(abstract_txt:patent in 3357) [ClassicSimilarity], result of:
            0.043805245 = score(doc=3357,freq=1.0), product of:
              0.16250692 = queryWeight, product of:
                1.4611902 = boost
                6.900717 = idf(docFreq=116, maxDocs=42740)
                0.016116507 = queryNorm
              0.26955926 = fieldWeight in 3357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.900717 = idf(docFreq=116, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3357)
          0.14063974 = weight(abstract_txt:karlsruhe in 3357) [ClassicSimilarity], result of:
            0.14063974 = score(doc=3357,freq=6.0), product of:
              0.19462983 = queryWeight, product of:
                1.5990996 = boost
                7.5520167 = idf(docFreq=60, maxDocs=42740)
                0.016116507 = queryNorm
              0.7226011 = fieldWeight in 3357, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5520167 = idf(docFreq=60, maxDocs=42740)
                0.0390625 = fieldNorm(doc=3357)
        0.24 = coord(6/25)