Document (#30977)

Author
Pfister, J.
Title
Clustering von Patent-Dokumenten am Beispiel der Datenbanken des Fachinformationszentrums Karlsruhe
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.129-146
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
In diesem Artikel, der im Anwendungsbereich der Patentrecherche und Patentinformation angesiedelt ist, wird das automatische Gruppieren von Patentdokumenten - das so genannte Clustering - als ein Werkzeug zur Aufbereitung der Ergebnismenge einer Datenbankanfrage untersucht. Der Schwerpunkt liegt dabei auf der Evaluierung von drei Clustering-Verfahren mittels Nutzerbewertungen.
Theme
Automatisches Klassifizieren
Field
Patentinformation

Similar documents (author)

  1. Pfister, D. Schmidt- => Schmidt-Pfister, D.: 4.98
    4.982081 = sum of:
      4.982081 = weight(author_txt:pfister in 5982) [ClassicSimilarity], result of:
        4.982081 = fieldWeight in 5982, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.375 = fieldNorm(doc=5982)
    
  2. Pfister, R.-D.: Ware oder öffentliches Gut? : Über den Charakter von Information; am Beispiel Internet (1994) 4.70
    4.697151 = sum of:
      4.697151 = weight(author_txt:pfister in 59) [ClassicSimilarity], result of:
        4.697151 = fieldWeight in 59, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.5 = fieldNorm(doc=59)
    
  3. Pfister, R.-D.: Neue Produkte auf der Basis von Multimedia (1995) 4.70
    4.697151 = sum of:
      4.697151 = weight(author_txt:pfister in 1391) [ClassicSimilarity], result of:
        4.697151 = fieldWeight in 1391, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.5 = fieldNorm(doc=1391)
    
  4. Pfister, H.-R.: Eröffnung des CSCL-Kompetenzzentrums am GMD-IPSI in Darmstadt : Kooperatives computerunterstütztes Lernen (CSCL) - Was ist das und wozu nützt es? (2000) 4.70
    4.697151 = sum of:
      4.697151 = weight(author_txt:pfister in 4833) [ClassicSimilarity], result of:
        4.697151 = fieldWeight in 4833, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.5 = fieldNorm(doc=4833)
    
  5. Hangel, N.; Schmidt-Pfister, D.: Why do you publish? : on the tensions between generating scientific knowledge and publication pressure (2017) 4.11
    4.1100073 = sum of:
      4.1100073 = weight(author_txt:pfister in 4054) [ClassicSimilarity], result of:
        4.1100073 = fieldWeight in 4054, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.4375 = fieldNorm(doc=4054)
    

Similar documents (content)

  1. Schramm, R.: Patentinformation (2004) 0.23
    0.2325191 = sum of:
      0.2325191 = product of:
        1.9376593 = sum of:
          0.043202017 = weight(abstract_txt:verfahren in 2955) [ClassicSimilarity], result of:
            0.043202017 = score(doc=2955,freq=2.0), product of:
              0.11310324 = queryWeight, product of:
                1.2292309 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.015968675 = queryNorm
              0.38196975 = fieldWeight in 2955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.046875 = fieldNorm(doc=2955)
          0.07503632 = weight(abstract_txt:patent in 2955) [ClassicSimilarity], result of:
            0.07503632 = score(doc=2955,freq=2.0), product of:
              0.16342558 = queryWeight, product of:
                1.4775969 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.015968675 = queryNorm
              0.4591467 = fieldWeight in 2955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.046875 = fieldNorm(doc=2955)
          1.8194209 = weight(title_txt:patentinformation in 2955) [ClassicSimilarity], result of:
            1.8194209 = score(doc=2955,freq=1.0), product of:
              0.22424977 = queryWeight, product of:
                1.73086 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.015968675 = queryNorm
              8.113368 = fieldWeight in 2955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                1.0 = fieldNorm(doc=2955)
        0.12 = coord(3/25)
    
  2. STN baut Patentinformation aus (2004) 0.17
    0.17019778 = sum of:
      0.17019778 = product of:
        1.0637362 = sum of:
          0.036088873 = weight(abstract_txt:datenbanken in 2304) [ClassicSimilarity], result of:
            0.036088873 = score(doc=2304,freq=1.0), product of:
              0.11405125 = queryWeight, product of:
                1.2343718 = boost
                5.7860904 = idf(docFreq=368, maxDocs=44218)
                0.015968675 = queryNorm
              0.3164268 = fieldWeight in 2304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7860904 = idf(docFreq=368, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2304)
          0.03660541 = weight(abstract_txt:beispiel in 2304) [ClassicSimilarity], result of:
            0.03660541 = score(doc=2304,freq=1.0), product of:
              0.11513694 = queryWeight, product of:
                1.240233 = boost
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.015968675 = queryNorm
              0.31792933 = fieldWeight in 2304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2304)
          0.08133145 = weight(abstract_txt:karlsruhe in 2304) [ClassicSimilarity], result of:
            0.08133145 = score(doc=2304,freq=1.0), product of:
              0.19604547 = queryWeight, product of:
                1.6183571 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.015968675 = queryNorm
              0.4148601 = fieldWeight in 2304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2304)
          0.90971047 = weight(title_txt:patentinformation in 2304) [ClassicSimilarity], result of:
            0.90971047 = score(doc=2304,freq=1.0), product of:
              0.22424977 = queryWeight, product of:
                1.73086 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.015968675 = queryNorm
              4.056684 = fieldWeight in 2304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.5 = fieldNorm(doc=2304)
        0.16 = coord(4/25)
    
  3. Gerick, T.: Content-based Information Retrieval auf Basis semantischer Abfragenetze : Kooperative Technologien am Beispsiel der Dokumentenrecherche in GENIOS Wirtschaftsdatenbanken (1999) 0.13
    0.13060716 = sum of:
      0.13060716 = product of:
        0.46645415 = sum of:
          0.027411789 = weight(abstract_txt:dabei in 3874) [ClassicSimilarity], result of:
            0.027411789 = score(doc=3874,freq=1.0), product of:
              0.07485281 = queryWeight, product of:
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.015968675 = queryNorm
              0.3662092 = fieldWeight in 3874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3874)
          0.050914068 = weight(abstract_txt:verfahren in 3874) [ClassicSimilarity], result of:
            0.050914068 = score(doc=3874,freq=1.0), product of:
              0.11310324 = queryWeight, product of:
                1.2292309 = boost
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.015968675 = queryNorm
              0.4501557 = fieldWeight in 3874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.761993 = idf(docFreq=377, maxDocs=44218)
                0.078125 = fieldNorm(doc=3874)
          0.051555537 = weight(abstract_txt:datenbanken in 3874) [ClassicSimilarity], result of:
            0.051555537 = score(doc=3874,freq=1.0), product of:
              0.11405125 = queryWeight, product of:
                1.2343718 = boost
                5.7860904 = idf(docFreq=368, maxDocs=44218)
                0.015968675 = queryNorm
              0.45203832 = fieldWeight in 3874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7860904 = idf(docFreq=368, maxDocs=44218)
                0.078125 = fieldNorm(doc=3874)
          0.05229344 = weight(abstract_txt:beispiel in 3874) [ClassicSimilarity], result of:
            0.05229344 = score(doc=3874,freq=1.0), product of:
              0.11513694 = queryWeight, product of:
                1.240233 = boost
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.015968675 = queryNorm
              0.45418474 = fieldWeight in 3874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.813565 = idf(docFreq=358, maxDocs=44218)
                0.078125 = fieldNorm(doc=3874)
          0.07123778 = weight(abstract_txt:dokumenten in 3874) [ClassicSimilarity], result of:
            0.07123778 = score(doc=3874,freq=1.0), product of:
              0.14148925 = queryWeight, product of:
                1.3748574 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.015968675 = queryNorm
              0.50348544 = fieldWeight in 3874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.078125 = fieldNorm(doc=3874)
          0.08843114 = weight(abstract_txt:patent in 3874) [ClassicSimilarity], result of:
            0.08843114 = score(doc=3874,freq=1.0), product of:
              0.16342558 = queryWeight, product of:
                1.4775969 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.015968675 = queryNorm
              0.54110956 = fieldWeight in 3874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.078125 = fieldNorm(doc=3874)
          0.12461041 = weight(abstract_txt:werkzeug in 3874) [ClassicSimilarity], result of:
            0.12461041 = score(doc=3874,freq=1.0), product of:
              0.20540898 = queryWeight, product of:
                1.6565542 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.015968675 = queryNorm
              0.6066454 = fieldWeight in 3874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.078125 = fieldNorm(doc=3874)
        0.28 = coord(7/25)
    
  4. Panyr, J.: Vektorraum-Modell und Clusteranalyse in Information-Retrieval-Systemen (1987) 0.09
    0.09166864 = sum of:
      0.09166864 = product of:
        0.572929 = sum of:
          0.040027708 = weight(abstract_txt:diesem in 2322) [ClassicSimilarity], result of:
            0.040027708 = score(doc=2322,freq=1.0), product of:
              0.0769848 = queryWeight, product of:
                1.0141412 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015968675 = queryNorm
              0.519943 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.109375 = fieldNorm(doc=2322)
          0.14104362 = weight(abstract_txt:dokumenten in 2322) [ClassicSimilarity], result of:
            0.14104362 = score(doc=2322,freq=2.0), product of:
              0.14148925 = queryWeight, product of:
                1.3748574 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.015968675 = queryNorm
              0.9968504 = fieldWeight in 2322, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.109375 = fieldNorm(doc=2322)
          0.12335162 = weight(abstract_txt:automatische in 2322) [ClassicSimilarity], result of:
            0.12335162 = score(doc=2322,freq=1.0), product of:
              0.16302757 = queryWeight, product of:
                1.4757965 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.015968675 = queryNorm
              0.7566304 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.109375 = fieldNorm(doc=2322)
          0.26850605 = weight(abstract_txt:clustering in 2322) [ClassicSimilarity], result of:
            0.26850605 = score(doc=2322,freq=1.0), product of:
              0.39491865 = queryWeight, product of:
                3.9784179 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.015968675 = queryNorm
              0.6799022 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.109375 = fieldNorm(doc=2322)
        0.16 = coord(4/25)
    
  5. Geiß, D.: Aus der Praxis der Patentinformation : Teil 1: Übersicht über die Entwicklung der elektronischen Medien bei Patentbehörden (2004) 0.09
    0.08664251 = sum of:
      0.08664251 = product of:
        0.5415157 = sum of:
          0.016447073 = weight(abstract_txt:dabei in 2366) [ClassicSimilarity], result of:
            0.016447073 = score(doc=2366,freq=1.0), product of:
              0.07485281 = queryWeight, product of:
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.015968675 = queryNorm
              0.21972553 = fieldWeight in 2366, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.687478 = idf(docFreq=1106, maxDocs=44218)
                0.046875 = fieldNorm(doc=2366)
          0.01715473 = weight(abstract_txt:diesem in 2366) [ClassicSimilarity], result of:
            0.01715473 = score(doc=2366,freq=1.0), product of:
              0.0769848 = queryWeight, product of:
                1.0141412 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.015968675 = queryNorm
              0.22283271 = fieldWeight in 2366, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.046875 = fieldNorm(doc=2366)
          0.053058688 = weight(abstract_txt:patent in 2366) [ClassicSimilarity], result of:
            0.053058688 = score(doc=2366,freq=1.0), product of:
              0.16342558 = queryWeight, product of:
                1.4775969 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.015968675 = queryNorm
              0.32466576 = fieldWeight in 2366, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.046875 = fieldNorm(doc=2366)
          0.45485523 = weight(title_txt:patentinformation in 2366) [ClassicSimilarity], result of:
            0.45485523 = score(doc=2366,freq=1.0), product of:
              0.22424977 = queryWeight, product of:
                1.73086 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.015968675 = queryNorm
              2.028342 = fieldWeight in 2366, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.25 = fieldNorm(doc=2366)
        0.16 = coord(4/25)