Search (29 results, page 1 of 2)

  • × theme_ss:"Automatisches Klassifizieren"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.06
    0.06387635 = sum of:
      0.052038666 = product of:
        0.20815466 = sum of:
          0.20815466 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
            0.20815466 = score(doc=562,freq=2.0), product of:
              0.3703701 = queryWeight, product of:
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.043685965 = queryNorm
              0.56201804 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.25 = coord(1/4)
      0.01183769 = product of:
        0.03551307 = sum of:
          0.03551307 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
            0.03551307 = score(doc=562,freq=2.0), product of:
              0.1529808 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.043685965 = queryNorm
              0.23214069 = fieldWeight in 562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.046875 = fieldNorm(doc=562)
        0.33333334 = coord(1/3)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Liu, R.-L.: Context recognition for hierarchical text classification (2009) 0.03
    0.02708788 = product of:
      0.05417576 = sum of:
        0.05417576 = product of:
          0.08126364 = sum of:
            0.04575057 = weight(_text_:l in 2760) [ClassicSimilarity], result of:
              0.04575057 = score(doc=2760,freq=2.0), product of:
                0.17363653 = queryWeight, product of:
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.043685965 = queryNorm
                0.26348472 = fieldWeight in 2760, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2760)
            0.03551307 = weight(_text_:22 in 2760) [ClassicSimilarity], result of:
              0.03551307 = score(doc=2760,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.23214069 = fieldWeight in 2760, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2760)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    22. 3.2009 19:11:54
  3. Panyr, J.: Vektorraum-Modell und Clusteranalyse in Information-Retrieval-Systemen (1987) 0.02
    0.024510827 = product of:
      0.049021654 = sum of:
        0.049021654 = product of:
          0.14706495 = sum of:
            0.14706495 = weight(_text_:d.h in 2322) [ClassicSimilarity], result of:
              0.14706495 = score(doc=2322,freq=2.0), product of:
                0.26960507 = queryWeight, product of:
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.043685965 = queryNorm
                0.5454829 = fieldWeight in 2322, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.0625 = fieldNorm(doc=2322)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    Ausgehend von theoretischen Indexierungsansätzen wird das klassische Vektorraum-Modell für automatische Indexierung (mit dem Trennschärfen-Modell) erläutert. Das Clustering in Information-Retrieval-Systemem wird als eine natürliche logische Folge aus diesem Modell aufgefaßt und in allen seinen Ausprägungen (d.h. als Dokumenten-, Term- oder Dokumenten- und Termklassifikation) behandelt. Anschließend werden die Suchstrategien in vorklassifizierten Dokumentenbeständen (Clustersuche) detailliert beschrieben. Zum Schluß wird noch die sinnvolle Anwendung der Clusteranalyse in Information-Retrieval-Systemen kurz diskutiert
  4. Liu, R.-L.: ¬A passage extractor for classification of disease aspect information (2013) 0.02
    0.022573236 = product of:
      0.045146473 = sum of:
        0.045146473 = product of:
          0.067719705 = sum of:
            0.038125478 = weight(_text_:l in 1107) [ClassicSimilarity], result of:
              0.038125478 = score(doc=1107,freq=2.0), product of:
                0.17363653 = queryWeight, product of:
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.043685965 = queryNorm
                0.2195706 = fieldWeight in 1107, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1107)
            0.029594226 = weight(_text_:22 in 1107) [ClassicSimilarity], result of:
              0.029594226 = score(doc=1107,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.19345059 = fieldWeight in 1107, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1107)
          0.6666667 = coord(2/3)
      0.5 = coord(1/2)
    
    Date
    28.10.2013 19:22:57
  5. Zhou, G.D.; Zhang, M.; Ji, D.H.; Zhu, Q.M.: Hierarchical learning strategy in semantic relation extraction (2008) 0.02
    0.01838312 = product of:
      0.03676624 = sum of:
        0.03676624 = product of:
          0.11029871 = sum of:
            0.11029871 = weight(_text_:d.h in 2077) [ClassicSimilarity], result of:
              0.11029871 = score(doc=2077,freq=2.0), product of:
                0.26960507 = queryWeight, product of:
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.043685965 = queryNorm
                0.40911216 = fieldWeight in 2077, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2077)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  6. Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.01
    0.01183769 = product of:
      0.02367538 = sum of:
        0.02367538 = product of:
          0.07102614 = sum of:
            0.07102614 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
              0.07102614 = score(doc=1046,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.46428138 = fieldWeight in 1046, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1046)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    5. 5.2003 14:17:22
  7. Schek, M.: Automatische Klassifizierung und Visualisierung im Archiv der Süddeutschen Zeitung (2005) 0.01
    0.010723486 = product of:
      0.021446971 = sum of:
        0.021446971 = product of:
          0.06434091 = sum of:
            0.06434091 = weight(_text_:d.h in 4884) [ClassicSimilarity], result of:
              0.06434091 = score(doc=4884,freq=2.0), product of:
                0.26960507 = queryWeight, product of:
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.043685965 = queryNorm
                0.23864876 = fieldWeight in 4884, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.1714344 = idf(docFreq=250, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=4884)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    Die Süddeutsche Zeitung (SZ) verfügt seit ihrer Gründung 1945 über ein Pressearchiv, das die Texte der eigenen Redakteure und zahlreicher nationaler und internationaler Publikationen dokumentiert und auf Anfrage für Recherchezwecke bereitstellt. Die Einführung der EDV begann Anfang der 90er Jahre mit der digitalen Speicherung zunächst der SZ-Daten. Die technische Weiterentwicklung ab Mitte der 90er Jahre diente zwei Zielen: (1) dem vollständigen Wechsel von der Papierablage zur digitalen Speicherung und (2) dem Wandel von einer verlagsinternen Dokumentations- und Auskunftsstelle zu einem auch auf dem Markt vertretenen Informationsdienstleister. Um die dabei entstehenden Aufwände zu verteilen und gleichzeitig Synergieeffekte zwischen inhaltlich verwandten Archiven zu erschließen, gründeten der Süddeutsche Verlag und der Bayerische Rundfunk im Jahr 1998 die Dokumentations- und Informationszentrum (DIZ) München GmbH, in der die Pressearchive der beiden Gesellschafter und das Bildarchiv des Süddeutschen Verlags zusammengeführt wurden. Die gemeinsam entwickelte Pressedatenbank ermöglichte das standortübergreifende Lektorat, die browserbasierte Recherche für Redakteure und externe Kunden im Intraund Internet und die kundenspezifischen Content Feeds für Verlage, Rundfunkanstalten und Portale. Die DIZPressedatenbank enthält zur Zeit 6,9 Millionen Artikel, die jeweils als HTML oder PDF abrufbar sind. Täglich kommen ca. 3.500 Artikel hinzu, von denen ca. 1.000 lektoriert werden. Das Lektorat erfolgt im DIZ nicht durch die Vergabe von Schlagwörtern am Dokument, sondern durch die Verlinkung der Artikel mit "virtuellen Mappen", den Dossiers. Diese stellen die elektronische Repräsentation einer Papiermappe dar und sind das zentrale Erschließungsobjekt. Im Gegensatz zu statischen Klassifikationssystemen ist die Dossierstruktur dynamisch und aufkommensabhängig, d.h. neue Dossiers werden hauptsächlich anhand der aktuellen Berichterstattung erstellt. Insgesamt enthält die DIZ-Pressedatenbank ca. 90.000 Dossiers, davon sind 68.000 Sachthemen (Topics), Personen und Institutionen. Die Dossiers sind untereinander zum "DIZ-Wissensnetz" verlinkt.
  8. Reiner, U.: Automatische DDC-Klassifizierung von bibliografischen Titeldatensätzen (2009) 0.01
    0.009864742 = product of:
      0.019729484 = sum of:
        0.019729484 = product of:
          0.05918845 = sum of:
            0.05918845 = weight(_text_:22 in 611) [ClassicSimilarity], result of:
              0.05918845 = score(doc=611,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.38690117 = fieldWeight in 611, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=611)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    22. 8.2009 12:54:24
  9. HaCohen-Kerner, Y. et al.: Classification using various machine learning methods and combinations of key-phrases and visual features (2016) 0.01
    0.009864742 = product of:
      0.019729484 = sum of:
        0.019729484 = product of:
          0.05918845 = sum of:
            0.05918845 = weight(_text_:22 in 2748) [ClassicSimilarity], result of:
              0.05918845 = score(doc=2748,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.38690117 = fieldWeight in 2748, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2748)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  10. Huang, Y.-L.: ¬A theoretic and empirical research of cluster indexing for Mandarine Chinese full text document (1998) 0.01
    0.008895945 = product of:
      0.01779189 = sum of:
        0.01779189 = product of:
          0.05337567 = sum of:
            0.05337567 = weight(_text_:l in 513) [ClassicSimilarity], result of:
              0.05337567 = score(doc=513,freq=2.0), product of:
                0.17363653 = queryWeight, product of:
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.043685965 = queryNorm
                0.30739886 = fieldWeight in 513, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=513)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  11. Koch, T.; Ardö, A.; Noodén, L.: ¬The construction of a robot-generated subject index : DESIRE II D3.6a, Working Paper 1 (1999) 0.01
    0.007625095 = product of:
      0.01525019 = sum of:
        0.01525019 = product of:
          0.04575057 = sum of:
            0.04575057 = weight(_text_:l in 1668) [ClassicSimilarity], result of:
              0.04575057 = score(doc=1668,freq=2.0), product of:
                0.17363653 = queryWeight, product of:
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.043685965 = queryNorm
                0.26348472 = fieldWeight in 1668, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1668)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  12. Hung, C.-M.; Chien, L.-F.: Web-based text classification in the absence of manually labeled training documents (2007) 0.01
    0.007625095 = product of:
      0.01525019 = sum of:
        0.01525019 = product of:
          0.04575057 = sum of:
            0.04575057 = weight(_text_:l in 87) [ClassicSimilarity], result of:
              0.04575057 = score(doc=87,freq=2.0), product of:
                0.17363653 = queryWeight, product of:
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.043685965 = queryNorm
                0.26348472 = fieldWeight in 87, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.046875 = fieldNorm(doc=87)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  13. Liu, R.-L.: Dynamic category profiling for text filtering and classification (2007) 0.01
    0.007625095 = product of:
      0.01525019 = sum of:
        0.01525019 = product of:
          0.04575057 = sum of:
            0.04575057 = weight(_text_:l in 900) [ClassicSimilarity], result of:
              0.04575057 = score(doc=900,freq=2.0), product of:
                0.17363653 = queryWeight, product of:
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.043685965 = queryNorm
                0.26348472 = fieldWeight in 900, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.046875 = fieldNorm(doc=900)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  14. Denoyer, L.; Gallinari, P.: Bayesian network model for semi-structured document classification (2004) 0.01
    0.007625095 = product of:
      0.01525019 = sum of:
        0.01525019 = product of:
          0.04575057 = sum of:
            0.04575057 = weight(_text_:l in 995) [ClassicSimilarity], result of:
              0.04575057 = score(doc=995,freq=2.0), product of:
                0.17363653 = queryWeight, product of:
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.043685965 = queryNorm
                0.26348472 = fieldWeight in 995, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.046875 = fieldNorm(doc=995)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  15. Liu, R.-L.: Context-based term frequency assessment for text classification (2010) 0.01
    0.007625095 = product of:
      0.01525019 = sum of:
        0.01525019 = product of:
          0.04575057 = sum of:
            0.04575057 = weight(_text_:l in 3331) [ClassicSimilarity], result of:
              0.04575057 = score(doc=3331,freq=2.0), product of:
                0.17363653 = queryWeight, product of:
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.043685965 = queryNorm
                0.26348472 = fieldWeight in 3331, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.9746525 = idf(docFreq=2257, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3331)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
  16. Bock, H.-H.: Datenanalyse zur Strukturierung und Ordnung von Information (1989) 0.01
    0.006905319 = product of:
      0.013810638 = sum of:
        0.013810638 = product of:
          0.041431915 = sum of:
            0.041431915 = weight(_text_:22 in 141) [ClassicSimilarity], result of:
              0.041431915 = score(doc=141,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.2708308 = fieldWeight in 141, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=141)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Pages
    S.1-22
  17. Dubin, D.: Dimensions and discriminability (1998) 0.01
    0.006905319 = product of:
      0.013810638 = sum of:
        0.013810638 = product of:
          0.041431915 = sum of:
            0.041431915 = weight(_text_:22 in 2338) [ClassicSimilarity], result of:
              0.041431915 = score(doc=2338,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.2708308 = fieldWeight in 2338, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2338)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    22. 9.1997 19:16:05
  18. Automatic classification research at OCLC (2002) 0.01
    0.006905319 = product of:
      0.013810638 = sum of:
        0.013810638 = product of:
          0.041431915 = sum of:
            0.041431915 = weight(_text_:22 in 1563) [ClassicSimilarity], result of:
              0.041431915 = score(doc=1563,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.2708308 = fieldWeight in 1563, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1563)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    5. 5.2003 9:22:09
  19. Jenkins, C.: Automatic classification of Web resources using Java and Dewey Decimal Classification (1998) 0.01
    0.006905319 = product of:
      0.013810638 = sum of:
        0.013810638 = product of:
          0.041431915 = sum of:
            0.041431915 = weight(_text_:22 in 1673) [ClassicSimilarity], result of:
              0.041431915 = score(doc=1673,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.2708308 = fieldWeight in 1673, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1673)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    1. 8.1996 22:08:06
  20. Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.01
    0.006905319 = product of:
      0.013810638 = sum of:
        0.013810638 = product of:
          0.041431915 = sum of:
            0.041431915 = weight(_text_:22 in 5273) [ClassicSimilarity], result of:
              0.041431915 = score(doc=5273,freq=2.0), product of:
                0.1529808 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043685965 = queryNorm
                0.2708308 = fieldWeight in 5273, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5273)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 16:24:52