Search (23 results, page 1 of 2)

  • × theme_ss:"Automatisches Klassifizieren"
  1. Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004) 0.38
    0.38179284 = product of:
      0.47724104 = sum of:
        0.065772705 = product of:
          0.1973181 = sum of:
            0.1973181 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
              0.1973181 = score(doc=562,freq=2.0), product of:
                0.35108855 = queryWeight, product of:
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.041411664 = queryNorm
                0.56201804 = fieldWeight in 562, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  8.478011 = idf(docFreq=24, maxDocs=44218)
                  0.046875 = fieldNorm(doc=562)
          0.33333334 = coord(1/3)
        0.1973181 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
          0.1973181 = score(doc=562,freq=2.0), product of:
            0.35108855 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.041411664 = queryNorm
            0.56201804 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
        0.1973181 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
          0.1973181 = score(doc=562,freq=2.0), product of:
            0.35108855 = queryWeight, product of:
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.041411664 = queryNorm
            0.56201804 = fieldWeight in 562, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              8.478011 = idf(docFreq=24, maxDocs=44218)
              0.046875 = fieldNorm(doc=562)
        0.016832126 = product of:
          0.033664253 = sum of:
            0.033664253 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
              0.033664253 = score(doc=562,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.23214069 = fieldWeight in 562, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=562)
          0.5 = coord(1/2)
      0.8 = coord(4/5)
    
    Content
    Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
    Date
    8. 1.2013 10:22:32
  2. Losee, R.M.; Haas, S.W.: Sublanguage terms : dictionaries, usage, and automatic classification (1995) 0.05
    0.04954435 = product of:
      0.24772175 = sum of:
        0.24772175 = weight(_text_:dictionaries in 2650) [ClassicSimilarity], result of:
          0.24772175 = score(doc=2650,freq=4.0), product of:
            0.2864761 = queryWeight, product of:
              6.9177637 = idf(docFreq=118, maxDocs=44218)
              0.041411664 = queryNorm
            0.86472046 = fieldWeight in 2650, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              6.9177637 = idf(docFreq=118, maxDocs=44218)
              0.0625 = fieldNorm(doc=2650)
      0.2 = coord(1/5)
    
    Abstract
    The use of terms from natural and social science titles and abstracts is studied from the perspective of sublanguages and their specialized dictionaries. Explores different notions of sublanguage distinctiveness. Object methods for separating hard and soft sciences are suggested based on measures of sublanguage use, dictionary characteristics, and sublanguage distinctiveness. Abstracts were automatically classified with a high degree of accuracy by using a formula that condsiders the degree of uniqueness of terms in each sublanguage. This may prove useful for text filtering of information retrieval systems
  3. Reiner, U.: DDC-based search in the data of the German National Bibliography (2008) 0.02
    0.020707065 = product of:
      0.103535324 = sum of:
        0.103535324 = product of:
          0.20707065 = sum of:
            0.20707065 = weight(_text_:german in 2166) [ClassicSimilarity], result of:
              0.20707065 = score(doc=2166,freq=10.0), product of:
                0.24051933 = queryWeight, product of:
                  5.808009 = idf(docFreq=360, maxDocs=44218)
                  0.041411664 = queryNorm
                0.8609314 = fieldWeight in 2166, product of:
                  3.1622777 = tf(freq=10.0), with freq of:
                    10.0 = termFreq=10.0
                  5.808009 = idf(docFreq=360, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2166)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Abstract
    In 2004, the German National Library began to classify title records of the German National Bibliography according to subject groups based on the divisions of the Dewey Decimal Classification (DDC). Since 2006, all titles of the main series of the German National Bibliography are classified in strict compliance with the DDC. On this basis, an enhanced DDC-based search can be realized - e.g., searching the data of the German National Bibliography for title records using number components of synthesized classification numbers or searching for DDC numbers using unclassified title records. This paper gives an account of the current research and development of the DDC-based search. The work is conducted in the VZG project Colibri that focuses on the automatic analysis of DDC-synthesized numbers and the automatic classification of bibliographic title records.
  4. Wätjen, H.-J.; Diekmann, B.; Möller, G.; Carstensen, K.-U.: Bericht zum DFG-Projekt: GERHARD : German Harvest Automated Retrieval and Directory (1998) 0.02
    0.015434134 = product of:
      0.07717067 = sum of:
        0.07717067 = product of:
          0.15434134 = sum of:
            0.15434134 = weight(_text_:german in 3065) [ClassicSimilarity], result of:
              0.15434134 = score(doc=3065,freq=2.0), product of:
                0.24051933 = queryWeight, product of:
                  5.808009 = idf(docFreq=360, maxDocs=44218)
                  0.041411664 = queryNorm
                0.6417004 = fieldWeight in 3065, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.808009 = idf(docFreq=360, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3065)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
  5. Wartena, C.; Sommer, M.: Automatic classification of scientific records using the German Subject Heading Authority File (SWD) (2012) 0.02
    0.015434134 = product of:
      0.07717067 = sum of:
        0.07717067 = product of:
          0.15434134 = sum of:
            0.15434134 = weight(_text_:german in 472) [ClassicSimilarity], result of:
              0.15434134 = score(doc=472,freq=8.0), product of:
                0.24051933 = queryWeight, product of:
                  5.808009 = idf(docFreq=360, maxDocs=44218)
                  0.041411664 = queryNorm
                0.6417004 = fieldWeight in 472, product of:
                  2.828427 = tf(freq=8.0), with freq of:
                    8.0 = termFreq=8.0
                  5.808009 = idf(docFreq=360, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=472)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Abstract
    The following paper deals with an automatic text classification method which does not require training documents. For this method the German Subject Heading Authority File (SWD), provided by the linked data service of the German National Library is used. Recently the SWD was enriched with notations of the Dewey Decimal Classification (DDC). In consequence it became possible to utilize the subject headings as textual representations for the notations of the DDC. Basically, we we derive the classification of a text from the classification of the words in the text given by the thesaurus. The method was tested by classifying 3826 OAI-Records from 7 different repositories. Mean reciprocal rank and recall were chosen as evaluation measure. Direct comparison to a machine learning method has shown that this method is definitely competitive. Thus we can conclude that the enriched version of the SWD provides high quality information with a broad coverage for classification of German scientific articles.
  6. Krüger, C.: Evaluation des WWW-Suchdienstes GERHARD unter besonderer Beachtung automatischer Indexierung (1999) 0.01
    0.0109135825 = product of:
      0.05456791 = sum of:
        0.05456791 = product of:
          0.10913582 = sum of:
            0.10913582 = weight(_text_:german in 1777) [ClassicSimilarity], result of:
              0.10913582 = score(doc=1777,freq=4.0), product of:
                0.24051933 = queryWeight, product of:
                  5.808009 = idf(docFreq=360, maxDocs=44218)
                  0.041411664 = queryNorm
                0.45375073 = fieldWeight in 1777, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  5.808009 = idf(docFreq=360, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1777)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Abstract
    Die vorliegende Arbeit beinhaltet eine Beschreibung und Evaluation des WWW - Suchdienstes GERHARD (German Harvest Automated Retrieval and Directory). GERHARD ist ein Such- und Navigationssystem für das deutsche World Wide Web, weiches ausschließlich wissenschaftlich relevante Dokumente sammelt, und diese auf der Basis computerlinguistischer und statistischer Methoden automatisch mit Hilfe eines bibliothekarischen Klassifikationssystems klassifiziert. Mit dem DFG - Projekt GERHARD ist der Versuch unternommen worden, mit einem auf einem automatischen Klassifizierungsverfahren basierenden World Wide Web - Dienst eine Alternative zu herkömmlichen Methoden der Interneterschließung zu entwickeln. GERHARD ist im deutschsprachigen Raum das einzige Verzeichnis von Internetressourcen, dessen Erstellung und Aktualisierung vollständig automatisch (also maschinell) erfolgt. GERHARD beschränkt sich dabei auf den Nachweis von Dokumenten auf wissenschaftlichen WWW - Servern. Die Grundidee dabei war, kostenintensive intellektuelle Erschließung und Klassifizierung von lnternetseiten durch computerlinguistische und statistische Methoden zu ersetzen, um auf diese Weise die nachgewiesenen Internetressourcen automatisch auf das Vokabular eines bibliothekarischen Klassifikationssystems abzubilden. GERHARD steht für German Harvest Automated Retrieval and Directory. Die WWW - Adresse (URL) von GERHARD lautet: http://www.gerhard.de. Im Rahmen der vorliegenden Diplomarbeit soll eine Beschreibung des Dienstes mit besonderem Schwerpunkt auf dem zugrundeliegenden Indexierungs- bzw. Klassifizierungssystem erfolgen und anschließend mit Hilfe eines kleinen Retrievaltests die Effektivität von GERHARD überprüft werden.
  7. Subramanian, S.; Shafer, K.E.: Clustering (2001) 0.01
    0.006732851 = product of:
      0.033664253 = sum of:
        0.033664253 = product of:
          0.067328505 = sum of:
            0.067328505 = weight(_text_:22 in 1046) [ClassicSimilarity], result of:
              0.067328505 = score(doc=1046,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.46428138 = fieldWeight in 1046, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=1046)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    5. 5.2003 14:17:22
  8. Reiner, U.: Automatische DDC-Klassifizierung von bibliografischen Titeldatensätzen (2009) 0.01
    0.005610709 = product of:
      0.028053544 = sum of:
        0.028053544 = product of:
          0.05610709 = sum of:
            0.05610709 = weight(_text_:22 in 611) [ClassicSimilarity], result of:
              0.05610709 = score(doc=611,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.38690117 = fieldWeight in 611, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=611)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    22. 8.2009 12:54:24
  9. HaCohen-Kerner, Y. et al.: Classification using various machine learning methods and combinations of key-phrases and visual features (2016) 0.01
    0.005610709 = product of:
      0.028053544 = sum of:
        0.028053544 = product of:
          0.05610709 = sum of:
            0.05610709 = weight(_text_:22 in 2748) [ClassicSimilarity], result of:
              0.05610709 = score(doc=2748,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.38690117 = fieldWeight in 2748, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2748)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    1. 2.2016 18:25:22
  10. Bock, H.-H.: Datenanalyse zur Strukturierung und Ordnung von Information (1989) 0.00
    0.0039274963 = product of:
      0.01963748 = sum of:
        0.01963748 = product of:
          0.03927496 = sum of:
            0.03927496 = weight(_text_:22 in 141) [ClassicSimilarity], result of:
              0.03927496 = score(doc=141,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.2708308 = fieldWeight in 141, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=141)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Pages
    S.1-22
  11. Dubin, D.: Dimensions and discriminability (1998) 0.00
    0.0039274963 = product of:
      0.01963748 = sum of:
        0.01963748 = product of:
          0.03927496 = sum of:
            0.03927496 = weight(_text_:22 in 2338) [ClassicSimilarity], result of:
              0.03927496 = score(doc=2338,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.2708308 = fieldWeight in 2338, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2338)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    22. 9.1997 19:16:05
  12. Automatic classification research at OCLC (2002) 0.00
    0.0039274963 = product of:
      0.01963748 = sum of:
        0.01963748 = product of:
          0.03927496 = sum of:
            0.03927496 = weight(_text_:22 in 1563) [ClassicSimilarity], result of:
              0.03927496 = score(doc=1563,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.2708308 = fieldWeight in 1563, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1563)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    5. 5.2003 9:22:09
  13. Jenkins, C.: Automatic classification of Web resources using Java and Dewey Decimal Classification (1998) 0.00
    0.0039274963 = product of:
      0.01963748 = sum of:
        0.01963748 = product of:
          0.03927496 = sum of:
            0.03927496 = weight(_text_:22 in 1673) [ClassicSimilarity], result of:
              0.03927496 = score(doc=1673,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.2708308 = fieldWeight in 1673, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1673)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    1. 8.1996 22:08:06
  14. Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.00
    0.0039274963 = product of:
      0.01963748 = sum of:
        0.01963748 = product of:
          0.03927496 = sum of:
            0.03927496 = weight(_text_:22 in 5273) [ClassicSimilarity], result of:
              0.03927496 = score(doc=5273,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.2708308 = fieldWeight in 5273, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5273)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    22. 7.2006 16:24:52
  15. Yi, K.: Automatic text classification using library classification schemes : trends, issues and challenges (2007) 0.00
    0.0039274963 = product of:
      0.01963748 = sum of:
        0.01963748 = product of:
          0.03927496 = sum of:
            0.03927496 = weight(_text_:22 in 2560) [ClassicSimilarity], result of:
              0.03927496 = score(doc=2560,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.2708308 = fieldWeight in 2560, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2560)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    22. 9.2008 18:31:54
  16. Liu, R.-L.: Context recognition for hierarchical text classification (2009) 0.00
    0.0033664254 = product of:
      0.016832126 = sum of:
        0.016832126 = product of:
          0.033664253 = sum of:
            0.033664253 = weight(_text_:22 in 2760) [ClassicSimilarity], result of:
              0.033664253 = score(doc=2760,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.23214069 = fieldWeight in 2760, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2760)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    22. 3.2009 19:11:54
  17. Pfeffer, M.: Automatische Vergabe von RVK-Notationen mittels fallbasiertem Schließen (2009) 0.00
    0.0033664254 = product of:
      0.016832126 = sum of:
        0.016832126 = product of:
          0.033664253 = sum of:
            0.033664253 = weight(_text_:22 in 3051) [ClassicSimilarity], result of:
              0.033664253 = score(doc=3051,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.23214069 = fieldWeight in 3051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3051)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    22. 8.2009 19:51:28
  18. Zhu, W.Z.; Allen, R.B.: Document clustering using the LSI subspace signature model (2013) 0.00
    0.0033664254 = product of:
      0.016832126 = sum of:
        0.016832126 = product of:
          0.033664253 = sum of:
            0.033664253 = weight(_text_:22 in 690) [ClassicSimilarity], result of:
              0.033664253 = score(doc=690,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.23214069 = fieldWeight in 690, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=690)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    23. 3.2013 13:22:36
  19. Egbert, J.; Biber, D.; Davies, M.: Developing a bottom-up, user-based method of web register classification (2015) 0.00
    0.0033664254 = product of:
      0.016832126 = sum of:
        0.016832126 = product of:
          0.033664253 = sum of:
            0.033664253 = weight(_text_:22 in 2158) [ClassicSimilarity], result of:
              0.033664253 = score(doc=2158,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.23214069 = fieldWeight in 2158, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2158)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    4. 8.2015 19:22:04
  20. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.00
    0.0028053545 = product of:
      0.014026772 = sum of:
        0.014026772 = product of:
          0.028053544 = sum of:
            0.028053544 = weight(_text_:22 in 2765) [ClassicSimilarity], result of:
              0.028053544 = score(doc=2765,freq=2.0), product of:
                0.1450166 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.041411664 = queryNorm
                0.19345059 = fieldWeight in 2765, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2765)
          0.5 = coord(1/2)
      0.2 = coord(1/5)
    
    Date
    22. 3.2009 19:14:43