Search (55 results, page 3 of 3)

Automatische Klassifikation und Extraktion in Documentum (2005) 0.00

0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 3974) [ClassicSimilarity], result of:
          0.015624595 = score(doc=3974,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 3974, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3974)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information - Wissenschaft und Praxis. 56(2005) H.5/6, S.276

Yao, H.; Etzkorn, L.H.; Virani, S.: Automated classification and retrieval of reusable software components (2008) 0.00

0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 1382) [ClassicSimilarity], result of:
          0.015624595 = score(doc=1382,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 1382, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1382)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Pong, J.Y.-H.; Kwok, R.C.-W.; Lau, R.Y.-K.; Hao, J.-X.; Wong, P.C.-C.: ¬A comparative study of two automatic document classification methods in a library setting (2008) 0.00

0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 2532) [ClassicSimilarity], result of:
          0.015624595 = score(doc=2532,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 2532, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2532)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Reiner, U.: VZG-Projekt Colibri : Bewertung von automatisch DDC-klassifizierten Titeldatensätzen der Deutschen Nationalbibliothek (DNB) (2009) 0.00
```
0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 2675) [ClassicSimilarity], result of:
          0.015624595 = score(doc=2675,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 2675, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2675)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Das VZG-Projekt Colibri/DDC beschäftigt sich seit 2003 mit automatischen Verfahren zur Dewey-Dezimalklassifikation (Dewey Decimal Classification, kurz DDC). Ziel des Projektes ist eine einheitliche DDC-Erschließung von bibliografischen Titeldatensätzen und eine Unterstützung der DDC-Expert(inn)en und DDC-Laien, z. B. bei der Analyse und Synthese von DDC-Notationen und deren Qualitätskontrolle und der DDC-basierten Suche. Der vorliegende Bericht konzentriert sich auf die erste größere automatische DDC-Klassifizierung und erste automatische und intellektuelle Bewertung mit der Klassifizierungskomponente vc_dcl1. Grundlage hierfür waren die von der Deutschen Nationabibliothek (DNB) im November 2007 zur Verfügung gestellten 25.653 Titeldatensätze (12 Wochen-/Monatslieferungen) der Deutschen Nationalbibliografie der Reihen A, B und H. Nach Erläuterung der automatischen DDC-Klassifizierung und automatischen Bewertung in Kapitel 2 wird in Kapitel 3 auf den DNB-Bericht "Colibri_Auswertung_DDC_Endbericht_Sommer_2008" eingegangen. Es werden Sachverhalte geklärt und Fragen gestellt, deren Antworten die Weichen für den Verlauf der weiteren Klassifizierungstests stellen werden. Über das Kapitel 3 hinaus führende weitergehende Betrachtungen und Gedanken zur Fortführung der automatischen DDC-Klassifizierung werden in Kapitel 4 angestellt. Der Bericht dient dem vertieften Verständnis für die automatischen Verfahren.

HaCohen-Kerner, Y.; Beck, H.; Yehudai, E.; Rosenstein, M.; Mughaz, D.: Cuisine : classification using stylistic feature sets and/or name-based feature sets (2010) 0.00

0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 3706) [ClassicSimilarity], result of:
          0.015624595 = score(doc=3706,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 3706, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3706)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Qu, B.; Cong, G.; Li, C.; Sun, A.; Chen, H.: ¬An evaluation of classification models for question topic categorization (2012) 0.00

0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 237) [ClassicSimilarity], result of:
          0.015624595 = score(doc=237,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 237, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=237)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Fang, H.: Classifying research articles in multidisciplinary sciences journals into subject categories (2015) 0.00

0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 2194) [ClassicSimilarity], result of:
          0.015624595 = score(doc=2194,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 2194, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2194)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

AlQenaei, Z.M.; Monarchi, D.E.: ¬The use of learning techniques to analyze the results of a manual classification system (2016) 0.00
```
0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 2836) [ClassicSimilarity], result of:
          0.015624595 = score(doc=2836,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 2836, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2836)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Classification is the process of assigning objects to pre-defined classes based on observations or characteristics of those objects, and there are many approaches to performing this task. The overall objective of this study is to demonstrate the use of two learning techniques to analyze the results of a manual classification system. Our sample consisted of 1,026 documents, from the ACM Computing Classification System, classified by their authors as belonging to one of the groups of the classification system: "H.3 Information Storage and Retrieval." A singular value decomposition of the documents' weighted term-frequency matrix was used to represent each document in a 50-dimensional vector space. The analysis of the representation using both supervised (decision tree) and unsupervised (clustering) techniques suggests that two pairs of the ACM classes are closely related to each other in the vector space. Class 1 (Content Analysis and Indexing) is closely related to Class 3 (Information Search and Retrieval), and Class 4 (Systems and Software) is closely related to Class 5 (Online Information Services). Further analysis was performed to test the diffusion of the words in the two classes using both cosine and Euclidean distance.

Suominen, A.; Toivanen, H.: Map of science with topic modeling : comparison of unsupervised learning and human-assigned subject classification (2016) 0.00

0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 3121) [ClassicSimilarity], result of:
          0.015624595 = score(doc=3121,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 3121, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3121)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Wang, H.; Hong, M.: Supervised Hebb rule based feature selection for text classification (2019) 0.00

0.0039061487 = product of:
  0.0078122974 = sum of:
    0.0078122974 = product of:
      0.015624595 = sum of:
        0.015624595 = weight(_text_:h in 5036) [ClassicSimilarity], result of:
          0.015624595 = score(doc=5036,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.13724773 = fieldWeight in 5036, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5036)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Groß, T.; Faden, M.: Automatische Indexierung elektronischer Dokumente an der Deutschen Zentralbibliothek für Wirtschaftswissenschaften : Bericht über die Jahrestagung der Internationalen Buchwissenschaftlichen Gesellschaft (2010) 0.00

0.0031249188 = product of:
  0.0062498376 = sum of:
    0.0062498376 = product of:
      0.012499675 = sum of:
        0.012499675 = weight(_text_:h in 4051) [ClassicSimilarity], result of:
          0.012499675 = score(doc=4051,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.10979818 = fieldWeight in 4051, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.03125 = fieldNorm(doc=4051)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Bibliotheksdienst. 44(2010) H.12, S.1120-1135

Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.00

0.0031249188 = product of:
  0.0062498376 = sum of:
    0.0062498376 = product of:
      0.012499675 = sum of:
        0.012499675 = weight(_text_:h in 4095) [ClassicSimilarity], result of:
          0.012499675 = score(doc=4095,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.10979818 = fieldWeight in 4095, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.03125 = fieldNorm(doc=4095)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.00
```
0.0027620643 = product of:
  0.0055241287 = sum of:
    0.0055241287 = product of:
      0.011048257 = sum of:
        0.011048257 = weight(_text_:h in 38) [ClassicSimilarity], result of:
          0.011048257 = score(doc=38,freq=4.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.0970488 = fieldWeight in 38, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.01953125 = fieldNorm(doc=38)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Footnote

Rez. in: VÖB-Mitteilungen 58(2005) H.3, S.102-104 (R.F. Müller); ZfBB 53(2006) H.5, S.282-283 (L. Svensson): "Das Sammeln und Verzeichnen elektronischer Ressourcen gehört in wissenschaftlichen Bibliotheken längst zum Alltag. Parallel dazu kündigt sich ein Paradigmenwechsel bei den Findmitteln an: Um einen effizienten und benutzerorientierten Zugang zu den gemischten Kollektionen bieten zu können, experimentieren einige bibliothekarische Diensteanbieter wie z. B. das hbz (http://suchen.hbz-nrw.de/dreilaender/), die Bibliothek der North Carolina State University (www.lib.ncsu.edu/) und demnächst vascoda (www.vascoda.de/) und der Librarians-Internet Index (www.lii.org/) zunehmend mit Suchmaschinentechnologie. Dabei wird angestrebt, nicht nur einen vollinvertierten Suchindex anzubieten, sondern auch das Browsing durch eine hierarchisch geordnete Klassifikation. Von den Daten in den deutschen Verbunddatenbanken ist jedoch nur ein kleiner Teil schon klassifikatorisch erschlossen. Fremddaten aus dem angloamerikanischen Bereich sind oft mit LCC und/oder DDC erschlossen, wobei die Library of Congress sich bei der DDCErschließung auf Titel, die hauptsächlich für die Public Libraries interessant sind, konzentriert. Die Deutsche Nationalbibliothek wird ab 2007 Printmedien und Hochschulschriften flächendeckend mit DDC erschließen. Es ist aber schon offensichtlich, dass v. a. im Bereich der elektronischen Publikationen die anfallenden Dokumentenmengen mit immer knapperen Personalressourcen nicht intellektuell erschlossen werden können, sondern dass neue Verfahren entwickelt werden müssen. Hier kommt Oberhausers Buch gerade richtig. Seit Anfang der 1990er Jahre sind mehrere Projekte zum Thema automatisches Klassifizieren durchgeführt worden. Wer sich in diese Thematik einarbeiten wollte oder sich für die Ergebnisse der größeren Projekte interessierte, konnte bislang auf keine Überblicksdarstellung zurückgreifen, sondern war auf eine Vielzahl von Einzeluntersuchungen sowie die Projektdokumentationen angewiesen. Oberhausers Darstellung, die auf einer Fülle von publizierter und grauer Literatur fußt, schließt diese Lücke. Das selbst gesetzte Ziel, einen guten Überblick über den momentanen Kenntnisstand und die Ergebnisse der einschlägigen Projekte verständlich zu vermitteln, erfüllt der Autor mit Bravour. Dabei ist anzumerken, dass er ein bibliothekarisches Grundwissen und mindestens grundlegende Kenntnisse über informationswissenschaftliche Grundbegriffe und Fragestellungen voraussetzt, wobei hier für den Einsteiger einige Hinweise auf einführende Darstellungen wünschenswert gewesen wären.

Borko, H.: Research in computer based classification systems (1985) 0.00

0.002734304 = product of:
  0.005468608 = sum of:
    0.005468608 = product of:
      0.010937216 = sum of:
        0.010937216 = weight(_text_:h in 3647) [ClassicSimilarity], result of:
          0.010937216 = score(doc=3647,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.096073404 = fieldWeight in 3647, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.02734375 = fieldNorm(doc=3647)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Schek, M.: Automatische Klassifizierung und Visualisierung im Archiv der Süddeutschen Zeitung (2005) 0.00

0.002734304 = product of:
  0.005468608 = sum of:
    0.005468608 = product of:
      0.010937216 = sum of:
        0.010937216 = weight(_text_:h in 4884) [ClassicSimilarity], result of:
          0.010937216 = score(doc=4884,freq=2.0), product of:
            0.113842286 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045821942 = queryNorm
            0.096073404 = fieldWeight in 4884, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.02734375 = fieldNorm(doc=4884)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Medienwirtschaft. 2(2005) H.1, S.20-24

Search (55 results, page 3 of 3)

Authors

Years

Languages

Types

Themes