Search (86 results, page 1 of 5)

  • × theme_ss:"Automatisches Indexieren"
  1. Gibb, F.; Smart, G.: Knowledge-based indexing : the view from SIMPR (1991) 0.22
    0.22487332 = product of:
      0.2998311 = sum of:
        0.08032228 = weight(_text_:c in 4424) [ClassicSimilarity], result of:
          0.08032228 = score(doc=4424,freq=2.0), product of:
            0.1505424 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.043643 = queryNorm
            0.5335526 = fieldWeight in 4424, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.109375 = fieldNorm(doc=4424)
        0.14861567 = weight(_text_:et in 4424) [ClassicSimilarity], result of:
          0.14861567 = score(doc=4424,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.7257575 = fieldWeight in 4424, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.109375 = fieldNorm(doc=4424)
        0.07089315 = product of:
          0.1417863 = sum of:
            0.1417863 = weight(_text_:al in 4424) [ClassicSimilarity], result of:
              0.1417863 = score(doc=4424,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.70888597 = fieldWeight in 4424, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.109375 = fieldNorm(doc=4424)
          0.5 = coord(1/2)
      0.75 = coord(3/4)
    
    Source
    Libraries and expert systems. Ed. C. MacDonald et al
  2. Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.18
    0.17624028 = product of:
      0.35248056 = sum of:
        0.1501245 = weight(_text_:et in 2759) [ClassicSimilarity], result of:
          0.1501245 = score(doc=2759,freq=4.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.7331258 = fieldWeight in 2759, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.078125 = fieldNorm(doc=2759)
        0.20235606 = sum of:
          0.1432258 = weight(_text_:al in 2759) [ClassicSimilarity], result of:
            0.1432258 = score(doc=2759,freq=4.0), product of:
              0.20001286 = queryWeight, product of:
                4.582931 = idf(docFreq=1228, maxDocs=44218)
                0.043643 = queryNorm
              0.716083 = fieldWeight in 2759, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.582931 = idf(docFreq=1228, maxDocs=44218)
                0.078125 = fieldNorm(doc=2759)
          0.059130248 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
            0.059130248 = score(doc=2759,freq=2.0), product of:
              0.15283036 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.043643 = queryNorm
              0.38690117 = fieldWeight in 2759, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.078125 = fieldNorm(doc=2759)
      0.5 = coord(2/4)
    
    Date
    1. 2.2016 18:25:22
    Source
    Semantic keyword-based search on structured data sources: First COST Action IC1302 International KEYSTONE Conference, IKC 2015, Coimbra, Portugal, September 8-9, 2015. Revised Selected Papers. Eds.: J. Cardoso et al
  3. Renouf, A.: Making sense of text : automated approaches to meaning extraction (1993) 0.11
    0.10975441 = product of:
      0.21950883 = sum of:
        0.14861567 = weight(_text_:et in 7111) [ClassicSimilarity], result of:
          0.14861567 = score(doc=7111,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.7257575 = fieldWeight in 7111, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.109375 = fieldNorm(doc=7111)
        0.07089315 = product of:
          0.1417863 = sum of:
            0.1417863 = weight(_text_:al in 7111) [ClassicSimilarity], result of:
              0.1417863 = score(doc=7111,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.70888597 = fieldWeight in 7111, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.109375 = fieldNorm(doc=7111)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Source
    Online information 93: 17th International Online Meeting Proceedings, London, 7.-9.12.1993. Ed. by D.I. Raitt et al
  4. Silvester, J.P.; Genuardi, M.T.: Machine-aided indexing from the analysis of natural language text (1994) 0.09
    0.09407521 = product of:
      0.18815042 = sum of:
        0.12738486 = weight(_text_:et in 2989) [ClassicSimilarity], result of:
          0.12738486 = score(doc=2989,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.6220778 = fieldWeight in 2989, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.09375 = fieldNorm(doc=2989)
        0.06076556 = product of:
          0.12153112 = sum of:
            0.12153112 = weight(_text_:al in 2989) [ClassicSimilarity], result of:
              0.12153112 = score(doc=2989,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.60761654 = fieldWeight in 2989, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2989)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Source
    Challenges in indexing electronic text and images. Ed.: R. Fidel et al
  5. Smith, P.J.; Normore, L.F.; Denning, R.; Johnson, W.P.: Computerized tools to support document analysis (1994) 0.09
    0.09407521 = product of:
      0.18815042 = sum of:
        0.12738486 = weight(_text_:et in 2990) [ClassicSimilarity], result of:
          0.12738486 = score(doc=2990,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.6220778 = fieldWeight in 2990, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.09375 = fieldNorm(doc=2990)
        0.06076556 = product of:
          0.12153112 = sum of:
            0.12153112 = weight(_text_:al in 2990) [ClassicSimilarity], result of:
              0.12153112 = score(doc=2990,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.60761654 = fieldWeight in 2990, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.09375 = fieldNorm(doc=2990)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Source
    Challenges in indexing electronic text and images. Ed.: R. Fidel et al
  6. Chevallet, J.-P.; Bruandet, M.F.: Impact de l'utilisation de multi terms sur la qualité des résponses dùn système de recherche d'information a indexation automatique (1999) 0.09
    0.09380088 = product of:
      0.18760176 = sum of:
        0.14709139 = weight(_text_:et in 6253) [ClassicSimilarity], result of:
          0.14709139 = score(doc=6253,freq=6.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.7183137 = fieldWeight in 6253, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.0625 = fieldNorm(doc=6253)
        0.040510375 = product of:
          0.08102075 = sum of:
            0.08102075 = weight(_text_:al in 6253) [ClassicSimilarity], result of:
              0.08102075 = score(doc=6253,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.4050777 = fieldWeight in 6253, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.0625 = fieldNorm(doc=6253)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Series
    Collection travaux et recherches; UL3
    Source
    Organisation des connaissances en vue de leur intégration dans les systèmes de représentation et de recherche d'information. Ed.: J. Maniez, et al
  7. Milstead, J.L.: Thesauri in a full-text world (1998) 0.07
    0.066640064 = product of:
      0.13328013 = sum of:
        0.053077027 = weight(_text_:et in 2337) [ClassicSimilarity], result of:
          0.053077027 = score(doc=2337,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.2591991 = fieldWeight in 2337, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2337)
        0.08020309 = sum of:
          0.050637968 = weight(_text_:al in 2337) [ClassicSimilarity], result of:
            0.050637968 = score(doc=2337,freq=2.0), product of:
              0.20001286 = queryWeight, product of:
                4.582931 = idf(docFreq=1228, maxDocs=44218)
                0.043643 = queryNorm
              0.25317356 = fieldWeight in 2337, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.582931 = idf(docFreq=1228, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2337)
          0.029565124 = weight(_text_:22 in 2337) [ClassicSimilarity], result of:
            0.029565124 = score(doc=2337,freq=2.0), product of:
              0.15283036 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.043643 = queryNorm
              0.19345059 = fieldWeight in 2337, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0390625 = fieldNorm(doc=2337)
      0.5 = coord(2/4)
    
    Date
    22. 9.1997 19:16:05
    Source
    Visualizing subject access for 21st century information resources: Papers presented at the 1997 Clinic on Library Applications of Data Processing, 2-4 Mar 1997, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign. Ed.: P.A. Cochrane et al
  8. Harman, D.: Automatic indexing (1994) 0.06
    0.06271681 = product of:
      0.12543362 = sum of:
        0.084923245 = weight(_text_:et in 7729) [ClassicSimilarity], result of:
          0.084923245 = score(doc=7729,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.41471857 = fieldWeight in 7729, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.0625 = fieldNorm(doc=7729)
        0.040510375 = product of:
          0.08102075 = sum of:
            0.08102075 = weight(_text_:al in 7729) [ClassicSimilarity], result of:
              0.08102075 = score(doc=7729,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.4050777 = fieldWeight in 7729, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.0625 = fieldNorm(doc=7729)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Source
    Challenges in indexing electronic text and images. Ed.: R. Fidel et al
  9. Gaus, W.; Kaluscha, R.: Maschinelle inhaltliche Erschließung von Arztbriefen und Auswertung von Reha-Entlassungsberichten (2006) 0.04
    0.04434748 = product of:
      0.08869496 = sum of:
        0.060049802 = weight(_text_:et in 6078) [ClassicSimilarity], result of:
          0.060049802 = score(doc=6078,freq=4.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.29325032 = fieldWeight in 6078, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.03125 = fieldNorm(doc=6078)
        0.028645162 = product of:
          0.057290323 = sum of:
            0.057290323 = weight(_text_:al in 6078) [ClassicSimilarity], result of:
              0.057290323 = score(doc=6078,freq=4.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.2864332 = fieldWeight in 6078, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.03125 = fieldNorm(doc=6078)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Abstract
    Schon Hippokrates empfahl den Ärzten, Krankenakten zu führen. Heute ist die detaillierte Dokumentation eine Standespflicht der Ärzte [Gaus et al 1999]. Diese Dokumentationen medizinischer Behandlungen bergen einen riesigen und wertvollen Erfahrungsschatz. Informationen zu Therapien und Behandlungsergebnissen, die in Studien erst mühsam erhoben werden müssten, sind bereits in der Routinedokumentation wie Operations- und Entlassungsberichten oder Arztbriefen zahlreich vorhanden. Mit der Einführung der elektronischen Datenverarbeitung in der Medizin liegen diese Informationen seit einigen Jahren auch maschinenlesbar vor, so dass ein Haupthemmnis für die Nutzung dieser Dokumentationen, nämlich die mühsame manuelle Aufbereitung der Papierakten, entfällt. Während die formale Erschließung nach Patientenmerkmalen wie Name und Geburtsdatum von den Krankenhaus- bzw. Praxisinformationssystemen heutzutage gut gelöst ist, bleibt die inhaltliche Erschließung dieser Dokumentationen schwierig, da nur wenige Informationen in strukturierter oder intellektuell indexierter Form vorliegen [Leiner et al. 2003]. Auch wenn nach der Einführung der Fallpauschalen (diagnosis related groups, DRG) in den Krankenhäusern die Diagnosen nach ICD-10 verschlüsselt werden, besteht ein Großteil der Informationen weiterhin aus freiem Text, dessen computerbasierte Erschließung aufgrund der Komplexität menschlicher Sprache nicht trivial ist. Zu diesen medizinischen Texten gehören u.a. Gutachten, verbal beschriebene (Differential-) Diagnosen, vielfältige Untersuchungs- und Befundberichte, Visitenblätter, Operationsberichte und der Arztbrief bzw. Entlassungsbericht. Arztbrief und Entlassbericht dienen der Information des einweisenden oder weiterbehandelnden Arztes (z.B. Hausarzt) über das, was mit dem Patienten geschehen ist, und geben Empfehlungen zur Weiterbehandlung. Sie fassen eine (stationäre) Behandlung epikritisch - also nachdem die Krankheit überwunden ist, im Rückblick - zusammen und geben einen Überblick über Anamnese (Vorgeschichte), Beschwerden und Symptome, die eingesetzten diagnostischen Verfahren, die gestellte(n) Diagnose(n), Therapie, Verlauf, Komplikationen und das erzielte Ergebnis. Sie haben somit eine ähnliche Funktion wie das Abstract in der Literaturdokumentation, oft wird eine Kopie in der Krankenakte obenauf abgelegt. Zumindest in Universitätskliniken möchten wissenschaftlich arbeitende Ärzte auch unter inhaltlichen Gesichtspunkten auf die Krankenakten zugreifen können, z.B. die Krankenakten aller Patienten mit einer bestimmten Diagnose einsehen, exzerpieren und die exzerpierten Daten auswerten. Auch bei der Suche nach ähnlichen Fällen oder im Bereich der Aus- und Fortbildung hilft eine inhaltliche Erschließung weiter. So könnte etwa ein Assistenzarzt, der im Rahmen seiner Weiterbildung demnächst Sonografien des Kniegelenkes durchzuführen hat, sich vorhandene Berichte von solchen Sonografien anschauen und sich so über relevante Untersuchungstechniken und Befunde vorab informieren.
  10. Husevag, A.-S.R.: Named entities in indexing : a case study of TV subtitles and metadata records (2016) 0.04
    0.039198004 = product of:
      0.07839601 = sum of:
        0.053077027 = weight(_text_:et in 3105) [ClassicSimilarity], result of:
          0.053077027 = score(doc=3105,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.2591991 = fieldWeight in 3105, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3105)
        0.025318984 = product of:
          0.050637968 = sum of:
            0.050637968 = weight(_text_:al in 3105) [ClassicSimilarity], result of:
              0.050637968 = score(doc=3105,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.25317356 = fieldWeight in 3105, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3105)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Source
    Proceedings of the 15th European Networked Knowledge Organization Systems Workshop (NKOS 2016) co-located with the 20th International Conference on Theory and Practice of Digital Libraries 2016 (TPDL 2016), Hannover, Germany, September 9, 2016. Edi. by Philipp Mayr et al. [http://ceur-ws.org/Vol-1676/=urn:nbn:de:0074-1676-5]
  11. Strobel, S.; Marín-Arraiza, P.: Metadata for scientific audiovisual media : current practices and perspectives of the TIB / AV-portal (2015) 0.04
    0.039198004 = product of:
      0.07839601 = sum of:
        0.053077027 = weight(_text_:et in 3667) [ClassicSimilarity], result of:
          0.053077027 = score(doc=3667,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.2591991 = fieldWeight in 3667, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3667)
        0.025318984 = product of:
          0.050637968 = sum of:
            0.050637968 = weight(_text_:al in 3667) [ClassicSimilarity], result of:
              0.050637968 = score(doc=3667,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.25317356 = fieldWeight in 3667, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3667)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Source
    Metadata and semantics research: 9th Research Conference, MTSR 2015, Manchester, UK, September 9-11, 2015, Proceedings. Eds.: E. Garoufallou et al
  12. Ma, N.; Zheng, H.T.; Xiao, X.: ¬An ontology-based latent semantic indexing approach using long short-term memory networks (2017) 0.04
    0.039198004 = product of:
      0.07839601 = sum of:
        0.053077027 = weight(_text_:et in 3810) [ClassicSimilarity], result of:
          0.053077027 = score(doc=3810,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.2591991 = fieldWeight in 3810, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3810)
        0.025318984 = product of:
          0.050637968 = sum of:
            0.050637968 = weight(_text_:al in 3810) [ClassicSimilarity], result of:
              0.050637968 = score(doc=3810,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.25317356 = fieldWeight in 3810, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3810)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Source
    Web and Big Data: First International Joint Conference, APWeb-WAIM 2017, Beijing, China, July 7-9, 2017, Proceedings, Part I. Eds.: L. Chen et al
  13. Salton, G.: Automatic processing of foreign language documents (1985) 0.03
    0.031358406 = product of:
      0.06271681 = sum of:
        0.042461623 = weight(_text_:et in 3650) [ClassicSimilarity], result of:
          0.042461623 = score(doc=3650,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.20735928 = fieldWeight in 3650, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.03125 = fieldNorm(doc=3650)
        0.020255188 = product of:
          0.040510375 = sum of:
            0.040510375 = weight(_text_:al in 3650) [ClassicSimilarity], result of:
              0.040510375 = score(doc=3650,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.20253885 = fieldWeight in 3650, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3650)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Source
    Theory of subject analysis: a sourcebook. Ed.: L.M. Chan, et al
  14. Needham, R.M.; Sparck Jones, K.: Keywords and clumps (1985) 0.03
    0.027438603 = product of:
      0.054877207 = sum of:
        0.03715392 = weight(_text_:et in 3645) [ClassicSimilarity], result of:
          0.03715392 = score(doc=3645,freq=2.0), product of:
            0.20477319 = queryWeight, product of:
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.043643 = queryNorm
            0.18143937 = fieldWeight in 3645, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.692005 = idf(docFreq=1101, maxDocs=44218)
              0.02734375 = fieldNorm(doc=3645)
        0.017723288 = product of:
          0.035446577 = sum of:
            0.035446577 = weight(_text_:al in 3645) [ClassicSimilarity], result of:
              0.035446577 = score(doc=3645,freq=2.0), product of:
                0.20001286 = queryWeight, product of:
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.043643 = queryNorm
                0.17722149 = fieldWeight in 3645, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  4.582931 = idf(docFreq=1228, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=3645)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Source
    Theory of subject analysis: a sourcebook. Ed.: L.M. Chan, et al
  15. Schneider, C.; Womser-Hacker, C.: Inhaltserschließungssysteme für Patenttexte : Test und Systemvergleich im Projekt PADOK (1986) 0.02
    0.02434133 = product of:
      0.09736532 = sum of:
        0.09736532 = weight(_text_:c in 2648) [ClassicSimilarity], result of:
          0.09736532 = score(doc=2648,freq=4.0), product of:
            0.1505424 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.043643 = queryNorm
            0.64676344 = fieldWeight in 2648, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.09375 = fieldNorm(doc=2648)
      0.25 = coord(1/4)
    
  16. Jones, K.P.: Natural-language processing and automatic indexing : a reply (1990) 0.02
    0.022949224 = product of:
      0.0917969 = sum of:
        0.0917969 = weight(_text_:c in 394) [ClassicSimilarity], result of:
          0.0917969 = score(doc=394,freq=2.0), product of:
            0.1505424 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.043643 = queryNorm
            0.6097744 = fieldWeight in 394, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.125 = fieldNorm(doc=394)
      0.25 = coord(1/4)
    
    Footnote
    Erwiderung auf: Korycinski, C. u. A.F. Newell
  17. Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.02
    0.022140577 = product of:
      0.044281155 = sum of:
        0.032455105 = weight(_text_:c in 1441) [ClassicSimilarity], result of:
          0.032455105 = score(doc=1441,freq=4.0), product of:
            0.1505424 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.043643 = queryNorm
            0.21558782 = fieldWeight in 1441, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.03125 = fieldNorm(doc=1441)
        0.011826049 = product of:
          0.023652097 = sum of:
            0.023652097 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
              0.023652097 = score(doc=1441,freq=2.0), product of:
                0.15283036 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043643 = queryNorm
                0.15476047 = fieldWeight in 1441, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1441)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Abstract
    This paper presents a research on syntactic structures known as noun phrases (NP) being applied to increase the effectiveness and efficiency of the mechanisms for the document's classification. Our hypothesis is the fact that the NP can be used instead of single words as a semantic aggregator to reduce the number of words that will be used for the classification system without losing its semantic coverage, increasing its efficiency. The experiment divided the documents classification process in three phases: a) NP preprocessing b) system training; and c) classification experiments. In the first step, a corpus of digitalized texts was submitted to a natural language processing platform1 in which the part-of-speech tagging was done, and them PERL scripts pertaining to the PALAVRAS package were used to extract the Noun Phrases. The preprocessing also involved the tasks of a) removing NP low meaning pre-modifiers, as quantifiers; b) identification of synonyms and corresponding substitution for common hyperonyms; and c) stemming of the relevant words contained in the NP, for similitude checking with other NPs. The first tests with the resulting documents have demonstrated its effectiveness. We have compared the structural similarity of the documents before and after the whole pre-processing steps of phase one. The texts maintained the consistency with the original and have kept the readability. The second phase involves submitting the modified documents to a SVM algorithm to identify clusters and classify the documents. The classification rules are to be established using a machine learning approach. Finally, tests will be conducted to check the effectiveness of the whole process.
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  18. Plaunt, C.; Norgard, B.A.: ¬An association-based method for automatic indexing with a controlled vocabulary (1998) 0.02
    0.021734547 = product of:
      0.043469094 = sum of:
        0.02868653 = weight(_text_:c in 1794) [ClassicSimilarity], result of:
          0.02868653 = score(doc=1794,freq=2.0), product of:
            0.1505424 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.043643 = queryNorm
            0.1905545 = fieldWeight in 1794, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1794)
        0.014782562 = product of:
          0.029565124 = sum of:
            0.029565124 = weight(_text_:22 in 1794) [ClassicSimilarity], result of:
              0.029565124 = score(doc=1794,freq=2.0), product of:
                0.15283036 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.043643 = queryNorm
                0.19345059 = fieldWeight in 1794, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1794)
          0.5 = coord(1/2)
      0.5 = coord(2/4)
    
    Date
    11. 9.2000 19:53:22
  19. Schwarz, C.: Komplexe Nominalgruppen als Indexierungseinheiten am Beispiel des Projekte CONDOR (1982) 0.02
    0.02008057 = product of:
      0.08032228 = sum of:
        0.08032228 = weight(_text_:c in 435) [ClassicSimilarity], result of:
          0.08032228 = score(doc=435,freq=2.0), product of:
            0.1505424 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.043643 = queryNorm
            0.5335526 = fieldWeight in 435, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.109375 = fieldNorm(doc=435)
      0.25 = coord(1/4)
    
  20. Salton, G.; Allen, J.; Buckley, C.; Singhal, A.: Automatic analysis, theme generation, and summarization of machine-readable data (1994) 0.02
    0.02008057 = product of:
      0.08032228 = sum of:
        0.08032228 = weight(_text_:c in 1168) [ClassicSimilarity], result of:
          0.08032228 = score(doc=1168,freq=2.0), product of:
            0.1505424 = queryWeight, product of:
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.043643 = queryNorm
            0.5335526 = fieldWeight in 1168, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.4494052 = idf(docFreq=3817, maxDocs=44218)
              0.109375 = fieldNorm(doc=1168)
      0.25 = coord(1/4)
    

Years

Languages

  • e 54
  • d 29
  • f 2
  • ru 1
  • More… Less…

Types

  • a 78
  • el 5
  • x 4
  • m 2
  • s 2
  • p 1
  • More… Less…