Search (53 results, page 1 of 3)

Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984) 0.10

0.096148506 = product of:
  0.14422275 = sum of:
    0.09689408 = weight(_text_:b in 262) [ClassicSimilarity], result of:
      0.09689408 = score(doc=262,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.54802394 = fieldWeight in 262, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.109375 = fieldNorm(doc=262)
    0.04732867 = product of:
      0.09465734 = sum of:
        0.09465734 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
          0.09465734 = score(doc=262,freq=2.0), product of:
            0.17475364 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049903523 = queryNorm
            0.5416616 = fieldWeight in 262, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=262)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Date: 20.10.2000 12:22:23

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.07

0.06867751 = product of:
  0.10301626 = sum of:
    0.06921006 = weight(_text_:b in 4157) [ClassicSimilarity], result of:
      0.06921006 = score(doc=4157,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.3914457 = fieldWeight in 4157, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.078125 = fieldNorm(doc=4157)
    0.033806194 = product of:
      0.06761239 = sum of:
        0.06761239 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
          0.06761239 = score(doc=4157,freq=2.0), product of:
            0.17475364 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049903523 = queryNorm
            0.38690117 = fieldWeight in 4157, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.04
```
0.035115734 = product of:
  0.0526736 = sum of:
    0.039151125 = weight(_text_:b in 1441) [ClassicSimilarity], result of:
      0.039151125 = score(doc=1441,freq=4.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.22143513 = fieldWeight in 1441, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.03125 = fieldNorm(doc=1441)
    0.013522477 = product of:
      0.027044954 = sum of:
        0.027044954 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
          0.027044954 = score(doc=1441,freq=2.0), product of:
            0.17475364 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049903523 = queryNorm
            0.15476047 = fieldWeight in 1441, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=1441)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)
```
Abstract

This paper presents a research on syntactic structures known as noun phrases (NP) being applied to increase the effectiveness and efficiency of the mechanisms for the document's classification. Our hypothesis is the fact that the NP can be used instead of single words as a semantic aggregator to reduce the number of words that will be used for the classification system without losing its semantic coverage, increasing its efficiency. The experiment divided the documents classification process in three phases: a) NP preprocessing b) system training; and c) classification experiments. In the first step, a corpus of digitalized texts was submitted to a natural language processing platform1 in which the part-of-speech tagging was done, and them PERL scripts pertaining to the PALAVRAS package were used to extract the Noun Phrases. The preprocessing also involved the tasks of a) removing NP low meaning pre-modifiers, as quantifiers; b) identification of synonyms and corresponding substitution for common hyperonyms; and c) stemming of the relevant words contained in the NP, for similitude checking with other NPs. The first tests with the resulting documents have demonstrated its effectiveness. We have compared the structural similarity of the documents before and after the whole pre-processing steps of phase one. The texts maintained the consistency with the original and have kept the readability. The second phase involves submitting the modified documents to a SVM algorithm to identify clusters and classify the documents. The classification rules are to be established using a machine learning approach. Finally, tests will be conducted to check the effectiveness of the whole process.

Source

Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Yusuff, A.: Automatisches Indexing and Abstracting : Grundlagen und Beispiele (2002) 0.03

0.03229803 = product of:
  0.09689408 = sum of:
    0.09689408 = weight(_text_:b in 1577) [ClassicSimilarity], result of:
      0.09689408 = score(doc=1577,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.54802394 = fieldWeight in 1577, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.109375 = fieldNorm(doc=1577)
  0.33333334 = coord(1/3)

Imprint: Potsdam : Fachhochschule, FB A-B-D

Olsgaard, J.N.; Evans, E.J.: Improving keyword indexing (1981) 0.03

0.028690744 = product of:
  0.08607223 = sum of:
    0.08607223 = product of:
      0.17214446 = sum of:
        0.17214446 = weight(_text_:72 in 4996) [ClassicSimilarity], result of:
          0.17214446 = score(doc=4996,freq=2.0), product of:
            0.27884293 = queryWeight, product of:
              5.58764 = idf(docFreq=449, maxDocs=44218)
              0.049903523 = queryNorm
            0.6173528 = fieldWeight in 4996, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.58764 = idf(docFreq=449, maxDocs=44218)
              0.078125 = fieldNorm(doc=4996)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Journal of the American society for information science. 32(1981), S.71-72

Thirion, B.; Leroy, J.P.; Baudic, F.; Douyère, M.; Piot, J.; Darmoni, S.J.: SDI selecting, decribing, and indexing : did you mean automatically? (2001) 0.03

0.027684024 = product of:
  0.08305207 = sum of:
    0.08305207 = weight(_text_:b in 6198) [ClassicSimilarity], result of:
      0.08305207 = score(doc=6198,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.46973482 = fieldWeight in 6198, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.09375 = fieldNorm(doc=6198)
  0.33333334 = coord(1/3)

Greiner-Petter, A.; Schubotz, M.; Cohl, H.S.; Gipp, B.: Semantic preserving bijective mappings for expressions involving special functions between computer algebra systems and document preparation systems (2019) 0.03

0.027471002 = product of:
  0.0412065 = sum of:
    0.027684024 = weight(_text_:b in 5499) [ClassicSimilarity], result of:
      0.027684024 = score(doc=5499,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.15657827 = fieldWeight in 5499, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.03125 = fieldNorm(doc=5499)
    0.013522477 = product of:
      0.027044954 = sum of:
        0.027044954 = weight(_text_:22 in 5499) [ClassicSimilarity], result of:
          0.027044954 = score(doc=5499,freq=2.0), product of:
            0.17475364 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049903523 = queryNorm
            0.15476047 = fieldWeight in 5499, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=5499)
      0.5 = coord(1/2)
  0.6666667 = coord(2/3)

Date: 20. 1.2015 18:30:22

Wiesenmüller, H.: DNB-Sacherschließung : Neues für die Reihen A und B (2019) 0.02
```
0.02397507 = product of:
  0.07192521 = sum of:
    0.07192521 = weight(_text_:b in 5212) [ClassicSimilarity], result of:
      0.07192521 = score(doc=5212,freq=6.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.40680233 = fieldWeight in 5212, product of:
          2.4494898 = tf(freq=6.0), with freq of:
            6.0 = termFreq=6.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.046875 = fieldNorm(doc=5212)
  0.33333334 = coord(1/3)
```
Abstract

"Alle paar Jahre wird die Bibliothekscommunity mit Veränderungen in der inhaltlichen Erschließung durch die Deutsche Nationalbibliothek konfrontiert. Sicher werden sich viele noch an die Einschnitte des Jahres 2014 für die Reihe A erinnern: Seither werden u.a. Ratgeber, Sprachwörterbücher, Reiseführer und Kochbücher nicht mehr mit Schlagwörtern erschlossen (vgl. das DNB-Konzept von 2014). Das Jahr 2017 brachte die Einführung der maschinellen Indexierung für die Reihen B und H bei gleichzeitigem Verlust der DDC-Tiefenerschließung (vgl. DNB-Informationen von 2017). Virulent war seither die Frage, was mit der Reihe A passieren würde. Seit wenigen Tagen kann man dies nun auf der Website der DNB nachlesen. (Nebenbei: Es ist zu befürchten, dass viele Links in diesem Blog-Beitrag in absehbarer Zeit nicht mehr funktionieren werden, da ein Relaunch der DNB-Website angekündigt ist. Wie beim letzten Mal wird es vermutlich auch diesmal keine Weiterleitungen von den alten auf die neuen URLs geben.)"

Source

https://www.basiswissen-rda.de/dnb-sacherschliessung-reihen-a-und-b/

Thönssen, B.: Automatische Indexierung und Schnittstellen zu Thesauri (1988) 0.02

0.02307002 = product of:
  0.06921006 = sum of:
    0.06921006 = weight(_text_:b in 30) [ClassicSimilarity], result of:
      0.06921006 = score(doc=30,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.3914457 = fieldWeight in 30, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.078125 = fieldNorm(doc=30)
  0.33333334 = coord(1/3)

Biebricher, P.; Fuhr, N.; Niewelt, B.: ¬Der AIR-Retrievaltest (1986) 0.02

0.02307002 = product of:
  0.06921006 = sum of:
    0.06921006 = weight(_text_:b in 4040) [ClassicSimilarity], result of:
      0.06921006 = score(doc=4040,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.3914457 = fieldWeight in 4040, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.078125 = fieldNorm(doc=4040)
  0.33333334 = coord(1/3)

SIGIR'92 : Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1992) 0.02
```
0.018055148 = product of:
  0.05416544 = sum of:
    0.05416544 = weight(_text_:b in 6671) [ClassicSimilarity], result of:
      0.05416544 = score(doc=6671,freq=10.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.30635473 = fieldWeight in 6671, product of:
          3.1622777 = tf(freq=10.0), with freq of:
            10.0 = termFreq=10.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.02734375 = fieldNorm(doc=6671)
  0.33333334 = coord(1/3)
```
Content

HARMAN, D.: Relevance feedback revisited; AALBERSBERG, I.J.: Incremental relevance feedback; TAGUE-SUTCLIFFE, J.: Measuring the informativeness of a retrieval process; LEWIS, D.D.: An evaluation of phrasal and clustered representations on a text categorization task; BLOSSEVILLE, M.J., G. HÉBRAIL, M.G. MONTEIL u. N. PÉNOT: Automatic document classification: natural language processing, statistical analysis, and expert system techniques used together; MASAND, B., G. LINOFF u. D. WALTZ: Classifying news stories using memory based reasoning; KEEN, E.M.: Term position ranking: some new test results; CROUCH, C.J. u. B. YANG: Experiments in automatic statistical thesaurus construction; GREFENSTETTE, G.: Use of syntactic context to produce term association lists for text retrieval; ANICK, P.G. u. R.A. FLYNN: Versioning of full-text information retrieval system; BURKOWSKI, F.J.: Retrieval activities in a database consisting of heterogeneous collections; DEERWESTER, S.C., K. WACLENA u. M. LaMAR: A textual object management system; NIE, J.-Y.:Towards a probabilistic modal logic for semantic-based information retrieval; WANG, A.W., S.K.M. WONG u. Y.Y. YAO: An analysis of vector space models based on computational geometry; BARTELL, B.T., G.W. COTTRELL u. R.K. BELEW: Latent semantic indexing is an optimal special case of multidimensional scaling; GLAVITSCH, U. u. P. SCHÄUBLE: A system for retrieving speech documents; MARGULIS, E.L.: N-Poisson document modelling; HESS, M.: An incrementally extensible document retrieval system based on linguistics and logical principles; COOPER, W.S., F.C. GEY u. D.P. DABNEY: Probabilistic retrieval based on staged logistic regression; FUHR, N.: Integration of probabilistic fact and text retrieval; CROFT, B., L.A. SMITH u. H. TURTLE: A loosely-coupled integration of a text retrieval system and an object-oriented database system; DUMAIS, S.T. u. J. NIELSEN: Automating the assignement of submitted manuscripts to reviewers; GOST, M.A. u. M. MASOTTI: Design of an OPAC database to permit different subject searching accesses; ROBERTSON, A.M. u. P. WILLETT: Searching for historical word forms in a database of 17th century English text using spelling correction methods; FAX, E.A., Q.F. CHEN u. L.S. HEATH: A faster algorithm for constructing minimal perfect hash functions; MOFFAT, A. u. J. ZOBEL: Parameterised compression for sparse bitmaps; GRANDI, F., P. TIBERIO u. P. Zezula: Frame-sliced patitioned parallel signature files; ALLEN, B.: Cognitive differences in end user searching of a CD-ROM index; SONNENWALD, D.H.: Developing a theory to guide the process of designing information retrieval systems; CUTTING, D.R., J.O. PEDERSEN, D. KARGER, u. J.W. TUKEY: Scatter/ Gather: a cluster-based approach to browsing large document collections; CHALMERS, M. u. P. CHITSON: Bead: Explorations in information visualization; WILLIAMSON, C. u. B. SHNEIDERMAN: The dynamic HomeFinder: evaluating dynamic queries in a real-estate information exploring system

Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.02

0.01802997 = product of:
  0.054089908 = sum of:
    0.054089908 = product of:
      0.108179815 = sum of:
        0.108179815 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
          0.108179815 = score(doc=402,freq=2.0), product of:
            0.17475364 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049903523 = queryNorm
            0.61904186 = fieldWeight in 402, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=402)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Information processing and management. 22(1986) no.6, S.465-476

Matthews, P.; Glitre, K.: Genre analysis of movies using a topic model of plot summaries (2021) 0.02

0.017214447 = product of:
  0.05164334 = sum of:
    0.05164334 = product of:
      0.10328668 = sum of:
        0.10328668 = weight(_text_:72 in 412) [ClassicSimilarity], result of:
          0.10328668 = score(doc=412,freq=2.0), product of:
            0.27884293 = queryWeight, product of:
              5.58764 = idf(docFreq=449, maxDocs=44218)
              0.049903523 = queryNorm
            0.3704117 = fieldWeight in 412, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.58764 = idf(docFreq=449, maxDocs=44218)
              0.046875 = fieldNorm(doc=412)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Journal of the Association for Information Science and Technology. 72(2021) no.12, S.1511-1527

Krutulis, J.D.; Jacob, E.K.: ¬A theoretical model for the study of emergent structure in adaptive information networks (1995) 0.02

0.016149014 = product of:
  0.04844704 = sum of:
    0.04844704 = weight(_text_:b in 3353) [ClassicSimilarity], result of:
      0.04844704 = score(doc=3353,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.27401197 = fieldWeight in 3353, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.0546875 = fieldNorm(doc=3353)
  0.33333334 = coord(1/3)

Source: Connectedness: information, systems, people, organizations. Proceedings of CAIS/ACSI 95, the proceedings of the 23rd Annual Conference of the Canadian Association for Information Science. Ed. by Hope A. Olson and Denis B. Ward

Siebenkäs, A.; Markscheffel, B.: Conception of a workflow for the semi-automatic construction of a thesaurus for the German printing industry (2015) 0.02

0.016149014 = product of:
  0.04844704 = sum of:
    0.04844704 = weight(_text_:b in 2091) [ClassicSimilarity], result of:
      0.04844704 = score(doc=2091,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.27401197 = fieldWeight in 2091, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2091)
  0.33333334 = coord(1/3)

Wiesenmüller, H.: Maschinelle Indexierung am Beispiel der DNB : Analyse und Entwicklungmöglichkeiten (2018) 0.02
```
0.016149014 = product of:
  0.04844704 = sum of:
    0.04844704 = weight(_text_:b in 5209) [ClassicSimilarity], result of:
      0.04844704 = score(doc=5209,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.27401197 = fieldWeight in 5209, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5209)
  0.33333334 = coord(1/3)
```
Abstract

Der Beitrag untersucht die Ergebnisse des bei der Deutschen Nationalbibliothek (DNB) eingesetzten Verfahrens zur automatischen Vergabe von Schlagwörtern. Seit 2017 kommt dieses auch bei Printausgaben der Reihen B und H der Deutschen Nationalbibliografie zum Einsatz. Die zentralen Problembereiche werden dargestellt und an Beispielen illustriert - beispielsweise dass nicht alle im Inhaltsverzeichnis vorkommenden Wörter tatsächlich thematische Aspekte ausdrücken und dass die Software sehr häufig Körperschaften und andere "Named entities" nicht erkennt. Die maschinell generierten Ergebnisse sind derzeit sehr unbefriedigend. Es werden Überlegungen für mögliche Verbesserungen und sinnvolle Strategien angestellt.

Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.02

0.015776224 = product of:
  0.04732867 = sum of:
    0.04732867 = product of:
      0.09465734 = sum of:
        0.09465734 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
          0.09465734 = score(doc=6265,freq=2.0), product of:
            0.17475364 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.049903523 = queryNorm
            0.5416616 = fieldWeight in 6265, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.109375 = fieldNorm(doc=6265)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Information outlook. 9(2005) no.8, S.22-23

Yang, T.-H.; Hsieh, Y.-L.; Liu, S.-H.; Chang, Y.-C.; Hsu, W.-L.: ¬A flexible template generation and matching method with applications for publication reference metadata extraction (2021) 0.01

0.014345372 = product of:
  0.043036114 = sum of:
    0.043036114 = product of:
      0.08607223 = sum of:
        0.08607223 = weight(_text_:72 in 63) [ClassicSimilarity], result of:
          0.08607223 = score(doc=63,freq=2.0), product of:
            0.27884293 = queryWeight, product of:
              5.58764 = idf(docFreq=449, maxDocs=44218)
              0.049903523 = queryNorm
            0.3086764 = fieldWeight in 63, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.58764 = idf(docFreq=449, maxDocs=44218)
              0.0390625 = fieldNorm(doc=63)
      0.5 = coord(1/2)
  0.33333334 = coord(1/3)

Source: Journal of the Association for Information Science and Technology. 72(2021) no.1, S.32-45

Experimentelles und praktisches Information Retrieval : Festschrift für Gerhard Lustig (1992) 0.01
```
0.013842012 = product of:
  0.041526034 = sum of:
    0.041526034 = weight(_text_:b in 4) [ClassicSimilarity], result of:
      0.041526034 = score(doc=4,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.23486741 = fieldWeight in 4, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.046875 = fieldNorm(doc=4)
  0.33333334 = coord(1/3)
```
Content

Enthält die Beiträge: SALTON, G.: Effective text understanding in information retrieval; KRAUSE, J.: Intelligentes Information retrieval; FUHR, N.: Konzepte zur Gestaltung zukünftiger Information-Retrieval-Systeme; HÜTHER, H.: Überlegungen zu einem mathematischen Modell für die Type-Token-, die Grundform-Token und die Grundform-Type-Relation; KNORZ, G.: Automatische Generierung inferentieller Links in und zwischen Hyperdokumenten; KONRAD, E.: Zur Effektivitätsbewertung von Information-Retrieval-Systemen; HENRICHS, N.: Retrievalunterstützung durch automatisch generierte Wortfelder; LÜCK, W., W. RITTBERGER u. M. SCHWANTNER: Der Einsatz des Automatischen Indexierungs- und Retrieval-System (AIR) im Fachinformationszentrum Karlsruhe; REIMER, U.: Verfahren der Automatischen Indexierung. Benötigtes Vorwissen und Ansätze zu seiner automatischen Akquisition: Ein Überblick; ENDRES-NIGGEMEYER, B.: Dokumentrepräsentation: Ein individuelles prozedurales Modell des Abstracting, des Indexierens und Klassifizierens; SEELBACH, D.: Zur Entwicklung von zwei- und mehrsprachigen lexikalischen Datenbanken und Terminologiedatenbanken; ZIMMERMANN, H.: Der Einfluß der Sprachbarrieren in Europa und Möglichkeiten zu ihrer Minderung; LENDERS, W.: Wörter zwischen Welt und Wissen; PANYR, J.: Frames, Thesauri und automatische Klassifikation (Clusteranalyse): HAHN, U.: Forschungsstrategien und Erkenntnisinteressen in der anwendungsorientierten automatischen Sprachverarbeitung. Überlegungen zu einer ingenieurorientierten Computerlinguistik; KUHLEN, R.: Hypertext und Information Retrieval - mehr als Browsing und Suche.
Cui, H.; Boufford, D.; Selden, P.: Semantic annotation of biosystematics literature without training examples (2010) 0.01
```
0.013842012 = product of:
  0.041526034 = sum of:
    0.041526034 = weight(_text_:b in 3422) [ClassicSimilarity], result of:
      0.041526034 = score(doc=3422,freq=2.0), product of:
        0.17680629 = queryWeight, product of:
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.049903523 = queryNorm
        0.23486741 = fieldWeight in 3422, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.542962 = idf(docFreq=3476, maxDocs=44218)
          0.046875 = fieldNorm(doc=3422)
  0.33333334 = coord(1/3)
```
Abstract

This article presents an unsupervised algorithm for semantic annotation of morphological descriptions of whole organisms. The algorithm is able to annotate plain text descriptions with high accuracy at the clause level by exploiting the corpus itself. In other words, the algorithm does not need lexicons, syntactic parsers, training examples, or annotation templates. The evaluation on two real-life description collections in botany and paleontology shows that the algorithm has the following desirable features: (a) reduces/eliminates manual labor required to compile dictionaries and prepare source documents; (b) improves annotation coverage: the algorithm annotates what appears in documents and is not limited by predefined and often incomplete templates; (c) learns clean and reusable concepts: the algorithm learns organ names and character states that can be used to construct reusable domain lexicons, as opposed to collection-dependent patterns whose applicability is often limited to a particular collection; (d) insensitive to collection size; and (e) runs in linear time with respect to the number of clauses to be annotated.

Search (53 results, page 1 of 3)

Authors

Years

Languages

Types

Themes