Search (36 results, page 1 of 2)

Hodges, P.R.: Keyword in title indexes : effectiveness of retrieval in computer searches (1983) 0.09

0.09078212 = product of:
  0.18156424 = sum of:
    0.12103168 = weight(_text_:fields in 5001) [ClassicSimilarity], result of:
      0.12103168 = score(doc=5001,freq=2.0), product of:
        0.31604284 = queryWeight, product of:
          4.951651 = idf(docFreq=849, maxDocs=44218)
          0.06382575 = queryNorm
        0.38295972 = fieldWeight in 5001, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.951651 = idf(docFreq=849, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5001)
    0.060532555 = weight(_text_:22 in 5001) [ClassicSimilarity], result of:
      0.060532555 = score(doc=5001,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.2708308 = fieldWeight in 5001, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0546875 = fieldNorm(doc=5001)
  0.5 = coord(2/4)

Abstract: A study was done to test the effectiveness of retrieval using title word searching. It was based on actual search profiles used in the Mechanized Information Center at Ohio State University, in order ro replicate as closely as possible actual searching conditions. Fewer than 50% of the relevant titles were retrieved by keywords in titles. The low rate of retrieval can be attributes to three sources: titles themselves, user and information specialist ignorance of the subject vocabulary in use, and to general language problems. Across fields it was found that the social sciences had the best retrieval rate, with science having the next best, and arts and humanities the lowest. Ways to enhance and supplement keyword in title searching on the computer and in printed indexes are discussed.
Date: 14. 3.1996 13:22:21

Daudaravicius, V.: ¬A framework for keyphrase extraction from scientific journals (2016) 0.04
```
0.042791158 = product of:
  0.17116463 = sum of:
    0.17116463 = weight(_text_:fields in 2930) [ClassicSimilarity], result of:
      0.17116463 = score(doc=2930,freq=4.0), product of:
        0.31604284 = queryWeight, product of:
          4.951651 = idf(docFreq=849, maxDocs=44218)
          0.06382575 = queryNorm
        0.5415868 = fieldWeight in 2930, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.951651 = idf(docFreq=849, maxDocs=44218)
          0.0546875 = fieldNorm(doc=2930)
  0.25 = coord(1/4)
```
Abstract

We present a framework for keyphrase extraction from scientific journals in diverse research fields. While journal articles are often provided with manually assigned keywords, it is not clear how to automatically extract keywords and measure their significance for a set of journal articles. We compare extracted keyphrases from journals in the fields of astrophysics, mathematics, physics, and computer science. We show that the presented statistics-based framework is able to demonstrate differences among journals, and that the extracted keyphrases can be used to represent journal or conference research topics, dynamics, and specificity.

Voorhees, E.M.: Implementing agglomerative hierarchic clustering algorithms for use in document retrieval (1986) 0.03

0.034590032 = product of:
  0.13836013 = sum of:
    0.13836013 = weight(_text_:22 in 402) [ClassicSimilarity], result of:
      0.13836013 = score(doc=402,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.61904186 = fieldWeight in 402, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.125 = fieldNorm(doc=402)
  0.25 = coord(1/4)

Source: Information processing and management. 22(1986) no.6, S.465-476

Oliver, C.T.: One-eyed king: automated indexing (1989) 0.03

0.03458048 = product of:
  0.13832192 = sum of:
    0.13832192 = weight(_text_:fields in 2316) [ClassicSimilarity], result of:
      0.13832192 = score(doc=2316,freq=2.0), product of:
        0.31604284 = queryWeight, product of:
          4.951651 = idf(docFreq=849, maxDocs=44218)
          0.06382575 = queryNorm
        0.43766826 = fieldWeight in 2316, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.951651 = idf(docFreq=849, maxDocs=44218)
          0.0625 = fieldNorm(doc=2316)
  0.25 = coord(1/4)

Abstract: In a work entitled 'Adagia' published in 1508, Erasmus collected ancient Greek and Roman proverbs. He included this proverb: "Among the blind, the one-eyed man is king". In a field where there is little interest in the theoretical research of related fields, and in understanding the theoretical assumptions on which practical activity is based, a one-eyed man, such as autumatic or mechanical indexing, easily appears respectable and becomes widely practiced despite its obvious deficiencies

Fuhr, N.; Niewelt, B.: ¬Ein Retrievaltest mit automatisch indexierten Dokumenten (1984) 0.03

0.030266277 = product of:
  0.12106511 = sum of:
    0.12106511 = weight(_text_:22 in 262) [ClassicSimilarity], result of:
      0.12106511 = score(doc=262,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.5416616 = fieldWeight in 262, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.109375 = fieldNorm(doc=262)
  0.25 = coord(1/4)

Date: 20.10.2000 12:22:23

Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.03

0.030266277 = product of:
  0.12106511 = sum of:
    0.12106511 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
      0.12106511 = score(doc=6265,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.5416616 = fieldWeight in 6265, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.109375 = fieldNorm(doc=6265)
  0.25 = coord(1/4)

Source: Information outlook. 9(2005) no.8, S.22-23

Fuhr, N.: Ranking-Experimente mit gewichteter Indexierung (1986) 0.03

0.025942523 = product of:
  0.10377009 = sum of:
    0.10377009 = weight(_text_:22 in 58) [ClassicSimilarity], result of:
      0.10377009 = score(doc=58,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.46428138 = fieldWeight in 58, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.09375 = fieldNorm(doc=58)
  0.25 = coord(1/4)

Date: 14. 6.2015 22:12:44

Hauer, M.: Automatische Indexierung (2000) 0.03

0.025942523 = product of:
  0.10377009 = sum of:
    0.10377009 = weight(_text_:22 in 5887) [ClassicSimilarity], result of:
      0.10377009 = score(doc=5887,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.46428138 = fieldWeight in 5887, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.09375 = fieldNorm(doc=5887)
  0.25 = coord(1/4)

Source: Wissen in Aktion: Wege des Knowledge Managements. 22. Online-Tagung der DGI, Frankfurt am Main, 2.-4.5.2000. Proceedings. Hrsg.: R. Schmidt

Fuhr, N.: Rankingexperimente mit gewichteter Indexierung (1986) 0.03

0.025942523 = product of:
  0.10377009 = sum of:
    0.10377009 = weight(_text_:22 in 2051) [ClassicSimilarity], result of:
      0.10377009 = score(doc=2051,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.46428138 = fieldWeight in 2051, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.09375 = fieldNorm(doc=2051)
  0.25 = coord(1/4)

Date: 14. 6.2015 22:12:56

Hauer, M.: Tiefenindexierung im Bibliothekskatalog : 17 Jahre intelligentCAPTURE (2019) 0.03

0.025942523 = product of:
  0.10377009 = sum of:
    0.10377009 = weight(_text_:22 in 5629) [ClassicSimilarity], result of:
      0.10377009 = score(doc=5629,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.46428138 = fieldWeight in 5629, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.09375 = fieldNorm(doc=5629)
  0.25 = coord(1/4)

Source: B.I.T.online. 22(2019) H.2, S.163-166

Biebricher, N.; Fuhr, N.; Lustig, G.; Schwantner, M.; Knorz, G.: ¬The automatic indexing system AIR/PHYS : from research to application (1988) 0.02

0.02161877 = product of:
  0.08647508 = sum of:
    0.08647508 = weight(_text_:22 in 1952) [ClassicSimilarity], result of:
      0.08647508 = score(doc=1952,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.38690117 = fieldWeight in 1952, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.078125 = fieldNorm(doc=1952)
  0.25 = coord(1/4)

Date: 16. 8.1998 12:51:22

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.02

0.02161877 = product of:
  0.08647508 = sum of:
    0.08647508 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
      0.08647508 = score(doc=4157,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.38690117 = fieldWeight in 4157, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.078125 = fieldNorm(doc=4157)
  0.25 = coord(1/4)

Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Tsareva, P.V.: Algoritmy dlya raspoznavaniya pozitivnykh i negativnykh vkhozdenii deskriptorov v tekst i protsedura avtomaticheskoi klassifikatsii tekstov (1999) 0.02

0.02161877 = product of:
  0.08647508 = sum of:
    0.08647508 = weight(_text_:22 in 374) [ClassicSimilarity], result of:
      0.08647508 = score(doc=374,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.38690117 = fieldWeight in 374, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.078125 = fieldNorm(doc=374)
  0.25 = coord(1/4)

Date: 1. 4.2002 10:22:41

Stankovic, R. et al.: Indexing of textual databases based on lexical resources : a case study for Serbian (2016) 0.02

0.02161877 = product of:
  0.08647508 = sum of:
    0.08647508 = weight(_text_:22 in 2759) [ClassicSimilarity], result of:
      0.08647508 = score(doc=2759,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.38690117 = fieldWeight in 2759, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.078125 = fieldNorm(doc=2759)
  0.25 = coord(1/4)

Date: 1. 2.2016 18:25:22

Yang, T.-H.; Hsieh, Y.-L.; Liu, S.-H.; Chang, Y.-C.; Hsu, W.-L.: ¬A flexible template generation and matching method with applications for publication reference metadata extraction (2021) 0.02
```
0.0216128 = product of:
  0.0864512 = sum of:
    0.0864512 = weight(_text_:fields in 63) [ClassicSimilarity], result of:
      0.0864512 = score(doc=63,freq=2.0), product of:
        0.31604284 = queryWeight, product of:
          4.951651 = idf(docFreq=849, maxDocs=44218)
          0.06382575 = queryNorm
        0.27354267 = fieldWeight in 63, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.951651 = idf(docFreq=849, maxDocs=44218)
          0.0390625 = fieldNorm(doc=63)
  0.25 = coord(1/4)
```
Abstract

Conventional rule-based approaches use exact template matching to capture linguistic information and necessarily need to enumerate all variations. We propose a novel flexible template generation and matching scheme called the principle-based approach (PBA) based on sequence alignment, and employ it for reference metadata extraction (RME) to demonstrate its effectiveness. The main contributions of this research are threefold. First, we propose an automatic template generation that can capture prominent patterns using the dominating set algorithm. Second, we devise an alignment-based template-matching technique that uses a logistic regression model, which makes it more general and flexible than pure rule-based approaches. Last, we apply PBA to RME on extensive cross-domain corpora and demonstrate its robustness and generality. Experiments reveal that the same set of templates produced by the PBA framework not only deliver consistent performance on various unseen domains, but also surpass hand-crafted knowledge (templates). We use four independent journal style test sets and one conference style test set in the experiments. When compared to renowned machine learning methods, such as conditional random fields (CRF), as well as recent deep learning methods (i.e., bi-directional long short-term memory with a CRF layer, Bi-LSTM-CRF), PBA has the best performance for all datasets.

Tsujii, J.-I.: Automatic acquisition of semantic collocation from corpora (1995) 0.02

0.017295016 = product of:
  0.069180064 = sum of:
    0.069180064 = weight(_text_:22 in 4709) [ClassicSimilarity], result of:
      0.069180064 = score(doc=4709,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.30952093 = fieldWeight in 4709, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0625 = fieldNorm(doc=4709)
  0.25 = coord(1/4)

Date: 31. 7.1996 9:22:19

Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.02

0.017295016 = product of:
  0.069180064 = sum of:
    0.069180064 = weight(_text_:22 in 6752) [ClassicSimilarity], result of:
      0.069180064 = score(doc=6752,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.30952093 = fieldWeight in 6752, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0625 = fieldNorm(doc=6752)
  0.25 = coord(1/4)

Date: 6. 3.1997 16:22:15

Lepsky, K.; Vorhauer, J.: Lingo - ein open source System für die Automatische Indexierung deutschsprachiger Dokumente (2006) 0.02

0.017295016 = product of:
  0.069180064 = sum of:
    0.069180064 = weight(_text_:22 in 3581) [ClassicSimilarity], result of:
      0.069180064 = score(doc=3581,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.30952093 = fieldWeight in 3581, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0625 = fieldNorm(doc=3581)
  0.25 = coord(1/4)

Date: 24. 3.2006 12:22:02

Probst, M.; Mittelbach, J.: Maschinelle Indexierung in der Sacherschließung wissenschaftlicher Bibliotheken (2006) 0.02

0.017295016 = product of:
  0.069180064 = sum of:
    0.069180064 = weight(_text_:22 in 1755) [ClassicSimilarity], result of:
      0.069180064 = score(doc=1755,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.30952093 = fieldWeight in 1755, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0625 = fieldNorm(doc=1755)
  0.25 = coord(1/4)

Date: 22. 3.2008 12:35:19

Glaesener, L.: Automatisches Indexieren einer informationswissenschaftlichen Datenbank mit Mehrwortgruppen (2012) 0.02

0.017295016 = product of:
  0.069180064 = sum of:
    0.069180064 = weight(_text_:22 in 401) [ClassicSimilarity], result of:
      0.069180064 = score(doc=401,freq=2.0), product of:
        0.2235069 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.06382575 = queryNorm
        0.30952093 = fieldWeight in 401, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.0625 = fieldNorm(doc=401)
  0.25 = coord(1/4)

Date: 11. 9.2012 19:43:22

Search (36 results, page 1 of 2)

Authors

Years

Languages

Types

Themes