Search (40 results, page 2 of 2)

  • × theme_ss:"Automatisches Indexieren"
  • × year_i:[2010 TO 2020}
  1. Schöneberg, U.; Gödert, W.: Erschließung mathematischer Publikationen mittels linguistischer Verfahren (2012) 0.00
    0.0013376063 = product of:
      0.009363244 = sum of:
        0.009363244 = product of:
          0.028089732 = sum of:
            0.028089732 = weight(_text_:f in 1055) [ClassicSimilarity], result of:
              0.028089732 = score(doc=1055,freq=2.0), product of:
                0.10631079 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.026672479 = queryNorm
                0.26422277 = fieldWeight in 1055, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1055)
          0.33333334 = coord(1/3)
      0.14285715 = coord(1/7)
    
    Source
    http://at.yorku.ca/c/b/f/j/99.htm
  2. Gil-Leiva, I.: SISA-automatic indexing system for scientific articles : experiments with location heuristics rules versus TF-IDF rules (2017) 0.00
    0.0013376063 = product of:
      0.009363244 = sum of:
        0.009363244 = product of:
          0.028089732 = sum of:
            0.028089732 = weight(_text_:f in 3622) [ClassicSimilarity], result of:
              0.028089732 = score(doc=3622,freq=2.0), product of:
                0.10631079 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.026672479 = queryNorm
                0.26422277 = fieldWeight in 3622, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3622)
          0.33333334 = coord(1/3)
      0.14285715 = coord(1/7)
    
    Abstract
    Indexing is contextualized and a brief description is provided of some of the most used automatic indexing systems. We describe SISA, a system which uses location heuristics rules, statistical rules like term frequency (TF) or TF-IDF to obtain automatic or semi-automatic indexing, depending on the user's preference. The aim of this research is to ascertain which rules (location heuristics rules or TF-IDF rules) provide the best indexing terms. SISA is used to obtain the automatic indexing of 200 scientific articles on fruit growing written in Portuguese. It uses, on the one hand, location heuristics rules founded on the value of certain parts of the articles for indexing such as titles, abstracts, keywords, headings, first paragraph, conclusions and references and, on the other, TF-IDF rules. The indexing is then evaluated to ascertain retrieval performance through recall, precision and f-measure. Automatic indexing of the articles with location heuristics rules provided the best results with the evaluation measures.
  3. Martins, A.L.; Souza, R.R.; Ribeiro de Mello, H.: ¬The use of noun phrases in information retrieval : proposing a mechanism for automatic classification (2014) 0.00
    0.0010325008 = product of:
      0.007227505 = sum of:
        0.007227505 = product of:
          0.01445501 = sum of:
            0.01445501 = weight(_text_:22 in 1441) [ClassicSimilarity], result of:
              0.01445501 = score(doc=1441,freq=2.0), product of:
                0.093402475 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.026672479 = queryNorm
                0.15476047 = fieldWeight in 1441, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1441)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  4. Mesquita, L.A.P.; Souza, R.R.; Baracho Porto, R.M.A.: Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses (2014) 0.00
    0.0010325008 = product of:
      0.007227505 = sum of:
        0.007227505 = product of:
          0.01445501 = sum of:
            0.01445501 = weight(_text_:22 in 1442) [ClassicSimilarity], result of:
              0.01445501 = score(doc=1442,freq=2.0), product of:
                0.093402475 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.026672479 = queryNorm
                0.15476047 = fieldWeight in 1442, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.03125 = fieldNorm(doc=1442)
          0.5 = coord(1/2)
      0.14285715 = coord(1/7)
    
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  5. Chung, E.-K.; Miksa, S.; Hastings, S.K.: ¬A framework of automatic subject term assignment for text categorization : an indexing conception-based approach (2010) 0.00
    8.9173764E-4 = product of:
      0.006242163 = sum of:
        0.006242163 = product of:
          0.018726489 = sum of:
            0.018726489 = weight(_text_:f in 3434) [ClassicSimilarity], result of:
              0.018726489 = score(doc=3434,freq=2.0), product of:
                0.10631079 = queryWeight, product of:
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.026672479 = queryNorm
                0.17614852 = fieldWeight in 3434, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.985786 = idf(docFreq=2232, maxDocs=44218)
                  0.03125 = fieldNorm(doc=3434)
          0.33333334 = coord(1/3)
      0.14285715 = coord(1/7)
    
    Abstract
    The purpose of this study is to examine whether the understandings of subject-indexing processes conducted by human indexers have a positive impact on the effectiveness of automatic subject term assignment through text categorization (TC). More specifically, human indexers' subject-indexing approaches, or conceptions, in conjunction with semantic sources were explored in the context of a typical scientific journal article dataset. Based on the premise that subject indexing approaches or conceptions with semantic sources are important for automatic subject term assignment through TC, this study proposed an indexing conception-based framework. For the purpose of this study, two research questions were explored: To what extent are semantic sources effective? To what extent are indexing conceptions effective? The experiments were conducted using a Support Vector Machine implementation in WEKA (I.H. Witten & E. Frank, [2000]). Using F-measure, the experiment results showed that cited works, source title, and title were as effective as the full text while a keyword was found more effective than the full text. In addition, the findings showed that an indexing conception-based framework was more effective than the full text. The content-oriented and the document-oriented indexing approaches especially were found more effective than the full text. Among three indexing conception-based approaches, the content-oriented approach and the document-oriented approach were more effective than the domain-oriented approach. In other words, in the context of a typical scientific journal article dataset, the objective contents and authors' intentions were more desirable for automatic subject term assignment via TC than the possible users' needs. The findings of this study support that incorporation of human indexers' indexing approaches or conception in conjunction with semantic sources has a positive impact on the effectiveness of automatic subject term assignment.
  6. Williams, R.V.: Hans Peter Luhn and Herbert M. Ohlman : their roles in the origins of keyword-in-context/permutation automatic indexing (2010) 0.00
    7.373484E-4 = product of:
      0.0051614386 = sum of:
        0.0051614386 = product of:
          0.020645754 = sum of:
            0.020645754 = weight(_text_:m in 3440) [ClassicSimilarity], result of:
              0.020645754 = score(doc=3440,freq=4.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.31105608 = fieldWeight in 3440, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3440)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
    Biographed
    Ohlmann, Herbert M.
  7. Mao, J.; Xu, W.; Yang, Y.; Wang, J.; Yuille, A.L.: Explain images with multimodal recurrent neural networks (2014) 0.00
    6.7729776E-4 = product of:
      0.0047410843 = sum of:
        0.0047410843 = product of:
          0.018964337 = sum of:
            0.018964337 = weight(_text_:m in 1557) [ClassicSimilarity], result of:
              0.018964337 = score(doc=1557,freq=6.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.28572327 = fieldWeight in 1557, product of:
                  2.4494898 = tf(freq=6.0), with freq of:
                    6.0 = termFreq=6.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1557)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
    Abstract
    In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel sentence descriptions to explain the content of images. It directly models the probability distribution of generating a word given previous words and the image. Image descriptions are generated by sampling from this distribution. The model consists of two sub-networks: a deep recurrent neural network for sentences and a deep convolutional network for images. These two sub-networks interact with each other in a multimodal layer to form the whole m-RNN model. The effectiveness of our model is validated on three benchmark datasets (IAPR TC-12 [8], Flickr 8K [28], and Flickr 30K [13]). Our model outperforms the state-of-the-art generative method. In addition, the m-RNN model can be applied to retrieval tasks for retrieving images or sentences, and achieves significant performance improvement over the state-of-the-art methods which directly optimize the ranking objective function for retrieval.
  8. Moreno, J.M.T.: Automatic text summarization (2014) 0.00
    4.6084277E-4 = product of:
      0.0032258993 = sum of:
        0.0032258993 = product of:
          0.012903597 = sum of:
            0.012903597 = weight(_text_:m in 1518) [ClassicSimilarity], result of:
              0.012903597 = score(doc=1518,freq=4.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.19441006 = fieldWeight in 1518, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1518)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
    Language
    m
    Type
    m
  9. Lichtenstein, A.; Plank, M.; Neumann, J.: TIB's portal for audiovisual media : combining manual and automatic indexing (2014) 0.00
    4.5621104E-4 = product of:
      0.0031934772 = sum of:
        0.0031934772 = product of:
          0.012773909 = sum of:
            0.012773909 = weight(_text_:m in 1981) [ClassicSimilarity], result of:
              0.012773909 = score(doc=1981,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.19245613 = fieldWeight in 1981, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1981)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  10. Böhm, A.; Seifert, C.; Schlötterer, J.; Granitzer, M.: Identifying tweets from the economic domain (2017) 0.00
    4.5621104E-4 = product of:
      0.0031934772 = sum of:
        0.0031934772 = product of:
          0.012773909 = sum of:
            0.012773909 = weight(_text_:m in 3495) [ClassicSimilarity], result of:
              0.012773909 = score(doc=3495,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.19245613 = fieldWeight in 3495, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3495)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  11. Short, M.: Text mining and subject analysis for fiction; or, using machine learning and information extraction to assign subject headings to dime novels (2019) 0.00
    4.5621104E-4 = product of:
      0.0031934772 = sum of:
        0.0031934772 = product of:
          0.012773909 = sum of:
            0.012773909 = weight(_text_:m in 5481) [ClassicSimilarity], result of:
              0.012773909 = score(doc=5481,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.19245613 = fieldWeight in 5481, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5481)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  12. Banerjee, K.; Johnson, M.: Improving access to archival collections with automated entity extraction (2015) 0.00
    3.9103805E-4 = product of:
      0.0027372662 = sum of:
        0.0027372662 = product of:
          0.010949065 = sum of:
            0.010949065 = weight(_text_:m in 2144) [ClassicSimilarity], result of:
              0.010949065 = score(doc=2144,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.1649624 = fieldWeight in 2144, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2144)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  13. Zhitomirsky-Geffet, M.; Prebor, G.; Bloch, O.: Improving proverb search and retrieval with a generic multidimensional ontology (2017) 0.00
    3.9103805E-4 = product of:
      0.0027372662 = sum of:
        0.0027372662 = product of:
          0.010949065 = sum of:
            0.010949065 = weight(_text_:m in 3320) [ClassicSimilarity], result of:
              0.010949065 = score(doc=3320,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.1649624 = fieldWeight in 3320, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.046875 = fieldNorm(doc=3320)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  14. Golub, K.; Lykke, M.; Tudhope, D.: Enhancing social tagging with automated keywords from the Dewey Decimal Classification (2014) 0.00
    3.2586502E-4 = product of:
      0.002281055 = sum of:
        0.002281055 = product of:
          0.00912422 = sum of:
            0.00912422 = weight(_text_:m in 2918) [ClassicSimilarity], result of:
              0.00912422 = score(doc=2918,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.13746867 = fieldWeight in 2918, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2918)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  15. Groß, T.: Automatische Indexierung von Dokumenten in einer wissenschaftlichen Bibliothek : Implementierung und Evaluierung am Beispiel der Deutschen Zentralbibliothek für Wirtschaftswissenschaften (2011) 0.00
    3.2586502E-4 = product of:
      0.002281055 = sum of:
        0.002281055 = product of:
          0.00912422 = sum of:
            0.00912422 = weight(_text_:m in 1083) [ClassicSimilarity], result of:
              0.00912422 = score(doc=1083,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.13746867 = fieldWeight in 1083, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1083)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
    Type
    m
  16. Donahue, J.; Hendricks, L.A.; Guadarrama, S.; Rohrbach, M.; Venugopalan, S.; Saenko, K.; Darrell, T.: Long-term recurrent convolutional networks for visual recognition and description (2014) 0.00
    3.2586502E-4 = product of:
      0.002281055 = sum of:
        0.002281055 = product of:
          0.00912422 = sum of:
            0.00912422 = weight(_text_:m in 1873) [ClassicSimilarity], result of:
              0.00912422 = score(doc=1873,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.13746867 = fieldWeight in 1873, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1873)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  17. Toepfer, M.; Kempf, A.O.: Automatische Indexierung auf Basis von Titeln und Autoren-Keywords : ein Werkstattbericht (2016) 0.00
    3.2586502E-4 = product of:
      0.002281055 = sum of:
        0.002281055 = product of:
          0.00912422 = sum of:
            0.00912422 = weight(_text_:m in 3209) [ClassicSimilarity], result of:
              0.00912422 = score(doc=3209,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.13746867 = fieldWeight in 3209, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3209)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  18. Golub, K.; Soergel, D.; Buchanan, G.; Tudhope, D.; Lykke, M.; Hiom, D.: ¬A framework for evaluating automatic indexing or classification in the context of retrieval (2016) 0.00
    3.2586502E-4 = product of:
      0.002281055 = sum of:
        0.002281055 = product of:
          0.00912422 = sum of:
            0.00912422 = weight(_text_:m in 3311) [ClassicSimilarity], result of:
              0.00912422 = score(doc=3311,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.13746867 = fieldWeight in 3311, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3311)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  19. Pollmeier, M.: Verlagsschlagwörter als Grundlage für den Einsatz eines maschinellen Verfahrens zur verbalen Erschließung der Kinder- und Jugendliteratur durch die Deutsche Nationalbibliothek : eine Datenanalyse (2019) 0.00
    3.2586502E-4 = product of:
      0.002281055 = sum of:
        0.002281055 = product of:
          0.00912422 = sum of:
            0.00912422 = weight(_text_:m in 1081) [ClassicSimilarity], result of:
              0.00912422 = score(doc=1081,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.13746867 = fieldWeight in 1081, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1081)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    
  20. Groß, T.; Faden, M.: Automatische Indexierung elektronischer Dokumente an der Deutschen Zentralbibliothek für Wirtschaftswissenschaften : Bericht über die Jahrestagung der Internationalen Buchwissenschaftlichen Gesellschaft (2010) 0.00
    2.6069203E-4 = product of:
      0.001824844 = sum of:
        0.001824844 = product of:
          0.007299376 = sum of:
            0.007299376 = weight(_text_:m in 4051) [ClassicSimilarity], result of:
              0.007299376 = score(doc=4051,freq=2.0), product of:
                0.066373095 = queryWeight, product of:
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.026672479 = queryNorm
                0.10997493 = fieldWeight in 4051, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.4884486 = idf(docFreq=9980, maxDocs=44218)
                  0.03125 = fieldNorm(doc=4051)
          0.25 = coord(1/4)
      0.14285715 = coord(1/7)
    

Languages

  • d 20
  • e 19
  • m 1
  • More… Less…

Types

  • a 31
  • el 9
  • x 6
  • m 2
  • p 1
  • More… Less…