Search (16 results, page 1 of 1)

  • × theme_ss:"Automatisches Indexieren"
  • × type_ss:"a"
  • × year_i:[2000 TO 2010}
  1. Newman, D.J.; Block, S.: Probabilistic topic decomposition of an eighteenth-century American newspaper (2006) 0.04
    0.03888039 = product of:
      0.07776078 = sum of:
        0.07776078 = sum of:
          0.028099505 = weight(_text_:science in 5291) [ClassicSimilarity], result of:
            0.028099505 = score(doc=5291,freq=2.0), product of:
              0.13793045 = queryWeight, product of:
                2.6341193 = idf(docFreq=8627, maxDocs=44218)
                0.052363027 = queryNorm
              0.20372227 = fieldWeight in 5291, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.6341193 = idf(docFreq=8627, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5291)
          0.04966127 = weight(_text_:22 in 5291) [ClassicSimilarity], result of:
            0.04966127 = score(doc=5291,freq=2.0), product of:
              0.1833664 = queryWeight, product of:
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.052363027 = queryNorm
              0.2708308 = fieldWeight in 5291, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5018296 = idf(docFreq=3622, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5291)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 17:32:00
    Source
    Journal of the American Society for Information Science and Technology. 57(2006) no.6, S.753-767
  2. Hlava, M.M.K.: Automatic indexing : comparing rule-based and statistics-based indexing systems (2005) 0.02
    0.024830636 = product of:
      0.04966127 = sum of:
        0.04966127 = product of:
          0.09932254 = sum of:
            0.09932254 = weight(_text_:22 in 6265) [ClassicSimilarity], result of:
              0.09932254 = score(doc=6265,freq=2.0), product of:
                0.1833664 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.052363027 = queryNorm
                0.5416616 = fieldWeight in 6265, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=6265)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information outlook. 9(2005) no.8, S.22-23
  3. Hauer, M.: Automatische Indexierung (2000) 0.02
    0.021283401 = product of:
      0.042566802 = sum of:
        0.042566802 = product of:
          0.085133605 = sum of:
            0.085133605 = weight(_text_:22 in 5887) [ClassicSimilarity], result of:
              0.085133605 = score(doc=5887,freq=2.0), product of:
                0.1833664 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.052363027 = queryNorm
                0.46428138 = fieldWeight in 5887, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.09375 = fieldNorm(doc=5887)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Wissen in Aktion: Wege des Knowledge Managements. 22. Online-Tagung der DGI, Frankfurt am Main, 2.-4.5.2000. Proceedings. Hrsg.: R. Schmidt
  4. Lepsky, K.; Vorhauer, J.: Lingo - ein open source System für die Automatische Indexierung deutschsprachiger Dokumente (2006) 0.01
    0.014188935 = product of:
      0.02837787 = sum of:
        0.02837787 = product of:
          0.05675574 = sum of:
            0.05675574 = weight(_text_:22 in 3581) [ClassicSimilarity], result of:
              0.05675574 = score(doc=3581,freq=2.0), product of:
                0.1833664 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.052363027 = queryNorm
                0.30952093 = fieldWeight in 3581, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=3581)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    24. 3.2006 12:22:02
  5. Probst, M.; Mittelbach, J.: Maschinelle Indexierung in der Sacherschließung wissenschaftlicher Bibliotheken (2006) 0.01
    0.014188935 = product of:
      0.02837787 = sum of:
        0.02837787 = product of:
          0.05675574 = sum of:
            0.05675574 = weight(_text_:22 in 1755) [ClassicSimilarity], result of:
              0.05675574 = score(doc=1755,freq=2.0), product of:
                0.1833664 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.052363027 = queryNorm
                0.30952093 = fieldWeight in 1755, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=1755)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2008 12:35:19
  6. Hlava, M.M.: Automatic indexing : a matter of degree (2002) 0.01
    0.014049753 = product of:
      0.028099505 = sum of:
        0.028099505 = product of:
          0.05619901 = sum of:
            0.05619901 = weight(_text_:science in 2501) [ClassicSimilarity], result of:
              0.05619901 = score(doc=2501,freq=2.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.40744454 = fieldWeight in 2501, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.109375 = fieldNorm(doc=2501)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Bulletin of the American Society for Information Science. 28(2002) no.1, S.12-15
  7. Renz, M.: Automatische Inhaltserschließung im Zeichen von Wissensmanagement (2001) 0.01
    0.012415318 = product of:
      0.024830636 = sum of:
        0.024830636 = product of:
          0.04966127 = sum of:
            0.04966127 = weight(_text_:22 in 5671) [ClassicSimilarity], result of:
              0.04966127 = score(doc=5671,freq=2.0), product of:
                0.1833664 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.052363027 = queryNorm
                0.2708308 = fieldWeight in 5671, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5671)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 3.2001 13:14:48
  8. Mongin, L.; Fu, Y.Y.; Mostafa, J.: Open Archives data Service prototype and automated subject indexing using D-Lib archive content as a testbed (2003) 0.01
    0.008515436 = product of:
      0.017030872 = sum of:
        0.017030872 = product of:
          0.034061745 = sum of:
            0.034061745 = weight(_text_:science in 1167) [ClassicSimilarity], result of:
              0.034061745 = score(doc=1167,freq=4.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.24694869 = fieldWeight in 1167, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1167)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The Indiana University School of Library and Information Science opened a new research laboratory in January 2003; The Indiana University School of Library and Information Science Information Processing Laboratory [IU IP Lab]. The purpose of the new laboratory is to facilitate collaboration between scientists in the department in the areas of information retrieval (IR) and information visualization (IV) research. The lab has several areas of focus. These include grid and cluster computing, and a standard Java-based software platform to support plug and play research datasets, a selection of standard IR modules and standard IV algorithms. Future development includes software to enable researchers to contribute datasets, IR algorithms, and visualization algorithms into the standard environment. We decided early on to use OAI-PMH as a resource discovery tool because it is consistent with our mission.
  9. Humphrey, S.M.; Névéol, A.; Browne, A.; Gobeil, J.; Ruch, P.; Darmoni, S.J.: Comparing a rule-based versus statistical system for automatic categorization of MEDLINE documents according to biomedical specialty (2009) 0.01
    0.0070961965 = product of:
      0.014192393 = sum of:
        0.014192393 = product of:
          0.028384786 = sum of:
            0.028384786 = weight(_text_:science in 3300) [ClassicSimilarity], result of:
              0.028384786 = score(doc=3300,freq=4.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.20579056 = fieldWeight in 3300, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3300)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    Automatic document categorization is an important research problem in Information Science and Natural Language Processing. Many applications, including, Word Sense Disambiguation and Information Retrieval in large collections, can benefit from such categorization. This paper focuses on automatic categorization of documents from the biomedical literature into broad discipline-based categories. Two different systems are described and contrasted: CISMeF, which uses rules based on human indexing of the documents by the Medical Subject Headings (MeSH) controlled vocabulary in order to assign metaterms (MTs), and Journal Descriptor Indexing (JDI), based on human categorization of about 4,000 journals and statistical associations between journal descriptors (JDs) and textwords in the documents. We evaluate and compare the performance of these systems against a gold standard of humanly assigned categories for 100 MEDLINE documents, using six measures selected from trec_eval. The results show that for five of the measures performance is comparable, and for one measure JDI is superior. We conclude that these results favor JDI, given the significantly greater intellectual overhead involved in human indexing and maintaining a rule base for mapping MeSH terms to MTs. We also note a JDI method that associates JDs with MeSH indexing rather than textwords, and it may be worthwhile to investigate whether this JDI method (statistical) and CISMeF (rule-based) might be combined and then evaluated showing they are complementary to one another.
    Source
    Journal of the American Society for Information Science and Technology. 60(2009) no.12, S.2530-2539
  10. Dolamic, L.; Savoy, J.: When stopword lists make the difference (2009) 0.01
    0.0070248763 = product of:
      0.014049753 = sum of:
        0.014049753 = product of:
          0.028099505 = sum of:
            0.028099505 = weight(_text_:science in 3319) [ClassicSimilarity], result of:
              0.028099505 = score(doc=3319,freq=2.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.20372227 = fieldWeight in 3319, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=3319)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and Technology. 61(2010) no.1, S.200-203
  11. Medelyan, O.; Witten, I.H.: Domain-independent automatic keyphrase indexing with small training sets (2008) 0.01
    0.006021322 = product of:
      0.012042644 = sum of:
        0.012042644 = product of:
          0.024085289 = sum of:
            0.024085289 = weight(_text_:science in 1871) [ClassicSimilarity], result of:
              0.024085289 = score(doc=1871,freq=2.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.17461908 = fieldWeight in 1871, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1871)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and Technology. 59(2008) no.7, S.1026-1040
  12. Chung, Y.M.; Lee, J.Y.: ¬A corpus-based approach to comparative evaluation of statistical term association measures (2001) 0.01
    0.0050177686 = product of:
      0.010035537 = sum of:
        0.010035537 = product of:
          0.020071074 = sum of:
            0.020071074 = weight(_text_:science in 5769) [ClassicSimilarity], result of:
              0.020071074 = score(doc=5769,freq=2.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.1455159 = fieldWeight in 5769, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5769)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and technology. 52(2001) no.4, S.283-296
  13. Li, W.; Wong, K.-F.; Yuan, C.: Toward automatic Chinese temporal information extraction (2001) 0.01
    0.0050177686 = product of:
      0.010035537 = sum of:
        0.010035537 = product of:
          0.020071074 = sum of:
            0.020071074 = weight(_text_:science in 6029) [ClassicSimilarity], result of:
              0.020071074 = score(doc=6029,freq=2.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.1455159 = fieldWeight in 6029, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=6029)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and technology. 52(2001) no.9, S.748-762
  14. Jones, S.; Paynter, G.W.: Automatic extractionof document keyphrases for use in digital libraries : evaluations and applications (2002) 0.01
    0.0050177686 = product of:
      0.010035537 = sum of:
        0.010035537 = product of:
          0.020071074 = sum of:
            0.020071074 = weight(_text_:science in 601) [ClassicSimilarity], result of:
              0.020071074 = score(doc=601,freq=2.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.1455159 = fieldWeight in 601, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=601)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and technology. 53(2002) no.8, S.653-677
  15. Dolamic, L.; Savoy, J.: Indexing and searching strategies for the Russian language (2009) 0.01
    0.0050177686 = product of:
      0.010035537 = sum of:
        0.010035537 = product of:
          0.020071074 = sum of:
            0.020071074 = weight(_text_:science in 3301) [ClassicSimilarity], result of:
              0.020071074 = score(doc=3301,freq=2.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.1455159 = fieldWeight in 3301, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3301)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Journal of the American Society for Information Science and Technology. 60(2009) no.12, S.2540-2547
  16. Rasmussen, E.M.: Indexing and retrieval for the Web (2002) 0.00
    0.0035124382 = product of:
      0.0070248763 = sum of:
        0.0070248763 = product of:
          0.014049753 = sum of:
            0.014049753 = weight(_text_:science in 4285) [ClassicSimilarity], result of:
              0.014049753 = score(doc=4285,freq=2.0), product of:
                0.13793045 = queryWeight, product of:
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.052363027 = queryNorm
                0.101861134 = fieldWeight in 4285, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.6341193 = idf(docFreq=8627, maxDocs=44218)
                  0.02734375 = fieldNorm(doc=4285)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Annual review of information science and technology. 37(2003), S.91-126