Search (32 results, page 1 of 2)

  • × theme_ss:"Semantisches Umfeld in Indexierung u. Retrieval"
  1. Smeaton, A.F.; Kelledy, L.; O'Donnell, R.: TREC-4 experiments at Dublin City University : thresholding posting lists, query expansion with WordNet and POS tagging of Spanish (1996) 0.06
    0.055851527 = product of:
      0.11170305 = sum of:
        0.11170305 = product of:
          0.2234061 = sum of:
            0.2234061 = weight(_text_:tagging in 7000) [ClassicSimilarity], result of:
              0.2234061 = score(doc=7000,freq=2.0), product of:
                0.342494 = queryWeight, product of:
                  5.9038734 = idf(docFreq=327, maxDocs=44218)
                  0.058011748 = queryNorm
                0.652292 = fieldWeight in 7000, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.9038734 = idf(docFreq=327, maxDocs=44218)
                  0.078125 = fieldNorm(doc=7000)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
  2. Jiang, Y.; Bai, W.; Zhang, X.; Hu, J.: Wikipedia-based information content and semantic similarity computation (2017) 0.03
    0.027925763 = product of:
      0.055851527 = sum of:
        0.055851527 = product of:
          0.11170305 = sum of:
            0.11170305 = weight(_text_:tagging in 2877) [ClassicSimilarity], result of:
              0.11170305 = score(doc=2877,freq=2.0), product of:
                0.342494 = queryWeight, product of:
                  5.9038734 = idf(docFreq=327, maxDocs=44218)
                  0.058011748 = queryNorm
                0.326146 = fieldWeight in 2877, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  5.9038734 = idf(docFreq=327, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=2877)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Abstract
    The Information Content (IC) of a concept is a fundamental dimension in computational linguistics. It enables a better understanding of concept's semantics. In the past, several approaches to compute IC of a concept have been proposed. However, there are some limitations such as the facts of relying on corpora availability, manual tagging, or predefined ontologies and fitting non-dynamic domains in the existing methods. Wikipedia provides a very large domain-independent encyclopedic repository and semantic network for computing IC of concepts with more coverage than usual ontologies. In this paper, we propose some novel methods to IC computation of a concept to solve the shortcomings of existing approaches. The presented methods focus on the IC computation of a concept (i.e., Wikipedia category) drawn from the Wikipedia category structure. We propose several new IC-based measures to compute the semantic similarity between concepts. The evaluation, based on several widely used benchmarks and a benchmark developed in ourselves, sustains the intuitions with respect to human judgments. Overall, some methods proposed in this paper have a good human correlation and constitute some effective ways of determining IC values for concepts and semantic similarity between concepts.
  3. Boyack, K.W.; Wylie,B.N.; Davidson, G.S.: Information Visualization, Human-Computer Interaction, and Cognitive Psychology : Domain Visualizations (2002) 0.03
    0.027788557 = product of:
      0.055577114 = sum of:
        0.055577114 = product of:
          0.11115423 = sum of:
            0.11115423 = weight(_text_:22 in 1352) [ClassicSimilarity], result of:
              0.11115423 = score(doc=1352,freq=4.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.54716086 = fieldWeight in 1352, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=1352)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 2.2003 17:25:39
    22. 2.2003 18:17:40
  4. Smeaton, A.F.; Rijsbergen, C.J. van: ¬The retrieval effects of query expansion on a feedback document retrieval system (1983) 0.03
    0.027509268 = product of:
      0.055018537 = sum of:
        0.055018537 = product of:
          0.11003707 = sum of:
            0.11003707 = weight(_text_:22 in 2134) [ClassicSimilarity], result of:
              0.11003707 = score(doc=2134,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.5416616 = fieldWeight in 2134, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.109375 = fieldNorm(doc=2134)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    30. 3.2001 13:32:22
  5. Zhang, W.; Yoshida, T.; Tang, X.: ¬A comparative study of TF*IDF, LSI and multi-words for text classification (2011) 0.03
    0.026491817 = product of:
      0.052983634 = sum of:
        0.052983634 = product of:
          0.1589509 = sum of:
            0.1589509 = weight(_text_:themes in 1165) [ClassicSimilarity], result of:
              0.1589509 = score(doc=1165,freq=2.0), product of:
                0.3729592 = queryWeight, product of:
                  6.429029 = idf(docFreq=193, maxDocs=44218)
                  0.058011748 = queryNorm
                0.42618844 = fieldWeight in 1165, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.429029 = idf(docFreq=193, maxDocs=44218)
                  0.046875 = fieldNorm(doc=1165)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Abstract
    One of the main themes in text mining is text representation, which is fundamental and indispensable for text-based intellegent information processing. Generally, text representation inludes two tasks: indexing and weighting. This paper has comparatively studied TF*IDF, LSI and multi-word for text representation. We used a Chinese and an English document collection to respectively evaluate the three methods in information retreival and text categorization. Experimental results have demonstrated that in text categorization, LSI has better performance than other methods in both document collections. Also, LSI has produced the best performance in retrieving English documents. This outcome has shown that LSI has both favorable semantic and statistical quality and is different with the claim that LSI can not produce discriminative power for indexing.
  6. Rekabsaz, N. et al.: Toward optimized multimodal concept indexing (2016) 0.02
    0.019649478 = product of:
      0.039298955 = sum of:
        0.039298955 = product of:
          0.07859791 = sum of:
            0.07859791 = weight(_text_:22 in 2751) [ClassicSimilarity], result of:
              0.07859791 = score(doc=2751,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.38690117 = fieldWeight in 2751, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2751)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  7. Kozikowski, P. et al.: Support of part-whole relations in query answering (2016) 0.02
    0.019649478 = product of:
      0.039298955 = sum of:
        0.039298955 = product of:
          0.07859791 = sum of:
            0.07859791 = weight(_text_:22 in 2754) [ClassicSimilarity], result of:
              0.07859791 = score(doc=2754,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.38690117 = fieldWeight in 2754, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=2754)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 2.2016 18:25:22
  8. Marx, E. et al.: Exploring term networks for semantic search over RDF knowledge graphs (2016) 0.02
    0.019649478 = product of:
      0.039298955 = sum of:
        0.039298955 = product of:
          0.07859791 = sum of:
            0.07859791 = weight(_text_:22 in 3279) [ClassicSimilarity], result of:
              0.07859791 = score(doc=3279,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.38690117 = fieldWeight in 3279, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3279)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Metadata and semantics research: 10th International Conference, MTSR 2016, Göttingen, Germany, November 22-25, 2016, Proceedings. Eds.: E. Garoufallou
  9. Kopácsi, S. et al.: Development of a classification server to support metadata harmonization in a long term preservation system (2016) 0.02
    0.019649478 = product of:
      0.039298955 = sum of:
        0.039298955 = product of:
          0.07859791 = sum of:
            0.07859791 = weight(_text_:22 in 3280) [ClassicSimilarity], result of:
              0.07859791 = score(doc=3280,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.38690117 = fieldWeight in 3280, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.078125 = fieldNorm(doc=3280)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Metadata and semantics research: 10th International Conference, MTSR 2016, Göttingen, Germany, November 22-25, 2016, Proceedings. Eds.: E. Garoufallou
  10. Sacco, G.M.: Dynamic taxonomies and guided searches (2006) 0.02
    0.01945199 = product of:
      0.03890398 = sum of:
        0.03890398 = product of:
          0.07780796 = sum of:
            0.07780796 = weight(_text_:22 in 5295) [ClassicSimilarity], result of:
              0.07780796 = score(doc=5295,freq=4.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.38301262 = fieldWeight in 5295, product of:
                  2.0 = tf(freq=4.0), with freq of:
                    4.0 = termFreq=4.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=5295)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    22. 7.2006 17:56:22
  11. Efthimiadis, E.N.: End-users' understanding of thesaural knowledge structures in interactive query expansion (1994) 0.02
    0.015719583 = product of:
      0.031439167 = sum of:
        0.031439167 = product of:
          0.06287833 = sum of:
            0.06287833 = weight(_text_:22 in 5693) [ClassicSimilarity], result of:
              0.06287833 = score(doc=5693,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.30952093 = fieldWeight in 5693, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0625 = fieldNorm(doc=5693)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    30. 3.2001 13:35:22
  12. Fieldhouse, M.; Hancock-Beaulieu, M.: ¬The design of a graphical user interface for a highly interactive information retrieval system (1996) 0.01
    0.013754634 = product of:
      0.027509268 = sum of:
        0.027509268 = product of:
          0.055018537 = sum of:
            0.055018537 = weight(_text_:22 in 6958) [ClassicSimilarity], result of:
              0.055018537 = score(doc=6958,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.2708308 = fieldWeight in 6958, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=6958)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Information retrieval: new systems and current research. Proceedings of the 16th Research Colloquium of the British Computer Society Information Retrieval Specialist Group, Drymen, Scotland, 22-23 Mar 94. Ed.: R. Leon
  13. Chang, C.-H.; Hsu, C.-C.: Integrating query expansion and conceptual relevance feedback for personalized Web information retrieval (1998) 0.01
    0.013754634 = product of:
      0.027509268 = sum of:
        0.027509268 = product of:
          0.055018537 = sum of:
            0.055018537 = weight(_text_:22 in 1319) [ClassicSimilarity], result of:
              0.055018537 = score(doc=1319,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.2708308 = fieldWeight in 1319, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1319)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    1. 8.1996 22:08:06
  14. Faaborg, A.; Lagoze, C.: Semantic browsing (2003) 0.01
    0.013754634 = product of:
      0.027509268 = sum of:
        0.027509268 = product of:
          0.055018537 = sum of:
            0.055018537 = weight(_text_:22 in 1026) [ClassicSimilarity], result of:
              0.055018537 = score(doc=1026,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.2708308 = fieldWeight in 1026, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1026)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Research and advanced technology for digital libraries : 7th European Conference, proceedings / ECDL 2003, Trondheim, Norway, August 17-22, 2003
  15. Knorz, G.; Rein, B.: Semantische Suche in einer Hochschulontologie (2005) 0.01
    0.013754634 = product of:
      0.027509268 = sum of:
        0.027509268 = product of:
          0.055018537 = sum of:
            0.055018537 = weight(_text_:22 in 1852) [ClassicSimilarity], result of:
              0.055018537 = score(doc=1852,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.2708308 = fieldWeight in 1852, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1852)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    11. 2.2011 18:22:58
  16. Knorz, G.; Rein, B.: Semantische Suche in einer Hochschulontologie : Ontologie-basiertes Information-Filtering und -Retrieval mit relationalen Datenbanken (2005) 0.01
    0.013754634 = product of:
      0.027509268 = sum of:
        0.027509268 = product of:
          0.055018537 = sum of:
            0.055018537 = weight(_text_:22 in 4324) [ClassicSimilarity], result of:
              0.055018537 = score(doc=4324,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.2708308 = fieldWeight in 4324, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=4324)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Date
    11. 2.2011 18:22:25
  17. Salaba, A.; Zeng, M.L.: Extending the "Explore" user task beyond subject authority data into the linked data sphere (2014) 0.01
    0.013754634 = product of:
      0.027509268 = sum of:
        0.027509268 = product of:
          0.055018537 = sum of:
            0.055018537 = weight(_text_:22 in 1465) [ClassicSimilarity], result of:
              0.055018537 = score(doc=1465,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.2708308 = fieldWeight in 1465, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1465)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  18. Mlodzka-Stybel, A.: Towards continuous improvement of users' access to a library catalogue (2014) 0.01
    0.013754634 = product of:
      0.027509268 = sum of:
        0.027509268 = product of:
          0.055018537 = sum of:
            0.055018537 = weight(_text_:22 in 1466) [ClassicSimilarity], result of:
              0.055018537 = score(doc=1466,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.2708308 = fieldWeight in 1466, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=1466)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
  19. Lund, K.; Burgess, C.; Atchley, R.A.: Semantic and associative priming in high-dimensional semantic space (1995) 0.01
    0.013754634 = product of:
      0.027509268 = sum of:
        0.027509268 = product of:
          0.055018537 = sum of:
            0.055018537 = weight(_text_:22 in 2151) [ClassicSimilarity], result of:
              0.055018537 = score(doc=2151,freq=2.0), product of:
                0.20314726 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.058011748 = queryNorm
                0.2708308 = fieldWeight in 2151, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.0546875 = fieldNorm(doc=2151)
          0.5 = coord(1/2)
      0.5 = coord(1/2)
    
    Source
    Proceedings of the Seventeenth Annual Conference of the Cognitive Science Society: July 22 - 25, 1995, University of Pittsburgh / ed. by Johanna D. Moore and Jill Fain Lehmann
  20. Case, D.O.: Looking for information : a survey on research on information seeking, needs, and behavior (2002) 0.01
    0.013245909 = product of:
      0.026491817 = sum of:
        0.026491817 = product of:
          0.07947545 = sum of:
            0.07947545 = weight(_text_:themes in 1270) [ClassicSimilarity], result of:
              0.07947545 = score(doc=1270,freq=2.0), product of:
                0.3729592 = queryWeight, product of:
                  6.429029 = idf(docFreq=193, maxDocs=44218)
                  0.058011748 = queryNorm
                0.21309422 = fieldWeight in 1270, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  6.429029 = idf(docFreq=193, maxDocs=44218)
                  0.0234375 = fieldNorm(doc=1270)
          0.33333334 = coord(1/3)
      0.5 = coord(1/2)
    
    Footnote
    Rez. in: JASIST 54(2003) no.7, S.695-697 (R. Savolainen): "Donald O. Case has written an ambitious book to create an overall picture of the major approaches to information needs and seeking (INS) studies. The aim to write an extensive review is reflected in the list of references containing about 700 items. The high ambitions are explained an p. 14, where Case states that he is aiming at a multidisciplinary understanding of the concept of information seeking. In the Preface, the author characterizes his book as an introduction to the topic for students at the graduate level, as well as as a review and handbook for scholars engagged in information behavior research. In my view, Looking for Information is particularly welcome as an academic textbook because the field of INS studies suffers from the lack of monographs. Along with the continuous growth of the number of journal articles and conference papers, there is a genuine need for a book that picks up the numerous pieces and puts them together. The use of the study as a textbook is facilitated by clearly delineated sections an major themes and the wealth of concrete examples of information seeking in everyday contexts. The book is lucidly written and it is accessible to novice readers, too. At first glance, the idea of providing a comprehensive review of INS studies may seem a mission impossible because the current number of articles, papers, and other contributions in this field is nearing the 10,000 range (p. 224). Donald Case is not alone in the task of coming to grips with an increasing number of studies; similar problems have been faced by those writing INS-related chapters for the Annual Review of Information Science and Technology (ARIST). Case has solved the problem of "too many publications to be reviewed" by concentrating an the INS literature published during the last two decades. Secondly, studies an library use and information retrieval are discussed only to a limited extent. In addition, Case is highly selective as to studies focusing an the use of specific sources and channels such as WWW. These delineations are reasonable, even though they beg some questions. First, how should one draw the line between studies an information seeking and information retrieval? Case does not discuss this question in greater detail, although in recent years, the overlapping areas of information seeking and retrieval studies have been broadened, along with the growing importance of WWW in information seeking/retrieval. Secondly, how can one define the concept of information searching (or, more specifically, Internet or Web searching) in relation to information seeking and information retrieval? In the field of Web searching studies, there is an increasing number of contributions that are of direct relevance to information-seeking studies. Clearly, the advent of the Internet, particularly, the Web, has blurred the previous lines between INS and IR literature, making them less clear cut. The book consists of five main sections, and comprises 13 chapters. There is an Appendix serving the needs of an INS textbook (questions for discussion and application). The structure of the book is meticulously planned and, as a whole, it offers a sufficiently balanced contribution to theoretical, methodological, and empirical issues of INS. The title, Looking for Information: A Survey of Research an Information Seeking, Needs, and Behavior aptly describes the main substance of the book. . . . It is easy to agree with Case about the significance of the problem of specialization and fragmentation. This problem seems to be concomitant with the broadening field of INS research. In itself, Case's book can be interpreted as a struggle against this fragmentation. His book suggests that this struggle is not hopeless and that it is still possible to draw an overall picture of the evolving research field. The major pieces of the puzzle were found and the book will provide a useful overview of INS studies for many years."