Search (10 results, page 1 of 1)

  • × author_ss:"Sormunen, E."
  1. Heinström, J.; Sormunen, E.; Savolainen, R.; Ek, S.: Developing an empirical measure of everyday information mastering (2020) 0.03
    0.03464484 = product of:
      0.057741396 = sum of:
        0.023073634 = weight(_text_:on in 5914) [ClassicSimilarity], result of:
          0.023073634 = score(doc=5914,freq=6.0), product of:
            0.109641045 = queryWeight, product of:
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.049850095 = queryNorm
            0.21044704 = fieldWeight in 5914, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5914)
        0.02245333 = weight(_text_:information in 5914) [ClassicSimilarity], result of:
          0.02245333 = score(doc=5914,freq=14.0), product of:
            0.08751074 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.049850095 = queryNorm
            0.256578 = fieldWeight in 5914, product of:
              3.7416575 = tf(freq=14.0), with freq of:
                14.0 = termFreq=14.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=5914)
        0.012214432 = product of:
          0.024428863 = sum of:
            0.024428863 = weight(_text_:technology in 5914) [ClassicSimilarity], result of:
              0.024428863 = score(doc=5914,freq=2.0), product of:
                0.14847288 = queryWeight, product of:
                  2.978387 = idf(docFreq=6114, maxDocs=44218)
                  0.049850095 = queryNorm
                0.16453418 = fieldWeight in 5914, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.978387 = idf(docFreq=6114, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=5914)
          0.5 = coord(1/2)
      0.6 = coord(3/5)
    
    Abstract
    The aim of the study was to develop an empirical measure for everyday information mastering (EIM). EIM describes the ways that individuals, based on their beliefs, attitudes, and expectations, orient themselves to information as a resource of everyday action. The key features of EIM were identified by conceptual analysis focusing on three EIM frameworks. Four modes of EIM-Proactive, Social, Reactive, and Passive-and their 12 constituents were identified. A survey of 39 items was developed in two pilot studies to operationalize the identified modes as measurable EIM constituents. The respondents in the main study were upper secondary school students (n = 412). Exploratory factor analysis (EFA) was applied to validate subscales for each EIM constituent. Seven subscales emerged: Inquiring and Scanning in the Proactive mode, Social media-centered, and Experiential in the Social mode, and Information poor, Overwhelmed, and Blunting in the Passive mode. Two constituents, Serendipitous and Intuitive, were not supported in the EFA. The findings highlight that the core constituents of an individual's everyday information mastering can be operationalized as psychometric scales. The instrument contributes to the systematic empirical study of EIM constituents and their relationships. The study further sheds light on key modes of EIM.
    Source
    Journal of the Association for Information Science and Technology. 71(2020) no.7, S.729-741
    Theme
    Information
  2. Järvelin, K.; Kristensen, J.; Niemi, T.; Sormunen, E.; Keskustalo, H.: ¬A deductive data model for query expansion (1996) 0.03
    0.031831995 = product of:
      0.05305332 = sum of:
        0.022607451 = weight(_text_:on in 2230) [ClassicSimilarity], result of:
          0.022607451 = score(doc=2230,freq=4.0), product of:
            0.109641045 = queryWeight, product of:
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.049850095 = queryNorm
            0.20619515 = fieldWeight in 2230, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.046875 = fieldNorm(doc=2230)
        0.0101838745 = weight(_text_:information in 2230) [ClassicSimilarity], result of:
          0.0101838745 = score(doc=2230,freq=2.0), product of:
            0.08751074 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.049850095 = queryNorm
            0.116372846 = fieldWeight in 2230, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=2230)
        0.020261997 = product of:
          0.040523995 = sum of:
            0.040523995 = weight(_text_:22 in 2230) [ClassicSimilarity], result of:
              0.040523995 = score(doc=2230,freq=2.0), product of:
                0.17456654 = queryWeight, product of:
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.049850095 = queryNorm
                0.23214069 = fieldWeight in 2230, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  3.5018296 = idf(docFreq=3622, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2230)
          0.5 = coord(1/2)
      0.6 = coord(3/5)
    
    Abstract
    We present a deductive data model for concept-based query expansion. It is based on three abstraction levels: the conceptual, linguistic and occurrence levels. Concepts and relationships among them are represented at the conceptual level. The expression level represents natural language expressions for concepts. Each expression has one or more matching models at the occurrence level. Each model specifies the matching of the expression in database indices built in varying ways. The data model supports a concept-based query expansion and formulation tool, the ExpansionTool, for environments providing heterogeneous IR systems. Expansion is controlled by adjustable matching reliability.
    Source
    Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR '96), Zürich, Switzerland, August 18-22, 1996. Eds.: H.P. Frei et al
  3. Sormunen, E.; Tanni, M.; Alamettälä, T.; Heinström, J.: Students' group work strategies in source-based writing assignments (2014) 0.02
    0.024141094 = product of:
      0.040235154 = sum of:
        0.013321568 = weight(_text_:on in 1289) [ClassicSimilarity], result of:
          0.013321568 = score(doc=1289,freq=2.0), product of:
            0.109641045 = queryWeight, product of:
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.049850095 = queryNorm
            0.121501654 = fieldWeight in 1289, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1289)
        0.014699157 = weight(_text_:information in 1289) [ClassicSimilarity], result of:
          0.014699157 = score(doc=1289,freq=6.0), product of:
            0.08751074 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.049850095 = queryNorm
            0.16796975 = fieldWeight in 1289, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1289)
        0.012214432 = product of:
          0.024428863 = sum of:
            0.024428863 = weight(_text_:technology in 1289) [ClassicSimilarity], result of:
              0.024428863 = score(doc=1289,freq=2.0), product of:
                0.14847288 = queryWeight, product of:
                  2.978387 = idf(docFreq=6114, maxDocs=44218)
                  0.049850095 = queryNorm
                0.16453418 = fieldWeight in 1289, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.978387 = idf(docFreq=6114, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=1289)
          0.5 = coord(1/2)
      0.6 = coord(3/5)
    
    Abstract
    Source-based writing assignments conducted by groups of students are a common learning task used in information literacy instruction. The fundamental assumption in group assignments is that students' collaboration substantially enhances their learning. The present study focused on the group work strategies adopted by upper secondary school students in source-based writing assignments. Seventeen groups authored Wikipedia or Wikipedia-style articles and were interviewed during and after the assignment. Group work strategies were analyzed in 6 activities: planning, searching, assessing sources, reading, writing, and editing. The students used 2 cooperative strategies: delegation and division of work, and 2 collaborative strategies: pair and group collaboration. Division of work into independently conducted parts was the most popular group work strategy. Also group collaboration, where students worked together to complete an activity, was commonly applied. Division of work was justified by efficiency in completing the project and by ease of control in the fair division of contributions. The motivation behind collaboration was related to quality issues and shared responsibility. We suggest that the present designs of learning tasks lead students to avoid collaboration, increasing the risk of low learning outcomes in information literacy instruction.
    Source
    Journal of the Association for Information Science and Technology. 65(2014) no.6, S.1217-1231
  4. Järvelin, A.; Keskustalo, H.; Sormunen, E.; Saastamoinen, M.; Kettunen, K.: Information retrieval from historical newspaper collections in highly inflectional languages : a query expansion approach (2016) 0.02
    0.022522688 = product of:
      0.037537813 = sum of:
        0.013321568 = weight(_text_:on in 3223) [ClassicSimilarity], result of:
          0.013321568 = score(doc=3223,freq=2.0), product of:
            0.109641045 = queryWeight, product of:
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.049850095 = queryNorm
            0.121501654 = fieldWeight in 3223, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3223)
        0.012001811 = weight(_text_:information in 3223) [ClassicSimilarity], result of:
          0.012001811 = score(doc=3223,freq=4.0), product of:
            0.08751074 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.049850095 = queryNorm
            0.13714671 = fieldWeight in 3223, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=3223)
        0.012214432 = product of:
          0.024428863 = sum of:
            0.024428863 = weight(_text_:technology in 3223) [ClassicSimilarity], result of:
              0.024428863 = score(doc=3223,freq=2.0), product of:
                0.14847288 = queryWeight, product of:
                  2.978387 = idf(docFreq=6114, maxDocs=44218)
                  0.049850095 = queryNorm
                0.16453418 = fieldWeight in 3223, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.978387 = idf(docFreq=6114, maxDocs=44218)
                  0.0390625 = fieldNorm(doc=3223)
          0.5 = coord(1/2)
      0.6 = coord(3/5)
    
    Abstract
    The aim of the study was to test whether query expansion by approximate string matching methods is beneficial in retrieval from historical newspaper collections in a language rich with compounds and inflectional forms (Finnish). First, approximate string matching methods were used to generate lists of index words most similar to contemporary query terms in a digitized newspaper collection from the 1800s. Top index word variants were categorized to estimate the appropriate query expansion ranges in the retrieval test. Second, the effectiveness of approximate string matching methods, automatically generated inflectional forms, and their combinations were measured in a Cranfield-style test. Finally, a detailed topic-level analysis of test results was conducted. In the index of historical newspaper collection the occurrences of a word typically spread to many linguistic and historical variants along with optical character recognition (OCR) errors. All query expansion methods improved the baseline results. Extensive expansion of around 30 variants for each query word was required to achieve the highest performance improvement. Query expansion based on approximate string matching was superior to using the inflectional forms of the query words, showing that coverage of the different types of variation is more important than precision in handling one type of variation.
    Source
    Journal of the Association for Information Science and Technology. 67(2016) no.12, S.2928-2946
  5. Alkula, R.; Sormunen, E.: Problems and guidelines for database descriptions (1989) 0.02
    0.022426186 = product of:
      0.056065463 = sum of:
        0.03230309 = weight(_text_:on in 2397) [ClassicSimilarity], result of:
          0.03230309 = score(doc=2397,freq=6.0), product of:
            0.109641045 = queryWeight, product of:
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.049850095 = queryNorm
            0.29462588 = fieldWeight in 2397, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2397)
        0.023762373 = weight(_text_:information in 2397) [ClassicSimilarity], result of:
          0.023762373 = score(doc=2397,freq=8.0), product of:
            0.08751074 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.049850095 = queryNorm
            0.27153665 = fieldWeight in 2397, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2397)
      0.4 = coord(2/5)
    
    Abstract
    An essential part of information retrieval knowledge is the knowledge of data base contents and structures. Currently, the variety of data bases is so wide that it is difficult to know the contents and structure of a particular data base and how they differ from those of other data bases. Because of the lack of commonly acknowledged guidelines for data base descriptions, each on-line service designs and produces printed manuals, on-line help texts and other user documentation in its own manner. For the presentation of exact information and knowledge on a data base, common, structured principles for data base descriptions are needed. Requirements and some solutions for such description principles are presented.
    Source
    Information and innovation. Proceedings of the 7th Nordic Conference for Information and Documentation, Århus University, Denmark, 28-30 August 1989. Ed. by Helge Clausen
  6. Vakkari, P.; Sormunen, E.: ¬The influence of relevance levels an the effectiveness of interactive information retrieval (2004) 0.01
    0.0129185235 = product of:
      0.032296307 = sum of:
        0.017638987 = weight(_text_:information in 2884) [ClassicSimilarity], result of:
          0.017638987 = score(doc=2884,freq=6.0), product of:
            0.08751074 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.049850095 = queryNorm
            0.20156369 = fieldWeight in 2884, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.046875 = fieldNorm(doc=2884)
        0.014657319 = product of:
          0.029314637 = sum of:
            0.029314637 = weight(_text_:technology in 2884) [ClassicSimilarity], result of:
              0.029314637 = score(doc=2884,freq=2.0), product of:
                0.14847288 = queryWeight, product of:
                  2.978387 = idf(docFreq=6114, maxDocs=44218)
                  0.049850095 = queryNorm
                0.19744103 = fieldWeight in 2884, product of:
                  1.4142135 = tf(freq=2.0), with freq of:
                    2.0 = termFreq=2.0
                  2.978387 = idf(docFreq=6114, maxDocs=44218)
                  0.046875 = fieldNorm(doc=2884)
          0.5 = coord(1/2)
      0.4 = coord(2/5)
    
    Abstract
    In this paper, we focus an the effect of graded relevance an the results of interactive information retrieval (IR) experiments based an assigned search tasks in a test collection. A group of 26 subjects searched for four Text REtrieval Conference (TREC) topics using automatic and interactive query expansion based an relevance feedback. The TREC- and user-suggested pools of relevant documents were reassessed an a four-level relevance scale. The results show that the users could identify nearly all highly relevant documents and about half of the marginal ones. Users also selected a fair number of irrelevant documents for query expansion. The findings suggest that the effectiveness of query expansion is closely related to the searchers' success in retrieving and identifying highly relevant documents for feedback. The implications of the results an interpreting the findings of past experiments with liberal relevance thresholds are also discussed.
    Source
    Journal of the American Society for Information Science and Technology. 55(2004) no.11, S.963-969
  7. Sormunen, E.; Kekäläinen, J.; Koivisto, J.; Järvelin, K.: Document text characteristics affect the ranking of the most relevant documents by expanded structured queries (2001) 0.01
    0.012117877 = product of:
      0.03029469 = sum of:
        0.013321568 = weight(_text_:on in 4487) [ClassicSimilarity], result of:
          0.013321568 = score(doc=4487,freq=2.0), product of:
            0.109641045 = queryWeight, product of:
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.049850095 = queryNorm
            0.121501654 = fieldWeight in 4487, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4487)
        0.016973123 = weight(_text_:information in 4487) [ClassicSimilarity], result of:
          0.016973123 = score(doc=4487,freq=8.0), product of:
            0.08751074 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.049850095 = queryNorm
            0.19395474 = fieldWeight in 4487, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4487)
      0.4 = coord(2/5)
    
    Abstract
    The increasing flood of documentary information through the Internet and other information sources challenges the developers of information retrieval systems. It is not enough that an IR system is able to make a distinction between relevant and non-relevant documents. The reduction of information overload requires that IR systems provide the capability of screening the most valuable documents out of the mass of potentially or marginally relevant documents. This paper introduces a new concept-based method to analyse the text characteristics of documents at varying relevance levels. The results of the document analysis were applied in an experiment on query expansion (QE) in a probabilistic IR system. Statistical differences in textual characteristics of highly relevant and less relevant documents were investigated by applying a facet analysis technique. In highly relevant documents a larger number of aspects of the request were discussed, searchable expressions for the aspects were distributed over a larger set of text paragraphs, and a larger set of unique expressions were used per aspect than in marginally relevant documents. A query expansion experiment verified that the findings of the text analysis can be exploited in formulating more effective queries for best match retrieval in the search for highly relevant documents. The results revealed that expanded queries with concept-based structures performed better than unexpanded queries or Ñnatural languageÒ queries. Further, it was shown that highly relevant documents benefit essentially more from the concept-based QE in ranking than marginally relevant documents.
  8. Sormunen, E.: Free-text searching in full-text databases : probing system limits (1993) 0.01
    0.0067210137 = product of:
      0.03360507 = sum of:
        0.03360507 = weight(_text_:information in 7120) [ClassicSimilarity], result of:
          0.03360507 = score(doc=7120,freq=4.0), product of:
            0.08751074 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.049850095 = queryNorm
            0.3840108 = fieldWeight in 7120, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.109375 = fieldNorm(doc=7120)
      0.2 = coord(1/5)
    
    Imprint
    Oxford : Learned Information
    Source
    Online information 93: 17th International Online Meeting Proceedings, London, 7.-9.12.1993. Ed. by D.I. Raitt et al
  9. Halttunen, K.; Sormunen, E.: Learning information retrieval through an educational game : is gaming sufficient for learning? (2000) 0.01
    0.0067210137 = product of:
      0.03360507 = sum of:
        0.03360507 = weight(_text_:information in 3865) [ClassicSimilarity], result of:
          0.03360507 = score(doc=3865,freq=4.0), product of:
            0.08751074 = queryWeight, product of:
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.049850095 = queryNorm
            0.3840108 = fieldWeight in 3865, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.7554779 = idf(docFreq=20772, maxDocs=44218)
              0.109375 = fieldNorm(doc=3865)
      0.2 = coord(1/5)
    
    Source
    Education for information. 18(2000) no.4, S.289-311
  10. Vakkari, P.; Jones, S.; MacFarlane, A.; Sormunen, E.: Query exhaustivity, relevance feedback and search success in automatic and interactive query expansion (2004) 0.00
    0.0031971764 = product of:
      0.015985882 = sum of:
        0.015985882 = weight(_text_:on in 4435) [ClassicSimilarity], result of:
          0.015985882 = score(doc=4435,freq=2.0), product of:
            0.109641045 = queryWeight, product of:
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.049850095 = queryNorm
            0.14580199 = fieldWeight in 4435, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.199415 = idf(docFreq=13325, maxDocs=44218)
              0.046875 = fieldNorm(doc=4435)
      0.2 = coord(1/5)
    
    Abstract
    This study explored how the expression of search facets and relevance feedback (RF) by users was related to search success in interactive and automatic query expansion in the course of the search process. Search success was measured both in the number of relevant documents retrieved, whether identified by users or not. Research design consisted of 26 users searching for four TREC topics in Okapi IR system, half of the searchers using interactive and half automatic query expansion based on RF. The search logs were recorded, and the users filled in questionnaires for each topic concerning various features of searching. The results showed that the exhaustivity of the query was the most significant predictor of search success. Interactive expansion led to better search success than automatic expansion if all retrieved relevant items were counted, but there was no difference between the methods if only those items recognised relevant by users were observed. The analysis showed that the difference was facilitated by the liberal relevance criterion used in TREC not favouring highly relevant documents in evaluation.