Search (65 results, page 1 of 4)

Frâncu, V.; Sabo, C.-N.: Implementation of a UDC-based multilingual thesaurus in a library catalogue : the case of BiblioPhil (2010) 0.13

0.13118336 = sum of:
  0.025232634 = product of:
    0.100930534 = sum of:
      0.100930534 = weight(_text_:authors in 3697) [ClassicSimilarity], result of:
        0.100930534 = score(doc=3697,freq=4.0), product of:
          0.23615624 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05180212 = queryNorm
          0.42738882 = fieldWeight in 3697, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.046875 = fieldNorm(doc=3697)
    0.25 = coord(1/4)
  0.10595073 = sum of:
    0.06383989 = weight(_text_:n in 3697) [ClassicSimilarity], result of:
      0.06383989 = score(doc=3697,freq=2.0), product of:
        0.22335295 = queryWeight, product of:
          4.3116565 = idf(docFreq=1611, maxDocs=44218)
          0.05180212 = queryNorm
        0.28582513 = fieldWeight in 3697, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.3116565 = idf(docFreq=1611, maxDocs=44218)
          0.046875 = fieldNorm(doc=3697)
    0.042110834 = weight(_text_:22 in 3697) [ClassicSimilarity], result of:
      0.042110834 = score(doc=3697,freq=2.0), product of:
        0.1814022 = queryWeight, product of:
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.05180212 = queryNorm
        0.23214069 = fieldWeight in 3697, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          3.5018296 = idf(docFreq=3622, maxDocs=44218)
          0.046875 = fieldNorm(doc=3697)

Abstract: In order to enhance the use of Universal Decimal Classification (UDC) numbers in information retrieval, the authors have represented classification with multilingual thesaurus descriptors and implemented this solution in an automated way. The authors illustrate a solution implemented in a BiblioPhil library system. The standard formats used are UNIMARC for subject authority records (i.e. the UDC-based multilingual thesaurus) and MARC XML support for data transfer. The multilingual thesaurus was built according to existing standards, the constituent parts of the classification notations being used as the basis for search terms in the multilingual information retrieval. The verbal equivalents, descriptors and non-descriptors, are used to expand the number of concepts and are given in Romanian, English and French. This approach saves the time of the indexer and provides more user-friendly and easier access to the bibliographic information. The multilingual aspect of the thesaurus enhances information access for a greater number of online users
Date: 22. 7.2010 20:40:56

Mitchell, J.S.; Zeng, M.L.; Zumer, M.: Modeling classification systems in multicultural and multilingual contexts (2012) 0.06

0.05500949 = sum of:
  0.025232634 = product of:
    0.100930534 = sum of:
      0.100930534 = weight(_text_:authors in 1967) [ClassicSimilarity], result of:
        0.100930534 = score(doc=1967,freq=4.0), product of:
          0.23615624 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05180212 = queryNorm
          0.42738882 = fieldWeight in 1967, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.046875 = fieldNorm(doc=1967)
    0.25 = coord(1/4)
  0.029776858 = product of:
    0.059553716 = sum of:
      0.059553716 = weight(_text_:22 in 1967) [ClassicSimilarity], result of:
        0.059553716 = score(doc=1967,freq=4.0), product of:
          0.1814022 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05180212 = queryNorm
          0.32829654 = fieldWeight in 1967, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=1967)
    0.5 = coord(1/2)

Abstract: This paper reports on the second part of an initiative of the authors on researching classification systems with the conceptual model defined by the Functional Requirements for Subject Authority Data (FRSAD) final report. In an earlier study, the authors explored whether the FRSAD conceptual model could be extended beyond subject authority data to model classification data. The focus of the current study is to determine if classification data modeled using FRSAD can be used to solve real-world discovery problems in multicultural and multilingual contexts. The paper discusses the relationships between entities (same type or different types) in the context of classification systems that involve multiple translations and /or multicultural implementations. Results of two case studies are presented in detail: (a) two instances of the DDC (DDC 22 in English, and the Swedish-English mixed translation of DDC 22), and (b) Chinese Library Classification. The use cases of conceptual models in practice are also discussed.

Mitchell, J.S.; Zeng, M.L.; Zumer, M.: Modeling classification systems in multicultural and multilingual contexts (2014) 0.05

0.045841243 = sum of:
  0.021027196 = product of:
    0.084108785 = sum of:
      0.084108785 = weight(_text_:authors in 1962) [ClassicSimilarity], result of:
        0.084108785 = score(doc=1962,freq=4.0), product of:
          0.23615624 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05180212 = queryNorm
          0.35615736 = fieldWeight in 1962, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1962)
    0.25 = coord(1/4)
  0.024814047 = product of:
    0.049628094 = sum of:
      0.049628094 = weight(_text_:22 in 1962) [ClassicSimilarity], result of:
        0.049628094 = score(doc=1962,freq=4.0), product of:
          0.1814022 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05180212 = queryNorm
          0.27358043 = fieldWeight in 1962, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=1962)
    0.5 = coord(1/2)

Abstract: This article reports on the second part of an initiative of the authors on researching classification systems with the conceptual model defined by the Functional Requirements for Subject Authority Data (FRSAD) final report. In an earlier study, the authors explored whether the FRSAD conceptual model could be extended beyond subject authority data to model classification data. The focus of the current study is to determine if classification data modeled using FRSAD can be used to solve real-world discovery problems in multicultural and multilingual contexts. The article discusses the relationships between entities (same type or different types) in the context of classification systems that involve multiple translations and/or multicultural implementations. Results of two case studies are presented in detail: (a) two instances of the Dewey Decimal Classification [DDC] (DDC 22 in English, and the Swedish-English mixed translation of DDC 22), and (b) Chinese Library Classification. The use cases of conceptual models in practice are also discussed.

Ménard, E.; Khashman, N.; Kochkina, S.; Torres-Moreno, J.-M.; Velazquez-Morales, P.; Zhou, F.; Jourlin, P.; Rawat, P.; Peinl, P.; Linhares Pontes, E.; Brunetti., I.: ¬A second life for TIIARA : from bilingual to multilingual! (2016) 0.04

0.044146135 = product of:
  0.08829227 = sum of:
    0.08829227 = sum of:
      0.053199906 = weight(_text_:n in 2834) [ClassicSimilarity], result of:
        0.053199906 = score(doc=2834,freq=2.0), product of:
          0.22335295 = queryWeight, product of:
            4.3116565 = idf(docFreq=1611, maxDocs=44218)
            0.05180212 = queryNorm
          0.23818761 = fieldWeight in 2834, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.3116565 = idf(docFreq=1611, maxDocs=44218)
            0.0390625 = fieldNorm(doc=2834)
      0.03509236 = weight(_text_:22 in 2834) [ClassicSimilarity], result of:
        0.03509236 = score(doc=2834,freq=2.0), product of:
          0.1814022 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.05180212 = queryNorm
          0.19345059 = fieldWeight in 2834, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.0390625 = fieldNorm(doc=2834)
  0.5 = coord(1/2)

Source: Knowledge organization. 43(2016) no.1, S.22-34

Kulygina, N.: Authority control in a multilanguage catalogue : Russian experience (2005) 0.04
```
0.038101718 = sum of:
  0.016821755 = product of:
    0.06728702 = sum of:
      0.06728702 = weight(_text_:authors in 4360) [ClassicSimilarity], result of:
        0.06728702 = score(doc=4360,freq=4.0), product of:
          0.23615624 = queryWeight, product of:
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.05180212 = queryNorm
          0.28492588 = fieldWeight in 4360, product of:
            2.0 = tf(freq=4.0), with freq of:
              4.0 = termFreq=4.0
            4.558814 = idf(docFreq=1258, maxDocs=44218)
            0.03125 = fieldNorm(doc=4360)
    0.25 = coord(1/4)
  0.021279963 = product of:
    0.042559925 = sum of:
      0.042559925 = weight(_text_:n in 4360) [ClassicSimilarity], result of:
        0.042559925 = score(doc=4360,freq=2.0), product of:
          0.22335295 = queryWeight, product of:
            4.3116565 = idf(docFreq=1611, maxDocs=44218)
            0.05180212 = queryNorm
          0.19055009 = fieldWeight in 4360, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            4.3116565 = idf(docFreq=1611, maxDocs=44218)
            0.03125 = fieldNorm(doc=4360)
    0.5 = coord(1/2)
```
Abstract

Before speaking about authority control in a multilingual environment, it is necessary to describe briefly the context in which this control is carried out. The electronic catalogue of the Russian State Library (RSL) contains more than 3 million records. Cataloguing with the use of the Aleph software and ??RC 21 has been carried out for this database since 2003. Other bibliographic records included in the electronic catalogue are received as a result of converting (1) bibliographic records from card catalogues, and (2) the bibliographic records created by RSL and other organizations with the use of other software and other formats. Up to 2003 the electronic catalogue included only records in Russian and in European languages. In 2003 we started cataloguing books and serials in the languages of the peoples of the Russian Federation and former union republics of the USSR. This year we are preparing for the description of documents in oriental languages. This "motley world" demands ordering and normalization of access points. Until 2003, two authority files were maintained in the RSL: the foreign and international organizations file and a file of "authors of special categories." Authority records were created in a format based on USMARC. In 2003 the authority control group was reorganized and its function widened. Now we are supposed to carry out the authority control of headings (controlled access points) in new bibliographic records by formation of authority records for these categories: - Names of persons, creators or objects of the work translated to Russian from other languages when in the document there is an original form of the name - Names of persons in which the forms of the names on the document differ from the form established for the system of catalogues at RSL - Such names of persons, as "authors of special categories" - Names of corporate bodies, Russian, foreign or international (creators or objects of the work), occurring in the system for the first time - Names of the corporate bodies, Russian, foreign or international reflected in authority records (machine-readable and traditional) if the form of the name on the document differs from the form established for the system of catalogues at RSL - The uniform titles of the Bible and its parts, anonymous classical works

DIN 1463: Erstellung und Weiterentwicklung von Thesauri : Teil 1: Einsprachige Thesauri (1987); Teil 2: Mehrsprachige Thesauri (1993) (1987-93) 0.04

0.037239935 = product of:
  0.07447987 = sum of:
    0.07447987 = product of:
      0.14895974 = sum of:
        0.14895974 = weight(_text_:n in 8669) [ClassicSimilarity], result of:
          0.14895974 = score(doc=8669,freq=2.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.6669253 = fieldWeight in 8669, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.109375 = fieldNorm(doc=8669)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: n

Austin, D.; Waters, J.: Guidelines for the establishment and development of multilingual thesauri (1980) 0.04

0.037239935 = product of:
  0.07447987 = sum of:
    0.07447987 = product of:
      0.14895974 = sum of:
        0.14895974 = weight(_text_:n in 0) [ClassicSimilarity], result of:
          0.14895974 = score(doc=0,freq=2.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.6669253 = fieldWeight in 0, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.109375 = fieldNorm(doc=0)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Type: n

Gonzalo, J.; Verdejo, F.; Peters, C.; Calzolari, N.: Applying EuroWordNet to cross-language text retrieval (1998) 0.04

0.037239935 = product of:
  0.07447987 = sum of:
    0.07447987 = product of:
      0.14895974 = sum of:
        0.14895974 = weight(_text_:n in 6445) [ClassicSimilarity], result of:
          0.14895974 = score(doc=6445,freq=2.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.6669253 = fieldWeight in 6445, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.109375 = fieldNorm(doc=6445)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Maurice, N.: Terminologie et information multilingue : aperçu des problèmes posés et recherche de solutions (1997) 0.03

0.031919945 = product of:
  0.06383989 = sum of:
    0.06383989 = product of:
      0.12767978 = sum of:
        0.12767978 = weight(_text_:n in 4548) [ClassicSimilarity], result of:
          0.12767978 = score(doc=4548,freq=2.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.57165027 = fieldWeight in 4548, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.09375 = fieldNorm(doc=4548)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Weihs, J.: Three tales of multilingual cataloguing (1998) 0.03

0.02807389 = product of:
  0.05614778 = sum of:
    0.05614778 = product of:
      0.11229556 = sum of:
        0.11229556 = weight(_text_:22 in 6063) [ClassicSimilarity], result of:
          0.11229556 = score(doc=6063,freq=2.0), product of:
            0.1814022 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05180212 = queryNorm
            0.61904186 = fieldWeight in 6063, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.125 = fieldNorm(doc=6063)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Date: 2. 8.2001 8:55:22

Freire, N.; Charles, V.; Isaac, A.: Subject information and multilingualism in European bibliographic datasets : experiences with Universal Decimal Classification (2015) 0.03

0.026599953 = product of:
  0.053199906 = sum of:
    0.053199906 = product of:
      0.10639981 = sum of:
        0.10639981 = weight(_text_:n in 2289) [ClassicSimilarity], result of:
          0.10639981 = score(doc=2289,freq=2.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.47637522 = fieldWeight in 2289, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.078125 = fieldNorm(doc=2289)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.02
```
0.023036236 = product of:
  0.046072472 = sum of:
    0.046072472 = product of:
      0.092144944 = sum of:
        0.092144944 = weight(_text_:n in 2372) [ClassicSimilarity], result of:
          0.092144944 = score(doc=2372,freq=6.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.41255307 = fieldWeight in 2372, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2372)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Out of vocabulary words, mostly proper nouns and technical terms, are one main source of performance degradation in Cross Language Information Retrieval (CLIR) systems. Those are words not found in the dictionary. Bilingual dictionaries in general do not cover most proper nouns, which are usually primary keys in the query. As they are spelling variants of each other in most languages, using an approximate string matching technique against the target database index is the common approach taken to find the target language correspondents of the original query key. N-gram technique proved to be the most effective among other string matching techniques. The issue arises when the languages dealt with have different alphabets. Transliteration is then applied based on phonetic similarities between the languages involved. In this study, both transliteration and the n-gram technique are combined to generate possible transliterations in an English-Arabic CLIR system. We refer to this technique as Transliteration N-Gram (TNG). We further enhance TNG by applying Part Of Speech disambiguation on the set of transliterations so that words with a similar spelling, but a different meaning, are excluded. Experimental results show that TNG gives promising results, and enhanced TNG further improves performance.
Pearce, C.; Nicholas, C.: TELLTALE: Experiments in a dynamic hypertext environment for degraded and multilingual data (1996) 0.02
```
0.02257081 = product of:
  0.04514162 = sum of:
    0.04514162 = product of:
      0.09028324 = sum of:
        0.09028324 = weight(_text_:n in 4071) [ClassicSimilarity], result of:
          0.09028324 = score(doc=4071,freq=4.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.40421778 = fieldWeight in 4071, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.046875 = fieldNorm(doc=4071)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Methods and tools for finding documents relevant to a user's needs in a document corpora can be found in the information retrieval, library science, and hypertext communities. Typically, these systems provide retrieval capabilities for fairly static copora, their algorithms are dependent on the language for which they are written, e.g. English, and they do not perform well when presented with misspelled words or text that has been degraded by OCR techniques. In this article, we present experimentation results for the TELLTALE system. TELLTALE is a dynamic hypertext environment that provides full-text search from a hypertext-style user interface for text corpora that may be garbled by OCR or transmission errors, and that may contain languages other than English. TELLTALE uses several techniques based on n-grams (n character sequences of text). With these results we show that the dynamic linkage mechanisms in TELLTALE are tolerant of garbles in up to 30% of the characters in the body of the texts
Pirkola, A.; Puolamäki, D.; Järvelin, K.: Applying query structuring in cross-language retrieval (2003) 0.02
```
0.02257081 = product of:
  0.04514162 = sum of:
    0.04514162 = product of:
      0.09028324 = sum of:
        0.09028324 = weight(_text_:n in 1074) [ClassicSimilarity], result of:
          0.09028324 = score(doc=1074,freq=4.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.40421778 = fieldWeight in 1074, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.046875 = fieldNorm(doc=1074)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

We will explore various ways to apply query structuring in cross-language information retrieval. In the first test, English queries were translated into Finnish using an electronic dictionary, and were run in a Finnish newspaper database of 55,000 articles. Queries were structured by combining the Finnish translation equivalents of the same English query key using the syn-operator of the InQuery retrieval system. Structured queries performed markedly better than unstructured queries. Second, the effects of compound-based structuring using a proximity operator for the translation equivalents of query language compound components were tested. The method was not useful in syn-based queries but resulted in decrease in retrieval effectiveness. Proper names are often non-identical spelling variants in different languages. This allows n-gram based translation of names not included in a dictionary. In the third test, a query structuring method where the Boolean and-operator was used to assign more weight to keys translated through n-gram matching gave good results.

Zimmermann, H.H.: Überlegungen zu einem multilingualen Thesaurus-Konzept (1995) 0.02

0.021279963 = product of:
  0.042559925 = sum of:
    0.042559925 = product of:
      0.08511985 = sum of:
        0.08511985 = weight(_text_:n in 2076) [ClassicSimilarity], result of:
          0.08511985 = score(doc=2076,freq=2.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.38110018 = fieldWeight in 2076, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.0625 = fieldNorm(doc=2076)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Konstruktion und Retrieval von Wissen: 3. Tagung der Deutschen ISKO-Sektion einschließlich der Vorträge des Workshops "Thesauri als terminologische Lexika", Weilburg, 27.-29.10.1993. Hrsg.: N. Meder u.a

Dini, L.: CACAO : multilingual access to bibliographic records (2007) 0.02

0.021055417 = product of:
  0.042110834 = sum of:
    0.042110834 = product of:
      0.08422167 = sum of:
        0.08422167 = weight(_text_:22 in 126) [ClassicSimilarity], result of:
          0.08422167 = score(doc=126,freq=2.0), product of:
            0.1814022 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05180212 = queryNorm
            0.46428138 = fieldWeight in 126, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.09375 = fieldNorm(doc=126)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Content: Vortrag anlässlich des Workshops: "Extending the multilingual capacity of The European Library in the EDL project Stockholm, Swedish National Library, 22-23 November 2007".

Yu, L.-C.; Wu, C.-H.; Chang, R.-Y.; Liu, C.-H.; Hovy, E.H.: Annotation and verification of sense pools in OntoNotes (2010) 0.02
```
0.01880901 = product of:
  0.03761802 = sum of:
    0.03761802 = product of:
      0.07523604 = sum of:
        0.07523604 = weight(_text_:n in 4236) [ClassicSimilarity], result of:
          0.07523604 = score(doc=4236,freq=4.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.33684817 = fieldWeight in 4236, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4236)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

The paper describes the OntoNotes, a multilingual (English, Chinese and Arabic) corpus with large-scale semantic annotations, including predicate-argument structure, word senses, ontology linking, and coreference. The underlying semantic model of OntoNotes involves word senses that are grouped into so-called sense pools, i.e., sets of near-synonymous senses of words. Such information is useful for many applications, including query expansion for information retrieval (IR) systems, (near-)duplicate detection for text summarization systems, and alternative word selection for writing support systems. Although a sense pool provides a set of near-synonymous senses of words, there is still no knowledge about whether two words in a pool are interchangeable in practical use. Therefore, this paper devises an unsupervised algorithm that incorporates Google n-grams and a statistical test to determine whether a word in a pool can be substituted by other words in the same pool. The n-gram features are used to measure the degree of context mismatch for a substitution. The statistical test is then applied to determine whether the substitution is adequate based on the degree of mismatch. The proposed method is compared with a supervised method, namely Linear Discriminant Analysis (LDA). Experimental results show that the proposed unsupervised method can achieve comparable performance with the supervised method.
Ulrich, P.S.: Multilinguale kollaborative Auskunftsdienste mit QuestionPoint (2005) 0.02
```
0.018619968 = product of:
  0.037239935 = sum of:
    0.037239935 = product of:
      0.07447987 = sum of:
        0.07447987 = weight(_text_:n in 4306) [ClassicSimilarity], result of:
          0.07447987 = score(doc=4306,freq=2.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.33346266 = fieldWeight in 4306, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4306)
      0.5 = coord(1/2)
  0.5 = coord(1/2)
```
Abstract

Besucht jemand eine Bibliothekswebseite und möchte dort eine Frage stellen, wird er in der ganzen WeIt mit einer ähnlichen Situation konfrontiert: die Seiten werden nur in der/(den) Landessprache(n) der Bibliothek und vielleicht in einer anderen Sprache - meist Englisch - angeboten. Wenn der Besucher die auf diesen Seiten verwendeten Sprachen nicht kennt, ist der vermittelte Eindruck: "Wir können Deine Frage nicht verstehen. Wenn Du nicht diese Sprache für Deine Frage verwendest, wird Deine Frage nicht beantwortet." In einer WeIt, in der man zunehmend versucht Barrieren zu reduzieren bzw. abzuschaffen, sollten sich Bibliotheken fragen, ob es nicht Möglichkeiten gibt, die vorhandene sprachliche Barriere abzubauen und Dienste in mehreren Sprachen anzubieten.

Agosti, M.; Braschler, M.; Ferro, N.; Peters, C.; Siebinga, S.: Roadmap for multiLingual information access in the European Library (2007) 0.02

0.018619968 = product of:
  0.037239935 = sum of:
    0.037239935 = product of:
      0.07447987 = sum of:
        0.07447987 = weight(_text_:n in 2431) [ClassicSimilarity], result of:
          0.07447987 = score(doc=2431,freq=2.0), product of:
            0.22335295 = queryWeight, product of:
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.05180212 = queryNorm
            0.33346266 = fieldWeight in 2431, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              4.3116565 = idf(docFreq=1611, maxDocs=44218)
              0.0546875 = fieldNorm(doc=2431)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Kutschekmanesch, S.; Lutes, B.; Moelle, K.; Thiel, U.; Tzeras, K.: Automated multilingual indexing : a synthesis of rule-based and thesaurus-based methods (1998) 0.02

0.01754618 = product of:
  0.03509236 = sum of:
    0.03509236 = product of:
      0.07018472 = sum of:
        0.07018472 = weight(_text_:22 in 4157) [ClassicSimilarity], result of:
          0.07018472 = score(doc=4157,freq=2.0), product of:
            0.1814022 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.05180212 = queryNorm
            0.38690117 = fieldWeight in 4157, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=4157)
      0.5 = coord(1/2)
  0.5 = coord(1/2)

Source: Information und Märkte: 50. Deutscher Dokumentartag 1998, Kongreß der Deutschen Gesellschaft für Dokumentation e.V. (DGD), Rheinische Friedrich-Wilhelms-Universität Bonn, 22.-24. September 1998. Hrsg. von Marlies Ockenfeld u. Gerhard J. Mantwill

Search (65 results, page 1 of 4)

Authors

Years

Languages

Types

Themes