Search (45 results, page 1 of 3)

Luca, E.W. de; Dahlberg, I.: ¬Die Multilingual Lexical Linked Data Cloud : eine mögliche Zugangsoptimierung? (2014) 0.03

0.029976932 = product of:
  0.059953865 = sum of:
    0.059953865 = sum of:
      0.004032909 = weight(_text_:a in 1736) [ClassicSimilarity], result of:
        0.004032909 = score(doc=1736,freq=2.0), product of:
          0.052761257 = queryWeight, product of:
            1.153047 = idf(docFreq=37942, maxDocs=44218)
            0.045758117 = queryNorm
          0.07643694 = fieldWeight in 1736, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            1.153047 = idf(docFreq=37942, maxDocs=44218)
            0.046875 = fieldNorm(doc=1736)
      0.018723397 = weight(_text_:h in 1736) [ClassicSimilarity], result of:
        0.018723397 = score(doc=1736,freq=2.0), product of:
          0.113683715 = queryWeight, product of:
            2.4844491 = idf(docFreq=10020, maxDocs=44218)
            0.045758117 = queryNorm
          0.16469726 = fieldWeight in 1736, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            2.4844491 = idf(docFreq=10020, maxDocs=44218)
            0.046875 = fieldNorm(doc=1736)
      0.03719756 = weight(_text_:22 in 1736) [ClassicSimilarity], result of:
        0.03719756 = score(doc=1736,freq=2.0), product of:
          0.16023713 = queryWeight, product of:
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.045758117 = queryNorm
          0.23214069 = fieldWeight in 1736, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            3.5018296 = idf(docFreq=3622, maxDocs=44218)
            0.046875 = fieldNorm(doc=1736)
  0.5 = coord(1/2)

Date: 22. 9.2014 19:00:13
Source: Information - Wissenschaft und Praxis. 65(2014) H.4/5, S.279-287
Type: a

Zhou, Y. et al.: Analysing entity context in multilingual Wikipedia to support entity-centric retrieval applications (2016) 0.02

0.022905817 = product of:
  0.045811635 = sum of:
    0.045811635 = product of:
      0.06871745 = sum of:
        0.0067215143 = weight(_text_:a in 2758) [ClassicSimilarity], result of:
          0.0067215143 = score(doc=2758,freq=2.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.12739488 = fieldWeight in 2758, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=2758)
        0.061995935 = weight(_text_:22 in 2758) [ClassicSimilarity], result of:
          0.061995935 = score(doc=2758,freq=2.0), product of:
            0.16023713 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045758117 = queryNorm
            0.38690117 = fieldWeight in 2758, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=2758)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Date: 1. 2.2016 18:25:22
Type: a

Celli, F. et al.: Enabling multilingual search through controlled vocabularies : the AGRIS approach (2016) 0.02

0.022905817 = product of:
  0.045811635 = sum of:
    0.045811635 = product of:
      0.06871745 = sum of:
        0.0067215143 = weight(_text_:a in 3278) [ClassicSimilarity], result of:
          0.0067215143 = score(doc=3278,freq=2.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.12739488 = fieldWeight in 3278, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=3278)
        0.061995935 = weight(_text_:22 in 3278) [ClassicSimilarity], result of:
          0.061995935 = score(doc=3278,freq=2.0), product of:
            0.16023713 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045758117 = queryNorm
            0.38690117 = fieldWeight in 3278, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.078125 = fieldNorm(doc=3278)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Metadata and semantics research: 10th International Conference, MTSR 2016, Göttingen, Germany, November 22-25, 2016, Proceedings. Eds.: E. Garoufallou
Type: a

Mitchell, J.S.; Zeng, M.L.; Zumer, M.: Modeling classification systems in multicultural and multilingual contexts (2012) 0.02

0.01943623 = product of:
  0.03887246 = sum of:
    0.03887246 = product of:
      0.05830869 = sum of:
        0.0057033943 = weight(_text_:a in 1967) [ClassicSimilarity], result of:
          0.0057033943 = score(doc=1967,freq=4.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.10809815 = fieldWeight in 1967, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=1967)
        0.052605297 = weight(_text_:22 in 1967) [ClassicSimilarity], result of:
          0.052605297 = score(doc=1967,freq=4.0), product of:
            0.16023713 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045758117 = queryNorm
            0.32829654 = fieldWeight in 1967, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=1967)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: This paper reports on the second part of an initiative of the authors on researching classification systems with the conceptual model defined by the Functional Requirements for Subject Authority Data (FRSAD) final report. In an earlier study, the authors explored whether the FRSAD conceptual model could be extended beyond subject authority data to model classification data. The focus of the current study is to determine if classification data modeled using FRSAD can be used to solve real-world discovery problems in multicultural and multilingual contexts. The paper discusses the relationships between entities (same type or different types) in the context of classification systems that involve multiple translations and /or multicultural implementations. Results of two case studies are presented in detail: (a) two instances of the DDC (DDC 22 in English, and the Swedish-English mixed translation of DDC 22), and (b) Chinese Library Classification. The use cases of conceptual models in practice are also discussed.
Type: a

De Luca, E.W.; Dahlberg, I.: Including knowledge domains from the ICC into the multilingual lexical linked data cloud (2014) 0.02
```
0.016853087 = product of:
  0.033706173 = sum of:
    0.033706173 = product of:
      0.05055926 = sum of:
        0.0067215143 = weight(_text_:a in 1493) [ClassicSimilarity], result of:
          0.0067215143 = score(doc=1493,freq=8.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.12739488 = fieldWeight in 1493, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1493)
        0.043837745 = weight(_text_:22 in 1493) [ClassicSimilarity], result of:
          0.043837745 = score(doc=1493,freq=4.0), product of:
            0.16023713 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045758117 = queryNorm
            0.27358043 = fieldWeight in 1493, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1493)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

A lot of information that is already available on the Web, or retrieved from local information systems and social networks is structured in data silos that are not semantically related. Semantic technologies make it emerge that the use of typed links that directly express their relations are an advantage for every application that can reuse the incorporated knowledge about the data. For this reason, data integration, through reengineering (e.g. triplify), or querying (e.g. D2R) is an important task in order to make information available for everyone. Thus, in order to build a semantic map of the data, we need knowledge about data items itself and the relation between heterogeneous data items. In this paper, we present our work of providing Lexical Linked Data (LLD) through a meta-model that contains all the resources and gives the possibility to retrieve and navigate them from different perspectives. We combine the existing work done on knowledge domains (based on the Information Coding Classification) within the Multilingual Lexical Linked Data Cloud (based on the RDF/OWL EurowordNet and the related integrated lexical resources (MultiWordNet, EuroWordNet, MEMODATA Lexicon, Hamburg Methaphor DB).

Date

22. 9.2014 19:01:18

Source

Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik

Type

a
Mitchell, J.S.; Zeng, M.L.; Zumer, M.: Modeling classification systems in multicultural and multilingual contexts (2014) 0.02
```
0.016552916 = product of:
  0.03310583 = sum of:
    0.03310583 = product of:
      0.049658746 = sum of:
        0.0058210026 = weight(_text_:a in 1962) [ClassicSimilarity], result of:
          0.0058210026 = score(doc=1962,freq=6.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.11032722 = fieldWeight in 1962, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1962)
        0.043837745 = weight(_text_:22 in 1962) [ClassicSimilarity], result of:
          0.043837745 = score(doc=1962,freq=4.0), product of:
            0.16023713 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045758117 = queryNorm
            0.27358043 = fieldWeight in 1962, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1962)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

This article reports on the second part of an initiative of the authors on researching classification systems with the conceptual model defined by the Functional Requirements for Subject Authority Data (FRSAD) final report. In an earlier study, the authors explored whether the FRSAD conceptual model could be extended beyond subject authority data to model classification data. The focus of the current study is to determine if classification data modeled using FRSAD can be used to solve real-world discovery problems in multicultural and multilingual contexts. The article discusses the relationships between entities (same type or different types) in the context of classification systems that involve multiple translations and/or multicultural implementations. Results of two case studies are presented in detail: (a) two instances of the Dewey Decimal Classification [DDC] (DDC 22 in English, and the Swedish-English mixed translation of DDC 22), and (b) Chinese Library Classification. The use cases of conceptual models in practice are also discussed.

Footnote

Contribution in a special issue "Beyond libraries: Subject metadata in the digital environment and Semantic Web" - Enthält Beiträge der gleichnamigen IFLA Satellite Post-Conference, 17-18 August 2012, Tallinn.

Type

a

Frâncu, V.; Sabo, C.-N.: Implementation of a UDC-based multilingual thesaurus in a library catalogue : the case of BiblioPhil (2010) 0.02

0.015692044 = product of:
  0.031384088 = sum of:
    0.031384088 = product of:
      0.04707613 = sum of:
        0.009878568 = weight(_text_:a in 3697) [ClassicSimilarity], result of:
          0.009878568 = score(doc=3697,freq=12.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.18723148 = fieldWeight in 3697, product of:
              3.4641016 = tf(freq=12.0), with freq of:
                12.0 = termFreq=12.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=3697)
        0.03719756 = weight(_text_:22 in 3697) [ClassicSimilarity], result of:
          0.03719756 = score(doc=3697,freq=2.0), product of:
            0.16023713 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045758117 = queryNorm
            0.23214069 = fieldWeight in 3697, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=3697)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: In order to enhance the use of Universal Decimal Classification (UDC) numbers in information retrieval, the authors have represented classification with multilingual thesaurus descriptors and implemented this solution in an automated way. The authors illustrate a solution implemented in a BiblioPhil library system. The standard formats used are UNIMARC for subject authority records (i.e. the UDC-based multilingual thesaurus) and MARC XML support for data transfer. The multilingual thesaurus was built according to existing standards, the constituent parts of the classification notations being used as the basis for search terms in the multilingual information retrieval. The verbal equivalents, descriptors and non-descriptors, are used to expand the number of concepts and are given in Romanian, English and French. This approach saves the time of the indexer and provides more user-friendly and easier access to the bibliographic information. The multilingual aspect of the thesaurus enhances information access for a greater number of online users
Date: 22. 7.2010 20:40:56
Type: a

Fluhr, C.: Crosslingual access to photo databases (2012) 0.02

0.015087794 = product of:
  0.030175587 = sum of:
    0.030175587 = product of:
      0.04526338 = sum of:
        0.008065818 = weight(_text_:a in 93) [ClassicSimilarity], result of:
          0.008065818 = score(doc=93,freq=8.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.15287387 = fieldWeight in 93, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=93)
        0.03719756 = weight(_text_:22 in 93) [ClassicSimilarity], result of:
          0.03719756 = score(doc=93,freq=2.0), product of:
            0.16023713 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045758117 = queryNorm
            0.23214069 = fieldWeight in 93, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.046875 = fieldNorm(doc=93)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Abstract: This paper is about search of photos in photo databases of agencies which sell photos over the Internet. The problem is far from the behavior of photo databases managed by librarians and also far from the corpora generally used for research purposes. The descriptions use mainly single words and it is well known that it is not the best way to have a good search. This increases the problem of semantic ambiguity. This problem of semantic ambiguity is crucial for cross-language querying. On the other hand, users are not aware of documentation techniques and use generally very simple queries but want to get precise answers. This paper gives the experience gained in a 3 year use (2006-2008) of a cross-language access to several of the main international commercial photo databases. The languages used were French, English, and German.
Date: 17. 4.2012 14:25:22
Type: a

Ménard, E.; Khashman, N.; Kochkina, S.; Torres-Moreno, J.-M.; Velazquez-Morales, P.; Zhou, F.; Jourlin, P.; Rawat, P.; Peinl, P.; Linhares Pontes, E.; Brunetti., I.: ¬A second life for TIIARA : from bilingual to multilingual! (2016) 0.01
```
0.012837617 = product of:
  0.025675233 = sum of:
    0.025675233 = product of:
      0.03851285 = sum of:
        0.007514882 = weight(_text_:a in 2834) [ClassicSimilarity], result of:
          0.007514882 = score(doc=2834,freq=10.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.14243183 = fieldWeight in 2834, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2834)
        0.030997967 = weight(_text_:22 in 2834) [ClassicSimilarity], result of:
          0.030997967 = score(doc=2834,freq=2.0), product of:
            0.16023713 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045758117 = queryNorm
            0.19345059 = fieldWeight in 2834, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2834)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

Multilingual controlled vocabularies are rare and often very limited in the choice of languages offered. TIIARA (Taxonomy for Image Indexing and RetrievAl) is a bilingual taxonomy developed for image indexing and retrieval. This controlled vocabulary offers indexers and image searchers innovative and coherent access points for ordinary images. The preliminary steps of the elaboration of the bilingual structure are presented. For its initial development, TIIARA included only two languages, French and English. As a logical follow-up, TIIARA was translated into eight languages-Arabic, Spanish, Brazilian Portuguese, Mandarin Chinese, Italian, German, Hindi and Russian-in order to increase its international scope. This paper briefly describes the different stages of the development of the bilingual structure. The processes used in the translations are subsequently presented, as well as the main difficulties encountered by the translators. Adding more languages in TIIARA constitutes an added value for a controlled vocabulary meant to be used by image searchers, who are often limited by their lack of knowledge of multiple languages.

Source

Knowledge organization. 43(2016) no.1, S.22-34

Type

a

Jahns, Y.: Sacherschließung - zeitgemäß und zukunftsfähig (2010) 0.01

0.012642393 = product of:
  0.025284786 = sum of:
    0.025284786 = product of:
      0.037927177 = sum of:
        0.0067215143 = weight(_text_:a in 3278) [ClassicSimilarity], result of:
          0.0067215143 = score(doc=3278,freq=2.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.12739488 = fieldWeight in 3278, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=3278)
        0.031205663 = weight(_text_:h in 3278) [ClassicSimilarity], result of:
          0.031205663 = score(doc=3278,freq=2.0), product of:
            0.113683715 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045758117 = queryNorm
            0.27449545 = fieldWeight in 3278, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.078125 = fieldNorm(doc=3278)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Bibliotheksdienst. 44(2010) H.5, S.461-468
Type: a

Hubrich, J.: Multilinguale Wissensorganisation im Zeitalter der Globalisierung : das Projekt CrissCross (2010) 0.01

0.011452909 = product of:
  0.022905817 = sum of:
    0.022905817 = product of:
      0.034358725 = sum of:
        0.0033607571 = weight(_text_:a in 4793) [ClassicSimilarity], result of:
          0.0033607571 = score(doc=4793,freq=2.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.06369744 = fieldWeight in 4793, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4793)
        0.030997967 = weight(_text_:22 in 4793) [ClassicSimilarity], result of:
          0.030997967 = score(doc=4793,freq=2.0), product of:
            0.16023713 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.045758117 = queryNorm
            0.19345059 = fieldWeight in 4793, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4793)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Wissensspeicher in digitalen Räumen: Nachhaltigkeit - Verfügbarkeit - semantische Interoperabilität. Proceedings der 11. Tagung der Deutschen Sektion der Internationalen Gesellschaft für Wissensorganisation, Konstanz, 20. bis 22. Februar 2008. Hrsg.: J. Sieglerschmidt u. H.P.Ohly
Type: a

Yu, L.-C.; Wu, C.-H.; Chang, R.-Y.; Liu, C.-H.; Hovy, E.H.: Annotation and verification of sense pools in OntoNotes (2010) 0.01
```
0.010897796 = product of:
  0.021795591 = sum of:
    0.021795591 = product of:
      0.032693386 = sum of:
        0.010627648 = weight(_text_:a in 4236) [ClassicSimilarity], result of:
          0.010627648 = score(doc=4236,freq=20.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.20142901 = fieldWeight in 4236, product of:
              4.472136 = tf(freq=20.0), with freq of:
                20.0 = termFreq=20.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4236)
        0.022065736 = weight(_text_:h in 4236) [ClassicSimilarity], result of:
          0.022065736 = score(doc=4236,freq=4.0), product of:
            0.113683715 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045758117 = queryNorm
            0.1940976 = fieldWeight in 4236, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=4236)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

The paper describes the OntoNotes, a multilingual (English, Chinese and Arabic) corpus with large-scale semantic annotations, including predicate-argument structure, word senses, ontology linking, and coreference. The underlying semantic model of OntoNotes involves word senses that are grouped into so-called sense pools, i.e., sets of near-synonymous senses of words. Such information is useful for many applications, including query expansion for information retrieval (IR) systems, (near-)duplicate detection for text summarization systems, and alternative word selection for writing support systems. Although a sense pool provides a set of near-synonymous senses of words, there is still no knowledge about whether two words in a pool are interchangeable in practical use. Therefore, this paper devises an unsupervised algorithm that incorporates Google n-grams and a statistical test to determine whether a word in a pool can be substituted by other words in the same pool. The n-gram features are used to measure the degree of context mismatch for a substitution. The statistical test is then applied to determine whether the substitution is adequate based on the degree of mismatch. The proposed method is compared with a supervised method, namely Linear Discriminant Analysis (LDA). Experimental results show that the proposed unsupervised method can achieve comparable performance with the supervised method.

Type

a
Tsai, M.-.F.; Chen, H.-H.; Wang, Y.-T.: Learning a merge model for multilingual information retrieval (2011) 0.01
```
0.010523799 = product of:
  0.021047598 = sum of:
    0.021047598 = product of:
      0.031571396 = sum of:
        0.0095056575 = weight(_text_:a in 2750) [ClassicSimilarity], result of:
          0.0095056575 = score(doc=2750,freq=16.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.18016359 = fieldWeight in 2750, product of:
              4.0 = tf(freq=16.0), with freq of:
                16.0 = termFreq=16.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2750)
        0.022065736 = weight(_text_:h in 2750) [ClassicSimilarity], result of:
          0.022065736 = score(doc=2750,freq=4.0), product of:
            0.113683715 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045758117 = queryNorm
            0.1940976 = fieldWeight in 2750, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=2750)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

This paper proposes a learning approach for the merging process in multilingual information retrieval (MLIR). To conduct the learning approach, we present a number of features that may influence the MLIR merging process. These features are mainly extracted from three levels: query, document, and translation. After the feature extraction, we then use the FRank ranking algorithm to construct a merge model. To the best of our knowledge, this practice is the first attempt to use a learning-based ranking algorithm to construct a merge model for MLIR merging. In our experiments, three test collections for the task of crosslingual information retrieval (CLIR) in NTCIR3, 4, and 5 are employed to assess the performance of our proposed method. Moreover, several merging methods are also carried out for a comparison, including traditional merging methods, the 2-step merging strategy, and the merging method based on logistic regression. The experimental results show that our proposed method can significantly improve merging quality on two different types of datasets. In addition to the effectiveness, through the merge model generated by FRank, our method can further identify key factors that influence the merging process. This information might provide us more insight and understanding into MLIR merging.

Type

a

Franz, G.: Interlingualer Wissensaustausch in der Wikipedia : Warum das Projekt noch kein (Welt-)Erfolg ist und von Möglichkeiten dies zu ändernStrategien im Angesicht der Globalisierung (2011) 0.01

0.008849675 = product of:
  0.01769935 = sum of:
    0.01769935 = product of:
      0.026549023 = sum of:
        0.0047050603 = weight(_text_:a in 4506) [ClassicSimilarity], result of:
          0.0047050603 = score(doc=4506,freq=2.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.089176424 = fieldWeight in 4506, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4506)
        0.021843962 = weight(_text_:h in 4506) [ClassicSimilarity], result of:
          0.021843962 = score(doc=4506,freq=2.0), product of:
            0.113683715 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045758117 = queryNorm
            0.19214681 = fieldWeight in 4506, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0546875 = fieldNorm(doc=4506)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Information - Wissenschaft und Praxis. 62(2011) H.4, S.183-190
Type: a

Stiller, J.; Gäde, M.; Petras, V.: Multilingual access to digital libraries : the Europeana use case (2013) 0.01

0.008849675 = product of:
  0.01769935 = sum of:
    0.01769935 = product of:
      0.026549023 = sum of:
        0.0047050603 = weight(_text_:a in 902) [ClassicSimilarity], result of:
          0.0047050603 = score(doc=902,freq=2.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.089176424 = fieldWeight in 902, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=902)
        0.021843962 = weight(_text_:h in 902) [ClassicSimilarity], result of:
          0.021843962 = score(doc=902,freq=2.0), product of:
            0.113683715 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045758117 = queryNorm
            0.19214681 = fieldWeight in 902, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0546875 = fieldNorm(doc=902)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Information - Wissenschaft und Praxis. 64(2013) H.2/3, S.86-95
Type: a

Hauer, M.: Zur Bedeutung normierter Terminologien in Zeiten moderner Sprach- und Information-Retrieval-Technologien (2013) 0.01

0.008849675 = product of:
  0.01769935 = sum of:
    0.01769935 = product of:
      0.026549023 = sum of:
        0.0047050603 = weight(_text_:a in 995) [ClassicSimilarity], result of:
          0.0047050603 = score(doc=995,freq=2.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.089176424 = fieldWeight in 995, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0546875 = fieldNorm(doc=995)
        0.021843962 = weight(_text_:h in 995) [ClassicSimilarity], result of:
          0.021843962 = score(doc=995,freq=2.0), product of:
            0.113683715 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045758117 = queryNorm
            0.19214681 = fieldWeight in 995, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0546875 = fieldNorm(doc=995)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: ABI-Technik. 33(2013) H.1, S.2-6
Type: a

Ye, Z.; Huang, J.X.; He, B.; Lin, H.: Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval (2012) 0.01
```
0.0085617015 = product of:
  0.017123403 = sum of:
    0.017123403 = product of:
      0.025685104 = sum of:
        0.010082272 = weight(_text_:a in 513) [ClassicSimilarity], result of:
          0.010082272 = score(doc=513,freq=18.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.19109234 = fieldWeight in 513, product of:
              4.2426405 = tf(freq=18.0), with freq of:
                18.0 = termFreq=18.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.0390625 = fieldNorm(doc=513)
        0.015602832 = weight(_text_:h in 513) [ClassicSimilarity], result of:
          0.015602832 = score(doc=513,freq=2.0), product of:
            0.113683715 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045758117 = queryNorm
            0.13724773 = fieldWeight in 513, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.0390625 = fieldNorm(doc=513)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)
```
Abstract

Wikipedia is characterized by its dense link structure and a large number of articles in different languages, which make it a notable Web corpus for knowledge extraction and mining, in particular for mining the multilingual associations. In this paper, motivated by a psychological theory of word meaning, we propose a graph-based approach to constructing a cross-language association dictionary (CLAD) from Wikipedia, which can be used in a variety of cross-language accessing and processing applications. In order to evaluate the quality of the mined CLAD, and to demonstrate how the mined CLAD can be used in practice, we explore two different applications of the mined CLAD to cross-language information retrieval (CLIR). First, we use the mined CLAD to conduct cross-language query expansion; and, second, we use it to filter out translation candidates with low translation probabilities. Experimental results on a variety of standard CLIR test collections show that the CLIR retrieval performance can be substantially improved with the above two applications of CLAD, which indicates that the mined CLAD is of sound quality.

Type

a

Huckstorf, A.; Petras, V.: Mind the lexical gap : EuroVoc Building Block of the Semantic Web (2011) 0.01

0.008142265 = product of:
  0.01628453 = sum of:
    0.01628453 = product of:
      0.024426792 = sum of:
        0.0057033943 = weight(_text_:a in 2782) [ClassicSimilarity], result of:
          0.0057033943 = score(doc=2782,freq=4.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.10809815 = fieldWeight in 2782, product of:
              2.0 = tf(freq=4.0), with freq of:
                4.0 = termFreq=4.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.046875 = fieldNorm(doc=2782)
        0.018723397 = weight(_text_:h in 2782) [ClassicSimilarity], result of:
          0.018723397 = score(doc=2782,freq=2.0), product of:
            0.113683715 = queryWeight, product of:
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.045758117 = queryNorm
            0.16469726 = fieldWeight in 2782, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              2.4844491 = idf(docFreq=10020, maxDocs=44218)
              0.046875 = fieldNorm(doc=2782)
      0.6666667 = coord(2/3)
  0.5 = coord(1/2)

Source: Information - Wissenschaft und Praxis. 62(2011) H.2/3, S.125-126
Type: a

Rettinger, A.; Schumilin, A.; Thoma, S.; Ell, B.: Learning a cross-lingual semantic representation of relations expressed in text (2015) 0.00

0.0022405048 = product of:
  0.0044810097 = sum of:
    0.0044810097 = product of:
      0.013443029 = sum of:
        0.013443029 = weight(_text_:a in 2027) [ClassicSimilarity], result of:
          0.013443029 = score(doc=2027,freq=8.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.25478977 = fieldWeight in 2027, product of:
              2.828427 = tf(freq=8.0), with freq of:
                8.0 = termFreq=8.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=2027)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)

Type: a

Freire, N.; Charles, V.; Isaac, A.: Subject information and multilingualism in European bibliographic datasets : experiences with Universal Decimal Classification (2015) 0.00

0.0019403342 = product of:
  0.0038806684 = sum of:
    0.0038806684 = product of:
      0.011642005 = sum of:
        0.011642005 = weight(_text_:a in 2289) [ClassicSimilarity], result of:
          0.011642005 = score(doc=2289,freq=6.0), product of:
            0.052761257 = queryWeight, product of:
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.045758117 = queryNorm
            0.22065444 = fieldWeight in 2289, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              1.153047 = idf(docFreq=37942, maxDocs=44218)
              0.078125 = fieldNorm(doc=2289)
      0.33333334 = coord(1/3)
  0.5 = coord(1/2)

Source: Classification and authority control: expanding resource discovery: proceedings of the International UDC Seminar 2015, 29-30 October 2015, Lisbon, Portugal. Eds.: Slavic, A. u. M.I. Cordeiro
Type: a

Search (45 results, page 1 of 3)

Authors

Languages

Types

Themes

Classifications