-
Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012)
0.29
0.28526202 = product of:
0.3993668 = sum of:
0.17932567 = weight(_text_:2f in 563) [ClassicSimilarity], result of:
0.17932567 = score(doc=563,freq=2.0), product of:
0.31907457 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.037635546 = queryNorm
0.56201804 = fieldWeight in 563, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=563)
0.007688564 = weight(_text_:information in 563) [ClassicSimilarity], result of:
0.007688564 = score(doc=563,freq=2.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.116372846 = fieldWeight in 563, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=563)
0.022828683 = weight(_text_:retrieval in 563) [ClassicSimilarity], result of:
0.022828683 = score(doc=563,freq=2.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.20052543 = fieldWeight in 563, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.046875 = fieldNorm(doc=563)
0.17932567 = weight(_text_:2f in 563) [ClassicSimilarity], result of:
0.17932567 = score(doc=563,freq=2.0), product of:
0.31907457 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.037635546 = queryNorm
0.56201804 = fieldWeight in 563, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=563)
0.0101981945 = product of:
0.030594582 = sum of:
0.030594582 = weight(_text_:22 in 563) [ClassicSimilarity], result of:
0.030594582 = score(doc=563,freq=2.0), product of:
0.13179328 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.037635546 = queryNorm
0.23214069 = fieldWeight in 563, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=563)
0.33333334 = coord(1/3)
0.71428573 = coord(5/7)
- Abstract
- In this thesis we propose three new word association measures for multi-word term extraction. We combine these association measures with LocalMaxs algorithm in our extraction model and compare the results of different multi-word term extraction methods. Our approach is language and domain independent and requires no training data. It can be applied to such tasks as text summarization, information retrieval, and document classification. We further explore the potential of using multi-word terms as an effective representation for general web-page summarization. We extract multi-word terms from human written summaries in a large collection of web-pages, and generate the summaries by aligning document words with these multi-word terms. Our system applies machine translation technology to learn the aligning process from a training set and focuses on selecting high quality multi-word terms from human written summaries to generate suitable results for web-page summarization.
- Content
- A Thesis presented to The University of Guelph In partial fulfilment of requirements for the degree of Master of Science in Computer Science. Vgl. Unter: http://www.inf.ufrgs.br%2F~ceramisch%2Fdownload_files%2Fpublications%2F2009%2Fp01.pdf.
- Date
- 10. 1.2013 19:22:47
-
Hotho, A.; Bloehdorn, S.: Data Mining 2004 : Text classification by boosting weak learners based on terms and concepts (2004)
0.24
0.24492845 = product of:
0.42862478 = sum of:
0.059775226 = product of:
0.17932567 = sum of:
0.17932567 = weight(_text_:3a in 562) [ClassicSimilarity], result of:
0.17932567 = score(doc=562,freq=2.0), product of:
0.31907457 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.037635546 = queryNorm
0.56201804 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.33333334 = coord(1/3)
0.17932567 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
0.17932567 = score(doc=562,freq=2.0), product of:
0.31907457 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.037635546 = queryNorm
0.56201804 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.17932567 = weight(_text_:2f in 562) [ClassicSimilarity], result of:
0.17932567 = score(doc=562,freq=2.0), product of:
0.31907457 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.037635546 = queryNorm
0.56201804 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.0101981945 = product of:
0.030594582 = sum of:
0.030594582 = weight(_text_:22 in 562) [ClassicSimilarity], result of:
0.030594582 = score(doc=562,freq=2.0), product of:
0.13179328 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.037635546 = queryNorm
0.23214069 = fieldWeight in 562, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.046875 = fieldNorm(doc=562)
0.33333334 = coord(1/3)
0.5714286 = coord(4/7)
- Content
- Vgl.: http://www.google.de/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&cad=rja&ved=0CEAQFjAA&url=http%3A%2F%2Fciteseerx.ist.psu.edu%2Fviewdoc%2Fdownload%3Fdoi%3D10.1.1.91.4940%26rep%3Drep1%26type%3Dpdf&ei=dOXrUMeIDYHDtQahsIGACg&usg=AFQjCNHFWVh6gNPvnOrOS9R3rkrXCNVD-A&sig2=5I2F5evRfMnsttSgFF9g7Q&bvm=bv.1357316858,d.Yms.
- Date
- 8. 1.2013 10:22:32
-
Noever, D.; Ciolino, M.: ¬The Turing deception (2022)
0.18
0.17932567 = product of:
0.41842657 = sum of:
0.059775226 = product of:
0.17932567 = sum of:
0.17932567 = weight(_text_:3a in 862) [ClassicSimilarity], result of:
0.17932567 = score(doc=862,freq=2.0), product of:
0.31907457 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.037635546 = queryNorm
0.56201804 = fieldWeight in 862, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=862)
0.33333334 = coord(1/3)
0.17932567 = weight(_text_:2f in 862) [ClassicSimilarity], result of:
0.17932567 = score(doc=862,freq=2.0), product of:
0.31907457 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.037635546 = queryNorm
0.56201804 = fieldWeight in 862, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=862)
0.17932567 = weight(_text_:2f in 862) [ClassicSimilarity], result of:
0.17932567 = score(doc=862,freq=2.0), product of:
0.31907457 = queryWeight, product of:
8.478011 = idf(docFreq=24, maxDocs=44218)
0.037635546 = queryNorm
0.56201804 = fieldWeight in 862, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
8.478011 = idf(docFreq=24, maxDocs=44218)
0.046875 = fieldNorm(doc=862)
0.42857143 = coord(3/7)
- Source
- https%3A%2F%2Farxiv.org%2Fabs%2F2212.06721&usg=AOvVaw3i_9pZm9y_dQWoHi6uv0EN
-
Semantik, Lexikographie und Computeranwendungen : Workshop ... (Bonn) : 1995.01.27-28 (1996)
0.06
0.06063357 = product of:
0.14147833 = sum of:
0.006407136 = weight(_text_:information in 190) [ClassicSimilarity], result of:
0.006407136 = score(doc=190,freq=2.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.09697737 = fieldWeight in 190, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0390625 = fieldNorm(doc=190)
0.1265727 = weight(_text_:kongress in 190) [ClassicSimilarity], result of:
0.1265727 = score(doc=190,freq=4.0), product of:
0.24693015 = queryWeight, product of:
6.5610886 = idf(docFreq=169, maxDocs=44218)
0.037635546 = queryNorm
0.51258504 = fieldWeight in 190, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
6.5610886 = idf(docFreq=169, maxDocs=44218)
0.0390625 = fieldNorm(doc=190)
0.008498495 = product of:
0.025495486 = sum of:
0.025495486 = weight(_text_:22 in 190) [ClassicSimilarity], result of:
0.025495486 = score(doc=190,freq=2.0), product of:
0.13179328 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.037635546 = queryNorm
0.19345059 = fieldWeight in 190, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=190)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Date
- 14. 4.2007 10:04:22
- RSWK
- Lexikographie / Semantik / Kongress / Bonn <1995>
- Series
- Sprache und Information ; 33
- Subject
- Lexikographie / Semantik / Kongress / Bonn <1995>
-
Linguistik und neue Medien (1998)
0.06
0.057179723 = product of:
0.40025803 = sum of:
0.40025803 = weight(_text_:kongress in 5770) [ClassicSimilarity], result of:
0.40025803 = score(doc=5770,freq=10.0), product of:
0.24693015 = queryWeight, product of:
6.5610886 = idf(docFreq=169, maxDocs=44218)
0.037635546 = queryNorm
1.6209363 = fieldWeight in 5770, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
6.5610886 = idf(docFreq=169, maxDocs=44218)
0.078125 = fieldNorm(doc=5770)
0.14285715 = coord(1/7)
- Footnote
- Publikation zu einem Kongress 1997 in Leipzig
- RSWK
- Lexikographie / Neue Medien / Kongress / Leipzig <1997> (2134)
Syntaktische Analyse / Neue Medien / Kongress / Leipzig <1997> (2134)
- Subject
- Lexikographie / Neue Medien / Kongress / Leipzig <1997> (2134)
Syntaktische Analyse / Neue Medien / Kongress / Leipzig <1997> (2134)
-
Rahmstorf, G.: Rückkehr von Ordnung in die Informationstechnik? (2000)
0.05
0.04559309 = product of:
0.1595758 = sum of:
0.007688564 = weight(_text_:information in 5504) [ClassicSimilarity], result of:
0.007688564 = score(doc=5504,freq=2.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.116372846 = fieldWeight in 5504, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.046875 = fieldNorm(doc=5504)
0.15188724 = weight(_text_:kongress in 5504) [ClassicSimilarity], result of:
0.15188724 = score(doc=5504,freq=4.0), product of:
0.24693015 = queryWeight, product of:
6.5610886 = idf(docFreq=169, maxDocs=44218)
0.037635546 = queryNorm
0.61510205 = fieldWeight in 5504, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
6.5610886 = idf(docFreq=169, maxDocs=44218)
0.046875 = fieldNorm(doc=5504)
0.2857143 = coord(2/7)
- Series
- Gemeinsamer Kongress der Bundesvereinigung Deutscher Bibliotheksverbände e.V. (BDB) und der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V. (DGI); Bd.1)(Tagungen der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V.; Bd.3
- Source
- Information und Öffentlichkeit: 1. Gemeinsamer Kongress der Bundesvereinigung Deutscher Bibliotheksverbände e.V. (BDB) und der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V. (DGI), Leipzig, 20.-23.3.2000. Zugleich 90. Deutscher Bibliothekartag, 52. Jahrestagung der Deutschen Gesellschaft für Informationswissenschaft und Informationspraxis e.V. (DGI). Hrsg.: G. Ruppelt u. H. Neißer
-
Wenzel, F.: Semantische Eingrenzung im Freitext-Retrieval auf der Basis morphologischer Segmentierungen (1980)
0.04
0.041085556 = product of:
0.09586629 = sum of:
0.012814272 = weight(_text_:information in 2037) [ClassicSimilarity], result of:
0.012814272 = score(doc=2037,freq=2.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.19395474 = fieldWeight in 2037, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=2037)
0.065900736 = weight(_text_:retrieval in 2037) [ClassicSimilarity], result of:
0.065900736 = score(doc=2037,freq=6.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.5788671 = fieldWeight in 2037, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.078125 = fieldNorm(doc=2037)
0.017151278 = product of:
0.051453833 = sum of:
0.051453833 = weight(_text_:29 in 2037) [ClassicSimilarity], result of:
0.051453833 = score(doc=2037,freq=2.0), product of:
0.13239008 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.037635546 = queryNorm
0.38865322 = fieldWeight in 2037, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.078125 = fieldNorm(doc=2037)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Abstract
- The basic problem in freetext retrieval is that the retrieval language is not properly adapted to that of the author. Morphological segmentation, where words with the same root are grouped together in the inverted file, is a good eliminator of noise and information loss, providing high recall but low precision
- Source
- Nachrichten für Dokumentation. 31(1980) H.1, S.29-35
-
Mauldin, M.L.: Conceptual information retrieval : a case study in adaptive partial parsing (1991)
0.04
0.038557813 = product of:
0.13495234 = sum of:
0.034000106 = weight(_text_:information in 121) [ClassicSimilarity], result of:
0.034000106 = score(doc=121,freq=22.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.51462007 = fieldWeight in 121, product of:
4.690416 = tf(freq=22.0), with freq of:
22.0 = termFreq=22.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0625 = fieldNorm(doc=121)
0.10095224 = weight(_text_:retrieval in 121) [ClassicSimilarity], result of:
0.10095224 = score(doc=121,freq=22.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.88675684 = fieldWeight in 121, product of:
4.690416 = tf(freq=22.0), with freq of:
22.0 = termFreq=22.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0625 = fieldNorm(doc=121)
0.2857143 = coord(2/7)
- LCSH
- FERRET (Information retrieval system)
Information storage and retrieval
- RSWK
- Freitextsuche / Information Retrieval
Information Retrieval / Expertensystem
Syntaktische Analyse Information Retrieval
- Subject
- Freitextsuche / Information Retrieval
Information Retrieval / Expertensystem
Syntaktische Analyse Information Retrieval
FERRET (Information retrieval system)
Information storage and retrieval
-
Rindflesch, T.C.; Aronson, A.R.: Semantic processing in information retrieval (1993)
0.04
0.03835719 = product of:
0.089500114 = sum of:
0.017939983 = weight(_text_:information in 4121) [ClassicSimilarity], result of:
0.017939983 = score(doc=4121,freq=8.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.27153665 = fieldWeight in 4121, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=4121)
0.059554238 = weight(_text_:retrieval in 4121) [ClassicSimilarity], result of:
0.059554238 = score(doc=4121,freq=10.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.5231199 = fieldWeight in 4121, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=4121)
0.0120058935 = product of:
0.03601768 = sum of:
0.03601768 = weight(_text_:29 in 4121) [ClassicSimilarity], result of:
0.03601768 = score(doc=4121,freq=2.0), product of:
0.13239008 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.037635546 = queryNorm
0.27205724 = fieldWeight in 4121, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0546875 = fieldNorm(doc=4121)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Abstract
- Intuition suggests that one way to enhance the information retrieval process would be the use of phrases to characterize the contents of text. A number of researchers, however, have noted that phrases alone do not improve retrieval effectiveness. In this paper we briefly review the use of phrases in information retrieval and then suggest extensions to this paradigm using semantic information. We claim that semantic processing, which can be viewed as expressing relations between the concepts represented by phrases, will in fact enhance retrieval effectiveness. The availability of the UMLS® domain model, which we exploit extensively, significantly contributes to the feasibility of this processing.
- Date
- 29. 6.2015 14:51:28
-
Rau, L.F.: Conceptual information extraction and retrieval from natural language input (198)
0.04
0.038177624 = product of:
0.08908112 = sum of:
0.018122118 = weight(_text_:information in 1955) [ClassicSimilarity], result of:
0.018122118 = score(doc=1955,freq=4.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.27429342 = fieldWeight in 1955, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=1955)
0.05380772 = weight(_text_:retrieval in 1955) [ClassicSimilarity], result of:
0.05380772 = score(doc=1955,freq=4.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.47264296 = fieldWeight in 1955, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.078125 = fieldNorm(doc=1955)
0.017151278 = product of:
0.051453833 = sum of:
0.051453833 = weight(_text_:29 in 1955) [ClassicSimilarity], result of:
0.051453833 = score(doc=1955,freq=2.0), product of:
0.13239008 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.037635546 = queryNorm
0.38865322 = fieldWeight in 1955, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.078125 = fieldNorm(doc=1955)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Date
- 16. 8.1998 13:29:20
- Footnote
- Wiederabgedruckt in: Readings in information retrieval. Ed.: K. Sparck Jones u. P. Willett. San Francisco: Morgan Kaufmann 1997. S.527-533
-
Byrne, C.C.; McCracken, S.A.: ¬An adaptive thesaurus employing semantic distance, relational inheritance and nominal compound interpretation for linguistic support of information retrieval (1999)
0.04
0.037628703 = product of:
0.0878003 = sum of:
0.021746542 = weight(_text_:information in 4483) [ClassicSimilarity], result of:
0.021746542 = score(doc=4483,freq=4.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.3291521 = fieldWeight in 4483, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.09375 = fieldNorm(doc=4483)
0.045657367 = weight(_text_:retrieval in 4483) [ClassicSimilarity], result of:
0.045657367 = score(doc=4483,freq=2.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.40105087 = fieldWeight in 4483, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.09375 = fieldNorm(doc=4483)
0.020396389 = product of:
0.061189163 = sum of:
0.061189163 = weight(_text_:22 in 4483) [ClassicSimilarity], result of:
0.061189163 = score(doc=4483,freq=2.0), product of:
0.13179328 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.037635546 = queryNorm
0.46428138 = fieldWeight in 4483, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.09375 = fieldNorm(doc=4483)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Date
- 15. 3.2000 10:22:37
- Source
- Journal of information science. 25(1999) no.2, S.113-131
-
Liu, S.; Liu, F.; Yu, C.; Meng, W.: ¬An effective approach to document retrieval via utilizing WordNet and recognizing phrases (2004)
0.04
0.03590283 = product of:
0.08377327 = sum of:
0.012814272 = weight(_text_:information in 4078) [ClassicSimilarity], result of:
0.012814272 = score(doc=4078,freq=2.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.19395474 = fieldWeight in 4078, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=4078)
0.05380772 = weight(_text_:retrieval in 4078) [ClassicSimilarity], result of:
0.05380772 = score(doc=4078,freq=4.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.47264296 = fieldWeight in 4078, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.078125 = fieldNorm(doc=4078)
0.017151278 = product of:
0.051453833 = sum of:
0.051453833 = weight(_text_:29 in 4078) [ClassicSimilarity], result of:
0.051453833 = score(doc=4078,freq=2.0), product of:
0.13239008 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.037635546 = queryNorm
0.38865322 = fieldWeight in 4078, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.078125 = fieldNorm(doc=4078)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Date
- 10.10.2005 10:29:08
- Source
- SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a
-
Doszkocs, T.E.; Zamora, A.: Dictionary services and spelling aids for Web searching (2004)
0.03
0.03306582 = product of:
0.07715358 = sum of:
0.009061059 = weight(_text_:information in 2541) [ClassicSimilarity], result of:
0.009061059 = score(doc=2541,freq=4.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.13714671 = fieldWeight in 2541, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0390625 = fieldNorm(doc=2541)
0.02690386 = weight(_text_:retrieval in 2541) [ClassicSimilarity], result of:
0.02690386 = score(doc=2541,freq=4.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.23632148 = fieldWeight in 2541, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0390625 = fieldNorm(doc=2541)
0.041188654 = product of:
0.06178298 = sum of:
0.025726916 = weight(_text_:29 in 2541) [ClassicSimilarity], result of:
0.025726916 = score(doc=2541,freq=2.0), product of:
0.13239008 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.037635546 = queryNorm
0.19432661 = fieldWeight in 2541, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0390625 = fieldNorm(doc=2541)
0.03605606 = weight(_text_:22 in 2541) [ClassicSimilarity], result of:
0.03605606 = score(doc=2541,freq=4.0), product of:
0.13179328 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.037635546 = queryNorm
0.27358043 = fieldWeight in 2541, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0390625 = fieldNorm(doc=2541)
0.6666667 = coord(2/3)
0.42857143 = coord(3/7)
- Abstract
- The Specialized Information Services Division (SIS) of the National Library of Medicine (NLM) provides Web access to more than a dozen scientific databases on toxicology and the environment on TOXNET . Search queries on TOXNET often include misspelled or variant English words, medical and scientific jargon and chemical names. Following the example of search engines like Google and ClinicalTrials.gov, we set out to develop a spelling "suggestion" system for increased recall and precision in TOXNET searching. This paper describes development of dictionary technology that can be used in a variety of applications such as orthographic verification, writing aid, natural language processing, and information storage and retrieval. The design of the technology allows building complex applications using the components developed in the earlier phases of the work in a modular fashion without extensive rewriting of computer code. Since many of the potential applications envisioned for this work have on-line or web-based interfaces, the dictionaries and other computer components must have fast response, and must be adaptable to open-ended database vocabularies, including chemical nomenclature. The dictionary vocabulary for this work was derived from SIS and other databases and specialized resources, such as NLM's Unified Medical Language Systems (UMLS) . The resulting technology, A-Z Dictionary (AZdict), has three major constituents: 1) the vocabulary list, 2) the word attributes that define part of speech and morphological relationships between words in the list, and 3) a set of programs that implements the retrieval of words and their attributes, and determines similarity between words (ChemSpell). These three components can be used in various applications such as spelling verification, spelling aid, part-of-speech tagging, paraphrasing, and many other natural language processing functions.
- Date
- 14. 8.2004 17:22:56
- Source
- Online. 28(2004) no.3, S.22-29
-
Chen, K.-H.: Evaluating Chinese text retrieval with multilingual queries (2002)
0.03
0.03181835 = product of:
0.074242815 = sum of:
0.0089699915 = weight(_text_:information in 1851) [ClassicSimilarity], result of:
0.0089699915 = score(doc=1851,freq=2.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.13576832 = fieldWeight in 1851, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=1851)
0.053266928 = weight(_text_:retrieval in 1851) [ClassicSimilarity], result of:
0.053266928 = score(doc=1851,freq=8.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.46789268 = fieldWeight in 1851, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=1851)
0.0120058935 = product of:
0.03601768 = sum of:
0.03601768 = weight(_text_:29 in 1851) [ClassicSimilarity], result of:
0.03601768 = score(doc=1851,freq=2.0), product of:
0.13239008 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.037635546 = queryNorm
0.27205724 = fieldWeight in 1851, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0546875 = fieldNorm(doc=1851)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Abstract
- This paper reports the design of a Chinese test collection with multilingual queries and the application of this test collection to evaluate information retrieval Systems. The effective indexing units, IR models, translation techniques, and query expansion for Chinese text retrieval are identified. The collaboration of East Asian countries for construction of test collections for cross-language multilingual text retrieval is also discussed in this paper. As well, a tool is designed to help assessors judge relevante and gather the events of relevante judgment. The log file created by this tool will be used to analyze the behaviors of assessors in the future.
- Source
- Knowledge organization. 29(2002) nos.3/4, S.156-170
-
Czejdo. B.D.; Tucci, R.P.: ¬A dataflow graphical language for database applications (1994)
0.03
0.03142337 = product of:
0.0733212 = sum of:
0.018122118 = weight(_text_:information in 559) [ClassicSimilarity], result of:
0.018122118 = score(doc=559,freq=4.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.27429342 = fieldWeight in 559, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=559)
0.038047805 = weight(_text_:retrieval in 559) [ClassicSimilarity], result of:
0.038047805 = score(doc=559,freq=2.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.33420905 = fieldWeight in 559, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.078125 = fieldNorm(doc=559)
0.017151278 = product of:
0.051453833 = sum of:
0.051453833 = weight(_text_:29 in 559) [ClassicSimilarity], result of:
0.051453833 = score(doc=559,freq=2.0), product of:
0.13239008 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.037635546 = queryNorm
0.38865322 = fieldWeight in 559, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.078125 = fieldNorm(doc=559)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Abstract
- Discusses a graphical language for information retrieval and processing. A lot of recent activity has occured in the area of improving access to database systems. However, current results are restricted to simple interfacing of database systems. Proposes a graphical language for specifying complex applications
- Date
- 20.10.2000 13:29:46
- Source
- CIT - Journal of computing and information technology. 2(1994) no.1, S.39-50
-
Gachot, D.A.; Lange, E.; Yang, J.: ¬The SYSTRAN NLP browser : an application of machine translation technology in cross-language information retrieval (1998)
0.03
0.030204242 = product of:
0.10571484 = sum of:
0.026633967 = weight(_text_:information in 6213) [ClassicSimilarity], result of:
0.026633967 = score(doc=6213,freq=6.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.40312737 = fieldWeight in 6213, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.09375 = fieldNorm(doc=6213)
0.07908088 = weight(_text_:retrieval in 6213) [ClassicSimilarity], result of:
0.07908088 = score(doc=6213,freq=6.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.6946405 = fieldWeight in 6213, product of:
2.4494898 = tf(freq=6.0), with freq of:
6.0 = termFreq=6.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.09375 = fieldNorm(doc=6213)
0.2857143 = coord(2/7)
- Series
- The Kluwer International series on information retrieval
- Source
- Cross-language information retrieval. Ed.: G. Grefenstette
-
Liddy, E.D.: Natural language processing for information retrieval and knowledge discovery (1998)
0.03
0.029837491 = product of:
0.06962081 = sum of:
0.02005751 = weight(_text_:information in 2345) [ClassicSimilarity], result of:
0.02005751 = score(doc=2345,freq=10.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.3035872 = fieldWeight in 2345, product of:
3.1622777 = tf(freq=10.0), with freq of:
10.0 = termFreq=10.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=2345)
0.037665404 = weight(_text_:retrieval in 2345) [ClassicSimilarity], result of:
0.037665404 = score(doc=2345,freq=4.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.33085006 = fieldWeight in 2345, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=2345)
0.011897894 = product of:
0.03569368 = sum of:
0.03569368 = weight(_text_:22 in 2345) [ClassicSimilarity], result of:
0.03569368 = score(doc=2345,freq=2.0), product of:
0.13179328 = queryWeight, product of:
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.037635546 = queryNorm
0.2708308 = fieldWeight in 2345, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5018296 = idf(docFreq=3622, maxDocs=44218)
0.0546875 = fieldNorm(doc=2345)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Abstract
- Natural language processing (NLP) is a powerful technology for the vital tasks of information retrieval (IR) and knowledge discovery (KD) which, in turn, feed the visualization systems of the present and future and enable knowledge workers to focus more of their time on the vital tasks of analysis and prediction
- Date
- 22. 9.1997 19:16:05
- Imprint
- Urbana-Champaign, IL : Illinois University at Urbana-Champaign, Graduate School of Library and Information Science
- Source
- Visualizing subject access for 21st century information resources: Papers presented at the 1997 Clinic on Library Applications of Data Processing, 2-4 Mar 1997, Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign. Ed.: P.A. Cochrane et al
-
Sheremet'eva, S.O.: Teoreticheskie i metodologicheskie problemy inzhenernoi lingvistiki (1998)
0.03
0.02914858 = product of:
0.068013355 = sum of:
0.012814272 = weight(_text_:information in 6316) [ClassicSimilarity], result of:
0.012814272 = score(doc=6316,freq=2.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.19395474 = fieldWeight in 6316, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.078125 = fieldNorm(doc=6316)
0.038047805 = weight(_text_:retrieval in 6316) [ClassicSimilarity], result of:
0.038047805 = score(doc=6316,freq=2.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.33420905 = fieldWeight in 6316, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.078125 = fieldNorm(doc=6316)
0.017151278 = product of:
0.051453833 = sum of:
0.051453833 = weight(_text_:29 in 6316) [ClassicSimilarity], result of:
0.051453833 = score(doc=6316,freq=2.0), product of:
0.13239008 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.037635546 = queryNorm
0.38865322 = fieldWeight in 6316, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.078125 = fieldNorm(doc=6316)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Abstract
- Examines the major topical issues in the area of linguistic engineering: machine translation, text synthesis and information retrieval
- Date
- 6. 3.1999 13:56:29
-
Bowker, L.: Information retrieval in translation memory systems : assessment of current limitations and possibilities for future development (2002)
0.03
0.028976265 = product of:
0.067611285 = sum of:
0.017939983 = weight(_text_:information in 1854) [ClassicSimilarity], result of:
0.017939983 = score(doc=1854,freq=8.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.27153665 = fieldWeight in 1854, product of:
2.828427 = tf(freq=8.0), with freq of:
8.0 = termFreq=8.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.0546875 = fieldNorm(doc=1854)
0.037665404 = weight(_text_:retrieval in 1854) [ClassicSimilarity], result of:
0.037665404 = score(doc=1854,freq=4.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.33085006 = fieldWeight in 1854, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.0546875 = fieldNorm(doc=1854)
0.0120058935 = product of:
0.03601768 = sum of:
0.03601768 = weight(_text_:29 in 1854) [ClassicSimilarity], result of:
0.03601768 = score(doc=1854,freq=2.0), product of:
0.13239008 = queryWeight, product of:
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.037635546 = queryNorm
0.27205724 = fieldWeight in 1854, product of:
1.4142135 = tf(freq=2.0), with freq of:
2.0 = termFreq=2.0
3.5176873 = idf(docFreq=3565, maxDocs=44218)
0.0546875 = fieldNorm(doc=1854)
0.33333334 = coord(1/3)
0.42857143 = coord(3/7)
- Abstract
- A translation memory system is a new type of human language technology (HLT) tool that is gaining popularity among translators. Such tools allow translators to store previously translated texts in a type of aligned bilingual database, and to recycle relevant parts of these texts when producing new translations. Currently, these tools retrieve information from the database using superficial character string matching, which often results in poor precision and recall. This paper explains how translation memory systems work, and it considers some possible ways for introducing more sophisticated information retrieval techniques into such systems by taking syntactic and semantic similarity into account. Some of the suggested techniques are inspired by these used in other areas of HLT, and some by techniques used in information science.
- Source
- Knowledge organization. 29(2002) nos.3/4, S.198-203
-
Pirkola, A.; Hedlund, T.; Keskustalo, H.; Järvelin, K.: Dictionary-based cross-language information retrieval : problems, methods, and research findings (2001)
0.03
0.028771937 = product of:
0.10070177 = sum of:
0.025370965 = weight(_text_:information in 3908) [ClassicSimilarity], result of:
0.025370965 = score(doc=3908,freq=4.0), product of:
0.066068366 = queryWeight, product of:
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.037635546 = queryNorm
0.3840108 = fieldWeight in 3908, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
1.7554779 = idf(docFreq=20772, maxDocs=44218)
0.109375 = fieldNorm(doc=3908)
0.07533081 = weight(_text_:retrieval in 3908) [ClassicSimilarity], result of:
0.07533081 = score(doc=3908,freq=4.0), product of:
0.11384433 = queryWeight, product of:
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.037635546 = queryNorm
0.6617001 = fieldWeight in 3908, product of:
2.0 = tf(freq=4.0), with freq of:
4.0 = termFreq=4.0
3.024915 = idf(docFreq=5836, maxDocs=44218)
0.109375 = fieldNorm(doc=3908)
0.2857143 = coord(2/7)
- Source
- Information retrieval. 4(2001), S.209-230