Search (8 results, page 1 of 1)

Manning, C.D.; Raghavan, P.; Schütze, H.: Introduction to information retrieval (2008) 0.02
```
0.01596949 = product of:
  0.06387796 = sum of:
    0.06387796 = weight(_text_:term in 4041) [ClassicSimilarity], result of:
      0.06387796 = score(doc=4041,freq=4.0), product of:
        0.21904005 = queryWeight, product of:
          4.66603 = idf(docFreq=1130, maxDocs=44218)
          0.04694356 = queryNorm
        0.29162687 = fieldWeight in 4041, product of:
          2.0 = tf(freq=4.0), with freq of:
            4.0 = termFreq=4.0
          4.66603 = idf(docFreq=1130, maxDocs=44218)
          0.03125 = fieldNorm(doc=4041)
  0.25 = coord(1/4)
```
Content

Inhalt: Boolean retrieval - The term vocabulary & postings lists - Dictionaries and tolerant retrieval - Index construction - Index compression - Scoring, term weighting & the vector space model - Computing scores in a complete search system - Evaluation in information retrieval - Relevance feedback & query expansion - XML retrieval - Probabilistic information retrieval - Language models for information retrieval - Text classification & Naive Bayes - Vector space classification - Support vector machines & machine learning on documents - Flat clustering - Hierarchical clustering - Matrix decompositions & latent semantic indexing - Web search basics - Web crawling and indexes - Link analysis Vgl. die digitale Fassung unter: http://nlp.stanford.edu/IR-book/pdf/irbookprint.pdf.
Kantardzic, M.: Data mining : concepts, models, methods, and algorithms (2003) 0.01
```
0.011292135 = product of:
  0.04516854 = sum of:
    0.04516854 = weight(_text_:term in 2291) [ClassicSimilarity], result of:
      0.04516854 = score(doc=2291,freq=2.0), product of:
        0.21904005 = queryWeight, product of:
          4.66603 = idf(docFreq=1130, maxDocs=44218)
          0.04694356 = queryNorm
        0.20621133 = fieldWeight in 2291, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.66603 = idf(docFreq=1130, maxDocs=44218)
          0.03125 = fieldNorm(doc=2291)
  0.25 = coord(1/4)
```
Abstract

This book offers a comprehensive introduction to the exploding field of data mining. We are surrounded by data, numerical and otherwise, which must be analyzed and processed to convert it into information that informs, instructs, answers, or otherwise aids understanding and decision-making. Due to the ever-increasing complexity and size of today's data sets, a new term, data mining, was created to describe the indirect, automatic data analysis techniques that utilize more complex and sophisticated tools than those which analysts used in the past to do mere data analysis. "Data Mining: Concepts, Models, Methods, and Algorithms" discusses data mining principles and then describes representative state-of-the-art methods and algorithms originating from different disciplines such as statistics, machine learning, neural networks, fuzzy logic, and evolutionary computation. Detailed algorithms are provided with necessary explanations and illustrative examples. This text offers guidance: how and when to use a particular software tool (with their companion data sets) from among the hundreds offered when faced with a data set to mine. This allows analysts to create and perform their own data mining experiments using their knowledge of the methodologies and techniques provided. This book emphasizes the selection of appropriate methodologies and data analysis software, as well as parameter tuning. These critically important, qualitative decisions can only be made with the deeper understanding of parameter meaning and its role in the technique that is offered here. Data mining is an exploding field and this book offers much-needed guidance to selecting among the numerous analysis programs that are available.
Pomerantz, J.: Metadata (2015) 0.01
```
0.011292135 = product of:
  0.04516854 = sum of:
    0.04516854 = weight(_text_:term in 3800) [ClassicSimilarity], result of:
      0.04516854 = score(doc=3800,freq=2.0), product of:
        0.21904005 = queryWeight, product of:
          4.66603 = idf(docFreq=1130, maxDocs=44218)
          0.04694356 = queryNorm
        0.20621133 = fieldWeight in 3800, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          4.66603 = idf(docFreq=1130, maxDocs=44218)
          0.03125 = fieldNorm(doc=3800)
  0.25 = coord(1/4)
```
Abstract

When "metadata" became breaking news, appearing in stories about surveillance by the National Security Agency, many members of the public encountered this once-obscure term from information science for the first time. Should people be reassured that the NSA was "only" collecting metadata about phone calls -- information about the caller, the recipient, the time, the duration, the location -- and not recordings of the conversations themselves? Or does phone call metadata reveal more than it seems? In this book, Jeffrey Pomerantz offers an accessible and concise introduction to metadata. In the era of ubiquitous computing, metadata has become infrastructural, like the electrical grid or the highway system. We interact with it or generate it every day. It is not, Pomerantz tell us, just "data about data." It is a means by which the complexity of an object is represented in a simpler form. For example, the title, the author, and the cover art are metadata about a book. When metadata does its job well, it fades into the background; everyone (except perhaps the NSA) takes it for granted. Pomerantz explains what metadata is, and why it exists. He distinguishes among different types of metadata -- descriptive, administrative, structural, preservation, and use -- and examines different users and uses of each type. He discusses the technologies that make modern metadata possible, and he speculates about metadata's future. By the end of the book, readers will see metadata everywhere. Because, Pomerantz warns us, it's metadata's world, and we are just living in it.
Colomb, R.M.: Information spaces : the architecture of cyberspace (2002) 0.01
```
0.009880973 = product of:
  0.039523892 = sum of:
    0.039523892 = product of:
      0.079047784 = sum of:
        0.079047784 = weight(_text_:assessment in 262) [ClassicSimilarity], result of:
          0.079047784 = score(doc=262,freq=2.0), product of:
            0.25917634 = queryWeight, product of:
              5.52102 = idf(docFreq=480, maxDocs=44218)
              0.04694356 = queryNorm
            0.30499613 = fieldWeight in 262, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.52102 = idf(docFreq=480, maxDocs=44218)
              0.0390625 = fieldNorm(doc=262)
      0.5 = coord(1/2)
  0.25 = coord(1/4)
```
Abstract

The Architecture of Cyberspace is aimed at students taking information management as a minor in their course as well as those who manage document collections but who are not professional librarians. The first part of this book looks at how users find documents and the problems they have; the second part discusses how to manage the information space using various tools such as classification and controlled vocabularies. It also explores the general issues of publishing, including legal considerations, as well the main issues of creating and managing archives. Supported by exercises and discussion questions at the end of each chapter, the book includes some sample assignments suitable for use with students of this subject. A glossary is also provided to help readers understand the specialised vocabulary and the key concepts in the design and assessment of information spaces.
Ceri, S.; Bozzon, A.; Brambilla, M.; Della Valle, E.; Fraternali, P.; Quarteroni, S.: Web Information Retrieval (2013) 0.01
```
0.008714426 = product of:
  0.017428853 = sum of:
    0.0047084456 = product of:
      0.018833783 = sum of:
        0.018833783 = weight(_text_:based in 1082) [ClassicSimilarity], result of:
          0.018833783 = score(doc=1082,freq=2.0), product of:
            0.14144066 = queryWeight, product of:
              3.0129938 = idf(docFreq=5906, maxDocs=44218)
              0.04694356 = queryNorm
            0.13315678 = fieldWeight in 1082, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0129938 = idf(docFreq=5906, maxDocs=44218)
              0.03125 = fieldNorm(doc=1082)
      0.25 = coord(1/4)
    0.012720408 = product of:
      0.025440816 = sum of:
        0.025440816 = weight(_text_:22 in 1082) [ClassicSimilarity], result of:
          0.025440816 = score(doc=1082,freq=2.0), product of:
            0.16438834 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04694356 = queryNorm
            0.15476047 = fieldWeight in 1082, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.03125 = fieldNorm(doc=1082)
      0.5 = coord(1/2)
  0.5 = coord(2/4)
```
Abstract

With the proliferation of huge amounts of (heterogeneous) data on the Web, the importance of information retrieval (IR) has grown considerably over the last few years. Big players in the computer industry, such as Google, Microsoft and Yahoo!, are the primary contributors of technology for fast access to Web-based information; and searching capabilities are now integrated into most information systems, ranging from business management software and customer relationship systems to social networks and mobile phone applications. Ceri and his co-authors aim at taking their readers from the foundations of modern information retrieval to the most advanced challenges of Web IR. To this end, their book is divided into three parts. The first part addresses the principles of IR and provides a systematic and compact description of basic information retrieval techniques (including binary, vector space and probabilistic models as well as natural language search processing) before focusing on its application to the Web. Part two addresses the foundational aspects of Web IR by discussing the general architecture of search engines (with a focus on the crawling and indexing processes), describing link analysis methods (specifically Page Rank and HITS), addressing recommendation and diversification, and finally presenting advertising in search (the main source of revenues for search engines). The third and final part describes advanced aspects of Web search, each chapter providing a self-contained, up-to-date survey on current Web research directions. Topics in this part include meta-search and multi-domain search, semantic search, search in the context of multimedia data, and crowd search. The book is ideally suited to courses on information retrieval, as it covers all Web-independent foundational aspects. Its presentation is self-contained and does not require prior background knowledge. It can also be used in the context of classic courses on data management, allowing the instructor to cover both structured and unstructured data in various formats. Its classroom use is facilitated by a set of slides, which can be downloaded from www.search-computing.org.

Date

16.10.2013 19:22:44

Dominich, S.: Mathematical foundations of information retrieval (2001) 0.00

0.003975128 = product of:
  0.015900511 = sum of:
    0.015900511 = product of:
      0.031801023 = sum of:
        0.031801023 = weight(_text_:22 in 1753) [ClassicSimilarity], result of:
          0.031801023 = score(doc=1753,freq=2.0), product of:
            0.16438834 = queryWeight, product of:
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.04694356 = queryNorm
            0.19345059 = fieldWeight in 1753, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.5018296 = idf(docFreq=3622, maxDocs=44218)
              0.0390625 = fieldNorm(doc=1753)
      0.5 = coord(1/2)
  0.25 = coord(1/4)

Date: 22. 3.2008 12:26:32

Social information retrieval systems : emerging technologies and applications for searching the Web effectively (2008) 0.00
```
0.002632101 = product of:
  0.010528404 = sum of:
    0.010528404 = product of:
      0.042113617 = sum of:
        0.042113617 = weight(_text_:based in 4127) [ClassicSimilarity], result of:
          0.042113617 = score(doc=4127,freq=10.0), product of:
            0.14144066 = queryWeight, product of:
              3.0129938 = idf(docFreq=5906, maxDocs=44218)
              0.04694356 = queryNorm
            0.2977476 = fieldWeight in 4127, product of:
              3.1622777 = tf(freq=10.0), with freq of:
                10.0 = termFreq=10.0
              3.0129938 = idf(docFreq=5906, maxDocs=44218)
              0.03125 = fieldNorm(doc=4127)
      0.25 = coord(1/4)
  0.25 = coord(1/4)
```
Content

Inhalt Collaborating to search effectively in different searcher modes through cues and specialty search / Naresh Kumar Agarwal and Danny C.C. Poo -- Collaborative querying using a hybrid content and results-based approach / Chandrani Sinha Ray ... [et al.] -- Collaborative classification for group-oriented organization of search results / Keiichi Nakata and Amrish Singh -- A case study of use-centered descriptions : archival descriptions of what can be done with a collection / Richard Butterworth -- Metadata for social recommendations : storing, sharing, and reusing evaluations of learning resources / Riina Vuorikari, Nikos Manouselis, and Erik Duval -- Social network models for enhancing reference-based search engine rankings / Nikolaos Korfiatis ... [et al.] -- From PageRank to social rank : authority-based retrieval in social information spaces / Sebastian Marius Kirsch ... [et al.] -- Adaptive peer-to-peer social networks for distributed content-based Web search / Le-Shin Wu ... [et al.] -- The ethics of social information retrieval / Brendan Luyt and Chu Keong Lee -- The social context of knowledge / Daniel Memmi -- Social information seeking in digital libraries / George Buchanan and Annika Hinze -- Relevant intra-actions in networked environments / Theresa Dirndorfer Anderson -- Publication and citation analysis as a tool for information retrieval / Ronald Rousseau -- Personalized information retrieval in a semantic-based learning environment / Antonella Carbonaro and Rodolfo Ferrini -- Multi-agent tourism system (MATS) / Soe Yu Maw and Myo-Myo Naing -- Hybrid recommendation systems : a case study on the movies domain / Konstantinos Markellos ... [et al.].
Croft, W.B.; Metzler, D.; Strohman, T.: Search engines : information retrieval in practice (2010) 0.00
```
0.0017656671 = product of:
  0.0070626684 = sum of:
    0.0070626684 = product of:
      0.028250674 = sum of:
        0.028250674 = weight(_text_:based in 2605) [ClassicSimilarity], result of:
          0.028250674 = score(doc=2605,freq=2.0), product of:
            0.14144066 = queryWeight, product of:
              3.0129938 = idf(docFreq=5906, maxDocs=44218)
              0.04694356 = queryNorm
            0.19973516 = fieldWeight in 2605, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              3.0129938 = idf(docFreq=5906, maxDocs=44218)
              0.046875 = fieldNorm(doc=2605)
      0.25 = coord(1/4)
  0.25 = coord(1/4)
```
Abstract

For introductory information retrieval courses at the undergraduate and graduate level in computer science, information science and computer engineering departments. Written by a leader in the field of information retrieval, Search Engines: Information Retrieval in Practice, is designed to give undergraduate students the understanding and tools they need to evaluate, compare and modify search engines. Coverage of the underlying IR and mathematical models reinforce key concepts. The book's numerous programming exercises make extensive use of Galago, a Java-based open source search engine. SUPPLEMENTS / Extensive lecture slides (in PDF and PPT format) / Solutions to selected end of chapter problems (Instructors only) / Test collections for exercises / Galago search engine

Search (8 results, page 1 of 1)

Authors

Years

Types

Themes

Subjects

Classifications