Information Retrieval: Data Structures & Algorithms

Front Cover
William Bruce Frakes, Ricardo Baeza-Yates
Prentice Hall, 1992 - Computers - 504 pages
0 Reviews

Information retrieval is a sub-field of computer science that deals with the automated storage and retrieval of documents. Providing the latest information retrieval techniques, this guide discusses Information Retrieval data structures and algorithms, including implementations in C. Aimed at software engineers building systems with book processing components, it provides a descriptive and evaluative explanation of storage and retrieval systems, file structures, term and query operations, document operations and hardware. Contains techniques for handling inverted files, signature files, and file organizations for optical disks. Discusses such operations as lexical analysis and stoplists, stemming algorithms, thesaurus construction, and relevance feedback and other query modification techniques. Provides information on Boolean operations, hashing algorithms, ranking algorithms and clustering algorithms. In addition to being of interest to software engineering professionals, this book will be useful to information science and library science professionals who are interested in text retrieval technology.

From inside the book

What people are saying - Write a review

We haven't found any reviews in the usual places.

Contents

FILE STRUCTURES
28
Signature Files
44
TERM AND QUERY OPERATIONS
102
Copyright

11 other sections not shown

Other editions - View all

Common terms and phrases

About the author (1992)

Ricardo Baeza-Yates has been VP of Research and Chief Research Scientist at Yahoo Labs, based in Sunnyvale, California, since August 2014. Before that, he founded and led the labs in Barcelona and Santiago de Chile from 2006 2015. Between 2008 and 2012 he also oversaw the Haifa lab. In addition, he is also a part-time Professor at the Department of Information and Communication Technologies of the Universitat Pompeu Fabra, in Barcelona, Spain, where in 2005 he was an ICREA research professor. Until 2004 he was a Professor, and before that founder and Director, of the Center for Web Research at the Department of Computing Science of the University of Chile (from where he is currently on a leave of absence). In 1989, he obtained a Ph.D. in computer science from the University of Waterloo, Canada. Before that, he obtained two master degrees (M.Sc. CS & M.Eng. EE) and an electronic engineering degree from the University of Chile in Santiago. He is co-author of the best-seller Modern Information Retrieval textbook, published in 1999 by Addison-Wesley, with a second enlarged edition in 2011, that won the ASIST 2012 Book of the Year award. He is also co-author of the 2nd edition of the Handbook of Algorithms and Data Structures, Addison-Wesley, 1991 and co-editor of Information Retrieval: Algorithms and Data Structures, Prentice-Hall, 1992. In addition, he is the author or co-author of more than 500 other publications. From 2002-2004, he was elected to the board of governors of the IEEE Computer Society and in 2012 he was elected for the ACM Council. He received the Organization of American States award for young researchers in exact sciences (1993), the Graham Medal for innovation in computing given by the University of Waterloo to distinguished ex-alumni (2007), the CLEI Latin American distinction for contributions to CS in the region (2009), and the National Award of the Chilean Association of Engineers (2010), among other distinctions. In 2003 he was the first computer scientist to be elected to the Chilean Academy of Sciences and since 2010 has been a founding member of the Chilean Academy of Engineering. In 2009 he was named ACM Fellow and in 2011 IEEE Fellow.

Bibliographic information