Google's PageRank and Beyond: The Science of Search Engine Rankings

Front Cover
Princeton University Press, Jul 1, 2011 - Mathematics - 240 pages

Why doesn't your home page appear on the first page of search results, even when you query your own name? How do other web pages always appear at the top? What creates these powerful rankings? And how? The first book ever about the science of web page rankings, Google's PageRank and Beyond supplies the answers to these and other questions and more.


The book serves two very different audiences: the curious science reader and the technical computational reader. The chapters build in mathematical sophistication, so that the first five are accessible to the general academic reader. While other chapters are much more mathematical in nature, each one contains something for both audiences. For example, the authors include entertaining asides such as how search engines make money and how the Great Firewall of China influences research.


The book includes an extensive background chapter designed to help readers learn more about the mathematics of search engines, and it contains several MATLAB codes and links to sample web data sets. The philosophy throughout is to encourage readers to experiment with the ideas and algorithms in the text.


Any business seriously interested in improving its rankings in the major search engines can benefit from the clear examples, sample code, and list of resources provided.


  • Many illustrative examples and entertaining asides

  • MATLAB code

  • Accessible and informal style

  • Complete and self-contained section for mathematics review

 

Contents

Chapter 1 Introduction to Web Search Engines
1
Chapter 2 Crawling Indexing and Query Processing
15
Chapter 3 Ranking Webpages by Popularity
25
Chapter 4 The Mathematics of Googles PageRank
31
Chapter 5 Parameters in the PageRank Model
47
Chapter 6 The Sensitivity of PageRank
57
Chapter 7 The PageRank Problem as a Linear System
71
Chapter 8 Issues in LargeScale Implementation of PageRank
75
Chapter 10 Updating the PageRank Vector
99
Chapter 11 The HITS Method for Ranking Webpages
115
Chapter 12 Other Link Methods for Ranking Webpages
131
Chapter 13 The Future of Web Information Retrieval
139
Chapter 14 Resources for Web Information Retrieval
149
Chapter 15 The Mathematics Guide
153
Chapter 16 Glossary
201
Bibliography
207

Chapter 9 Accelerating the Computation of PageRank
89
Index
219

Other editions - View all

Common terms and phrases

About the author (2011)

Amy N. Langville is Assistant Professor of Mathematics at the College of Charleston in Charleston, South Carolina. She studies mathematical algorithms for information retrieval and text and data mining applications. Carl D. Meyer is Professor of Mathematics at North Carolina State University. In addition to information retrieval, his research areas include numerical analysis, linear algebra, and Markov chains. He is the author of Matrix Analysis and Applied Linear Algebra.

Bibliographic information