Data on the Web: From Relations to Semistructured Data and XML

Front Cover
Morgan Kaufmann, 2000 - Computers - 258 pages

The Web is causing a revolution in how we represent, retrieve, and process information Its growth has given us a universally accessible database-but in the form of a largely unorganized collection of documents. This is changing, thanks to the simultaneous emergence of new ways of representing data: from within the Web community, XML; and from within the database community, semistructured data. The convergence of these two approaches has rendered them nearly identical. Now, there is a concerted effort to develop effective techniques for retrieving and processing both kinds of data.

Data on the Web is the only comprehensive, up-to-date examination of these rapidly evolving retrieval and processing strategies, which are of critical importance for almost all Web- and data-intensive enterprises. This book offers detailed solutions to a wide range of practical problems while equipping you with a keen understanding of the fundamental issues-including data models, query languages, and schemas-involved in their design, implementation, and optimization. You'll find it to be compelling reading, whether your interest is that of a practitioner involved in a database-driven Web enterprise or a researcher in computer science or related field.


  • Provides an in-depth look at XML and other technologies for publishing structured documents on the Web.
  • Examines recently developed methods for querying and updating structured Web documents and semistructured data, including XML-QL and XSL.
  • Looks deeper into the convergence of Web and database approaches to semistructured data presentation and querying.
  • Details practical examples of how these techniques are already being applied-and how they will be used in the near future.
  • Teaches sound techniques for writing queries over Web data, describing loose schemas over partially structured data, and implementing and optimizing queries on semistructured data.

What people are saying - Write a review

Data on the Web : From Relations to Semistructured Data and XML (The Morgan Kaufmann Series in Data Management Systems)

User Review  - Not Available - Book Verdict

Most data on the web are not well structured, making the search and retrieval process difficult since the spiders, robots, and other search engines don't really understand the context of the data they ... Read full review


A Syntax for Data
Query Languages for XML
Interpretation and Advanced Features
Typing Semistructured Data
Query Processing
The Lore System

Common terms and phrases

About the author (2000)

Serge Abiteboul is Senior Researcher at I.N.R.I.A. and a professor at the cole Polytechnique. He received his Ph.D. in computer science from the University of Southern California in 1982 and his Th se d'Etat from the University of Paris XI in 1986. His recent research has focused on object databases, digital libraries, semistructured data, data integration, and electronic commerce.

Peter Buneman is a professor in the Computer and Information Science Department at the University of Pennsylvania. He earned his undergraduate degree from Cambridge and his Ph.D. from the University of Warwick. His research interests include databases, programming languages, cognitive science, and classification theory.

Dan Suciu is a researcher at AT&T Labs who received his Ph.D. from the University of Pennsylvania in 1995. He has devoted his recent research and publications to various aspects of semistructured data, organizing several workshops on the topic and serving on the committees of ICDT, PODS, and EDBT.

Bibliographic information