The Fourth Paradigm: Data-intensive Scientific DiscoveryAnthony J. G. Hey, Stewart Tansley, Kristin Michele Tolle This book presents the first broad look at the rapidly emerging field of data-intensive science, with the goal of influencing the worldwide scientific and computing research communities and inspiring the next generation of scientists. Increasingly, scientific breakthroughs will be powered by advanced computing capabilities that help researchers manipulate and explore massive datasets. The speed at which any given scientific discipline advances will depend on how well its researchers collaborate with one another, and with technologists, in areas of eScience such as databases, workflow management, visualization, and cloud-computing technologies. This collection of essays expands on the vision of pioneering computer scientist Jim Gray for a new, fourth paradigm of discovery based on data-intensive science and offers insights into how it can be fully realized. |
Contents
555 | 51 |
SCIENTIFIC INFRASTRUCTURE | 107 |
INTRODUCTION Daron Green | 109 |
Copyright | |
23 other sections not shown
Common terms and phrases
algorithms analyze applications approach archives automated biological capture challenges clinical cloud cloud computing collaboration complex computer science create curation cyberinfrastructure data analysis data deluge data management data sharing data-centric data-intensive science database datasets discovery distributed domain Earth Electronic Health Records emerging enable engine environment environmental eScience example exploration Figure fourth paradigm framework gene genome GEOSS global Harry Pearce healthcare human images individual infrastructure integration interactions Internet interoperability Jim Gray knowledge laboratory language Large Hadron Collider large-scale literature MapReduce metadata methods Microsoft Research models multicore National neurons next-generation nodes ocean open access parallel patient petabytes pi-calculus potential programming projects provenance PubMed Central queries real-time scale scholarly communication scientists semantic Semantic Web sensor sequence simulation structure Szalay technologies terabytes tion University VisTrails visualization workflows