Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython

"O'Reilly Media, Inc.", Sep 25, 2017 - Computers - 550 pages

Get complete instructions for manipulating, processing, cleaning, and crunching datasets in Python. Updated for Python 3.6, the second edition of this hands-on guide is packed with practical case studies that show you how to solve a broad set of data analysis problems effectively. You’ll learn the latest versions of pandas, NumPy, IPython, and Jupyter in the process.

Written by Wes McKinney, the creator of the Python pandas project, this book is a practical, modern introduction to data science tools in Python. It’s ideal for analysts new to Python and for Python programmers new to data science and scientific computing. Data files and related material are available on GitHub.

Use the IPython shell and Jupyter notebook for exploratory computing
Learn basic and advanced features in NumPy (Numerical Python)
Get started with data analysis tools in the pandas library
Use flexible tools to load, clean, transform, merge, and reshape data
Create informative visualizations with matplotlib
Apply the pandas groupby facility to slice, dice, and summarize datasets
Analyze and manipulate regular and irregular time series data
Learn how to solve real-world data analysis problems with thorough, detailed examples

Preview this book »

Selected pages

Title Page

Table of Contents

Index

Chapter 1 Preliminaries	1

Chapter 2 Python Language Basics IPython and Jupyter Notebooks	15

Chapter 3 Builtin Data Structures Functions and Files	51

Arrays and Vectorized Computation	87

Chapter 5 Getting Started with pandas	125

Chapter 6 Data Loading Storage and File Formats	169

Chapter 7 Data Cleaning and Preparation	195

Join Combine and Reshape	225

Chapter 11 Time Series	323

Chapter 12 Advanced pandas	369

Chapter 13 Introduction to Modeling Libraries in Python	389

Chapter 14 Data Analysis Examples	409

Appendix A Advanced NumPy	455

Appendix B More on the IPython System	489

Index	511

About the Author	529

Chapter 9 Plotting and Visualization	257

Chapter 10 Data Aggregation and Group Operations	293

Copyright

Other editions - View all

Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython
Wes McKinney
Limited preview - 2017

Python for Data Analysis: Data Wrangling with Pandas, Numpy, and Ipython
Wes McKinney
No preview available - 2017

About the author (2017)

Wes McKinney is a New York?based software developer and entrepreneur. After finishing his undergraduate degree in mathematics at MIT in 2007, he went on to do quantitative finance work at AQR Capital Management in Greenwich, CT. Frustrated by cumbersome data analysis tools, he learned Python and started building what would later become the pandas project. He's now an active member of the Python data community and is an advocate for the use of Python in data analysis, finance, and statistical computing applications. Wes was later the co-founder and CEO of DataPad, whose technology assets and team were acquired by Cloudera in 2014. He has since become involved in big data technology, joining the Project Management Committees for the Apache Arrow and Apache Parquet projects in the Apache Software Foundation. In 2016, he joined Two Sigma Investments in New York City, where he continues working to make data analysis faster and easier through open source software.

Bibliographic information

Title	Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython
Author	Wes McKinney
Publisher	"O'Reilly Media, Inc.", 2017
ISBN	1491957638, 9781491957639
Length	550 pages
Subjects	Computers › Data Science › General Computers / Data Science / Data Modeling & Design Computers / Data Science / General Computers / Languages / Python

Export Citation	BiBTeX EndNote RefMan

About Google Books - Privacy Policy - Terms of Service - Information for Publishers - Report an issue - Help - Google Home

Books

Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython

Selected pages

Contents

Other editions - View all

Common terms and phrases

About the author (2017)

Bibliographic information