Python for Data Analysis, 3E

About the Open Edition

The 3rd edition of Python for Data Analysis is now available as an “Open Access” HTML version on this site in addition to the usual print and e-book formats. This edition was initially published in August 2022 and will have errata fixed periodically over the coming months and years. If you encounter any errata, please report them here.

In general, the content from this website may not be copied or reproduced. The code examples are MIT-licensed and can be found on GitHub or Gitee along with the supporting datasets.

If you find the online edition of the book useful, please consider ordering a paper copy or a DRM-free eBook (in PDF and EPUB formats) to support the author.

This web version of the book was created with the Quarto publishing system.

What’s New in the 3rd Edition?

The book has been updated for pandas 2.0.0 and Python 3.10. The changes between the 2nd and 3rd editions are focused on bringing the content up-to-date with changes in pandas since 2017.

Update History

This website will be updated periodically as new early release content becomes available, and post-publication for errata fixes.

  • April 12, 2023: Update to pandas 2.0.0 and fix some code examples.
  • October 19, 2022: Fix a table link and add links.
  • September 20, 2022: Website update after final publication including a couple of minor errata fixes.
  • July 22, 2022: Incorporate copy-editing and other improvements for “QC1” stage of production en route to publication in print later this summer.
  • May 18, 2022: Update open access edition with all chapters. Include edits from technical review feedback (thank you!), acknowledgements for the third edition, and other preparation to make the book ready for production on its way to print later in 2022.
  • February 13, 2022: Update open access edition with chapters 7 through 10.
  • January 23, 2022: First open access edition with chapters 1 through 6.