Statistical Data Analysis in Python


Editor's note: This tutorial was originally published as course instructional material, and may contain out-of-context references to other courses therein; this takes nothing away from the validity or usefulness of the material. This tutorial will introduce the use of Python for statistical data analysis, using data stored as Pandas DataFrame objects. Much of the work involved in analyzing data resides in importing, cleaning and transforming data in preparation for analysis. Therefore, the first half of the course is comprised of a 2-part overview of basic and intermediate Pandas usage that will show how to effectively manipulate datasets in memory. This includes tasks like indexing, alignment, join/merge methods, date/time types, and handling of missing data.