M06 Pandas
DS 5100: Programming for Data Science
Twitter
Facebook
LinkedIn
Welcome
Syllabus
Schedule
Final Project
Final Project Report
M01 Getting Started
About Rivanna
Using Unix
SSH for GitHub
Git and GitHub
Activity: Using Rivanna
Activity: Using Git and GitHub
M02 Introducing Python
Data and Code
Python Object Types
Activity: Hello, World!
Activity: Jupyter Lab
NB: Data Types, Operators, and Expressions
NB: Numbers
NB: Booleans
NB: Strings
NB: Structures
M03 Control Structures
Values, Variables, Expressions, and Statements
NB: Control Structures
NB: Iterables and Iterators
NB: Comprehensions
M04 Functions
NB: Introduction to Functions
NB: Importing Functions
NB: Lambda Functions
NB: Recursion
NB: Variable Scope
NB: Functions Calling Functions
M05 NumPy
NB: NumPy First Steps
NB: NumPy Continued
NB: Python Timing Experiment
NB: Basic File I/O
M06 Pandas
NB: Introducting Pandas
NB: Introducing Pandas II
NB: Narrow vs Wide Tables
NB: Method Chaining
NB: Pandas and SQL
M07 Classes
What is OOP?
NB: Introducing Classes
NB: Classes Example
NB: More About Classes
NB: Understanding Class Attributes
M08 Testing
Testing and Verification
NB: Exception Handling
NB: Unit Testing with Unittest
M09 Modules
NB: Packages, Modules, and Imports
NB: More on Classes and Globals
NB: Building Your Own Package
NB: Demonstrations of Import and
init
.py()
Review
M10 Basic R
NB: Getting Started
NB: Control Structures and Functions
M11 Dplyr
NB: Introducing Tibbles
NB: The
dplyr
package
M12 Visualization
NB: GGPlot2
NB: R Markdown
NB: Loose Ends
NB: GGPlot in Python with Plotnine
M13 Agile
Project Management
Agile Development
User Stories
Appendices
Setting Up O’Reilly
Refreshing the Course Repo
Table of contents
Topics
Readings
Required
Edit this page
M06 Pandas
Topics
Introduce Pandas and its relationship to NumPy
Understand difference between Series and DataFrames
Understand role of indexes in Pandas
Review fundamental methods in Pandas
Readings
Required
McKinney 2017, Chapter 5: Getting Started with Pandas
Getting Started in Pandas (Official)
NB: Basic File I/O
NB: Introducting Pandas