Data Visualization Using Python and R
Data Visualization Using Python and R
Local tools
● Python: Seaborn, Matplotlib, etc.
● R: ggplot, base R, etc.
Enterprise Tools
● Looker / Tableau / Power BI / etc.
Course Materials Release Schedule
The next three weeks worth of content will be released in two 1.5-week segments
rather than three 1-week segments like we have seen previously.
Content for this 1.5 week session: Visualization for Reporting Using Python and R
Content released in 1.5 weeks: Visualization for Reporting Using Enterprise Tools
Assignment due dates, as usual, will be posted with the individual assignments. The
release schedule above relates to content availability and may not align exactly
with assignment release dates.
Data Management
Visualizations Part 1: Introduction and
Data Visualization For Reporting using Python and R
Overview
https://clauswilke.com/dataviz/index.html
Use direct data point labels instead of color coding for more than 8 categories
Use monotonic color scales (light to dark - avoid circular scales like rainbow)
- Line graphs
- Histograms
- Box plots
- Scatter plots
- Legends, titles, labels
In addition to basic build, you should know how to customize these objects, including
size, color, and placement in a Colab or JupyterLab notebook or .RMD file output
Python
Visualizations
Charts in Colaboratory Notebook Walkthrough
https://colab.research.google.com/notebooks/charts.ipynb
Altair Notebook Walkthrough
https://colab.research.google.com/notebooks/snippets/altair.ipynb
Visualizations in Python: Deep Dive Tutorials
https://www.youtube.com/watch?v=Nt84_TzRkbo