0% found this document useful (0 votes)
90 views

Python Data Science Intro To ML - Course 01

This document outlines a 5-day Python data science course covering key Python libraries. The course covers IPython for interactive Python, NumPy for numerical computing, Pandas for data manipulation, Matplotlib for visualization, and Scikit-Learn and statsmodels for machine learning. Specific topics include NumPy arrays, Pandas data structures, Matplotlib plotting, linear regression, logistic regression, decision trees, and clustering. The goal is to equip students with tools for data exploration, cleaning, analysis, and modeling in Python.

Uploaded by

kPrasad8
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
90 views

Python Data Science Intro To ML - Course 01

This document outlines a 5-day Python data science course covering key Python libraries. The course covers IPython for interactive Python, NumPy for numerical computing, Pandas for data manipulation, Matplotlib for visualization, and Scikit-Learn and statsmodels for machine learning. Specific topics include NumPy arrays, Pandas data structures, Matplotlib plotting, linear regression, logistic regression, decision trees, and clustering. The goal is to equip students with tools for data exploration, cleaning, analysis, and modeling in Python.

Uploaded by

kPrasad8
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Python Data Science ( 5 days)

Table of Contents
Preface
Installation and Configuration Python Anaconda distribution
1. IPython: Beyond Normal Python
 Help and Documentation in IPython
 Keyboard Shortcuts in the IPython Shell
 IPython Magic Commands
 Input and Output History
 IPython and Shell Commands
 Errors and Debugging
 Profiling and Timing Code
 More IPython Resources
2. Introduction to NumPy
 Understanding Data Types in Python
 The Basics of NumPy Arrays
 Computation on NumPy Arrays: Universal Functions
 Aggregations: Min, Max, and Everything In Between
 Computation on Arrays: Broadcasting
 Comparisons, Masks, and Boolean Logic
 Fancy Indexing
 Sorting Arrays
 Structured Data: NumPy's Structured Arrays
3. Data Manipulation with Pandas
 Introducing Pandas Objects
 Data Indexing and Selection
 Operating on Data in Pandas
 Handling Missing Data
 Data Manipulations / Cleansing / Munging
 Hierarchical Indexing
 Combining Datasets: Concat and Append
 Combining Datasets: Merge and Join
 Aggregation and Grouping
 Pivot Tables
 Vectorized String Operations
 Working with Time Series
 High-Performance Pandas: eval() and query()
 Further Resources

4. Visualization with Matplotlib


 Simple Line Plots
 Simple Scatter Plots
 Visualizing Errors
 Density and Contour Plots
 Histograms, Binnings, and Density
 Customizing Plot Legends
 Customizing Colorbars
 Multiple Subplots
 Text and Annotation
 Customizing Ticks
 Customizing Matplotlib: Configurations and Stylesheets
 Three-Dimensional Plotting in Matplotlib
 Geographic Data with Basemap
 Visualization with Seaborn
 Further Resources
5. Machine Learning
 What Is Machine Learning?
 Introducing Scikit-Learn and statmodels
 Hyperparameters and Model Validation
 Linear Regression
 Logistic Regession.
 Decision Trees and Random Forests
 k-Means Clustering
 Kernel Density Estimation
 Supervised / Unsupervised learnings

You might also like