0% found this document useful (0 votes)
8 views

Introduction to Data Science

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views

Introduction to Data Science

Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

INTRODUCTION TO NO CODE

DATA SCIENCE

Robert Hoyt MD FACP FAMIA

David Patrishkoff MS LSSMBB

1. Free Virtual Sessions


2. Four weeks: March 8, 15, 22 and 29. Friday at 6 PM EST for one-hour
3. The free open-source software programs Orange Data Mining and JASP will be use
4. Try our Textbook Tutor GPT trained on our data science textbook for Free
5. Purchase of our textbook is optional
6. Hands-on exercises with multiple datasets
7. Sessions will be recorded
8. No pre-requisites except students are encouraged to load Orange and JASP and follow
along with the instructor as he demonstrates the use of software. No programming.
9. Enroll using the QR code or see below

Contact: [email protected]
Enroll: https://www.nocodedatascience.net/workshops
Session 1: Introduction to Data Science and Orange Data Mining
Objective: Learn data science concepts and the Orange Data Mining environment.
Outline:
Introduction to Data Science (15 minutes)
● What is Data Science?
● Importance of Data Science in all industries
● Types of Data (structured vs. unstructured)
● Overview of the Data Science Process (ask questions, wrangle data, analyze data,
interpret results)
Getting Started with Orange Data Mining (15 minutes)
● Introduction to Orange Data Mining software
● Available videos
● Modules available
● Add-ons
● Installation and setup
● Interface walkthrough (canvas, widgets, data table, etc.)
● How to save and share Orange files (OWSs)
Basic Concepts of Data Mining (15 minutes)
● Data Import and Preprocessing from File and Datasets widgets
● Basic Data Exploration and Visualization
● Introduction to Data Types and Variables (nominal, ordinal, interval, ratio)
Hands-On Exercise (15 minutes)
● Load a sample dataset (e.g., heart disease prediction)
● Basic data visualization (scatter plot, histograms, box plots)
● Familiarization with drag-and-drop interface
● Save data and save workflow
Homework Assignment: Import a dataset of your choice or use one from the datasets widget
and prepare a basic visualization to share in the next session.

Session 2: Preprocessing and Exploratory Data Analysis with Orange


Objective: Learn data cleaning data and perform exploratory data analysis.
Outline:
Review of Homework and Q&A (10 minutes)
● Discuss any issues or insights from the homework assignment
Data Preprocessing Techniques (20 minutes)
● Handling missing values
● Data normalization and transformation
Exploratory Data Analysis (EDA) (20 minutes)
● Descriptive statistics using several widgets
● Advanced visualization (e.g., heat maps, rain cloud plot, etc.)
● Detecting patterns and outliers
● Using the box plot widget to analyze numerical and categorical variables
Hands-On Exercise (10 minutes)
● Clean a provided dataset using preprocessing widgets
● Perform EDA to uncover initial insights

Contact: [email protected]
Enroll: https://www.nocodedatascience.net/workshops
Homework Assignment: Preprocess a dataset you are familiar with and perform EDA to
identify potential areas of interest for further analysis.

Session 3: Statistical Analysis and Hypothesis Testing with JASP


Objective: Introduction to statistical tests and hypothesis testing using JASP.
Outline:
Introduction to JASP (10 minutes)
● Overview of JASP software
● Differences between Orange and JASP
● Importing data into JASP
Basics of Statistical Analysis (20 minutes)
● Descriptive statistics vs. inferential statistics
● Common statistical tests (t-test, chi-square, ANOVA)
● Understanding p-values and confidence intervals
Hypothesis Testing (20 minutes)
● Formulating hypotheses i
● Choosing the appropriate statistical test
● Interpreting the results of statistical tests
Hands-On Exercise (10 minutes)
● Run basic statistical tests on a provided dataset
● Interpret the output and make inferences
Homework Assignment: Formulate a hypothesis based on your previous EDA, choose an
appropriate statistical test, and run it using JASP.

Session 4: Data Modeling and Presentation of Findings


Objective: Cover basic data modeling concepts and demonstrate how to present findings
effectively.
Outline:
Review of Homework and Q&A (10 minutes)
● Discuss the results of the hypothesis tests conducted as homework
Introduction to Data Modeling (20 minutes)
● What is a model in data science?
● Supervised vs. unsupervised learning
● Building a simple classification and regression predictive model in Orange
● Brief discussion of the algorithms available in Orange and JASP
Evaluating model performance (20 minutes)
● Taking the confusion out of the confusion matrix
● Understanding the AUC, accuracy, precision, recall, and specificity
● Comparing model results
Hands-On Exercise (10 minutes)
● Use Orange to create a simple classification model based on the dataset used in
Session 3
● Prepare a summary report or presentation of the findings
Homework Assignment: None. Continue practicing skills learned and apply them to your own
work or research.

Contact: [email protected]
Enroll: https://www.nocodedatascience.net/workshops

You might also like