0% found this document useful (0 votes)

45 views45 pages

Stata Slides

Uploaded by

Hamza Nadeem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views45 pages

Stata Slides

Uploaded by

Hamza Nadeem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 45

Introduction to

Stata
What is Stata?
• Stata is a comprehensive statistical software used for
data analysis, data management, and graphics

• Widely used in Economics, Social Sciences, Biostatistics,

and other fields for rigorous data analysis and research
Why Use Stata?
• User-friendly interface.
• Comprehensive documentation and strong community support.
• Versatility in handling large datasets.
• Wide range of statistical tools.
• Stata has continued to evolve with a focus on expanding its
statistical methods, improving user interface and experience, and
integrating with other programming languages like Python and R.
• It remains widely used in academia, government, and industry for
a variety of purposes, including economics, biostatistics, political
science, and sociology.
Notes:
• Why Use Stata?
• User-Friendly Interface: Stata offers an intuitive and easy-to-navigate interface,
making it accessible for beginners while still powerful enough for advanced users.
• Comprehensive Toolset: It provides a wide range of statistical, graphical, and
data management tools, all in one package, supporting both simple and complex
analyses.
• Consistency Across Platforms: Whether you're using Windows, Mac, or Linux,
Stata delivers the same functionality, ensuring a consistent experience across
different operating systems.
• Strong Community Support: Stata has a large and active user community,
offering plenty of online resources, tutorials, and forums where users can seek
help and share knowledge.
• Regular Updates: The software is regularly updated to include the latest
statistical methods, ensuring that users have access to cutting-edge tools for their
research.
• Widely Accepted in Academia and Industry: Stata is widely used in academia
for teaching and research, as well as in various industries for data analysis,
making it a valuable skill in both academic and professional settings.
2.Getting Started with Stata

• Getting Started with Stata

• Stata Interface Overview
• Menu Bar: Provides access to most of Stata’s features.
• Command Window: Where commands are typed and
executed.
• Review Window: Shows the history of commands you
have entered.
• Variables Window: Displays the list of variables in the
dataset.
• Results Window: Displays the output from commands.
• Do-file Editor: Used for writing and saving sequences of
commands (scripts).
Basic
•LoadingCommands
Data:
•use command to load datasets.
and Syntax
•Example: use sysuse auto
•Viewing Data:
•browse to view the dataset in a spreadsheet format.
•list to display data in the Results window.
•Describing Data:
•describe for an overview of the dataset.
•summarize to get summary statistics like mean, standard
deviation, etc.
•Clearing Data:
•clear command to clear the dataset.
•Example: clear
•Importing and Exporting Data
•Importing from Excel, CSV, etc. (import excel, import
delimited)
•Exporting to different formats (export excel, export
delimited)
3. Data Management in Stata

•. Data Management in Stata

•Creating and Modifying Variables
•generate to create new variables.
•replace to modify existing variables.
•Example: generate age_sq = age^2
4.
•
Basic Data Analysis
Descriptive Statistics
•tabulate for frequency tables.
•summarize with detailed options.
•tabstat for customized summary statistics.
•Graphs and Plots
•histogram, scatter, and boxplot for visual data exploration.
•Example: scatter yvar xvar
•Mention the Graph Editor for customizing plots.
•Basic Regression Analysis
•regress command for running linear regression.
•Interpreting the output (coefficients, R-squared, etc.).
•Example: regress yvar xvar1 xvar2
Introduction to Do-Files

• What is a Do-File?
• Explanation of a Do-file as a script containing Stata
commands.

• Creating and Running Do-Files

• How to write, save, and run a Do-file.
• Benefits of using Do-files (reproducibility, efficiency).
6. Tips and Best Practices
•Commenting Code
•Using * or // to add comments in Do-files.
•Organizing Work
•Importance of clear file structures and naming
conventions.
•Using log files to save session outputs.
Hands on Activity
Task 1 in Stata
11

Agenda

Introduction to Stata
Introduction to the assignment
Simple step wise guidelines to carry out the assigned tasks.
Input of data in Stata
Creating a self explanatory Do file
Carrying out analysis
Interpretation of results
12

Stata Basic Commands:

 Loading Data Set:
 Sysuse auto
 Browsing Data set in Data Editor:
 Browse/br
Wiping out memory /clearing DataSet:
Clear/clr
Codebook
Sum
Input y x
4 important windows Basic interface of Stata.
Managing a do file
Keeping a log file
The first assignment
is related to
household income
and consumption
using a dataset. You
can either use a
real dataset or
create a simple
hypothetical one.
Step-by-Step
Assignment Outline:
Objective: Analyze the
relationship between household
income and consumption, run a
regression model, interpret the
coefficients, and test for
heteroskedasticity and
multicollinearity using Stata.

1. Dataset:
You can use real data from publicly available sources.
(e.g., World Bank, UCI Machine Learning
Repository).
2. For simplicity, let’s use a small hypothetical dataset
for this example.
15

Data input in Stata

Input y x
1
2
3
end
16

Hypothetical Data:
Assignment Tasks:
Task 1: Run the
Regression in Stata
Model: Y=β0+β1X+u
Where Y is household
consumption and X is
household income.

Command
regress consumption income
Task 2: Interpret
the Coefficients
•Explain what the slope (β1) means.
•For instance, if β1=0.5 it means that for every 1 unit
increase in household income, consumption increases
by 0.5 units.
•Interpret the intercept (β0) and the significance levels
(p-values).
19

Outcome of the activity:

Conclusion:
In this assignment, students will:
•Learn to run a simple linear regression.
•Understand the interpretation of
regression coefficients.
Thank You
Regards,
Fatima.
Practice:2 Hands On Activity

T W O S M A L L D ATA S E T S
Agenda:

• Entering 2 small data sets by making use of following commands:e

• clear

• . input Y X

• end
2 Data Sets given on Page no: 65
Entering
Data Set
1:
Entering 1st data Set:
• . clear
• . input Y X
• Y X
• 1. 70 80
• 2. 65 100
• 3. 90 120
• 4. 95 140
• 5. 110 160
• 6. 120 180
• 7. 130 200
• 8. 140 220
• 9. 155 240
• 10. 150 260
• 11. end

• .
• . gen sample = 1

• . save temp_sample1, replace

• file temp_sample1.dta saved
Using (gen, save) commands to generate
and save sample 1
• gen sample = 1

• save temp_sample1, replace

• br
Browsing 1st sample Data Set
Now give the command of clear and enter
sample 2
• Clear

• Input Y X

• 55 80

• 60 88

• 70 100

• 80 120

• 95 140

• 110 160

• 118 180

• 145 220

• 150 240

• 175 260

• end
Generating sample 2 and saving it:

• gen sample = 2

• . save temp_sample2, replace

Browse for sample 2:
• Y X sample
• 55 80 2
• 60 88 2
• 70 100 2
• 80 120 2
• 95 140 2
• 110 160 2
• 118 180 2
• 145 220 2
• 150 240 2
• 175 260 2
•
Now Append both samples

• use temp_sample1, clear

• append using temp_sample2

• List

• Br
Y X sample
70 80 1
65 100 1
90 120 1
95 140 1
110 160 1
120 180 1
130 200 1
140 220 1
155 240 1
150 260 1
55 80 2
60 88 2
70 100 2
80 120 2
95 140 2
110 160 2
118 180 2
145 220 2
150 240 2
175 260 2
Using List Command
• Command syntax:
• List
| Y X|
• |-----------------|
• 1. | 15000 25000 |
• 2. | 18000 30000 |
• 3. | 30000 50000 |
• 4. | 35000 60000 |
• 5. | 40000 70000 |
• |-----------------|
• 6. | 50000 80000 |
• 7. | 55000 100000 |
• 8. | 600000 110000 |
• 9. | 70000 115000 |
• 10. | 80000 125000 |
• |-----------------|
• 11. | . .
• 12. | . .
Next set of commands to be executed:
•Now running individual regression analysis on each sample to obtain estimates and predict yhat:

•use temp_sample1, clear

•reg y x

•predict yhat1

•predict res, residuals

•gen residuals_square=res^2

•scatter y x

• twoway (scatter y x) (lfit y x)

•twoway (scatter y x) (lfit yhat x)

•use temp_sample2,clear

•reg y x

•predict yhat2

•predict res, residuals

•list yhat2 res

•gen res_squares =res^2

•list res res_squares

•scatter y x

•twoway (scatter y x) (lfit y x)

•twoway (scatter y x) (lfit yhat x)

Command for making histogram

• . histogram residuals, normal

Results of sample1:
Results of sample 2:
Entering 2 blank values

• Input y x

• 1

• 2

• 3

• 4. .

• End
Using List if missing (X) Command:
. list if missing(X)

• |Y X|
• |-------|
• 11. | . . |
• 12. | . . |
• +-------+
Drop if X is missing:

• drop if missing (X)

Using Edit Command:

• edit

• Data editor will open, we will manually enter the values and then save every
individual value before entering new value or we can use drop command to drop
the missing value
• List if missing (X)

• Drop if missing (X)

Replacing missing values by mean
values
• Use following Commands to get means of X and Y:

• summarize X

• Summarize Y

• Now using the following commands:

• replace X = 70000 if missing(X)

• replace Y = 50000 if missing(Y)

• br

Intro To Stata 2022
No ratings yet
Intro To Stata 2022
36 pages
Lecture 1-2 Applied Econometrics
No ratings yet
Lecture 1-2 Applied Econometrics
68 pages
Introduction To Stata Software, MaU, 2022
No ratings yet
Introduction To Stata Software, MaU, 2022
93 pages
Stata Session 2
No ratings yet
Stata Session 2
11 pages
STATA
No ratings yet
STATA
26 pages
STATA Basics Regression and Panal Data
100% (1)
STATA Basics Regression and Panal Data
26 pages
An Introduction To Modern Econometrics Using Stata (Christopher Baum) PDF
100% (1)
An Introduction To Modern Econometrics Using Stata (Christopher Baum) PDF
349 pages
Free Numerical Reasoning Test Questions Answers
100% (3)
Free Numerical Reasoning Test Questions Answers
18 pages
Biostatistics in Public Health Using STATA-2016
100% (4)
Biostatistics in Public Health Using STATA-2016
202 pages
STATA Frain
No ratings yet
STATA Frain
68 pages
(Cameron & Trivedi 2009) Microeconometrics Using Stata
No ratings yet
(Cameron & Trivedi 2009) Microeconometrics Using Stata
733 pages
Introduction To STATA
No ratings yet
Introduction To STATA
57 pages
STATA Commands
100% (2)
STATA Commands
35 pages
Stata Application Part I
No ratings yet
Stata Application Part I
27 pages
Cameron and Trivedi STATA
100% (3)
Cameron and Trivedi STATA
732 pages
Bio624 Class1handout
No ratings yet
Bio624 Class1handout
48 pages
Stata Workshop
No ratings yet
Stata Workshop
5 pages
Introduction To Stata Data Management: Chang Y. Chung Office of Population Research Princeton University September 2013
100% (1)
Introduction To Stata Data Management: Chang Y. Chung Office of Population Research Princeton University September 2013
24 pages
Stat A Tutorial
No ratings yet
Stat A Tutorial
40 pages
Lec11-Stata Regression
No ratings yet
Lec11-Stata Regression
9 pages
Stata Guide V1
No ratings yet
Stata Guide V1
65 pages
CAMERON, C. e TRIVEDI, P.K. Microeconometrics Using Stata. Cambridge: CUP, 2010
No ratings yet
CAMERON, C. e TRIVEDI, P.K. Microeconometrics Using Stata. Cambridge: CUP, 2010
3 pages
Empirical Guidance
No ratings yet
Empirical Guidance
38 pages
An Introduction To Modern Econometrics Using Stata by Christopher F. Baum
No ratings yet
An Introduction To Modern Econometrics Using Stata by Christopher F. Baum
349 pages
Poe 5 Statatoc
No ratings yet
Poe 5 Statatoc
12 pages
Introduction To STATA With Econometrics in Mind: January 2010
No ratings yet
Introduction To STATA With Econometrics in Mind: January 2010
47 pages
Stata Review
No ratings yet
Stata Review
9 pages
ECON6067 Stata (II) 2022
No ratings yet
ECON6067 Stata (II) 2022
22 pages
Introduction Stata Slides 2
No ratings yet
Introduction Stata Slides 2
25 pages
Introduction To Stata and Data Management
No ratings yet
Introduction To Stata and Data Management
30 pages
Gravity13 Stata
No ratings yet
Gravity13 Stata
80 pages
Introduction To STATA: Introduction To STATA About STATA Basic Operations Regression Analysis Panel Data Analysis
No ratings yet
Introduction To STATA: Introduction To STATA About STATA Basic Operations Regression Analysis Panel Data Analysis
27 pages
Stata Notes
No ratings yet
Stata Notes
7 pages
Stata Reference Manual: What You Should Know About Stata After Taking The Stata Introduction Course
No ratings yet
Stata Reference Manual: What You Should Know About Stata After Taking The Stata Introduction Course
26 pages
Stata An Introduction Summer 2020
No ratings yet
Stata An Introduction Summer 2020
60 pages
Stat A Guide
No ratings yet
Stat A Guide
16 pages
Stata Tutorial 13 v2 0
No ratings yet
Stata Tutorial 13 v2 0
45 pages
A Short Guide To Stata 10 For Windows
No ratings yet
A Short Guide To Stata 10 For Windows
7 pages
Introduction To Stata: 1 Data Manipulation
No ratings yet
Introduction To Stata: 1 Data Manipulation
6 pages
Stat A Red
No ratings yet
Stat A Red
4 pages
Stata's: What To Do First?
No ratings yet
Stata's: What To Do First?
3 pages
Stata
No ratings yet
Stata
6 pages
An Introduction To Stata For Economists: Data Management
No ratings yet
An Introduction To Stata For Economists: Data Management
49 pages
Using Stata With The Fundamentals of Political: Science Research
No ratings yet
Using Stata With The Fundamentals of Political: Science Research
20 pages
A Short Introduction To STATA
No ratings yet
A Short Introduction To STATA
8 pages
Stata
No ratings yet
Stata
26 pages
Summary of Basic STATA Commands and Syntax
No ratings yet
Summary of Basic STATA Commands and Syntax
5 pages
Comandos
No ratings yet
Comandos
51 pages
Advanced Stata
No ratings yet
Advanced Stata
54 pages
UsefulStataCommands PDF
No ratings yet
UsefulStataCommands PDF
51 pages
An Introduction To Stata For Economists: Data Analysis
No ratings yet
An Introduction To Stata For Economists: Data Analysis
48 pages
Useful Stata Commands
No ratings yet
Useful Stata Commands
48 pages
Basic Tutorial Stata PDF
No ratings yet
Basic Tutorial Stata PDF
5 pages
Top 50 SAP ABAP Interview Questions and Answers PDF
No ratings yet
Top 50 SAP ABAP Interview Questions and Answers PDF
12 pages
De Vera, Crisangelyn C
No ratings yet
De Vera, Crisangelyn C
2 pages
Takeover Full
50% (2)
Takeover Full
92 pages
HRM Notes BBA
No ratings yet
HRM Notes BBA
39 pages
Background of The Study: Manual System in Generating Reports of Inventory and Check-Up
No ratings yet
Background of The Study: Manual System in Generating Reports of Inventory and Check-Up
5 pages
#9 - RA 9028 As Amended by RA 10364
100% (1)
#9 - RA 9028 As Amended by RA 10364
3 pages
Startup Data Scraping
No ratings yet
Startup Data Scraping
16 pages
Lecture 01.1
No ratings yet
Lecture 01.1
21 pages
Piercing The Fog Intelligence and Army Air Forces Operations in World War II
No ratings yet
Piercing The Fog Intelligence and Army Air Forces Operations in World War II
516 pages
How To Create COBie Using With BIM Interoperability Tool
No ratings yet
How To Create COBie Using With BIM Interoperability Tool
26 pages
Polymerization of Alkenes... Final..fizza...
No ratings yet
Polymerization of Alkenes... Final..fizza...
19 pages
Chap07 DMMvideo
No ratings yet
Chap07 DMMvideo
40 pages
Department of Educat
No ratings yet
Department of Educat
3 pages
10 Vallarta v. CA
No ratings yet
10 Vallarta v. CA
2 pages
E Chapter
No ratings yet
E Chapter
6 pages
Naskah 2 Layout
No ratings yet
Naskah 2 Layout
10 pages
2-In-1 Mbot: Line Follower and Object Avoidance: Technology Workshop Craft Home Food Play Outside Costumes
No ratings yet
2-In-1 Mbot: Line Follower and Object Avoidance: Technology Workshop Craft Home Food Play Outside Costumes
4 pages
Kumera Crane Gearboxes
No ratings yet
Kumera Crane Gearboxes
5 pages
Dessler HRM12e PPT 04
No ratings yet
Dessler HRM12e PPT 04
46 pages
Solution Case Study CCN
100% (1)
Solution Case Study CCN
7 pages
Case - Study Vietjet
No ratings yet
Case - Study Vietjet
26 pages
Installation Instruction: Single Pole Insulated Conductor Rail Programme 812
No ratings yet
Installation Instruction: Single Pole Insulated Conductor Rail Programme 812
9 pages
Handout 14b - Integration by Substitution Method
No ratings yet
Handout 14b - Integration by Substitution Method
6 pages
E+H-PROMAG W 400 - Tender Text - TTW400EN
No ratings yet
E+H-PROMAG W 400 - Tender Text - TTW400EN
2 pages
01 JRODOS Overview
No ratings yet
01 JRODOS Overview
25 pages
Econometric Slides
No ratings yet
Econometric Slides
13 pages
Phannarak CV
No ratings yet
Phannarak CV
2 pages
Lab 4.5.1 Observing TCP and UDP Using Netstat (Instructor Version)
No ratings yet
Lab 4.5.1 Observing TCP and UDP Using Netstat (Instructor Version)
7 pages
Administration: Order of Completion
No ratings yet
Administration: Order of Completion
24 pages
Exam Time Table 2024 Bulanala-1
No ratings yet
Exam Time Table 2024 Bulanala-1
2 pages
Lion Air Eticket (IQVQBS) - Diyarn Putra Maulana
No ratings yet
Lion Air Eticket (IQVQBS) - Diyarn Putra Maulana
4 pages
PC Specification List
No ratings yet
PC Specification List
12 pages
Appendix B For 29
No ratings yet
Appendix B For 29
1 page
Python Beyond Limits: Python, #3
From Everand
Python Beyond Limits: Python, #3
AnwaarX
No ratings yet
Study Guide MO-500 Certification Exam Microsoft Access Expert ( Office 2019)
From Everand
Study Guide MO-500 Certification Exam Microsoft Access Expert ( Office 2019)
Anand Vemula
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Crystal Reports Introduction: Versions 2008-2016
From Everand
Crystal Reports Introduction: Versions 2008-2016
Seth Bonder
No ratings yet
Understanding Educational Statistics Using Microsoft Excel and SPSS
From Everand
Understanding Educational Statistics Using Microsoft Excel and SPSS
Martin Lee Abbott
No ratings yet
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
From Everand
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
César Pérez López
No ratings yet
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet

Stata Slides

Uploaded by

Stata Slides

Uploaded by

Introduction to

• Widely used in Economics, Social Sciences, Biostatistics,

• Getting Started with Stata

•. Data Management in Stata

• Creating and Running Do-Files

Stata Basic Commands:

Data input in Stata

Outcome of the activity:

• Entering 2 small data sets by making use of following commands:e

• . save temp_sample1, replace

• save temp_sample1, replace

• . save temp_sample2, replace

• use temp_sample1, clear

• append using temp_sample2

•use temp_sample1, clear

•predict res, residuals

• twoway (scatter y x) (lfit y x)

•twoway (scatter y x) (lfit yhat x)

•predict res, residuals

•list yhat2 res

•gen res_squares =res^2

•list res res_squares

•twoway (scatter y x) (lfit y x)

•twoway (scatter y x) (lfit yhat x)

• . histogram residuals, normal

• drop if missing (X)

• Drop if missing (X)

• Now using the following commands:

• replace X = 70000 if missing(X)

• replace Y = 50000 if missing(Y)

You might also like