Linear Regression
Linear Regression
Statistics
Abstract
nYQp/IlQrHD3i3D0OdRyi7TvSFl4Cf3VC1y0abggQZXdtwnfKZBYtws= on 07/17/2024
Linear regression is a statistical procedure for calculating the value of a dependent variable from an independent variable. Linear regression
measures the association between two variables. It is a modeling technique where a dependent variable is predicted based on one or more
independent variables. Linear regression analysis is the most widely used of all statistical techniques. This article explains the basic concepts
and explains how we can do linear regression calculations in SPSS and excel.
Keywords: Continuous variable test, excel and SPSS analysis, linear regression
DOI:
10.4103/jpcs.jpcs_8_18 How to cite this article: Kumari K, Yadav S. Linear regression analysis
study. J Pract Cardiovasc Sci 2018;4:33-6.
© 2018 Journal of the Practice of Cardiovascular Sciences | Published by Wolters Kluwer - Medknow 33
Kumari and Yadav: Linear regression
Figure 2: Starting Data Analysis ToolPak. Click the OFFICE button and
choose Excel options.
Figure 4: The regression screen. Choose Data > Data Analysis >
Figure 3: The Tool Pak. Choose Add Ins > Choose Analysis ToolPak Regression. Input y Range: A1:A8. Input X Range: B1:C8. Check Labels,
and select Go. Residuals, Output Range as A50.
If we now look at the last three columns, we can create an equation: Equation = Sales: 8533.9 − 79.9 (price) + 0.07 (TV ads). Therefore as the price goes up,
the sales go down (P = 0.006), and the addition of TV ads is less (0.07 multiplied by advertisement money with a P value of 0.01). SE: Standard error, MS:
Mean square, SS: Sum of square
regression is used in a similar way as described above. The the form of an equation. In this article, we have used simple
three genotypes, i.e., normal homozygote AA, heterozygote examples and SPSS and excel to illustrate linear regression
AB and homozygote mutant BB may be coded as 1, 2, and 3, analysis and encourage the readers to analyze their data by
respectively. The test may be preceded, and in a similar way, these techniques.
the unstandardized coefficient (β) would explain the effect on Financial support and sponsorship
the dependent variable with per allele dose increase. Nil.
Example 3 – Using Excel to see the relationship between
Conflicts of interest
sale of medicine with the price of the medicine and TV There are no conflicts of interest.
advertisements.
Table 5 contains data which can be entered into an Excel sheet. References
Follow instructions as shown in figures 2-4. 1. Schneider A, Hommel G, Blettner M. Linear regression analysis:
Part 14 of a series on evaluation of scientific publications. Dtsch Arztebl
As shown in Table 6, Multiple R is the Correlation Coefficient, Int 2010;107:776‑82.
where 1 means a perfect correlation and zero means none. 2. Freedman DA. Statistical Models: Theory and Practice. Cambridge,
R Square is the coefficient of determination which here means USA: Cambridge University Press; 2009.
3. Chan YH. Biostatistics 201: Linear regression analysis. Age (years).
that 92% of the variation can be explained by the variables. Singapore Med J 2004;45:55-61.
Adjusted R square adjusts for multiple variables and should be 4. Chan YH. Biostatistics 103: Qualitative data – Tests of independence.
used here. here. Table 7 shows how to create a linear regression Singapore Med J 2003;44:498‑503.
5. Gaddis ML, Gaddis GM. Introduction to biostatistics: Part 6, correlation
equation from the data. and regression. Ann Emerg Med 1990;19:1462‑8.
6. Mendenhall W, Sincich T. Statistics for Engineering and the Sciences.
Conclusion 7.
3rd ed. New York: Dellen Publishing Co.; 1992.
Panchenko D. 18.443 Statistics for Applications, Section 14, Simple
The techniques for testing the relationship between two Linear Regression. Massachusetts Institute of Technology: MIT
variables are correlation and linear regression. Correlation OpenCourseWare; 2006.
8. Elazar JP. Multiple Regression in Behavioral Research:
quantifies the strength of the linear relationship between a pair Explanation and Prediction. 2nd ed. New York: Holt, Rinehart and
of variables, whereas regression expresses the relationship in Winston; 1982.