MHA FPX 5017 Assessment 3 Predicting an Outcome Using Regression Models

Paper Instructions

Overview

Perform multiple regression on the relationship between hospital costs and patient age, risk factors, and patient satisfaction scores, and then generate a prediction to support this health care decision. Write a 3–4-page analysis of the results in a Word document and insert the test results into this document.

Note You are strongly encouraged to complete the assessments in this course in the order they are presented.

Regression is an important statistical technique for determining the relationship between an outcome (dependent variable) and predictors (independent variables). Multiple regression evaluates the relative predictive contribution of each independent variable on a dependent variable. The regression model can then be used for predicting an outcome at various levels of the independent variables. For this assessment, you will perform multiple regression and generate a prediction to support a health care decision.

By successfully completing this assessment, you will demonstrate your proficiency in the following course competencies and assessment criteria

Competency 2

Analyze data using computer-based programming and software.

  • Perform the appropriate multiple regression using a dataset.

Competency 3

Interpret results of data analysis for value-based health care decisions, policy, or practice.

  • Interpret the statistical significance and effect size of the regression coefficients of a data analysis.
  • Interpret the fit of the regression model for prediction of a data analysis.

Competency 4

Present results of data analysis to support a decision or recommendation.

  • Apply the statistical results of the multiple regression of a data analysis to support a health care decision.
  • Write a narrative summary of the results that includes practical, administration-related implications of the multiple regression.

Competency 5

Communicate audience-appropriate health management content in a logically structured and concise manner, writing clearly with correct use of grammar, punctuation, spelling, and APA style.

  • Write clearly and concisely, using correct grammar, mechanics, and APA formatting.

Multiple Linear Regression

  • Casson, R. J., & Farmer, L. D. M. (2014). Understanding and checking the assumptions of linear regression A primer for medical researchers. Clinical & Experimental Ophthalmology, 42(6), 590–596.
  • Frey, B. B. (Ed.). (2018). Multiple linear regression. In The SAGE encyclopedia of educational research, measurement, and evaluation (Vols. 1–4). Thousand Oaks, CA Sage.
  • Katz, M. H. (2003). Multivariable analysis A primer for readers of medical research. Annals of Internal Medicine, 138(8), 644-650.

Multiple Regression in Microsoft Excel.

  • Statsoft.com. (n.d.). How to find relationship between variables, multiple regression. Retrieved from http //www.statsoft.com/Textbook/Multiple-Regression

Regression Analysis

  • Gallo, A. (2015, November 04). A refresher on regression analysis. Harvard Business Review Digital Articles, 2-9.
  • SCSUEcon. (2011, August 20). Linear regression in Excel [Video] | Transcript. Retrieved from https //www.youtube.com/watch?v=TkiB1xBnjn4

Using Regression Analysis Every Day.

Effect Size

  • Sullivan, G. M. (2012). FAQs about effect size. Journal of Graduate Medical Education, 4(3), 283-284. Retrieved from https //www.ncbi.nlm.nih.gov/pmc/articles/PMC3444175/
  • Sullivan, G. M., & Feinn, R. (2012). Using effect size or why the P value is not enough. Journal of Graduate Medical Education, 4(3), 279-282.

Predictive Analytics

  • Davenport, T. H. (2014, September 02). A predictive analytics primer. Harvard Business Review Digital Articles, 2-4.
  • IntroToIS BYU. (2016, November 04). Creating a multiple linear regression predictive model in Excel [Video] | Transcript. Retrieved from https //www.youtube.com/watch?v=rWJKLqp3wWs

Textbook

  • Kros, J. F., & Rosenthal, D. A. (2016). Statistics for health care management and administration Working with Excel (3rd ed.). San Francisco, CA Jossey-Bass.

Available in the Capella Library.

Asssessment Instructions

Preparation

  • Download the Assessment 3 Dataset [XLSX].

The dataset contains the following variables:

  • cost (hospital cost in dollars)
  • age (patient age in years)
  • risk (count of patient risk factors).
  • satisfaction (patient satisfaction score percentile rank).

Instructions

Hospital administration needs to make a decision on the amount of reimbursement required to cover expected costs for next year. For this assessment, using information on hospital discharges from last year, perform multiple regression on the relationship between hospital costs and patient age, risk factors, and patient satisfaction scores, and then generate a prediction to support this health care decision. Write a 3–4-page analysis of the results in a Word document and insert the test results into this document (copied from the output file and pasted into a Word document). Refer to Copy From Excel to Another Office Program for instructions.

Submit both the Word document and the Excel file that shows the results.

We Work Hard So That You Don’t

We’ll write a 100% plagiarism-free paper in under 1 hour.

Introduction

Regression analysis refers to the set of statistical methods that are applied in the estimation of the dependent variable and one or more independent variables. Regression analysis can be applied to assess the strength of the correlation between variables and for modeling the future relationship that may be expected between independent and dependent variables. In regression analysis, there exist several variations such as multiple linear, linear, as well as nonlinear. Some of the most common models are multiple linear and simple linear (Kumari & Yadav, 2018). Non-linear regression analysis is usually applied for complicated data sets where the independent and dependent variables indicate a nonlinear relationship (Aggarwal & Ranganathan, 2017). There are numerous applications of regression analysis, including research processes as well as financial analysis. The purpose of this assignment is to predict an outcome using regression models through the application of the dataset given.

Before conducting regression analysis, it is necessary to understand the assumptions. One of the assumptions is that the independent variable is not always random. Some other assumptions include the value of residuals is zero, the independent and dependent variables often show a linear relationship between the intercept and the slope, the value of residual is always constant across all the observations made; finally, the values of residual are not always correlated across different observations (Montgomery et al., 2021). Besides, the residual values often follow the normal distribution.

Regression analysis

From the information given, the dependent variable is hospital costs, while the independent variables include patient age, risk factors, and patient satisfaction scores. Both the independent and dependent variables are continuous.

Table 1: Descriptive Statistics

  Mean Std. Deviation  N
Cost 14906.51  2614.346  185
Age 73.25 6.430 185
Risk 5.69  2.777 185
Satisfaction   50.02   28.919 185

Table 1 indicates the descriptive statistics for both the dependent and independent variables. The means of variables, cost, age, risk, and satisfaction include $14906.51, 73.25 years, 5.69, and 50.02. The sample size used was 185.

Table 2: Correlations

  • Cost
  • Age
  • Risk
  • Satisfaction
Pearson Correlation cost   1.000 .279  .199     -.071
Age .279 1.000  .152    .094
Risk .199  .152  1.000 .037
Satisfaction   -.071 .094 .037 1.000

 

Sig. (1-tailed) cost  . .000  .003   .169
Age .000 . .019 .101
Risk .003 .019 . .307
Satisfaction   .161 .101 .307 .

 

N cost 185 185 185 185
Age 185 185 185 185
Risk 185 185 185 185
Satisfaction   185 185 185 185

 

Table 2 shows the correlation between dependent and independent variables. The outcomes show that there is a weak positive correlation between the cost and age; the correlation coefficient is 0.279. The correlation between cost and risk is also weak and positive; the correlation coefficient is 0.199. Finally, the correlation between cost and the level of satisfaction is weak and negative; the correlation coefficient is -.071.

Table 3 Model Summary

Model R R Square Adjusted R Square Std. Error of the Estimate 

 

Change Statistics

R Square Change F Change df1 df2 Sig. F Change
1 .336a .113 .098 2482.429 .113 7.692 3 181 .000

NOTE: a. Predictors (Constant), satisfaction, risk, age

From table 3, the R-Square is 0.113 showing a “Medium” effect size; therefore, the model attempt to explain much of the variance in the dependent variable. The significant value from the analysis is 0.000 < 0.05; therefore, we reject that null hypothesis and conclude that the model is fit or significant. Given that the analysis was done at 95% level of significance, the null hypothesis is rejected when the significant values obtained are less than 0.05.

Table 4 Coefficients

a

Model Unstandardized Coefficients Standardized Coefficients t Sig. 95.0% Confidence Interval for B
B Std. Error Beta Lower Bound Upper Bound
1 (Constan) 6652.176 2096.818 3.173 .002 2514.825 10789.527
age 107.036 28.911 .263 3.702 .000 49.990 164.082
risk 153.557 66.685 .163 2.303 .022 21.978 285.136
satisfaction -9.195 6.358 -.102 -1.446 .150 -21.740 3.351

a. Dependent Variable cost

From table 4, there is the indication of different unstandardized coefficients for the independent variables used in the study. A regression equation can therefore be formulated from the information given. Using the equation of a straight line, Y= Mx +C, at the Y-intercept, x becomes 0.

Therefore, the equation becomes, Y=M (0) + C, Y=C. From the table above, Y= 6652.176. To formulate a regression equation, there is the need for the analysis to consider the constant and unstandardized coefficients of the independent variables. The equation takes the form of a line equation which is Y= Mx + c,

Therefore, we find that:

Cost = 6652.176 + 107.036 (age) + 153.557 (risk) – 9.195 (satisfaction)

The above regression equation can be used to predict the costs given each of the independent variables. While determining the cost using each of the variables, we set all other independent variables to zero. The above equation shows that the cost depends on the age of the patients, risks factors, as well as the level of satisfaction of the patients after treatments.

Conclusion

Regression analysis can be applied to assess the strength of the correlation between variables and for modeling the future relationship that may be expected between independent and dependent variables. The analysis shows that the hospital costs are dependent on patient age, risk factors, and patient satisfaction scores. Both the independent and dependent variables are continuous.

References

  • Kumari, K., & Yadav, S. (2018). Linear regression analysis study. Journal of the practice of Cardiovascular Sciences, 4(1), 33. https //www.j-pcs.org/article.asp?issn=2395-5414;year=2018;volume=4;issue=1;spage=33;epage=36;aulast=Kumari
  • Montgomery, D. C., Peck, E. A., & Vining, G. G. (2021). Introduction to linear regression analysis. John Wiley & Sons. http //sutlib2.sut.ac.th/sut_contents/H133678.pdf
  • Aggarwal, R., & Ranganathan, P. (2017). Common pitfalls in statistical analysis Linear regression analysis. Perspectives in clinical research, 8(2), 100. https //www.ncbi.nlm.nih.gov/pmc/articles/PMC5384397/

We Work Hard So That You Don’t

We’ll write a 100% plagiarism-free paper in under 1 hour