Team Case Study 1- Croq’pain

Page 1 of 5

Assignment:

Using the data for this case, provided in the spreadsheet file entitled CROQPAIN.xls, do the following:

(a) Examine the operating earnings regression model output obtained from the 60 stores, as shown in Table 6.27. Try to improve the model by eliminating certain independent variables or by making any other changes that you think make good sense. You should strive for a model that is simple (i.e., has few independent variables), and that does not violate any of the basic assumptions of multiple regression (e.g., multicollinearity, heteroscedasticity), but nevertheless has good predictive power.

Solution. We will try to improve the model by taking various steps.

Please find below the step-wise solution for improving the model-

1. Exploratory Data Analysis to check the Outliers: -

We performed the Exploratory Data Analysis, where we see the scatter plots for all the variables to see if they are containing any outliers or not.

For example, we can see the scatter plot of K below-

[pic 1]

We can see that the above 3-4 points are in the different range from most of the values. So, these can be considered the outliers.

2. Outlier Analysis through inter-quartile range-

We calculated the first and third quartile for each variable, and then calculated the inter-quartile range. From that interquartile range, we calculated the lower and upper bound for each variable, and any outlier will lie outside that range.

[pic 2]

3. Winsorization Method to perform outlier treatment-

Instead of deleting the outliers for different variables, we replaced them by the nearest upper bound or lower bound variable so that it lies within the allowed range of lower and upper bound.

4. Checking the Correlation factor between all the variables-

We checked the Pearson’s correlation value between all the variables to see how much each variable is correlated with each other. The significance value is taken as 0.05 and it is for 2-tailed distribution.

[pic 3]

We can see from here that the variable EARN is well correlated with the variable INC, and the significance value is 0.00, which lies in the acceptable region. Similarly, we can see the correlation values for rest of the combinations. It is just for a basic idea of relation of these variables.

5. Checking of the multicollinearity and performing the linear regression-

“EARN” has been considered the dependent variable, and rest of the variables excluding “STORE” have been considered as the independent variables. Now, Linear Regression has been performed on the remaining independent variables to check multicollinearity between them. VIF (Variance Inflation Factor) is obtained from this operation. It measures the impact of collinearity among the variables and their dependency on another variable in a regression model. Independent variables with VIF Values greater than 3 were removed (3 has been considered as the benchmark to ensure minimum risk).

Here is the screenshot of the output[pic 4]

[pic 5]

Here, we can see that the significance F value of ANOVA is 0.000, which shows that the model is statistically significant.

Now, we will look into the coefficient table. Variables with coefficient equal to zero, significance value greater than 0.05, and VIF values greater than 3 will be removed to remove multicollinearity and insignificant variables. So, the remaining independent variables are – SIZE, INC, NREST, and PRICE.

Now, we will again perform the linear regression with EARN as dependent variables, and SIZE, INC, NREST, and PRICE as independent variables.

[pic 6]

We can see that the overall significance value of ANOVA is 0.00, but the PRICE variable is having significance value as 0.066, which is greater than 0.05. So, we will remove this variable, and further build the model.

The new output will be –

[pic 7]

Here, every parameter lies in the allowable range. So, we can conclude this output. And the linear regression model will be as –

EARN = -352.218 + 0.748*SIZE + 11.506*INC + 1.642*NREST

(b) Michel thinks that a good way to validate the model obtained with data from the 60 stores is to see how a similar model, obtained from the 50 stores opened before 1994, would have performed in predicting the performance of the last ten stores opened. Step back one year prior to the opening of the last ten restaurants. Amend the model you have developed using only the data from the first fifty stores. Using Croq’Pain’s performance ratio target of 26%, which of the ten stores would you have opened in 1994?

Download as (for upgraded members) txt (6.7 Kb) pdf (972.3 Kb) docx (748.7 Kb)

Continue for 4 more pages »

Read full document Save

Essay Preview

prev next

By: victor policarpio

Submitted: October 23, 2018

Essay Length: 1,077 Words / 5 Pages

Paper type: Case Study

Report this essay

Related Essays

Nike Case Study

SHORT CASE SUMMARY Nike, Inc. (503-671-6453, www.nike.com) is the worlds #1 athletic shoe and apparel seller. Nike currently employs 20,700 employees, with total sales of

1,706 Words | 7 Pages
Brinkerhoff International Inc Case Study

MEMORANDUM TO: JUAN C. ARAQUE FROM: GROUP #6 SUBJECT: CASE STUDY FOR COMPANY "BRINKERHOFF INTERNATIONAL INC." DATE: 11/14/00 CC: HUMAN RESOURCE DIRECTOR OBJECTIVE: After careful

2,797 Words | 12 Pages
Learning Team Case Study

Learning Team Case Study Team A selected a situation that Shannon Payne is currently experiencing at her workplace. The problem is with the two-person accounting

1,015 Words | 5 Pages
Case Study Review - Reviving an Ancient Therapy to Manage Chronic Pain

Title: Reviving an Ancient Therapy to Manage Chronic Pain Reference: Podiatry Today, December 2003, pg. 46-53 Author: Nicholas A Grumbine, DPM Rating: 4/5 Abstract Objective:

786 Words | 4 Pages