Indirectly “yes”, but I will ultimately argue “no”.

Introduction

Lots of people in the US are killed by guns. Guns are a leading cause of death among children. Mass shootings happen nearly daily. Gun laws and controls are a hotly debated and divisive political issue in the US. (Citations needed!)

I will use data on gun deaths to illustrate Omitted Variable Bias, and to also explore the link between gun ownership and gun deaths.

Description of the data

The data is for US states in 2016, and comes from various sources.

Variable Description Source
death.per.100k The number of gun deaths per 100,000 people, in 2016. CDC
trump.vote The percentage of votes for Trump in 2016. This variable is meant to measure the percentage of Repbulicans by state. New York Times
gun.owner The estimated percentage of households in 2016 that owned guns. RAND
back.check A dummy variable indicating whether the state has background checks. RAND

Download the data

Download the data from the website using:

guns <- read.csv("https://rtgodwin.com/data/guns.csv")

Take a look at the first 6 observations for each variable:

head(guns)
       state death.per.100k gun.owner trump.vote back.check permit
1    Alabama           21.5      52.8       62.1          0      0
2     Alaska           23.3      57.2       51.3          0      0
3    Arizona           15.2      36.0       48.1          0      0
4   Arkansas           17.8      51.8       60.6          0      0
5 California            7.9      16.3       31.5          1      1
6   Colorado           14.3      37.9       43.3          1      0
  • Notice that Alabama has a lot of deaths (21.5) and Republicans (62.1%).
  • California has few deaths (7.9) and Republicans (31.5%).
  • Think about the magnitude of these numbers. With a population of 39.15 million in 2016, 7.9 deaths per 100k means 3,093 deaths.
  • Look at how many households have guns!

Plot the data:

plot(guns$trump.vote, guns$death.per.100k,
     xlab = "Trump vote percentage in 2016",
     ylab = "Gun deaths per 100k population",
     main = "Deaths and votes by state",
     pch = 16, col = "magenta")

Now we’ll estimate the model:

$death.per.100k = \beta_0 + \beta_1trump.votes + \epsilon$

In R we can use:

mod1 <- lm(death.per.100k ~ trump.vote, data = guns)
summary(mod1)
Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept) -2.95296    2.57444  -1.147    0.257    
trump.vote   0.32572    0.05131   6.348 7.36e-08 ***
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 3.67 on 48 degrees of freedom
Multiple R-squared:  0.4564,	Adjusted R-squared:  0.4451 
F-statistic:  40.3 on 1 and 48 DF,  p-value: 7.357e-08
  • The estimated effect of 0.32572 means that an additional 1% vote for Trump is associated with an extra 0.33 gun deaths per 100,000 population.
  • To try to put this in perspective, in California this would be an extra 129 gun deaths in 2016.
  • The p-value on trump.vote is small (0.0000000736). We reject the null that percentage of Republicans have no association with gun deaths. According to this model, trump.vote is statistically significant.
  • The $R^2$ is 46%.

The above estimated effect of trump.vote on gun deaths is likely biased. This is a situation of Omitted Variable Bias (OVB). What do you think the most important determinant of gun deaths is?

A model that uses the percentage of households that have guns

We’ll estimate a model that includes more variables (gun.owner and back.check). The new population model is:

$death.per.100k = \beta_0 + \beta_1trump.votes + \beta_2gun.owner + \beta_3back.check + \epsilon$

In R we can use:

mod2 <- lm(death.per.100k ~ trump.vote + gun.owner + back.check, data = guns)
summary(mod2)
Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)  1.74293    2.87344   0.607 0.547121    
trump.vote   0.08054    0.07435   1.083 0.284361    
gun.owner    0.19353    0.05075   3.814 0.000406 ***
back.check  -1.26643    1.23944  -1.022 0.312231    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 3.198 on 46 degrees of freedom
Multiple R-squared:  0.6044,	Adjusted R-squared:  0.5786 
F-statistic: 23.42 on 3 and 46 DF,  p-value: 2.37e-09

trump.vote is no longer statistically significant!

  • We fail to reject the null that percentage of Replubicans have no effect on gun deaths.
  • However, the percentage of gun owners has a statistically significant effect: we reject the null that gun.owner has no effect on deaths.
  • It is estimated that for every additional 1% of households that own guns, an extra 0.19 people died per 100k.
  • Backgorund checks have a negative, but statistically insignificant impact.

Summary

After including gun.ownership, trump.vote is no longer statistically significant. On einterpretation of the above estimated results is that the percentage of Republicans only effects gun deaths through the tendency of Republicans to have more guns. That is, after controlling for the amount of guns, it doesn’t matter whether a people are Republican or not. That is, two state with differing percentages of Republicans, but with the same percentage of guns, would have the same amount of gun deaths.