phd

New resource: Intro to econometrics in R

I've added a new resource to this site - all of my section materials from ARE 212, Max Auffhammer's first-year PhD econometrics course, which build off of notes written by Dan HammerPatrick Baylis, and Kenny Bell.

These section notes simultaneously provide a gentle introduction to econometrics and to R. I covered very basic coding, including matrix operations and functions; partitioned regression and goodness of fit; hypothesis testing; ggplot2; generalized least squares and maximum likelihood; large sample properties of OLS; non-standard standard errors (twice!); instrumental variables; power calculations; spatial data; and replication, with a bonus intro to Monte Carlo simulation. Quite a full semester! 

It's so nice when everything is well behaved! OLS is even consistent!

It's so nice when everything is well behaved! OLS is even consistent!

I hope these will be a helpful resource to others. A warning: I made many of the materials from scratch and/or expanded existing notes, so there are almost certainly errors. Please let me know if you find any. A more important warning: These notes are rife with bad jokes. Prepare yourselves.

Finally, I owe a debt of gratitude to all of the ARE-212-ers of 2016, who braved 8 AM section (not my choice) and have already dramatically improved these materials. Thanks for being a super fun class!

An end-of-semester gift from one of my students (did I mention I had a great class?) and excellent in-joke for those in the know.

An end-of-semester gift from one of my students (did I mention I had a great class?) and excellent in-joke for those in the know.

Out of the Darkness and Into the Light? Development Effects of Rural Electrification In India

At long last, Louis and I have a working paper version of our paper, "Out of the Darkness and Into the Light? Development Effects of Rural Electrification in India" (Appendix here - warning: it's a pretty big file). The paper has, as you might expect, a lot of gory details in it, and the appendix even more so, so I'll try to provide a less technical overview here. 

The basic motivation behind the paper is the following: there are still over a billion people around the world without access to electricity, most of whom are poor, rural, and live in South Asia or Sub-Saharan Africa. Developing country governments and NGOs are pouring billions of dollars into rural electrification programs with the explicit goal of bringing people out of poverty - one of the new UN Sustainable Development Goals is to "Ensure Access to Affordable, Reliable, Sustainable and Modern Energy for All."

In light of the big pushes towards universal energy access (pun only half intended), it's important that we understand what these investment dollars are going towards - even more so when we're talking about developing country dollars, which have opportunity costs like schools and health clinics and roads. It turns out, though, that we know surprisingly little about the actual effects of electrification on economic development. There is a strong positive correlation between GDP per capita and electricity consumption, with rich countries like the USA and Japan consuming much more energy per capita than poorer places like Nigeria and Bangladesh:

Data source: World Bank

Data source: World Bank

As we all know, though, correlation does not equal causation (thanks, XKCD!). Figuring out the causality here is definitely not trivial: at the country level, there are lots of things that drive income differences between Japan and Bangladesh that have nothing to do with electricity. It turns out that these omitted variable bias problems plague sub-national-level studies as well. If we just regressed incomes (or better yet, welfare!) on electricity infrastructure at the village level in a developing country (say, India), we'd likely end up with a number that was way too high. Why? Energy infrastructure projects are large and expensive, and (correctly) not randomly placed: governments think long and hard about where to put the electricity grid. In practice, this usually means that places that are already doing well economically, or places that are expected to do well in the future, are the first to get access to electricity. In more technical terms, that regression I described above is subject to a ton of omitted variable bias. 

In our new paper, Louis and I try to shed light on the following question: What does electrification do to rural economies? To do this, we take advantage of a natural experiment built into India's national rural electrification program, RGGVY, which would eventually expand electricity access in over 400,000 villages across 27 states. In order to keep costs down, in the first phase of the program, only villages with neighborhoods of 300+ people were eligible for electrification. This means that we can compare villages with 299-person neighborhoods to those with 301-person neighborhoods to look at the effect of electrification. The idea is that, in the absence of RGGVY, these villages would be virtually indistinguishable - and indeed, in the paper, we show that prior to the program, villages just above and just below the 300-person cutoff look the same. Having a cutoff of this kind built into the program is really nice from an evaluation standpoint, because it allows us to mimic a randomized experiment couched in the natural (large-scale) rollout of RGGVY. 

Having identified this cutoff, we get to bring a bunch of cool data to bear on the problem. The first thing we need to do is to demonstrate that RGGVY actually led to increases in electricity access for eligible villages. We're concerned that a simple 1/0 electrification indicator doesn't actually capture power availability and use - it would mask variation in power quality or in the number of households that have access to the grid, for example. Instead, we turn to remote sensing. We got access to boundary shapefiles (outlines) of [almost] every village in India, and superimposed them on top of NOAA's DMSP-OLS nighttime lights dataset - this is a satellite-based measure of nighttime brightness. We combine these to datasets and "cookie-cutter" out the nighttime lights values for each village, which allows us to create statistics like the average or maximum brightness in each location.

Yum, village cookies!

Yum, village cookies!

We pair nighttime brightness data from 2011, 6 years after the announcement of the program, with population information from the 2001 Census (the official population of record for RGGVY), and look at the effects of eligibility for RGGVY on night lights. Another advantage of the cutoff-based ("regression discontinuity", for those in the know) analysis? It lends itself naturally to blogging, because you should be able to see the difference between ineligible villages and eligible villages very clearly in a figure. Et voila:

Effects of eligibility for RGGVY on nighttime brightness. Each dot contains ~1,800 villages.

Effects of eligibility for RGGVY on nighttime brightness. Each dot contains ~1,800 villages.

We find that RGGVY eligibility (aka crossing the 300-person threshold) led to a sharp increase in nighttime brightness, as visible from space - remember that we're looking at villages of around 300 people, so we're pretty impressed that we can detect this. It helps that we're studying India, where there are upwards of 30,000 villages in our main estimation sample. It's a little hard to interpret this effect directly, since it's in units of brightness points, but previous papers that have gone out and groundtruthed the relationship between nighttime lights and electrification suggest that this is consistent with about a 50-percent increase in the household electrification rate in our sample. We do a bunch of work in the paper and in the appendix to show that these changes are actually attributable to RGGVY - if you're curious, check it out!

 

Okay, great: it looks like RGGVY brought electricity to Indian villages - but what about the effects of electricity access on things we care about, like the workforce, asset ownership, housing stock characteristics, village-wide outcomes, and education? We gathered a bunch of village-level data from the Indian Census of 2011 and the District Information System for Education, and look, among other things, at the effects of electrification on outcomes like men and women working in agriculture vs. the more formal sector; ownership of assets like TVs and motorcycles; whether households are classified as "dilapidated" or have mud floors, and other housing stock outcomes; the presence of mobile phone coverage, agricultural credit societies, and other village-wide services; and the number of children enrolled in primary and upper-primary school. Here's a snapshot of (some of) our results:

For all of the outcomes I've shown here, we see very little evidence of sharp discontinuities at the 300-person threshold. We do see clear evidence of men shifting from agricultural to non-agricultural employment (see p. 20 in the paper), but little else. The graphical evidence is borne out by the regression estimates as well: in almost no cases can we reject the null hypothesis of zero effect, but more importantly, we can reject even modest effects in nearly all outcomes. (The exception to this is education: we again see no visual evidence of effects on education, but our sample size is much smaller for these outcomes, since not all villages actually contain schools, so we have less precision with which to rule out effects; the schools effects are also sensitive to specification choices, bandwidths, etc in a way that the other outcomes are not.) To reiterate: in the medium term, we find that eligibility for RGGVY caused a substantial increase in electricity use, but can reject small effects on labor markets, asset ownership, housing characteristics, and village-wide outcomes; we do not find robust evidence that RGGVY led to changes in education. We were pretty surprised by these results, so we threw a bunch of checks at them (see the Appendix) - but they seem to hold up. (Turns out that our results are also consistent with new evidence from our Berkeley colleagues Ken Lee, Ted Miguel, and Catherine Wolfram in Kenya - see the abstract here).

A couple of these tests in particular are worth highlighting:

You might be concerned that we're not finding much because the program wasn't implemented well, or because we're lumping a bunch of villages where electrification did a lot in with villages where electrification didn't do anything, so things are averaging out to zero. When we cherry-pick the states that saw the largest increase in nighttime brightness as a result of the program, however, we don't see evidence of this. Among this selected sample, the nighttime lights effect approximately doubles - but the effects on the other outcomes stay the same. 

You might also be concerned that villages with around 300 people are unlikely to see big effects - they might be too poor, too credit-constrained, etc, etc, etc. A couple of responses to this: first, if we care about electrification from a poverty-reduction standpoint, then we should be worried about people being too poor to take advantage of electrification. But that's more speculative than data-driven, so we do a more formal test to think about effects of electrification for the rest of the villages in India. Rather than relying on our nice cutoff, we instead do a difference-in-differences (DD) analysis: we compare villages electrified in the first wave of the program (like a "treatment" group), before and after electrification, to villages electrified in the second wave of the program (like a "control" group). When we do this, and calculate different effects by population groups, here's what we find:

ddvsrd

There are a couple important takeaways from this figure: first, our cutoff-based estimate (navy dots) line up remarkably well with the DD point estimates for the appropriately sized villages; and second, the brightness effect is increasing in population, but the other outcomes (proportion of men working in agriculture shown here) are not, suggesting that our original estimates might actually generalize to the rest of the population. 

So what does it all mean? We present well-identified quasi-experimental evidence from the world's largest unelectrified population. Taken together, our results suggest that rural electrification may not be as beneficial as previously thought. Does our paper say that we shouldn't be implementing these kinds of electrification programs, or that electricity isn't making people better off? No - we explicitly don't make any statements about overall welfare in the paper, because we don't have the data to support these types of claims. In fact, we visited some villages in Karnataka in December, which made it pretty clear that people like having access to power (so do I!). But we can say that it's difficult to find evidence in the data that electrification is dramatically transforming rural India after 5 years or so. We think this is a case where highlighting a null result is really important - take that, file drawer! At the end of the day, in the medium term, rural electrification just doesn't appear to be a silver bullet for development. In typical Ivory Tower fashion, we think more research is needed to understand where and when power can transform economies - maybe we should be targeting electricity infrastructure upgrades to urban areas, for instance. We've got a couple projects in the works to look at some of these questions - so check back with us in a few years, and hopefully we'll have some more answers!