<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Content overview | ECON3500: Econometrics and Applications</title><link>https://econ3500s26.netlify.app/content/</link><atom:link href="https://econ3500s26.netlify.app/content/index.xml" rel="self" type="application/rss+xml"/><description>Content overview</description><generator>Source Themes Academic (https://sourcethemes.com/academic/)</generator><language>en-us</language><lastBuildDate>Thu, 01 Jan 2026 00:00:00 +0000</lastBuildDate><image><url>https://econ3500s26.netlify.app/media/social-image.png</url><title>Content overview</title><link>https://econ3500s26.netlify.app/content/</link></image><item><title>Exam 3 - Ch 8-10, 12</title><link>https://econ3500s26.netlify.app/content/13-exam3/</link><pubDate>Tue, 14 Apr 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/13-exam3/</guid><description>&lt;p>&lt;strong>Quick links:&lt;/strong>
&lt;a href="#what-should-i-know">What to know&lt;/a> ·
&lt;a href="#coverage-guide">Coverage guide&lt;/a> ·
&lt;a href="#what-is-the-structure-of-the-exam">Exam structure&lt;/a> ·
&lt;a href="#how-can-i-prepare">How to prepare&lt;/a> ·
&lt;a href="#inclass-practice">In-class practice&lt;/a> ·
&lt;a href="#slides">Slides&lt;/a> ·
&lt;a href="#practice-exam">Practice exam&lt;/a>&lt;/p>
&lt;h2 id="coverage-guide">Coverage guide&lt;/h2>
&lt;p>The
&lt;a href="https://econ3500s26.netlify.app/slides/Exam3-Chapters8_9_10_12Coverage.pdf">Exam 3 Coverage Guide&lt;/a> summarizes everything you need to know for the exam — calculations, interpretations, and key concepts by chapter.&lt;/p>
&lt;h2 id="what-should-i-know">What should I know?&lt;/h2>
&lt;p>See the specific chapter guides from
&lt;a href="../08-content">Chapter 8&lt;/a>,
&lt;a href="../09-content">Chapter 9&lt;/a>,
&lt;a href="../10-content">Chapter 10&lt;/a>, and
&lt;a href="../12-content">Chapter 12&lt;/a>. You should understand all terms and definitions (be able to define and also apply them). You should also be comfortable interpreting regression output either written out in equation form or as Stata output. The level of PS4/PS5 problems is about right in terms of difficulty.&lt;/p>
&lt;p>There will not be a Stata component to the exam.&lt;/p>
&lt;h3 id="i-need-more-practice">I need more practice!&lt;/h3>
&lt;p>I recommend the following resources:&lt;/p>
&lt;ul>
&lt;li>Odd-numbered problems from Stock and Watson - solutions are available
&lt;a href="../../bonus/general-resources">here&lt;/a>&lt;/li>
&lt;li>
&lt;a href="#inclass-practice">In-class practice&lt;/a>&lt;/li>
&lt;li>
&lt;a href="#practice-exam">Practice exam&lt;/a>&lt;/li>
&lt;/ul>
&lt;h2 id="what-is-the-structure-of-the-exam">What is the structure of the exam?&lt;/h2>
&lt;p>The exam will be a lot like a math exam - solving some problems and showing your work. I would expect about 3 multi-part questions. There may be some questions that involve definitions and/or true-false.&lt;/p>
&lt;p>The exam is closed book with a calculator allowed. You are allowed to bring your own one-page formula sheet.&lt;/p>
&lt;p>What is allowed?&lt;/p>
&lt;ul>
&lt;li>Pen or pencil&lt;/li>
&lt;li>One page formula sheet (double sided)&lt;/li>
&lt;li>Calculator (basic, scientific, graphing)&lt;/li>
&lt;li>Drink and quiet snack&lt;/li>
&lt;/ul>
&lt;p>What is not allowed?&lt;/p>
&lt;ul>
&lt;li>&lt;del>Phone&lt;/del>&lt;/li>
&lt;li>&lt;del>Other notes&lt;/del>&lt;/li>
&lt;li>&lt;del>Textbook&lt;/del>&lt;/li>
&lt;li>&lt;del>Tablet/Laptop&lt;/del>&lt;/li>
&lt;li>&lt;del>Earbuds/headphones&lt;/del>&lt;/li>
&lt;/ul>
&lt;h2 id="how-can-i-prepare">How can I prepare?&lt;/h2>
&lt;p>A few ideas, depending on where you&amp;rsquo;re feeling least secure:&lt;/p>
&lt;ul>
&lt;li>Work through problem sets and prevous in-class exercises (posted in their respective
&lt;a href="../../content">content&lt;/a> section). Do them without looking at your past work!&lt;/li>
&lt;li>Make one-page chapter guides. These are handy resources for the exam and a good way to make sure you&amp;rsquo;ve thought about what&amp;rsquo;s important&lt;/li>
&lt;li>Make yourself a formula sheet. Useful to have, and also the act of creating it will help you&lt;/li>
&lt;li>Watch videos/review slides when you have gaps.&lt;/li>
&lt;/ul>
&lt;h2 id="inclass-practice" class="unnumbered">In-class practice&lt;/h2>
&lt;p>In-class practice activities:&lt;/p>
&lt;ul>
&lt;li>CH9 Activity:
&lt;a href="../practice/ch9-activity3-study-doctor-student.pdf">Activity&lt;/a> ·
&lt;a href="../practice/ch9-activity3-study-doctor-solutions.pdf">Solutions&lt;/a>&lt;/li>
&lt;li>CH10 Activity:
&lt;a href="../practice/ch10-activity2-fe-showdown.pdf">Activity&lt;/a> ·
&lt;a href="../practice/ch10-activity2-fe-showdown-answerkey.pdf">Solutions&lt;/a>&lt;/li>
&lt;li>CH12 Activity:
&lt;a href="../practice/ch12-activity2-2sls-in-action-student.pdf">Activity&lt;/a> ·
&lt;a href="../practice/ch12-activity2-2sls-in-action-solutions.pdf">Solutions&lt;/a>&lt;/li>
&lt;/ul>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/exam3-review-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/exam3-review.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/exam3-review-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/exam3-review.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;!-- ## Fall 2020 in-class review -->
&lt;!-- **Start around 22-minute mark ** -->
&lt;!-- &lt;iframe src="https://streaming.uvm.edu/media/embed/ypKTzfH/" width="640" height="480" frameborder="0" allowfullscreen>&lt;/iframe> -->
&lt;h2 id="practice-exam" class="unnumbered">Practice exam&lt;/h2>
&lt;p>Use this for additional practice and to get a sense of the format and difficulty of questions. It covers the same topics as this exam (panel data/fixed effects, instrumental variables, difference-in-differences).&lt;/p>
&lt;ul>
&lt;li>
&lt;a href="../practice/exam3-practice.pdf">Practice Exam 1&lt;/a>&lt;/li>
&lt;li>
&lt;a href="../practice/exam3-practice-solutions.pdf">Practice Exam 1 Solutions&lt;/a>&lt;/li>
&lt;li>
&lt;a href="../practice/exam3-practice2.pdf">Practice Exam 2&lt;/a>&lt;/li>
&lt;li>
&lt;a href="../practice/exam3-practice2-solutions.pdf">Practice Exam 2 Solutions&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>(&lt;em>I strongly recommend completing the practice exams before reviewing the solutions&lt;/em>)&lt;/p></description></item><item><title>Week 13 - Instrumental Variables</title><link>https://econ3500s26.netlify.app/content/12-content/</link><pubDate>Sun, 12 Apr 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/12-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>Let&amp;rsquo;s wrap up with a final hurrah - one more empirical tool to help us identify &lt;em>causal impacts&lt;/em>.&lt;/p>
&lt;h2 id="reading-guide">Reading Guide&lt;/h2>
&lt;h4 id="chapter-12-instrumental-variables-regression">&lt;strong>Chapter 12: Instrumental Variables Regression&lt;/strong>&lt;/h4>
&lt;h4 id="sw-121-the-iv-estimator-with-a-single-regressor-and-a-single-instrument">&lt;strong>SW 12.1&lt;/strong> The IV Estimator with a Single Regressor and a Single Instrument&lt;/h4>
&lt;p>Make sure you understand the difference between &lt;strong>endogeneous variables&lt;/strong> (correlated with error term $u$) and &lt;strong>exogenous variables&lt;/strong> (not correlated). The textbook covers two conditions for a valid instrument: &lt;strong>relevance&lt;/strong> ($corr(Z_i,X_i) \neq 0$) and &lt;strong>exogeneity&lt;/strong> ($corr(Z_i,u_i) = 0$). To make the assumptions of IV explicit, I prefer to include an &lt;strong>exclusion&lt;/strong> assumption: that the instrument $Z$ affects $Y$ only through $X$.
If you come away from this section with the right vocabulary and a good intuition of what an instrument &amp;ldquo;does,&amp;rdquo; then you&amp;rsquo;re in good shape!&lt;/p>
&lt;p>I find the derivation (in the text and video) helpful, but it&amp;rsquo;s not something I&amp;rsquo;d test you on.&lt;/p>
&lt;blockquote class="twitter-tweet">
&lt;a href="https://twitter.com/lukestein/status/1298665911101136901">&lt;/a>
&lt;/blockquote>&lt;script async src="https://platform.twitter.com/widgets.js" charset="utf-8">&lt;/script>
&lt;h4 id="sw-122-the-general-iv-regression-model">&lt;strong>SW 12.2&lt;/strong> The General IV Regression Model&lt;/h4>
&lt;p>Now we get a bit technical. The key important things here are the understanding of two-stage least squares - what it is and how it works, plus a good understanding of those assumptions.&lt;/p>
&lt;h4 id="sw-123-checking-instrument-validity">&lt;strong>SW 12.3&lt;/strong> Checking Instrument Validity&lt;/h4>
&lt;p>Make sure you know what a &lt;strong>weak instrument&lt;/strong> is, why it&amp;rsquo;s a problem, and how we identify it. Make sure you understand the logic behind whether an instrument violates it&amp;rsquo;s exogeneity assumption.&lt;/p>
&lt;blockquote>
&lt;p>nobody will remember:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>your salary&lt;/p>
&lt;/li>
&lt;li>
&lt;p>how “busy you were”&lt;/p>
&lt;/li>
&lt;li>
&lt;p>how many hours you worked&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>people will remember:&lt;/p>
&lt;ul>
&lt;li>when your instrumental variable violated the exclusion restriction&lt;/li>
&lt;/ul>
&lt;p>— @dggoldst.bsky.social&lt;br>
&lt;a href="https://bsky.app/profile/dggoldst.bsky.social/post/3kzxkv76lds2v" target="_blank" rel="noopener">https://bsky.app/profile/dggoldst.bsky.social/post/3kzxkv76lds2v&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;!-- ```{r echo=FALSE} -->
&lt;!-- blogdown::shortcode('tweet', '1466634426067075077') -->
&lt;!-- ``` -->
&lt;!-- ```{r echo=FALSE} -->
&lt;!-- blogdown::shortcode('tweet', '1428455696538882048') -->
&lt;!-- ``` -->
&lt;p>&lt;strong>Skip the test of overidentifying restrictions&lt;/strong>&lt;/p>
&lt;h4 id="sw-124-126-the-other-stuff">&lt;strong>SW 12.4-12.6&lt;/strong> The Other Stuff&lt;/h4>
&lt;p>Good content and examples, but nothing new.&lt;/p>
&lt;h2 id="other-resources">Other resources&lt;/h2>
&lt;p>Getting your head around IV can be tricky! Here are a few resources if you want to get another take:&lt;/p>
&lt;ul>
&lt;li>
&lt;a href="http://www.rebeccabarter.com/blog/2018-05-23-instrumental_variables/" target="_blank" rel="noopener">Rebecca Barter: Understanding nstrumental variables&lt;/a>: Helpful diagrams!&lt;/li>
&lt;li>
&lt;a href="http://pierrelouisvezina.weebly.com/uploads/2/3/4/2/2342194/ivs.pdf" target="_blank" rel="noopener">Pierre-Louis Vézina:Instrumental Variables - an intuitive guide&lt;/a>: Overview plus a few more examples of IVs in the literature&lt;/li>
&lt;li>Nick Huntington-Klein has the best econometrics resources 🔥 🔥.
&lt;ul>
&lt;li>Check out his lecture slides on
&lt;a href="https://nickch-k.github.io/EconometricsSlides/Week_08/Week_08_Instrumental_Variables.html#1" target="_blank" rel="noopener">instrumental variables&lt;/a>, an accompanying video lecture
&lt;a href="https://www.youtube.com/watch?v=6mFNeWHPMYs" target="_blank" rel="noopener">here&lt;/a>.&lt;/li>
&lt;li>Also, some
&lt;a href="https://nickchk.com/causalgraphs.html#iv" target="_blank" rel="noopener">fancy animations&lt;/a> to show what IV actually does w/ data&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch12-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch12-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch12-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch12-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;!-- ## [Video: Introduction to IV](https://youtu.be/MeHHe303JuI ) -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/MeHHe303JuI" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
--></description></item><item><title>Week 14 - Research Week</title><link>https://econ3500s26.netlify.app/content/11-content/</link><pubDate>Sun, 12 Apr 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/11-content/</guid><description>&lt;h1 id="writing-papers">Writing Papers&lt;/h1>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>&lt;a href="https://econ3500s26.netlify.app/slides/rweek-slides.pdf" class="btn btn-primary">📥 Download this week's slides&lt;/a>&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/rweek-slides.pdf" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/rweek-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;h2 id="presenting-results">Presenting results&lt;/h2>
&lt;h3 id="empirical-specification-template">Empirical specification template&lt;/h3>
&lt;p>&lt;em>A helpful, five-step template!&lt;/em>&lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup>&lt;/p>
&lt;p>(1) First, I state what in broad terms type of model I am using (probably OLS!) and what I am going to do with it. If I&amp;rsquo;m using a fancy speciifcation like a difference-in-differences or panel data, now is a good time to mention this.&lt;/p>
&lt;blockquote>
&lt;p>I use ordinary least squares to estimate the relationship between ice cream flavor consumed and risk aversion, using state-level fixed effects to control for time-invariant determinants of risk aversion that may vary by state.&lt;/p>
&lt;/blockquote>
&lt;p>(2) Then, I&amp;rsquo;m going to write the population model I am estimating in equation form. I&amp;rsquo;m going to use appropriate subscripts:&lt;/p>
&lt;p>$outcome_{is} = \beta_0 + \beta_1 VariableName_{is} + \beta_2 AnotherVariableName_{2,is} + f_s + u_{is}$&lt;/p>
&lt;ul>
&lt;li>Use descriptive names (these don&amp;rsquo;t actually have to be your variable names)&lt;/li>
&lt;li>If you use fixed effects, you can notate these without having to expand them:
&lt;ul>
&lt;li>adding state fixed effects could be written as $ &amp;hellip; + f_s + &amp;hellip;$, for example&lt;/li>
&lt;li>or, you could add them this way: $\sum_{s=1}^{50}D_s$ for a set of 50 state-level fixed effects (with one omitted)&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>List covariates you include. If you have a ton of these, you could include a vector of individual-specific covariates ($X'\gamma$, for example) and then list them in paragraph form. However, this isn&amp;rsquo;t likely for most papers.&lt;/li>
&lt;/ul>
&lt;p>(3) Then, I&amp;rsquo;m going to define my independent and dependent variables in a list or paragraph form. I&amp;rsquo;m going to also describe what the subscripts are&lt;/p>
&lt;blockquote>
&lt;p>Where $outcome_{is}$ is measured risk aversion for individual $i$ living in state $s$, following the scale described above. $VariableName_{is}$ is a measure of flavor intensity, standardized around Edy&amp;rsquo;s French Vanilla &amp;hellip;&lt;/p>
&lt;/blockquote>
&lt;p>I&amp;rsquo;ll want to include any controls and fixed effects.&lt;/p>
&lt;p>(4) I&amp;rsquo;ll mention any special things I do when coding (missing value flags, etc), and what types of standard errors I&amp;rsquo;m using&lt;/p>
&lt;p>(5) Now, I write how I will interpret my coefficient of interest (causal, correlational, etc.) - what will the coefficients on my key indicator variables tell me? Are there any key identifying assumptions at play?&lt;/p>
&lt;p>If you are using multiple specifications, then you have two options:&lt;/p>
&lt;ul>
&lt;li>If it&amp;rsquo;s a matter of adding additional controls, then mention that you will also add them in a second model in part (3).&lt;/li>
&lt;li>If it&amp;rsquo;s a fundamentally different population model, then include a second model, and define any terms that were not previously defined. Make sure to discuss how the interpretation of results would differ.&lt;/li>
&lt;/ul>
&lt;h3 id="working-with-outreg2">Working with &lt;code>outreg2&lt;/code>&lt;/h3>
&lt;p>Download sample
&lt;a href="../practice/EC200-12-02-example1.do">code&lt;/a>, which uses
&lt;a href="../practice/graduation.dta">&lt;code>graduation.dta&lt;/code>&lt;/a>&lt;/p>
&lt;h2 id="introductions-and-conclusions">Introductions and conclusions&lt;/h2>
&lt;h3 id="the-introduction-formula">The introduction formula&lt;/h3>
&lt;p>Listen to Keith Head and follow
&lt;a href="http://blogs.ubc.ca/khead/research/research-advice/formula" target="_blank" rel="noopener">&amp;ldquo;The Introduction Formula&amp;rdquo;&lt;/a>&lt;/p>
&lt;h3 id="links">Links&lt;/h3>
&lt;h4 id="introductions">Introductions&lt;/h4>
&lt;ul>
&lt;li>
&lt;a href="http://blogs.ubc.ca/khead/research/research-advice/formula" target="_blank" rel="noopener">The Introduction Formula&lt;/a>, Keith Head&lt;/li>
&lt;li>
&lt;a href="https://static1.squarespace.com/static/5e6033a4ea02d801f37e15bb/t/5f62a398731cc06411b39f48/1600299928487/phd_paper_writing.pdf" target="_blank" rel="noopener">Writing Tips for Ph.D. Students&lt;/a>, John Cochrane&lt;/li>
&lt;li>
&lt;a href="https://macromomblog.com/2019/09/29/we-need-to-talk-more/" target="_blank" rel="noopener">We need to talk MORE&lt;/a> Claudia Sahm (general paper writing guidance), with special attention to introductions&lt;/li>
&lt;li>
&lt;a href="https://www.cgdev.org/blog/how-write-introduction-your-development-economics-paper" target="_blank" rel="noopener">How to write the introduction of your development economics paper&lt;/a>, Dave Evans surveys the structure of published papers&lt;/li>
&lt;/ul>
&lt;h4 id="conclusions">Conclusions&lt;/h4>
&lt;ul>
&lt;li>
&lt;a href="http://marcfbellemare.com/wordpress/12060" target="_blank" rel="noopener">The Conclusion Formula&lt;/a>, Marc Bellemare&lt;/li>
&lt;/ul>
&lt;h2 id="general-guidance-for-paper-writing">General guidance for paper writing&lt;/h2>
&lt;ul>
&lt;li>
&lt;a href="https://www.brown.edu/Research/Shapiro/pdfs/foursteps.pdf" target="_blank" rel="noopener">Four steps to an applied micro paper&lt;/a>, Jesse Shapiro&lt;/li>
&lt;li>
&lt;a href="http://marcfbellemare.com/wordpress/wp-content/uploads/2020/09/BellemareHowToPaperSeptember2020.pdf" target="_blank" rel="noopener">How to write applied papers in economics&lt;/a>, Marc Bellemare&lt;/li>
&lt;li>
&lt;a href="http://www.deirdremccloskey.com/docs/pdf/Article_86.pdf" target="_blank" rel="noopener">&lt;em>Economical Writing&lt;/em>&lt;/a>, Deidre McCloskey&lt;/li>
&lt;li>
&lt;a href="https://static1.squarespace.com/static/55c143d9e4b0cb07521c6d17/t/5910bd1246c3c4fdbdf9976a/1494269202424/Kremer&amp;#43;on&amp;#43;writing&amp;#43;papers.pdf" target="_blank" rel="noopener">Writing papers: a checklist&lt;/a>, Michael Kremer&lt;/li>
&lt;/ul>
&lt;section class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1" role="doc-endnote">
&lt;p>Use the template, but please do not plagiarize the template.&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/section></description></item><item><title>Week 11/12 - Panel data</title><link>https://econ3500s26.netlify.app/content/10-content/</link><pubDate>Sun, 22 Mar 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/10-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>We&amp;rsquo;ve complained a lot about challenges to obtaining causal estimation. Now, let&amp;rsquo;s &lt;em>do&lt;/em> something about it! :strong:&lt;/p>
&lt;p>&lt;strong>Tuesday&lt;/strong> covers panel data, first differencing, and fixed effects — the tools that let us eliminate time-invariant omitted variable bias.&lt;/p>
&lt;p>&lt;strong>Thursday&lt;/strong> covers difference-in-differences (DiD) — a research design that uses these tools to estimate the causal effects of policies and events. DiD isn&amp;rsquo;t actually in Chapter 10&lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup>, but it fits here nicely.&lt;/p>
&lt;h2 id="reading-guide">Reading Guide&lt;/h2>
&lt;h4 id="chapter-10-regression-with-panel-data">&lt;strong>Chapter 10: Regression with Panel Data&lt;/strong>&lt;/h4>
&lt;h4 id="sw-101-panel-data">&lt;strong>SW 10.1&lt;/strong> Panel data&lt;/h4>
&lt;p>Make sure you know the difference between &lt;strong>cross-sectional&lt;/strong>, &lt;strong>time-series&lt;/strong>, and &lt;strong>panel&lt;/strong> data. Also, what does it mean to be &lt;strong>balanced&lt;/strong>?&lt;/p>
&lt;h4 id="sw-102-panel-data-with-two-time-periods-before-and-after-comparisons">&lt;strong>SW 10.2&lt;/strong> Panel data with two time periods: &amp;ldquo;Before and after&amp;rdquo; comparisons&lt;/h4>
&lt;p>We can control for variables that are &lt;em>constant&lt;/em> over time by differencing them out. Do you understand how this works?&lt;/p>
&lt;h4 id="sw-103-fixed-effects-regression">&lt;strong>SW 10.3&lt;/strong> Fixed effects regression&lt;/h4>
&lt;p>Instead of differencing, let&amp;rsquo;s include a bunch of entity-specific intercepts. What are we doing, why does it work? When is it like the before-after comparisons from before?&lt;/p>
&lt;h4 id="sw-104-regression-with-time-fixed-effects">&lt;strong>SW 10.4&lt;/strong> Regression with time fixed effects&lt;/h4>
&lt;p>Just like we can control for unit-specific factors that remain constant over time, we can also control for factors that vary over time but are constant across units with &lt;strong>time fixed effects&lt;/strong>. Neat!&lt;/p>
&lt;p>Can we include both entity- &lt;em>and&lt;/em> time-specific fixed effetcst? You bet we can!&lt;/p>
&lt;h4 id="sw-105-the-fixed-effects-regression-assumptions-and-standard-errors-for-fixed-effects-regressions">&lt;strong>SW 10.5&lt;/strong> The fixed effects regression assumptions and standard errors for fixed effects regressions&lt;/h4>
&lt;p>How do our LS assumptions change when we move to panel data?&lt;/p>
&lt;h4 id="sw--difference-in-differences-estimation">&lt;strong>SW ??&lt;/strong> Difference-in-differences estimation&lt;/h4>
&lt;p>Set up and interpret difference-in-differences estimation. There&amp;rsquo;s a decent discussion in &lt;strong>SW13.4&lt;/strong>, but I think it&amp;rsquo;s more appropriate to cover it here, since it&amp;rsquo;s so closely linked to panel data.&lt;/p>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;h4>Tuesday 3/31: Panel Data, First Differencing &amp;amp; Fixed Effects (Rae)&lt;/h4>
&lt;p>&lt;a href="https://econ3500s26.netlify.app/slides/ch10_slides_s26.pdf" class="btn btn-primary">📥 Download this week's slides&lt;/a>&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch10_slides_s26.pdf" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch10_slides_s26.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;div class="slides-container">
&lt;h4>Thursday 4/4: Difference-in-Differences&lt;/h4>
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch10c-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch10c-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch10c-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch10c-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;div class="slides-container">
&lt;h4>Tuesday 4/8: Fixed Effects Extensions &amp;amp; Inference&lt;/h4>
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch10d-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch10d-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch10d-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch10d-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;h2 id="in-class-activity">In-Class Activity&lt;/h2>
&lt;p>From &lt;strong>Tuesday 4/7&lt;/strong>, here are the fixed effects handout and solution set:&lt;/p>
&lt;ul>
&lt;li>
&lt;a href="../practice/ch10-activity2-fe-showdown.pdf">Fixed Effects Showdown activity (PDF)&lt;/a>&lt;/li>
&lt;li>
&lt;a href="../practice/ch10-activity2-fe-showdown-answerkey.pdf">Fixed Effects Showdown answer key (PDF)&lt;/a> &lt;em>(try the activity first if you want the full practice effect)&lt;/em>&lt;/li>
&lt;/ul>
&lt;!-- ## Videos -->
&lt;!-- ### [Video: Data types and difference-in-differences](https://youtu.be/hLEmPwd1R4E ) -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/hLEmPwd1R4E" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
-->
&lt;!-- ### [Class Video](https://streaming.uvm.edu/media/embed/iM9eQzi/) -->
&lt;!-- *You need to use your netid to log in - let me know if you have issues* -->
&lt;!-- &lt;iframe src="https://streaming.uvm.edu/media/embed/iM9eQzi/" width="640" height="480" frameborder="0" allowfullscreen>&lt;/iframe> -->
&lt;section class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1" role="doc-endnote">
&lt;p>Check out SW13.4 if you want to be precise.&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/section></description></item><item><title>Week 10 - Causal Diagrams &amp; Assessing Studies</title><link>https://econ3500s26.netlify.app/content/09-content/</link><pubDate>Sun, 15 Mar 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/09-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>This week has two connected parts. &lt;strong>Tuesday&lt;/strong>: we introduce &lt;strong>causal diagrams (DAGs)&lt;/strong> — a visual framework for thinking about causality, confounding, and what to control for. &lt;strong>Thursday&lt;/strong>: we apply that framework to &lt;strong>assessing regression validity&lt;/strong> (internal and external validity, omitted variable bias, measurement error, simultaneity).&lt;/p>
&lt;h2 id="reading-guide">Reading Guide&lt;/h2>
&lt;h3 id="tuesday-causal-diagrams-dags">Tuesday: Causal Diagrams (DAGs)&lt;/h3>
&lt;p>Read the following chapters from Huntington-Klein,
&lt;a href="https://theeffectbook.net" target="_blank" rel="noopener">&lt;em>The Effect&lt;/em>&lt;/a> (free online):&lt;/p>
&lt;h4 id="the-effect-chapter-6-causal-diagrams">&lt;strong>The Effect, Chapter 6:&lt;/strong> Causal Diagrams&lt;/h4>
&lt;p>What are DAGs? Nodes, arrows, and how to represent a data generating process visually.&lt;/p>
&lt;h4 id="the-effect-chapter-7-drawing-causal-diagrams">&lt;strong>The Effect, Chapter 7:&lt;/strong> Drawing Causal Diagrams&lt;/h4>
&lt;p>How to build a DAG for your own research question. Practical guidance on simplifying and avoiding common mistakes.&lt;/p>
&lt;h4 id="the-effect-chapter-8-causal-paths-and-closing-back-doors">&lt;strong>The Effect, Chapter 8:&lt;/strong> Causal Paths and Closing Back Doors&lt;/h4>
&lt;p>The key chapter: front door vs. back door paths, open vs. closed paths, confounders, colliders, and the backdoor criterion.&lt;/p>
&lt;h3 id="thursday-assessing-regression-validity-sw-chapter-9">Thursday: Assessing Regression Validity (SW Chapter 9)&lt;/h3>
&lt;h4 id="sw-91-internal-and-external-validity">&lt;strong>SW 9.1&lt;/strong> Internal and External Validity&lt;/h4>
&lt;p>Get those definitions, and threats to external validity&lt;/p>
&lt;h4 id="sw-92-threats-to-internal-validity-of-multiple-regression-analysis">&lt;strong>SW 9.2&lt;/strong> Threats to Internal Validity of Multiple Regression Analysis&lt;/h4>
&lt;p>Five big threats. And how to handle them.&lt;/p>
&lt;h4 id="bonus-sw-111-linear-probability-models">&lt;strong>Bonus:&lt;/strong> SW 11.1 Linear Probability Models&lt;/h4>
&lt;p>Optional background if you want a bit more on [linear probability models]{.kw}. This is not part of Chapter 9, but it connects directly to Thursday&amp;rsquo;s class and Lab 6.&lt;/p>
&lt;h4 id="sw-94-example-test-score-and-class-size">&lt;strong>SW 9.4&lt;/strong> Example: Test Score and Class Size&lt;/h4>
&lt;p>No new content, but this is a really good walk-through of the 9.1 and 9.2 content.&lt;/p>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;h4>Tuesday: Causal Diagrams (DAGs)&lt;/h4>
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch8b-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch8b-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch8b-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch8b-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;div class="slides-container">
&lt;h4>Thursday: Assessing Studies (SW Ch. 9)&lt;/h4>
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch9-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch9-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch9-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch9-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;!-- ## Videos -->
&lt;!-- ### [Video: Assessing Studies](https://youtu.be/zIx8kyPZSC0 ) -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/zIx8kyPZSC0" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
-->
&lt;!-- https://youtu.be/ --></description></item><item><title>Week 9 - Nonlinear Regression</title><link>https://econ3500s26.netlify.app/content/08-content/</link><pubDate>Sun, 08 Mar 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/08-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>Let&amp;rsquo;s make regressions faaaancy, with three new friends: &lt;strong>polynomials&lt;/strong>, &lt;strong>logarithms&lt;/strong>, and &lt;strong>interactions&lt;/strong>!&lt;/p>
&lt;h2 id="reading-guide">Reading Guide&lt;/h2>
&lt;h4 id="chapter-8-nonlinear-regression-functions">&lt;strong>Chapter 8: Nonlinear Regression Functions&lt;/strong>&lt;/h4>
&lt;h4 id="sw-81-a-general-strategy-for-modeling-nonlinear-regression-functons">&lt;strong>SW 8.1&lt;/strong> A General Strategy for Modeling Nonlinear Regression Functons&lt;/h4>
&lt;p>Key takeaway here - think about the economic theory and its application to determine the best approach to modeling!&lt;/p>
&lt;h4 id="sw-82-nonlinear-functions-of-a-single-independent-variable">&lt;strong>SW 8.2&lt;/strong> Nonlinear Functions of a Single Independent Variable&lt;/h4>
&lt;p>Polynomials and logs. Note that this section includes only 3 ways of setting up equations: level-level, log-log, log-level. We&amp;rsquo;ll also cover level-log, because, why not?&lt;/p>
&lt;h4 id="sw-83-interactions-between-independent-variables">&lt;strong>SW 8.3&lt;/strong> Interactions Between Independent Variables&lt;/h4>
&lt;p>Three types of interactions you should know:&lt;/p>
&lt;ol>
&lt;li>Binary-binary&lt;/li>
&lt;li>Continuous-binary&lt;/li>
&lt;li>Contnuous-continuous&lt;/li>
&lt;/ol>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch8-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch8-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch8-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch8-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;!-- ## Videos -->
&lt;!-- This video talks us through polynomial and logarithmic regressions. We will do interactions (SW 8.3) entirely in class. Don't forget to complete [VQ8 on Blackboard!](https://blackboard.uvm.edu) -->
&lt;!-- ### [Video: Nonlinear Reression Functions](https://youtu.be/LAPKh9ib2Tg ) -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/LAPKh9ib2Tg" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
--></description></item><item><title>Exam 2 - CH 4-7</title><link>https://econ3500s26.netlify.app/content/11-exam2/</link><pubDate>Fri, 20 Feb 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/11-exam2/</guid><description>&lt;h2 id="what-should-i-know">What should I know?&lt;/h2>
&lt;p>See the specific chapter guides from
&lt;a href="../04-content">Week 4&lt;/a>,
&lt;a href="../05-content">Week 5&lt;/a>,
&lt;a href="../06-content">Week 6&lt;/a>, and
&lt;a href="../07-content">Week 7&lt;/a>. You should understand all terms and definitions (be able to define and also apply them). You should also be comfortable interpreting regression output either written out in equation form or as Stata output. The level of PS2/PS3 problems is about right in terms of difficulty.&lt;/p>
&lt;p>There will not be a Stata component to the exam.&lt;/p>
&lt;p>You can find a more detailed set of guidelines
&lt;a href="https://econ3500s26.netlify.app/slides/Exam2-Chapters4_7Coverage.pdf">here&lt;/a>.&lt;/p>
&lt;h3 id="i-need-more-practice">I need more practice!&lt;/h3>
&lt;p>I recommend the following resources:&lt;/p>
&lt;ul>
&lt;li>Odd-numbered problems from Stock and Watson - solutions are available
&lt;a href="../../bonus/general-resources">here&lt;/a>&lt;/li>
&lt;li>
&lt;a href="#practice-exam">Practice exam&lt;/a>&lt;/li>
&lt;li>
&lt;a href="#inclass-practice">In-class practice&lt;/a>&lt;/li>
&lt;/ul>
&lt;h2 id="what-is-the-structure-of-the-exam">What is the structure of the exam?&lt;/h2>
&lt;p>The exam will be a lot like a math exam - solving some problems and showing your work. I would expect about 3 multi-part questions. There may be some questions that involve definitions and/or true-false.&lt;/p>
&lt;p>The exam is closed book with a calculator allowed. You are allowed a formula sheet that you created!&lt;/p>
&lt;p>&lt;strong>I will provide the following F-distribution table (recreated from S&amp;amp;W):
&lt;a href="../PracticeExams/f_distribution.pdf">F-table&lt;/a>.&lt;/strong>&lt;/p>
&lt;p>You will not need a normal distribution table. However, you will want to know common critical values for t-tests.&lt;/p>
&lt;p>What is allowed?&lt;/p>
&lt;ul>
&lt;li>Pen or pencil&lt;/li>
&lt;li>One page formula sheet (double sided)&lt;/li>
&lt;li>Calculator (basic, scientific, graphing)&lt;/li>
&lt;li>Drink and quiet snack&lt;/li>
&lt;/ul>
&lt;p>What is not allowed?&lt;/p>
&lt;ul>
&lt;li>&lt;del>Phone&lt;/del>&lt;/li>
&lt;li>&lt;del>Other notes&lt;/del>&lt;/li>
&lt;li>&lt;del>Textbook&lt;/del>&lt;/li>
&lt;li>&lt;del>Tablet/Laptop&lt;/del>&lt;/li>
&lt;li>&lt;del>Earbuds/headphones&lt;/del>&lt;/li>
&lt;/ul>
&lt;h2 id="inclass-practice" class="unnumbered">In-class practice&lt;/h2>
&lt;p>We have several sets of in-class practice activities:&lt;/p>
&lt;ul>
&lt;li>
&lt;a href="../practice/W4-02-in-class-exercise.pdf">CH4 Practice (Week 4) - Single Linear Regression&lt;/a>&lt;/li>
&lt;li>
&lt;a href="../practice/W5-05-03-in-class_inference.pdf">CH5 Practice (Week 5) - Inference&lt;/a>&lt;/li>
&lt;li>
&lt;a href="../practice/W6-01-in-class-exercise.pdf">CH6 Practice (Week 6) - Multiple Linear Regression&lt;/a>&lt;/li>
&lt;li>
&lt;a href="../practice/W7-02-in-class-exercise.pdf">CH7 Practice (Week 7) - Inference with Multiple Linear Regression&lt;/a>&lt;/li>
&lt;/ul>
&lt;!--
## Helpful materials {#helpful-materials}
- [Formula chart](../Formula-sheet_f20.pdf): Is this all the knowledge you need to know? Of course not! But, this should cover most of the formulas you might need.
-->
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>&lt;a href="https://econ3500s26.netlify.app/slides/Exam1_Review.pdf" class="btn btn-primary">📥 Download this week's slides&lt;/a>&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/Exam1_Review.pdf" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/Exam1_Review.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;h2 id="practice-exam" class="unnumbered">Past exam &lt;/h2>
&lt;p>
&lt;a href="../PracticeExams/Unit1Quiz_spring2018.pdf">Spring 2018 Practice Exam&lt;/a>&lt;/p>
&lt;p>
&lt;a href="../PracticeExams/Unit1Quiz_spring2018_soln.pdf">Spring 2018 Practice Exam Solutions&lt;/a>&lt;/p>
&lt;p>
&lt;a href="../PracticeExams/Unit1Quiz_spring2017_v1.pdf">Spring 2017 Practice Exam&lt;/a> - note that this exam asks you to interpret log coefficients, which &lt;em>we have not yet covered!&lt;/em>&lt;/p>
&lt;p>
&lt;a href="../PracticeExams/Unit1Quiz_spring2017_v1solution.pdf">Spring 2017 Practice Exam Solutions&lt;/a> (&lt;em>I strongly recommend doing the practice exam in its entirety before reviewing the solutions&lt;/em>)&lt;/p></description></item><item><title>Week 7 - Hypothesis Tests with Multiple Regressions</title><link>https://econ3500s26.netlify.app/content/07-content/</link><pubDate>Mon, 16 Feb 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/07-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>And now, we&amp;rsquo;re back to hypothesis testing. The big thing we&amp;rsquo;re going to learn about is testing more complicated hypotheses, like whether two coefficients are equal, and whether a whole bunch of coefficients equal zero.&lt;/p>
&lt;h2 id="reading-guide">Reading Guide&lt;/h2>
&lt;h4 id="chapter-7-hypothesis-tests-and-confidence-intervals-with-multiple-regressions">&lt;strong>Chapter 7: Hypothesis Tests and Confidence Intervals with Multiple Regressions&lt;/strong>&lt;/h4>
&lt;h4 id="sw-71-hypothesis-tests-and-confidence-intervals-for-a-single-coefficient">&lt;strong>SW 7.1&lt;/strong> Hypothesis Tests and Confidence Intervals for a Single Coefficient&lt;/h4>
&lt;p>Here, we want to test hypothesis of this form: $H_0: \beta_j = \beta_{j,0}$ vs. $H_a: \beta_j \neq \beta_{j,0}$&lt;/p>
&lt;p>If you want another take on hypothesis testing with regression coefficients, go ahead and read this. I&amp;rsquo;m not going to cover this in class, because we&amp;rsquo;ve hit this in Chapter 5.&lt;/p>
&lt;h4 id="sw-72-tests-of-joint-hypotheses">&lt;strong>SW 7.2&lt;/strong> Tests of Joint Hypotheses&lt;/h4>
&lt;p>Here, we test hypothesis with lots of coefficients, of this form: $H_0: \beta_j = \beta_{j,0}, \beta_m = \beta_{m,0}, &amp;hellip;$, for $q$ restrictions, vs $H_a$: any one of those $q$ restrictions does not hold.&lt;/p>
&lt;h4 id="sw-73-testing-single-restrictions-involving-multiple-coefficients">&lt;strong>SW 7.3&lt;/strong> Testing Single Restrictions Involving Multiple Coefficients&lt;/h4>
&lt;p>Here, we test one restriction, but with multiple coefficients, like this: $H_0: \beta_j = \beta_m$ vs. $H_a: \beta_j \neq \beta_m$&lt;/p>
&lt;h4 id="thats-it">That&amp;rsquo;s it!&lt;/h4>
&lt;p>7.5 is a good review of things we&amp;rsquo;ve already discussed, and 7.6 walk us through an example. I encourage you to read them, but not mandatory. And we&amp;rsquo;re skipping 7.4&lt;/p>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch7-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch7-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch7-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch7-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;!-- ## Videos -->
&lt;!-- ### [Video: Hypothesis Tests with Multiple Linear Regression](https://youtu.be/qjgyS9u2EWM ) -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/qjgyS9u2EWM" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
-->
&lt;!-- *Consider this week's video as a supplement to our class practice - we will go through this content in detail during class!* --></description></item><item><title>Week 6 - Multiple Linear Regression</title><link>https://econ3500s26.netlify.app/content/06-content/</link><pubDate>Tue, 10 Feb 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/06-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>Let&amp;rsquo;s model!&lt;/p>
&lt;p>Now, we can build powerful models with heaps of dependent variables. Want to predict wages? Let&amp;rsquo;s control for education, for experience, for gender, for age, for age &lt;em>squared&lt;/em> (yes!). YES. Only our degrees of freedom can hold us back.&lt;/p>
&lt;h2 id="reading-guide">Reading Guide&lt;/h2>
&lt;h4 id="chapter-6-linear-regression-with-multiple-regressors">&lt;strong>Chapter 6: Linear Regression with Multiple Regressors&lt;/strong>&lt;/h4>
&lt;h4 id="sw-61-omitted-variable-bias">&lt;strong>SW 6.1&lt;/strong> Omitted Variable Bias&lt;/h4>
&lt;p>A discussion that connects nicely with our previous discussion of the zero conditional mean discussion and causal inference.&lt;/p>
&lt;h4 id="sw-62-the-multiple-regression-model">&lt;strong>SW 6.2&lt;/strong> The Multiple Regression Model&lt;/h4>
&lt;p>Hooray!&lt;/p>
&lt;h4 id="sw-63-the-ols-estimator-in-multiple-regression">&lt;strong>SW 6.3&lt;/strong> The OLS Estimator in Multiple Regression&lt;/h4>
&lt;p>This section doesn&amp;rsquo;t get into derivation, and neither do we!&lt;/p>
&lt;h4 id="sw-64-measures-of-fit-in-multiple-regression">&lt;strong>SW 6.4&lt;/strong> Measures of Fit in Multiple Regression&lt;/h4>
&lt;p>The only &lt;em>new&lt;/em> thing here is a revised $SER$ forumla and the introduction of the Adjusted $R^2$. Note that the lecture video also discusses the root mean standard error, $RMSE$, which is a lot like the $SER$ except that it uses $n$ rather than degrees of freedom as a denominator.&lt;/p>
&lt;h4 id="sw-65-the-least-squares-assumptions-in-multiple-regression">&lt;strong>SW 6.5&lt;/strong> The Least Squares Assumptions in Multiple Regression&lt;/h4>
&lt;p>Take the three from univariate regression and add &amp;hellip; no multicollinearity. Sorted.&lt;/p>
&lt;h4 id="sw-66-distribution-of-the-ols-estimators-in-multiple-regression">&lt;strong>SW 6.6&lt;/strong> Distribution of the OLS Estimators in Multiple Regression&lt;/h4>
&lt;p>Just the intuition, don&amp;rsquo;t worry about the appendix.&lt;/p>
&lt;h4 id="sw-67-multicollinearity">&lt;strong>SW 6.7&lt;/strong> Multicollinearity&lt;/h4>
&lt;p>Make sure you understand the examples, but remember that in practice, any statistical package will fix perfect multicollinearity on its own. Imperfect multicollinearity, on the other hand, is something to think about when crafting your models.&lt;/p>
&lt;h4 id="sw-68conclusion">&lt;strong>SW 6.8&lt;/strong> Conclusion&lt;/h4>
&lt;p>Treat yourself.&lt;/p>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch6-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch6-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch6-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch6-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;!-- ## Videos -->
&lt;!-- ### [Video: Multiple Linear Regression](https://youtu.be/XIrD0qD0TCs) -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/XIrD0qD0TCs" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
-->
&lt;!-- ## In-class exercises -->
&lt;!-- Download PDF [here](../practice/W6-01-in-class-exercise.pdf), which contains regression output for questions (2) - (4) -->
&lt;!-- ### Exercises -->
&lt;!-- Consider a dataset on earnings in the United States. We are interested -->
&lt;!-- in the returns to education - how much an extra year of schooling "buys" -->
&lt;!-- you in terms of weekly wages (\...as of 1980). You're also worried about -->
&lt;!-- whether one's education suffers from omitted variable bias. -->
&lt;!-- 1. You estimate two equations: $$\begin{aligned} -->
&lt;!-- \widehat{wage} &amp;= 146.95 + 60.21educ\\ -->
&lt;!-- \widehat{educ} &amp; = 5.84 + 0.075IQ\end{aligned}$$ -->
&lt;!-- Based on these results, is 60.21 an overestimate or underestimate of -->
&lt;!-- the returns to education? How do you know? -->
&lt;!-- 2. You estimate another equation: -->
&lt;!-- $\widehat{wage} = -128.89 +42.06 educ + 5.14 IQ$ -->
&lt;!-- What is the interpretation of the coefficient on $educ$? What is the -->
&lt;!-- interpretation of the constant? -->
&lt;!-- 3. Now, you control for experience and age and estimate the following -->
&lt;!-- population regression model: -->
&lt;!-- $$wage_i = \beta_0 + \beta_1 educ_i + \beta_2 IQ_i + \beta_3 exper_i + \beta_4 age_i + \beta_5 age_i^2 + u_i$$ -->
&lt;!-- A one-year increase in age is associated with what change in wages? -->
&lt;!-- (mind the squared term) -->
&lt;!-- &lt;img src="../practice/w6-q1.png"> -->
&lt;!-- 4. Finally, because you are worried about omitted variable bias, you -->
&lt;!-- include father's and mother's education. -->
&lt;!-- &lt;img src="../practice/w6-q2.png"> -->
&lt;!-- 1. Why might parent's education might directly affect wages? -->
&lt;!-- 2. Which other independent variables do you think parent's -->
&lt;!-- education might affect? Explain. -->
&lt;!-- 3. How did controlling for parent's education affect the returns to -->
&lt;!-- education? The returns to IQ? -->
&lt;h2 id="other-resources">Other resources&lt;/h2>
&lt;blockquote class="twitter-tweet">&lt;p lang="en" dir="ltr">As requested, slower graphs! Also added a graph on collider bias, the webpage explanation helps there.&lt;br>&lt;br>These graphs are intended to show what standard causal inference methods actually *do* to data, and how they work.&lt;br>&lt;br>This is what controlling for a binary variable looks like: &lt;a href="https://t.co/dTZxqY5JxA">pic.twitter.com/dTZxqY5JxA&lt;/a>&lt;/p>&amp;mdash; Nick HK (@nickchk) &lt;a href="https://twitter.com/nickchk/status/1068215492458905600?ref_src=twsrc%5Etfw">November 29, 2018&lt;/a>&lt;/blockquote> &lt;script async src="https://platform.twitter.com/widgets.js" charset="utf-8">&lt;/script></description></item><item><title>Week 5 - Inference with One Regressor</title><link>https://econ3500s26.netlify.app/content/05-content/</link><pubDate>Sun, 01 Feb 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/05-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>As we carry on into the wonderful world of statistical inference, expect flashbacks of our statistics review. That&amp;rsquo;s what the review was for! It&amp;rsquo;s all coming together 😌.&lt;/p>
&lt;h2 id="reading-guide">Reading Guide&lt;/h2>
&lt;p>&lt;strong>Chapter 5: Hypothesis Tests and Confidence Intervals&lt;/strong>&lt;/p>
&lt;p>&lt;strong>SW 5.1&lt;/strong> Testing Hypotheses About One of the Regression Coefficients&lt;/p>
&lt;p>This is very important stuff. Again, don’t worry about calculating the variance of a beta coefficient by hand. However, note how similar this is to our hypothesis testing in Chapter 3!&lt;/p>
&lt;p>&lt;strong>SW 5.2&lt;/strong> Confidence intervals for a regression coefficient&lt;/p>
&lt;p>&lt;strong>SW 5.3&lt;/strong> Regression when X is a binary variable&lt;/p>
&lt;p>We are going to use this ALL THE TIME.&lt;/p>
&lt;p>&lt;strong>SW 5.4&lt;/strong> Heteroskedasticity and homoskedasticity&lt;/p>
&lt;blockquote class="twitter-tweet">
&lt;a href="https://twitter.com/ChelseaParlett/status/1316528345224343552">&lt;/a>
&lt;/blockquote>&lt;script async src="https://platform.twitter.com/widgets.js" charset="utf-8">&lt;/script>
&lt;p>Drop &amp;ldquo;heteroskedasticity&amp;rdquo; into any conversation and you&amp;rsquo;re sure to delight. Just one more benefit of EC200.
The examples and implications of heteroskedasticity are important.&lt;/p>
&lt;p>And, now you&amp;rsquo;ll know to add &lt;code>, robust&lt;/code> to all your Stata regressions.&lt;/p>
&lt;!-- (*Quick, someone tell Nate Silver!*) -->
&lt;!-- ```{r echo=FALSE} -->
&lt;!-- blogdown::shortcode('tweet', '1273493646969647105') -->
&lt;!-- ``` -->
&lt;!-- ```{r echo=FALSE} -->
&lt;!-- blogdown::shortcode('tweet', '1274901353563934726') -->
&lt;!-- ``` -->
&lt;p>&lt;strong>SW 5.5&lt;/strong> Theoretical foundations of OLS&lt;/p>
&lt;p>Know the Gauss-Markov theorem and related assumptions. Skip &amp;ldquo;regression estimators other than OLS.&amp;rdquo; The appendix contains a proof of the Gauss-Markov theorm, but we will not cover that.&lt;/p>
&lt;p>&lt;del>&lt;strong>SW 5.6&lt;/strong>&lt;/del> Skip this section!&lt;/p>
&lt;p>&lt;strong>SW 5.7&lt;/strong> Conclusion&lt;/p>
&lt;p>For the good times.&lt;/p>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch5-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch5-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch5-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch5-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;!-- ## Videos -->
&lt;!-- ### [Video: Inference with One-Variable Linear Regression](https://youtu.be/DZo5m5q3bmA) -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/DZo5m5q3bmA" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
-->
&lt;h2 id="other-resources">Other resources&lt;/h2>
&lt;ul>
&lt;li>
&lt;a href="https://egap.org/resource/10-things-to-know-about-hypothesis-testing/" target="_blank" rel="noopener">EGAP: 10 things to know about hypothesis testing&lt;/a>&lt;/li>
&lt;li>
&lt;a href="https://egap.org/resource/10-things-to-know-about-statistical-power/" target="_blank" rel="noopener">EGAP: 10 things to know about statistical power&lt;/a> - gets a bit deeper than we go, but accessible and handy!&lt;/li>
&lt;/ul></description></item><item><title>Week 4 - Linear Regression with One Regressor</title><link>https://econ3500s26.netlify.app/content/04-content/</link><pubDate>Mon, 26 Jan 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/04-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>Welcome to Week 4! We are proceeding boldly into the world of &lt;strong>linear regression.&lt;/strong> We&amp;rsquo;re starting by looking into the &lt;em>linear&lt;/em> relationship between two variables.&lt;/p>
&lt;p>What we &lt;em>won&amp;rsquo;t&lt;/em> be doing is controlling for other factors, nor conducting statistical inference. We won&amp;rsquo;t be looking at non-linear relationships yet either! Bah.&lt;/p>
&lt;p>Rather, we&amp;rsquo;re going to dive deep into what it means to look at how we can find a good estimate - nay, the best estimate! - of the relatonship between some $X$ and some $Y$.&lt;/p>
&lt;p>This is where &lt;em>Stock and Watson&lt;/em> really start to shine. I highly recommend basking in their expertise and conversational style.&lt;/p>
&lt;h2 id="reading-guide">Reading Guide&lt;/h2>
&lt;p>&lt;strong>Chapter 4: Linear Regression with One Regressor&lt;/strong>&lt;/p>
&lt;p>&lt;strong>SW 4.1&lt;/strong> - The Linear Regression Model&lt;/p>
&lt;p>This section is packed w/ good intuition and &lt;strong>bolded&lt;/strong> vocabulary. Make sure you know it!&lt;/p>
&lt;p>&lt;strong>SW 4.2&lt;/strong> - Estimating the Coefficient of the Linear Regression Model?&lt;/p>
&lt;p>You should be able to estimate linear regression coefficients by hand 😪.&lt;/p>
&lt;p>&lt;strong>SW 4.3&lt;/strong> Measures of Fit&lt;/p>
&lt;p>Know how to use and interpret $R^2$, $ESS$, $TSS$, $SSR$ and $SER$. You will also need to know how to find these from raw Stata output as well.&lt;/p>
&lt;p>&lt;strong>SW 4.4&lt;/strong> Least Squares Assumptions&lt;/p>
&lt;p>Known and understand the three least squares assumptions&lt;/p>
&lt;p>&lt;strong>SW 4.5&lt;/strong> Sampling Distribution of the OLS&lt;/p>
&lt;p>Discuss unbiasedness of estimators and effects of larger vs. smaller sample sizes on standard errors. We won&amp;rsquo;t calculate standard errors by hand.&lt;/p>
&lt;h2 id="note-on-causality">Note on causality&lt;/h2>
&lt;p>At this stage, we are thinking about &lt;em>making good model&lt;/em> of data, but not necessarily the &lt;strong>data generating process&lt;/strong> behind that data. When we use the framing of an &lt;strong>independent&lt;/strong> and &lt;strong>dependent&lt;/strong> variables, it&amp;rsquo;s tempting to think that we&amp;rsquo;re examining whether the &lt;strong>independent&lt;/strong> variable &lt;em>causes&lt;/em> the &lt;strong>dependent&lt;/strong> variable.&lt;/p>
&lt;p>At this point in the course, we&amp;rsquo;re looking at associations which &lt;em>could&lt;/em> be causal &amp;hellip; or they could not be!&lt;/p>
&lt;p>If you want to dig deeper, check out this great guide from EGAP:
&lt;a href="https://egap.org/resource/10-things-to-know-about-causal-inference/" target="_blank" rel="noopener">10 things to know about causal inference&lt;/a>.&lt;/p>
&lt;blockquote class="twitter-tweet">
&lt;a href="https://twitter.com/WhitesPhD/status/1308094245669081089">&lt;/a>
&lt;/blockquote>&lt;script async src="https://platform.twitter.com/widgets.js" charset="utf-8">&lt;/script>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>
&lt;a href="https://econ3500s26.netlify.app/slides/ch4-quarto/" target="_blank" class="btn btn-primary">📊 HTML&lt;/a>
&lt;a href="https://econ3500s26.netlify.app/slides/ch4-slides.pdf" class="btn btn-outline-primary">📥 Download PDF&lt;/a>
&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch4-quarto/" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch4-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;!-- ## Videos -->
&lt;!-- ### [Video: Linear Regression w/ One Variable](https://youtu.be/tB8ayLyMOgU) -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/tB8ayLyMOgU" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
-->
&lt;!-- ### [Video: Derivation of $\hat{\beta_0}$ and $\hat{\beta_1}$](https://youtu.be/0jetwjxb-6w) -->
&lt;!-- *Here, I work through how we derive estimates of $\hat{\beta_0}$ and $\hat{\beta_1}$ using our good friend, calculus.* -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/0jetwjxb-6w" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
-->
&lt;h3 id="video-building-intuition-around-the-ols-modelhttpsyoutuberlraevlqusw">
&lt;a href="https://youtu.be/rlrAeVlQusw" target="_blank" rel="noopener">Video: Building intuition around the OLS model&lt;/a>&lt;/h3>
&lt;p>&lt;em>You can play along with the
&lt;a href="https://phet.colorado.edu/sims/html/least-squares-regression/latest/least-squares-regression_en.html" target="_blank" rel="noopener">same simulator!&lt;/a>&lt;/em>
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/rlrAeVlQusw" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
&lt;/p>
&lt;h2 id="in-class-exercise">In-class exercise&lt;/h2>
&lt;p>Link to pdf
&lt;a href="../practice/W4-02-in-class-exercise.pdf">here&lt;/a>&lt;/p>
&lt;p>Consider a dataset on births to women in the United States. Two
variables of interest are infant birth weight in ounces (&lt;code>bwght&lt;/code>), and
the average number of cigarettes the mother smoked per day during
pregnancy (&lt;code>cigs&lt;/code>). The following simple regression was estimated using
data on 1,388 births.&lt;/p>
&lt;pre>&lt;code> Source | SS df MS Number of obs = 1,388
-------------+---------------------------------- F(1, 1386) = 32.24
Model | 13060.4194 1 13060.4194 Prob &amp;gt; F = 0.0000
Residual | 561551.3 1,386 405.159668 R-squared = 0.0227
-------------+---------------------------------- Adj R-squared = 0.0220
Total | 574611.72 1,387 414.283864 Root MSE = 20.129
------------------------------------------------------------------------------
bwght | Coef. Std. Err. t P&amp;gt;|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
cigs | -.5137721 .0904909 -5.68 0.000 -.6912861 -.3362581
_cons | 119.7719 .5723407 209.27 0.000 118.6492 120.8946
------------------------------------------------------------------------------
&lt;/code>&lt;/pre>
&lt;p>These results can also be written in the following way:&lt;/p>
&lt;p>$$\widehat{bwght} = 119.77 - 0.514 cigs$$&lt;/p>
&lt;ol>
&lt;li>
&lt;p>What is the dependent variable? What is the independent variable?&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Write, in words, what the interpretation of $0.514$ is.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>What is the predicted birth weight among mothers who do not smoke?
What about when $cigs=20$ (one pack per day)? Comment on the
difference.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Consider Prof. Beam, whose mother &amp;ldquo;cut back&amp;rdquo; to 10 cigarettes per
day (it was the 80s) and was born weighing 9lb, 15 oz. What is her
residual?&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Find $R^2$ in the raw regression output. What does it tell us?&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Are any least squares assumptions likely to be violated? Explain.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Does this simple regression necessarily capture a causal
relationship between the child&amp;rsquo;s birth weight and the mother&amp;rsquo;s
smoking habits? Explain.&lt;/p>
&lt;/li>
&lt;/ol></description></item><item><title>Week 3 - Stats Exam</title><link>https://econ3500s26.netlify.app/content/03-statsquiz/</link><pubDate>Mon, 12 Jan 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/03-statsquiz/</guid><description>&lt;h2 id="what-should-i-know">What should I know?&lt;/h2>
&lt;p>Make sure you are comfortable with the material chapter guides from
&lt;a href="../01-content">Week 1&lt;/a> and
&lt;a href="../02-content">Week 2&lt;/a>. What does comfortable mean? That you&amp;rsquo;re familiar with definitions and terminology, and that you can work through problems at the level of PS1 (solutions on
&lt;a href="http://brightspace.uvm.edu" target="_blank" rel="noopener">Brightspace&lt;/a>).&lt;/p>
&lt;p>There will &lt;em>not&lt;/em> be a Stata component to the exam.&lt;/p>
&lt;h2 id="i-need-more-practice">I need more practice!&lt;/h2>
&lt;p>I recommend the following resources:&lt;/p>
&lt;ul>
&lt;li>Odd-numbered problems from Stock and Watson - solutions are available
&lt;a href="../../bonus/general-resources">here&lt;/a>&lt;/li>
&lt;li>
&lt;a href="#practice-exam">Practice exam&lt;/a>&lt;/li>
&lt;li>
&lt;a href="#inclass-practice">In-class practice&lt;/a>&lt;/li>
&lt;/ul>
&lt;h2 id="what-is-the-structure-of-the-exam">What is the structure of the exam?&lt;/h2>
&lt;p>The exam will be a lot like a math exam - solving some problems and showing your work. I would expect about 3 multi-part questions. There may be some questions that involve definitions and/or true-false.&lt;/p>
&lt;p>The exam is closed book with a calculator allowed. You are allowed to bring a one-page formula sheet.&lt;/p>
&lt;p>What is allowed?&lt;/p>
&lt;ul>
&lt;li>Pen or pencil&lt;/li>
&lt;li>One page formula sheet (double sided)&lt;/li>
&lt;li>Calculator (basic, scientific, graphing)&lt;/li>
&lt;li>Drink and quiet snack&lt;/li>
&lt;/ul>
&lt;p>What is not allowed?&lt;/p>
&lt;ul>
&lt;li>&lt;del>Phone&lt;/del>&lt;/li>
&lt;li>&lt;del>Other notes&lt;/del>&lt;/li>
&lt;li>&lt;del>Textbook&lt;/del>&lt;/li>
&lt;li>&lt;del>Tablet/Laptop&lt;/del>&lt;/li>
&lt;li>&lt;del>Earbuds/headphones&lt;/del>&lt;/li>
&lt;/ul>
&lt;!-- ### In-class practice -->
&lt;!-- [Here are the problems](../practice/EC3500-03-01practice.pdf) we covered in class. -->
&lt;!-- And a video with solutions! I recommend working through the problems on your own **first**. -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/q_NbBSDWcpI" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
-->
&lt;!-- Or, download a [pdf of the solutions](../practice/EC200-03-01practice_solutions.pdf) -->
&lt;h2 id="helpful-materials">Helpful materials&lt;/h2>
&lt;p>Two things that may be useful to you:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>
&lt;a href="../AppendixTable1_rev.pdf">Standard Normal Distribution Table&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>
&lt;a href="../Formula-sheet_f20.pdf">Formula chart&lt;/a>: Is this all the knowledge you need to know? Of course not! But, this should cover most of the formulas you might need.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="practice-exam" class="unnumbered">Past exam &lt;/h2>
&lt;p>Yes, this exam is very old! Fortunately, the statistics don&amp;rsquo;t change!&lt;/p>
&lt;p>
&lt;a href="../PracticeExams/StatsQuiz_Spring2018_print.pdf">Fall 2018 Practice Exam&lt;/a>&lt;/p>
&lt;p>
&lt;a href="../PracticeExams/StatsQuiz_Spring2018_sol.pdf">Fall 2018 Practice Exam Solutions&lt;/a> (&lt;em>I strongly recommend doing the practice exam in its entirety before reviewing the solutions&lt;/em>)&lt;/p></description></item><item><title>Week 1 - Probability Review</title><link>https://econ3500s26.netlify.app/content/01-content/</link><pubDate>Tue, 06 Jan 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/01-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>Welcome to Week 1! Our goal this week is to (1) help us sort out our varying technologies and workflows and (2) start reviewing statistics and probability. Basically, this the start of a three-week mini-bootcamp to get us ready for the glorious world of regressions!^[Aside: This unit will pull from lots of resources. Things will be simpler once we get to chapter 4 and onward]&lt;/p>
&lt;p>What you should do for this three-week unit depends on your background:&lt;/p>
&lt;h3 id="i-took-stat1410-or-another-course-recently-and-remember-some-stuff">I took STAT1410 or another course recently and remember some stuff&lt;/h3>
&lt;ol>
&lt;li>Read through SW, and dig deeper into anything that doesn&amp;rsquo;t look familiar.&lt;/li>
&lt;li>Watch the class videos and complete video quizzes&lt;/li>
&lt;li>Use the Khan Academy to clarify anything you feel a bit fuzzy on.&lt;/li>
&lt;li>Make sure you can do the practice problems.&lt;/li>
&lt;/ol>
&lt;h3 id="everyone-else">Everyone else&lt;/h3>
&lt;ol>
&lt;li>Start by reviewing the Khan Academy videos&lt;/li>
&lt;li>Work through &lt;em>all&lt;/em> the Khan Academy practice problems.&lt;/li>
&lt;li>Watch the class videos and complete video quizzes&lt;/li>
&lt;li>Head into the practice problems, and review concepts where you are still stuck.&lt;/li>
&lt;/ol>
&lt;h2 id="reading-guide-for-week-1">Reading Guide for Week 1&lt;/h2>
&lt;p>&lt;strong>Chapter 2.1-2.4: Review of Probability&lt;/strong>&lt;/p>
&lt;p>This is the material you should already know, along with supports from
&lt;a href="https://www.khanacademy.org/math/ap-statistics/" target="_blank" rel="noopener">Khan Academy (&lt;i class="fas fa-tv"> &lt;/i>)&lt;/a>. Remember that you don&amp;rsquo;t need to memorize formulas! &lt;em>Note that most links take you a direct video, but there are also relevant videos in the accompanying playlist.&lt;/em>&lt;/p>
&lt;h3 id="sw-21-random-variables-and-probability-distributions">SW 2.1 Random Variables and Probability Distributions&lt;/h3>
&lt;ul>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/random-variables-ap/discrete-random-variables/v/discrete-probability-distribution" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Introduction to random variables and probability distributions variables &lt;/a> &lt;em>(entire playlist, starting with &amp;ldquo;Constructing a probability distribution for a random varible&amp;rdquo;)&lt;/em>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/random-variables-ap/xfb5d8e68:mean-standard-deviation-random-variables/v/expected-value-of-a-discrete-random-variable" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Mean and standard deviation of random variables &lt;/a> &lt;em>(entire playlist)&lt;/em>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/probability-ap/stats-conditional-probability/v/testing-independence-from-experimental-data" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Conditional probability&lt;/a> &lt;em>(entire playlist)&lt;/em>&lt;/li>
&lt;/ul>
&lt;h3 id="sw-22-expected-values-mean-and-variance">SW 2.2 Expected Values, Mean, and Variance&lt;/h3>
&lt;ul>
&lt;li>You only need a general knowledge of kurtosis and skew.&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/random-variables-ap/transforming-random-variables/v/impact-of-scaling-and-shifting-random-variables" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Transforming random variables&lt;/a> &lt;em>(entire playlist, just 3 videos)&lt;/em>&lt;/li>
&lt;/ul>
&lt;h3 id="sw-23-two-random-variables">SW 2.3 Two Random Variables&lt;/h3>
&lt;ul>
&lt;li>Law of iterated expectations can be skimmed
&lt;ul>
&lt;li>Make sure that you&amp;rsquo;re good with key concept box 2.3!&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/random-variables-ap/combining-random-variables/v/mean-of-sum-and-difference-of-random-variables" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Combining random variables&lt;/a> &lt;em>(entire playlist)&lt;/em>&lt;/li>
&lt;/ul>
&lt;h3 id="sw-24-the-normal-chi-squared-student-t-and-f-distributions">SW 2.4 The Normal, Chi-Squared, Student t, and F distributions&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Normal distribution only&lt;/strong>, &lt;del>but we&amp;rsquo;ll discuss t-distributions a little bit&lt;/del>. We won’t use chi-squared, and we’ll come back to F-distributions later. For any work we&amp;rsquo;ll be doing, our sample sizes will be greater than 100, so $t \rightarrow z$.&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/density-curves-normal-distribution-ap/measuring-position/v/z-score-introduction" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Z-scores&lt;/a> &lt;em>(entire playlist)&lt;/em>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/density-curves-normal-distribution-ap/stats-normal-distributions/v/ck12-org-normal-distribution-problems-qualitative-sense-of-normal-distributions" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Normal distributions and the empirical rule&lt;/a> &lt;em>not explicitly covered, but extremely useful review for intuition&lt;/em>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/density-curves-normal-distribution-ap/normal-distributions-calculations/v/z-table-for-proportion-below" target="_blank" rel="noopener"> &lt;i class="fas fa-tv"> &lt;/i> Normal distribution calculations&lt;/a> &lt;em>(entire playlist, walks through all the possible ways of working w/ normal tables)&lt;/em>&lt;/li>
&lt;/ul>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>&lt;a href="https://econ3500s26.netlify.app/slides/ch2-slides.pdf" class="btn btn-primary">📥 Download this week's slides&lt;/a>&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch2-slides.pdf" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch2-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div></description></item><item><title>Week 2 - Statistics Review</title><link>https://econ3500s26.netlify.app/content/02-content/</link><pubDate>Tue, 06 Jan 2026 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/02-content/</guid><description>&lt;h2 id="overview">Overview&lt;/h2>
&lt;p>Welcome to Week 2! Statistical review continues apace.&lt;/p>
&lt;p>As before, what you should do this final week before the exam depends on your background:&lt;/p>
&lt;h3 id="i-took-stat1410-recently-and-remember-some-stuff">I took STAT1410 recently and remember some stuff&lt;/h3>
&lt;ol>
&lt;li>Read through SW, and dig deeper into anything that doesn&amp;rsquo;t look familiar.&lt;/li>
&lt;li>Watch the class videos and complete video quiz.&lt;/li>
&lt;li>Use the Khan Academy to clarify anything you feel a bit fuzzy on.&lt;/li>
&lt;li>Make sure you can do the practice problems.&lt;/li>
&lt;/ol>
&lt;h3 id="everyone-else">Everyone else&lt;/h3>
&lt;ol>
&lt;li>Start by reviewing the Khan Academy videos&lt;/li>
&lt;li>Work through &lt;em>all&lt;/em> the Khan Academy practice problems.&lt;/li>
&lt;li>Watch the class videos and complete video quiz.&lt;/li>
&lt;li>Head into the practice problems, and review concepts where you are still stuck.&lt;/li>
&lt;/ol>
&lt;h2 id="reading-guide-chapter-2">Reading Guide (Chapter 2)&lt;/h2>
&lt;h3 id="sw-25-random-sampling-and-the-distribution-of-the-sample-average">SW 2.5 Random Sampling and the Distribution of the Sample Average&lt;/h3>
&lt;ul>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/sampling-distribution-ap/what-is-sampling-distribution/v/introduction-to-sampling-distributions" target="_blank" rel="noopener"> &lt;i class="fas fa-tv"> &lt;/i> What is a sampling distribution?&lt;/a> &lt;em>(just the one video)&lt;/em>&lt;/li>
&lt;/ul>
&lt;h3 id="sw-26-large-sample-approximations-to-sampling-distributions">SW 2.6 Large-sample approximations to sampling distributions&lt;/h3>
&lt;ul>
&lt;li>&lt;em>Get that central limit theorem!&lt;/em>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/sampling-distribution-ap/what-is-sampling-distribution/v/central-limit-theorem" target="_blank" rel="noopener"> &lt;i class="fas fa-tv"> &lt;/i> Central Limit Theorem&lt;/a> &lt;em>(just the one video)&lt;/em>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/sampling-distribution-ap/what-is-sampling-distribution/v/sampling-distribution-of-the-sample-mean" target="_blank" rel="noopener"> &lt;i class="fas fa-tv"> &lt;/i> Sampling distribution of a sample, part I mean&lt;/a> and
&lt;a href="https://www.khanacademy.org/math/ap-statistics/sampling-distribution-ap/what-is-sampling-distribution/v/sampling-distribution-of-the-sample-mean-2" target="_blank" rel="noopener">part II&lt;/a> &lt;em>(just the two videos, with a practice after)&lt;/em>&lt;/li>
&lt;/ul>
&lt;h2 id="reading-guide-chapter-3">Reading Guide (Chapter 3)&lt;/h2>
&lt;p>This is the material you should know, along with supports from
&lt;a href="https://www.khanacademy.org/math/ap-statistics/" target="_blank" rel="noopener">Khan Academy (&lt;i class="fas fa-tv"> &lt;/i>)&lt;/a>. Remember that you don&amp;rsquo;t need to memorize formulas!&lt;/p>
&lt;h3 id="sw-31estimation-of-the-population-mean">SW 3.1 Estimation of the population mean&lt;/h3>
&lt;ul>
&lt;li>Don’t worry about BLUE, we’ll come back to it. But make sure you get &lt;strong>bias&lt;/strong>, &lt;strong>consistency&lt;/strong>, and &lt;strong>efficiency&lt;/strong>.&lt;/li>
&lt;/ul>
&lt;h3 id="sw-32hypothesis-tests-concerning-the-population-mean">SW 3.2 Hypothesis tests concerning the population mean&lt;/h3>
&lt;ul>
&lt;li>&lt;em>KA presents confidence intervals before hypothesis tests, so if following their curruculum, start w/ confidence intervals first&lt;/em>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/xfb5d8e68:inference-categorical-proportions/idea-significance-tests/v/idea-behind-hypothesis-testing" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> The idea behind significance tests&lt;/a>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/xfb5d8e68:inference-quantitative-means/one-sample-t-interval-mean/v/introduction-to-t-statistics" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Introduction to confidence intervals&lt;/a>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/xfb5d8e68:inference-quantitative-means/one-sample-t-test-mean/v/writing-hypotheses-for-significance-test-about-means" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Setting up a test for a population mean&lt;/a> &lt;em>(entire playlist)&lt;/em>&lt;/li>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/xfb5d8e68:inference-quantitative-means/xfb5d8e68:carrying-out-test-mean/v/when-to-use-z-or-t-statistics-in-significance-tests" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Carrying out a test for a population mean&lt;/a> &lt;em>(entire playlist, except part about TI calculator)&lt;/em>&lt;/li>
&lt;/ul>
&lt;h3 id="sw-33confidence-intervals-for-the-population-mean">SW 3.3 Confidence intervals for the population mean&lt;/h3>
&lt;ul>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/xfb5d8e68:inference-quantitative-means/one-sample-t-interval-mean/v/introduction-to-t-statistics" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Constructing a confidence interval for a population mean&lt;/a>&lt;/li>
&lt;/ul>
&lt;h3 id="sw-34-comparing-means-from-different-populations">&lt;del>SW 3.4 Comparing means from different populations&lt;/del>&lt;/h3>
&lt;p>&lt;strong>SKIP&lt;/strong>&lt;/p>
&lt;p>&lt;del>Only the type of two-sample test we&amp;rsquo;ve been practicing - look at difference between two means where the standard deviation is unknown and you do not assume that they come from the same underlying population distribution. &lt;strong>Make sure you can use formulas 3.19 and 3.20&lt;/strong>&lt;/del>
&lt;del>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/xfb5d8e68:inference-categorical-proportions/idea-significance-tests/v/idea-behind-hypothesis-testing" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Testing for the differences of two population means&lt;/a>&lt;/del>&lt;/p>
&lt;h3 id="skip-sw-3536">Skip SW 3.5/3.6&lt;/h3>
&lt;h3 id="sw-37scatterplots-the-sample-covariance-and-the-sample-correlation">SW 3.7 Scatterplots, the sample covariance, and the sample correlation&lt;/h3>
&lt;ul>
&lt;li>
&lt;a href="https://www.khanacademy.org/math/ap-statistics/bivariate-data-ap/correlation-coefficient-r/v/calculating-correlation-coefficient-r" target="_blank" rel="noopener">&lt;i class="fas fa-tv"> &lt;/i> Correlation coefficients&lt;/a>&lt;/li>
&lt;/ul>
&lt;h2 id="slides">Slides&lt;/h2>
&lt;div class="slides-container">
&lt;p>&lt;a href="https://econ3500s26.netlify.app/slides/ch3-slides.pdf" class="btn btn-primary">📥 Download this week's slides&lt;/a>&lt;/p>
&lt;figure class="slide-thumb">
&lt;a href="https://econ3500s26.netlify.app/slides/ch3-slides.pdf" target="_blank">
&lt;img src="https://econ3500s26.netlify.app/slides/ch3-slides.png" alt="First slide" />
&lt;/a>
&lt;/figure>
&lt;/div>
&lt;!-- ## Videos -->
&lt;!--
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe src="https://www.youtube.com/embed/dMFQw0DcOgU" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video">&lt;/iframe>
&lt;/div>
--></description></item><item><title/><link>https://econ3500s26.netlify.app/content/practice/ch10-activity1-did-by-hand/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/practice/ch10-activity1-did-by-hand/</guid><description>&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-did-by-hand">In-Class Activity: DiD by Hand&lt;/h1>
&lt;p>&lt;strong>Chapter 10 — Difference-in-Differences&lt;/strong>&lt;br>
Time: ~20 minutes&lt;/p>
&lt;hr>
&lt;h2 id="setup">Setup&lt;/h2>
&lt;p>In January 2020, New Jersey increased its state minimum wage from $10.00 to $12.00 per hour. Neighboring Pennsylvania kept its minimum wage at $7.25. A researcher collected data on average employment at fast-food restaurants (measured as full-time equivalent workers per restaurant) in both states, before and after the policy change.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>&lt;strong>Before&lt;/strong> (Nov 2019)&lt;/th>
&lt;th>&lt;strong>After&lt;/strong> (Mar 2020)&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;strong>New Jersey&lt;/strong> (treatment)&lt;/td>
&lt;td>20.4&lt;/td>
&lt;td>21.0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Pennsylvania&lt;/strong> (control)&lt;/td>
&lt;td>23.3&lt;/td>
&lt;td>21.2&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;hr>
&lt;h2 id="questions">Questions&lt;/h2>
&lt;p>&lt;strong>1.&lt;/strong> Calculate the simple before-and-after change in employment for New Jersey (the treatment group).&lt;/p>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;p>&lt;strong>2.&lt;/strong> A state legislator sees your answer to Question 1 and says, &amp;ldquo;See! Raising the minimum wage &lt;em>increased&lt;/em> employment!&amp;rdquo; Explain why the simple before-after comparison is not a valid estimate of the causal effect of the minimum wage increase.&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>3.&lt;/strong> Calculate the difference-in-differences (DiD) estimate of the effect of the minimum wage increase on fast-food employment.&lt;/p>
&lt;p>&lt;em>Show your work. You can calculate this either as (a) the difference in before-after changes between the two groups, or (b) the difference in cross-state gaps between the two time periods.&lt;/em>&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;p>&lt;strong>4.&lt;/strong> Interpret your DiD estimate from Question 3 in a complete sentence. Be specific about units and direction.&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>5.&lt;/strong> State the parallel trends assumption in the context of this study. Be precise — what, specifically, must be true for the DiD estimate to be valid?&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>6.&lt;/strong> Evaluate whether parallel trends is plausible here. Identify at least one reason it might hold and one reason it might fail.&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>6b.&lt;/strong> A researcher suggests adding restaurant-level characteristics — franchise type, seating capacity, and years in operation — as control variables to the DiD regression. Explain &lt;em>two&lt;/em> reasons why this might be useful: one about &lt;strong>precision&lt;/strong> and one about the &lt;strong>validity of the parallel trends assumption&lt;/strong>.&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>7. (Bonus)&lt;/strong> Write the regression equation that would produce the DiD estimate from Question 3. Define your variables clearly and identify which coefficient gives the DiD estimate.&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;hr>
&lt;p>\pagebreak&lt;/p>
&lt;h2 id="instructor-notes--do-not-distribute">INSTRUCTOR NOTES — DO NOT DISTRIBUTE&lt;/h2>
&lt;h3 id="answers">Answers&lt;/h3>
&lt;p>&lt;strong>1.&lt;/strong> Simple before-after change for NJ:&lt;/p>
&lt;p>$$\Delta_{NJ} = 21.0 - 20.4 = +0.6 \text{ FTE workers}$$&lt;/p>
&lt;p>&lt;strong>2.&lt;/strong> The before-after comparison confounds the effect of the minimum wage increase with any other changes happening over the same time period. PA employment fell by 2.1 FTE — if NJ would have followed the same trend absent the policy, the simple comparison overstates the true effect. March 2020 also coincides with the onset of COVID-19, making this a particularly sharp example of why common time trends must be accounted for.&lt;/p>
&lt;p>&lt;strong>3.&lt;/strong> DiD estimate (either approach):&lt;/p>
&lt;p>&lt;em>Approach (a): Difference in before-after changes&lt;/em>&lt;/p>
&lt;p>$$\text{DiD} = \Delta_{NJ} - \Delta_{PA} = (21.0 - 20.4) - (21.2 - 23.3) = 0.6 - (-2.1) = +2.7$$&lt;/p>
&lt;p>&lt;em>Approach (b): Difference in cross-state gaps&lt;/em>&lt;/p>
&lt;p>$$\text{DiD} = (21.0 - 21.2) - (20.4 - 23.3) = (-0.2) - (-2.9) = +2.7$$&lt;/p>
&lt;p>Both give DiD = &lt;strong>+2.7 FTE workers per restaurant&lt;/strong>.&lt;/p>
&lt;p>&lt;strong>4.&lt;/strong> &amp;ldquo;The minimum wage increase in New Jersey is estimated to have increased average fast-food restaurant employment by 2.7 full-time equivalent workers per restaurant, relative to what would have occurred absent the policy change.&amp;rdquo;&lt;/p>
&lt;p>&lt;strong>5.&lt;/strong> The parallel trends assumption requires that, &lt;em>in the absence of the minimum wage increase&lt;/em>, average fast-food employment in New Jersey would have changed by the same amount as in Pennsylvania between November 2019 and March 2020. Any time trends affecting fast-food employment would have been the same in both states had NJ not raised its minimum wage.&lt;/p>
&lt;p>&lt;strong>6.&lt;/strong>&lt;/p>
&lt;p>&lt;em>Reasons it might hold:&lt;/em>&lt;/p>
&lt;ul>
&lt;li>NJ and PA are neighboring states with similar economies, labor markets, and demographics&lt;/li>
&lt;li>Fast-food chains operate similarly across state lines (same companies, similar consumer bases)&lt;/li>
&lt;li>Pre-treatment trends in fast-food employment might be similar&lt;/li>
&lt;/ul>
&lt;p>&lt;em>Reasons it might fail:&lt;/em>&lt;/p>
&lt;ul>
&lt;li>March 2020 is the start of the COVID-19 pandemic — differential pandemic impacts across states could violate parallel trends (this is the key one to discuss!)&lt;/li>
&lt;li>Anticipation effects: NJ employers might have adjusted employment before formal implementation&lt;/li>
&lt;li>Other state-level policies may have changed at the same time&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>6b.&lt;/strong>&lt;/p>
&lt;p>&lt;em>Precision:&lt;/em> Adding restaurant-level controls (franchise type, seating capacity, years in operation) reduces residual variance → tighter standard errors on the DiD estimate, even if parallel trends holds unconditionally.&lt;/p>
&lt;p>&lt;em>Credibility:&lt;/em> Parallel trends may only hold &lt;em>conditional&lt;/em> on these covariates. If, for example, newer franchises were disproportionately opening in NJ during this period (anticipating higher wages), or if NJ and PA had systematically different franchise compositions, then observable restaurant characteristics could predict differential outcome trends. Controlling for them makes parallel trends more plausible. Omitting relevant covariates when they predict differential trends is OVB — the parallel trends assumption may only hold after conditioning.&lt;/p>
&lt;p>&lt;strong>7.&lt;/strong> Regression equation:&lt;/p>
&lt;p>$$Employment_{st} = \beta_0 + \beta_1 \cdot NJ_s + \beta_2 \cdot After_t + \beta_3 \cdot (NJ_s \times After_t) + u_{st}$$&lt;/p>
&lt;p>Where:&lt;/p>
&lt;ul>
&lt;li>$NJ_s = 1$ if New Jersey, 0 if Pennsylvania&lt;/li>
&lt;li>$After_t = 1$ if March 2020 (post-treatment), 0 if November 2019 (pre-treatment)&lt;/li>
&lt;li>$\beta_3$ is the DiD estimate (= 2.7)&lt;/li>
&lt;/ul>
&lt;p>Interpretation of all coefficients:&lt;/p>
&lt;ul>
&lt;li>$\beta_0 = 23.3$ (PA employment, before)&lt;/li>
&lt;li>$\beta_1 = 20.4 - 23.3 = -2.9$ (NJ-PA gap, before)&lt;/li>
&lt;li>$\beta_2 = 21.2 - 23.3 = -2.1$ (PA change over time)&lt;/li>
&lt;li>$\beta_3 = 2.7$ (DiD: the additional change in NJ beyond the PA trend)&lt;/li>
&lt;/ul>
&lt;h3 id="teaching-notes">Teaching Notes&lt;/h3>
&lt;ul>
&lt;li>This activity is inspired by Card and Krueger (1994). Numbers here are illustrative and close to (but not identical to) the original study.&lt;/li>
&lt;li>The COVID timing issue in Q6 is intentional — it creates a natural discussion point about threats to identification and how real-world events can compromise research designs.&lt;/li>
&lt;li>Q6b connects to the &amp;ldquo;Adding Control Variables&amp;rdquo; section of the slides: stress the distinction between controls for precision vs. controls for identification (conditional parallel trends). This is different from RCTs where controls are only about precision.&lt;/li>
&lt;li>For Q7, emphasize that the interaction term is the key — students often struggle to see how a 2×2 table maps to a regression with an interaction.&lt;/li>
&lt;li>Common student error: confusing &amp;ldquo;parallel trends&amp;rdquo; with &amp;ldquo;equal levels.&amp;rdquo; Stress that the assumption is about &lt;em>changes&lt;/em> (trends), not &lt;em>levels&lt;/em>.&lt;/li>
&lt;/ul></description></item><item><title/><link>https://econ3500s26.netlify.app/content/practice/ch10-activity2-fe-showdown/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/practice/ch10-activity2-fe-showdown/</guid><description>&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-fixed-effects-showdown">In-Class Activity: Fixed Effects Showdown&lt;/h1>
&lt;p>&lt;strong>Chapter 10 — Panel Data and Fixed Effects&lt;/strong>&lt;br>
Time: ~20-25 minutes&lt;/p>
&lt;hr>
&lt;h2 id="setup">Setup&lt;/h2>
&lt;p>A researcher wants to estimate the effect of beer taxes on traffic fatality rates. She has a balanced panel dataset of all 48 contiguous U.S. states observed annually from 2000 to 2009 (T = 10, N = 48, so 480 observations total).&lt;/p>
&lt;p>The dependent variable is the &lt;strong>traffic fatality rate&lt;/strong> (deaths per 10,000 people). The key independent variable is the &lt;strong>real beer tax&lt;/strong> (dollars per case, adjusted for inflation). She also observes &lt;strong>per capita income&lt;/strong> (in thousands of dollars).&lt;/p>
&lt;p>She estimates four specifications. Study the output below carefully.&lt;/p>
&lt;hr>
&lt;h3 id="regression-output">Regression Output&lt;/h3>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>&lt;strong>(1) Pooled OLS&lt;/strong>&lt;/th>
&lt;th>&lt;strong>(2) Entity FE&lt;/strong>&lt;/th>
&lt;th>&lt;strong>(3) Entity + Time FE&lt;/strong>&lt;/th>
&lt;th>&lt;strong>(4) First Difference&lt;/strong>&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;strong>Beer Tax&lt;/strong>&lt;/td>
&lt;td>-0.655***&lt;/td>
&lt;td>-0.640**&lt;/td>
&lt;td>-0.485*&lt;/td>
&lt;td>-0.072&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>(0.188)&lt;/td>
&lt;td>(0.254)&lt;/td>
&lt;td>(0.261)&lt;/td>
&lt;td>(0.117)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Income&lt;/strong>&lt;/td>
&lt;td>0.062***&lt;/td>
&lt;td>-0.063*&lt;/td>
&lt;td>-0.071**&lt;/td>
&lt;td>-0.018&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>(0.015)&lt;/td>
&lt;td>(0.032)&lt;/td>
&lt;td>(0.031)&lt;/td>
&lt;td>(0.022)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>State FE&lt;/td>
&lt;td>No&lt;/td>
&lt;td>Yes&lt;/td>
&lt;td>Yes&lt;/td>
&lt;td>&amp;ndash;&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Year FE&lt;/td>
&lt;td>No&lt;/td>
&lt;td>No&lt;/td>
&lt;td>Yes&lt;/td>
&lt;td>&amp;ndash;&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Differenced&lt;/td>
&lt;td>No&lt;/td>
&lt;td>No&lt;/td>
&lt;td>No&lt;/td>
&lt;td>Yes&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>SE type&lt;/td>
&lt;td>Robust&lt;/td>
&lt;td>Clustered (state)&lt;/td>
&lt;td>Clustered (state)&lt;/td>
&lt;td>Clustered (state)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>N&lt;/td>
&lt;td>480&lt;/td>
&lt;td>480&lt;/td>
&lt;td>480&lt;/td>
&lt;td>432&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>R-squared&lt;/td>
&lt;td>0.091&lt;/td>
&lt;td>0.905&lt;/td>
&lt;td>0.918&lt;/td>
&lt;td>0.003&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Significance: *** p&amp;lt;0.01, ** p&amp;lt;0.05, * p&amp;lt;0.1&lt;/p>
&lt;hr>
&lt;h2 id="questions">Questions&lt;/h2>
&lt;p>&lt;strong>1.&lt;/strong> For each specification, briefly state what types of omitted variables it controls for and what it does &lt;em>not&lt;/em> control for.&lt;/p>
&lt;ul>
&lt;li>&lt;strong>(1) Pooled OLS:&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;ul>
&lt;li>&lt;strong>(2) Entity FE:&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;ul>
&lt;li>&lt;strong>(3) Entity + Time FE:&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;ul>
&lt;li>&lt;strong>(4) First Difference:&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;p>&lt;strong>1b.&lt;/strong> What is one specific example of a &lt;em>time shock&lt;/em> in this context — something that would affect all states' fatality rates in a given year equally? How does adding year fixed effects in specification (3) address it?&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>2.&lt;/strong> Look at the coefficient on &lt;strong>Income&lt;/strong>. It flips sign between specification (1) and specification (2). Explain why this happens. What does this tell us about the pooled OLS estimate?&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;p>&lt;strong>3.&lt;/strong> The coefficient on &lt;strong>Beer Tax&lt;/strong> shrinks substantially from specification (1) to specification (4). Does this mean beer taxes have no effect on fatality rates? What should we conclude?&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;p>&lt;strong>4.&lt;/strong> Why does specification (2) use clustered standard errors (clustered by state) rather than the heteroskedasticity-robust standard errors used in specification (1)?&lt;/p>
&lt;p>(a) Which panel data least squares assumption is being addressed by clustering?&lt;/p>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;p>(b) What goes wrong with standard (unclustered) SEs when that assumption is violated?&lt;/p>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;p>&lt;strong>5.&lt;/strong> Even with two-way fixed effects (specification 3), what threats to a causal interpretation remain? Identify at least two specific concerns.&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;p>&lt;strong>6. (Optional/Discussion)&lt;/strong> Suppose states adopted beer tax increases at different times — some in 2002, some in 2005, some in 2008. A researcher uses the two-way FE approach from specification (3) with a single indicator $treated_{it} = 1$ after the state&amp;rsquo;s tax increase. Why might this estimate be misleading, even if two-way FE was appropriate in specification (3)?&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;hr>
&lt;p>\pagebreak&lt;/p>
&lt;h2 id="instructor-notes--do-not-distribute">INSTRUCTOR NOTES — DO NOT DISTRIBUTE&lt;/h2>
&lt;h3 id="answers">Answers&lt;/h3>
&lt;p>&lt;strong>1.&lt;/strong> What each specification controls for:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;strong>(1) Pooled OLS:&lt;/strong> Controls for nothing beyond the included regressors (beer tax, income). Does not account for any unobserved differences across states or over time. Treats all 480 observations as independent cross-sectional data.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>(2) Entity FE:&lt;/strong> Controls for all time-invariant state characteristics (drinking culture, geography, road infrastructure, population density, state-level attitudes toward drunk driving). Does NOT control for factors that change over time and affect all states (national trends in vehicle safety, federal highway policy, changes in social norms around drunk driving).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>(3) Entity + Time FE:&lt;/strong> Controls for both time-invariant state characteristics AND common time trends that affect all states equally (national economic cycles, improvements in vehicle safety technology, federal policies, nationwide public health campaigns). Does NOT control for time-varying, state-specific factors (state-level policy changes other than beer tax, state-specific economic shocks).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>(4) First Difference:&lt;/strong> Like entity FE, removes time-invariant state characteristics by looking at year-over-year changes within each state. Does NOT control for common time trends. Note N = 432 because first-differencing loses one year per state (480 − 48 = 432).&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>1b.&lt;/strong> Example: the 2008 financial crisis reduced driving, which could lower fatality rates in every state. Or: improvements in airbag/safety technology affecting all cars nationally in a given year. Year fixed effects address this by giving each year its own intercept — the common level in year $t$ is partialled out for all states, leaving only within-year, across-state variation in fatalities and beer taxes. Without year FEs, a national decline in fatalities would be partially attributed to beer taxes if beer taxes happened to be higher in that period.&lt;/p>
&lt;p>&lt;strong>2.&lt;/strong> The sign flip on Income:&lt;/p>
&lt;p>In pooled OLS, Income has a &lt;em>positive&lt;/em> coefficient (+0.062): richer states appear to have higher fatality rates. This is driven by omitted variable bias — states with higher income may also be larger, more rural, have more driving, etc. These time-invariant state characteristics are confounded with income in pooled regression.&lt;/p>
&lt;p>Once we add entity FE, we are looking at changes in income &lt;em>within&lt;/em> a state over time. Within a given state, when income rises, fatality rates fall slightly (−0.063). Higher within-state income may lead to better vehicles, more safety investment, or less risky behavior.&lt;/p>
&lt;p>The sign flip is strong evidence that pooled OLS suffers from OVB due to unobserved state characteristics.&lt;/p>
&lt;p>&lt;strong>3.&lt;/strong> The shrinking Beer Tax coefficient:&lt;/p>
&lt;p>The decline from −0.655 to −0.072 does not mean beer taxes have no effect. It means:&lt;/p>
&lt;ul>
&lt;li>Much of the pooled OLS association was driven by cross-state differences correlated with both beer taxes and fatality rates (time-invariant OVB)&lt;/li>
&lt;li>After controlling for state and year FE, the remaining within-state, within-year variation in beer taxes may be too small to precisely estimate the effect&lt;/li>
&lt;li>The FD estimate is small and imprecisely estimated, suggesting year-to-year changes in beer taxes are not strongly associated with year-to-year changes in fatality rates&lt;/li>
&lt;li>Possible interpretations: (a) the true causal effect is small, (b) not enough within-state variation in beer taxes to detect it, or (c) effects may materialize with a lag&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Key teaching point:&lt;/strong> The shrinking coefficient is the &lt;em>point&lt;/em> of fixed effects — we strip away confounding variation to isolate the causal estimate, which may be smaller than the biased OLS estimate.&lt;/p>
&lt;p>&lt;strong>4.&lt;/strong> Why clustered standard errors:&lt;/p>
&lt;p>(a) &lt;strong>Assumption 2&lt;/strong> requires observations to be i.i.d. draws &lt;em>across entities&lt;/em>. The assumption only requires independence across entities, not within. However, standard OLS/robust SEs assume errors are independent across &lt;em>all&lt;/em> observations. Within a state, fatality rates in year $t$ are correlated with year $t-1$ (serial correlation). Clustering at the state level allows arbitrary within-state, within-cluster correlation — correcting for this autocorrelation in the error term.&lt;/p>
&lt;p>(b) With positive serial correlation (typical in panel data), unclustered SEs are usually &lt;strong>too small&lt;/strong>. This means confidence intervals are too narrow and $t$-statistics are too large → we over-reject the null (false positives). The point estimates are still consistent — only inference is wrong. Note: in some designs with negative within-cluster correlation, clustered SEs can be larger than unclustered, but this is less common.&lt;/p>
&lt;p>&lt;strong>5.&lt;/strong> Remaining threats even with two-way FE:&lt;/p>
&lt;ol>
&lt;li>
&lt;p>&lt;strong>Time-varying, state-specific confounders:&lt;/strong> Other state policies changing at the same time as beer taxes (DUI enforcement, speed limits, seatbelt laws). Two-way FE only handles fixed state traits and common time shocks — not policies varying across both states and time.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Reverse causality:&lt;/strong> States with rising fatality rates might respond by raising beer taxes. The policy change may be endogenous to the outcome.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Measurement error:&lt;/strong> Beer taxes are a poor proxy for the actual price of alcohol (substitution to untaxed beverages, cross-border purchases).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Spillovers:&lt;/strong> A tax increase in one state may push drinking across state lines (SUTVA violation).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Lagged effects:&lt;/strong> Tax changes may not affect behavior immediately; contemporaneous comparison may miss the effect.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>&lt;strong>6.&lt;/strong> With staggered adoption, TWFE implicitly uses &lt;em>already-treated&lt;/em> states as &amp;ldquo;controls&amp;rdquo; for later-adopting states. If treatment effects change over time (dynamic effects), the already-treated state&amp;rsquo;s continuing effect looks like a &amp;ldquo;trend&amp;rdquo; — contaminating the comparison. TWFE is a weighted average of all possible 2×2 DiD comparisons, and when treatment effects are heterogeneous, some comparisons receive &lt;em>negative&lt;/em> weights, potentially flipping the sign of the aggregate estimate even when all individual true effects are positive. Modern estimators (Callaway-Sant&amp;rsquo;Anna, Sun-Abraham) avoid this by only using not-yet-treated or never-treated units as controls.&lt;/p>
&lt;h3 id="teaching-notes">Teaching Notes&lt;/h3>
&lt;ul>
&lt;li>This scenario is adapted from the Stock &amp;amp; Watson textbook&amp;rsquo;s running example of U.S. traffic fatalities (originally from Ruhm, 1996). Numbers are constructed for pedagogical clarity.&lt;/li>
&lt;li>The R² jump from 0.091 (pooled) to 0.905 (entity FE) is a great discussion point — most variation in fatality rates is &lt;em>between&lt;/em> states, not &lt;em>within&lt;/em> states over time.&lt;/li>
&lt;li>The income sign-flip is one of the most memorable examples of OVB in panel data — exactly what FE is designed to fix.&lt;/li>
&lt;li>For Q4, connect explicitly to the &amp;ldquo;Problem of Serial Correlation&amp;rdquo; slide: stress that coefficients are still consistent, but inference (SEs, CIs, p-values) is wrong without clustering.&lt;/li>
&lt;li>Q6 is optional — use if students are comfortable with basic DiD. It connects to the staggered adoption section of Tuesday&amp;rsquo;s slides.&lt;/li>
&lt;/ul></description></item><item><title/><link>https://econ3500s26.netlify.app/content/practice/ch12-activity1-good-instrument/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/practice/ch12-activity1-good-instrument/</guid><description>&lt;p>ECON3500 Econometrics and Applications &lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-is-this-a-good-instrument">In-Class Activity: Is This a Good Instrument?&lt;/h1>
&lt;p>&lt;strong>Chapter 12 — Instrumental Variables Regression&lt;/strong>&lt;/p>
&lt;p>For each scenario below, a researcher wants to estimate the causal effect of $X$ on $Y$ and proposes using $Z$ as an instrumental variable. Your job: evaluate whether $Z$ is a valid instrument.&lt;/p>
&lt;p>For each scenario, assess:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Relevance&lt;/strong>: Is $Z$ correlated with $X$? Why or why not?&lt;/li>
&lt;li>&lt;strong>Exogeneity&lt;/strong>: Is $Z$ uncorrelated with the error term $u$? What could go wrong?&lt;/li>
&lt;li>&lt;strong>Exclusion&lt;/strong>: Does $Z$ affect $Y$ &lt;em>only&lt;/em> through $X$? What are the threats?&lt;/li>
&lt;li>&lt;strong>Overall verdict&lt;/strong>: Good instrument, bad instrument, or &amp;ldquo;it depends&amp;rdquo;?&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="scenario-1-returns-to-education">Scenario 1: Returns to Education&lt;/h2>
&lt;p>A researcher wants to estimate the effect of &lt;strong>years of education&lt;/strong> on &lt;strong>log weekly earnings&lt;/strong>. She is worried that ability is unobserved and correlated with both education and earnings (classic OVB). She proposes using &lt;strong>distance from the student&amp;rsquo;s childhood home to the nearest four-year college&lt;/strong> as an instrument for years of education.&lt;/p>
&lt;p>&lt;em>(Based on Card, 1993)&lt;/em>&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>Your assessment&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Relevance&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exogeneity&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exclusion&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Verdict&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;hr>
&lt;h2 id="scenario-2-police-and-crime">Scenario 2: Police and Crime&lt;/h2>
&lt;p>A city government wants to know whether &lt;strong>hiring more police officers&lt;/strong> reduces &lt;strong>violent crime&lt;/strong>. The problem: cities that experience crime waves hire more police, creating simultaneous causality. A researcher proposes using &lt;strong>the number of firefighters hired in the same year&lt;/strong> as an instrument for police hiring.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>Your assessment&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Relevance&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exogeneity&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exclusion&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Verdict&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;hr>
&lt;h2 id="scenario-3-class-size-and-test-scores">Scenario 3: Class Size and Test Scores&lt;/h2>
&lt;p>A school district wants to estimate the effect of &lt;strong>class size&lt;/strong> on &lt;strong>standardized test scores&lt;/strong>. Parents with more resources may sort into schools with smaller classes (selection bias). A researcher proposes using &lt;strong>random fluctuations in cohort size due to enrollment cutoffs&lt;/strong> (e.g., Maimonides' Rule: when enrollment crosses a multiple of 40, an additional class section is created) as an instrument for class size.&lt;/p>
&lt;p>&lt;em>(Based on Angrist and Lavy, 1999)&lt;/em>&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>Your assessment&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Relevance&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exogeneity&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exclusion&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Verdict&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;hr>
&lt;h2 id="scenario-4-smoking-and-health">Scenario 4: Smoking and Health&lt;/h2>
&lt;p>A health economist wants to estimate the effect of &lt;strong>cigarette consumption&lt;/strong> (packs per day) on &lt;strong>infant birth weight&lt;/strong>. Smoking behavior is likely correlated with other health behaviors, income, and stress, all of which also affect birth weight. She proposes using &lt;strong>state cigarette excise taxes&lt;/strong> as an instrument for cigarette consumption.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>Your assessment&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Relevance&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exogeneity&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exclusion&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Verdict&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;hr>
&lt;h2 id="scenario-5-income-and-health">Scenario 5: Income and Health&lt;/h2>
&lt;p>A researcher wants to estimate the causal effect of &lt;strong>income&lt;/strong> on &lt;strong>self-reported health status&lt;/strong>. Income is endogenous because health affects the ability to work (reverse causality) and unobserved factors like motivation affect both. He proposes using &lt;strong>lottery winnings&lt;/strong> (among people who play the lottery) as an instrument for income.&lt;/p>
&lt;p>&lt;em>(Based on Lindahl, 2005)&lt;/em>&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>Your assessment&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Relevance&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exogeneity&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Exclusion&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Verdict&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;hr>
&lt;p>\newpage&lt;/p>
&lt;h2 id="instructor-notes--do-not-distribute">INSTRUCTOR NOTES — DO NOT DISTRIBUTE&lt;/h2>
&lt;h3 id="scenario-1-distance-to-college--education--earnings">Scenario 1: Distance to College → Education → Earnings&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Relevance&lt;/strong>: Yes. Students who grow up closer to a college face lower costs of attending (commuting, living at home) and are more likely to attend. Empirically strong first stage.&lt;/li>
&lt;li>&lt;strong>Exogeneity&lt;/strong>: Potential concern. Families who live near colleges may differ systematically — they may live in more urban areas, have higher-income parents, or have better access to other amenities. If these factors also affect earnings directly, exogeneity fails. Card (1993) argues that after controlling for region, urban/rural status, and family background, proximity is plausibly exogenous.&lt;/li>
&lt;li>&lt;strong>Exclusion&lt;/strong>: The worry is that proximity to a college captures something about the local labor market or community characteristics that directly affect earnings. If living near a college means living near a city with better job opportunities, the exclusion restriction is violated. Controls help, but the assumption is not testable.&lt;/li>
&lt;li>&lt;strong>Verdict&lt;/strong>: Reasonable but imperfect. The key question is whether controls adequately capture the ways proximity might directly affect earnings. This is a classic &amp;ldquo;plausible but debatable&amp;rdquo; instrument — good for discussion.&lt;/li>
&lt;/ul>
&lt;h3 id="scenario-2-firefighters--police--crime">Scenario 2: Firefighters → Police → Crime&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Relevance&lt;/strong>: Moderate. Cities that expand public safety budgets may hire both more police and more firefighters. Budget shocks could move both.&lt;/li>
&lt;li>&lt;strong>Exogeneity&lt;/strong>: Problematic. The same budget pressures, political dynamics, or public safety concerns that drive firefighter hiring could also directly affect crime or crime reporting. A crime wave could trigger expanded hiring across all public safety departments.&lt;/li>
&lt;li>&lt;strong>Exclusion&lt;/strong>: Very dubious. Firefighter hiring likely reflects the same municipal budget and public safety environment that directly affects crime through many channels (social services, economic conditions, etc.).&lt;/li>
&lt;li>&lt;strong>Verdict&lt;/strong>: Bad instrument. The conditions that drive firefighter hiring are likely correlated with unobserved determinants of crime. This violates both exogeneity and exclusion. (Compare with Levitt&amp;rsquo;s (1997) use of electoral cycles as an instrument, which is more plausible.)&lt;/li>
&lt;/ul>
&lt;h3 id="scenario-3-enrollment-cutoffs--class-size--test-scores">Scenario 3: Enrollment Cutoffs → Class Size → Test Scores&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Relevance&lt;/strong>: Yes. Maimonides' Rule mechanically determines class size. When enrollment crosses a multiple of 40, an additional section is created, producing a sharp drop in average class size. This generates strong variation.&lt;/li>
&lt;li>&lt;strong>Exogeneity&lt;/strong>: Strong. The enrollment cutoffs are determined by administrative rules and population fluctuations that are plausibly unrelated to the characteristics of individual students. Parents would need to precisely manipulate enrollment counts to game the system, which is difficult.&lt;/li>
&lt;li>&lt;strong>Exclusion&lt;/strong>: Mostly satisfied. The enrollment cutoff affects students primarily through class size. One concern: crossing a threshold could also affect other school resources (need for additional classrooms, teacher quality of marginal hires). Angrist and Lavy argue these channels are minor.&lt;/li>
&lt;li>&lt;strong>Verdict&lt;/strong>: Good instrument — one of the textbook examples of a well-designed IV strategy. The mechanical rule creates quasi-random variation in class size.&lt;/li>
&lt;/ul>
&lt;h3 id="scenario-4-cigarette-taxes--smoking--birth-weight">Scenario 4: Cigarette Taxes → Smoking → Birth Weight&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Relevance&lt;/strong>: Yes. Higher taxes raise the price of cigarettes, which reduces consumption (basic demand theory). Empirically well-documented.&lt;/li>
&lt;li>&lt;strong>Exogeneity&lt;/strong>: Moderate concern. State tax rates could be correlated with state-level characteristics that also affect birth weight — for example, states with higher cigarette taxes may also have better public health infrastructure, higher incomes, or different demographics. Including state-level controls helps.&lt;/li>
&lt;li>&lt;strong>Exclusion&lt;/strong>: The main worry: cigarette taxes generate state revenue that could fund health programs, or high-tax states may have other anti-smoking policies (clean air laws, health education) that affect birth weight through channels other than the mother&amp;rsquo;s smoking. If taxes proxy for a broader public health environment, exclusion is violated.&lt;/li>
&lt;li>&lt;strong>Verdict&lt;/strong>: Decent but requires careful controls. The instrument is relevant and widely used (Evans and Ringel, 1999), but exclusion requires that tax variation is not simply proxying for a state&amp;rsquo;s overall health policy environment. Cross-state and over-time variation in tax changes can help.&lt;/li>
&lt;/ul>
&lt;h3 id="scenario-5-lottery-winnings--income--health">Scenario 5: Lottery Winnings → Income → Health&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Relevance&lt;/strong>: Yes. Lottery winnings directly increase income. The magnitude depends on the size of winnings — small prizes may not generate enough variation (potential weak instrument concern for small winners).&lt;/li>
&lt;li>&lt;strong>Exogeneity&lt;/strong>: Strong, conditional on playing the lottery. Among lottery players, the amount won is random. The key conditioning assumption is that the &lt;em>decision to play&lt;/em> is not itself endogenous — the sample is restricted to lottery players, so we need exogeneity within that group, not in the general population.&lt;/li>
&lt;li>&lt;strong>Exclusion&lt;/strong>: Mostly satisfied. The main channel from lottery winnings to health is through increased income/wealth. Possible concerns: large winnings could cause stress, lifestyle changes, or social disruption that affect health independent of the &amp;ldquo;income&amp;rdquo; channel. But for moderate winnings, income is the dominant pathway.&lt;/li>
&lt;li>&lt;strong>Verdict&lt;/strong>: Good instrument (within the lottery-playing population). The randomization of winnings is a powerful source of exogenous variation. The main limitation is &lt;strong>external validity&lt;/strong>: the LATE applies to lottery players, who may not be representative of the general population. Also, the LATE captures the effect of &lt;em>windfall&lt;/em> income, which may differ from the effect of &lt;em>earned&lt;/em> income.&lt;/li>
&lt;/ul>
&lt;h3 id="general-teaching-notes">General Teaching Notes&lt;/h3>
&lt;ul>
&lt;li>Emphasize that &lt;strong>exogeneity and exclusion are not testable&lt;/strong> — they require economic reasoning and domain knowledge.&lt;/li>
&lt;li>Relevance &lt;em>is&lt;/em> testable via the first-stage F-statistic ($F &amp;gt; 10$ rule of thumb).&lt;/li>
&lt;li>Use these scenarios to highlight that most IV debates are about exclusion and exogeneity, not relevance.&lt;/li>
&lt;li>Scenario 2 (firefighters) is intentionally a clear &amp;ldquo;bad instrument&amp;rdquo; — students should feel confident rejecting it. Scenarios 1 and 4 are &amp;ldquo;good but debatable,&amp;rdquo; which models the real-world messiness of IV.&lt;/li>
&lt;li>Connect to &lt;strong>LATE&lt;/strong>: In each scenario, ask students &lt;em>who are the compliers?&lt;/em> For example, in Scenario 1, compliers are students whose education decision is actually affected by distance to college — not those who would attend regardless or never attend regardless.&lt;/li>
&lt;/ul></description></item><item><title/><link>https://econ3500s26.netlify.app/content/practice/ch12-activity2-2sls-in-action/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/practice/ch12-activity2-2sls-in-action/</guid><description>&lt;p>ECON3500 Econometrics and Applications &lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-2sls-in-action">In-Class Activity: 2SLS in Action&lt;/h1>
&lt;p>&lt;strong>Chapter 12 — Instrumental Variables Regression&lt;/strong>&lt;/p>
&lt;p>A labor economist wants to estimate the &lt;strong>causal effect of years of education on log hourly wages&lt;/strong>. She uses data on 3,010 U.S. men born between 1930 and 1939 from the 1980 Census. She is worried that OLS estimates are biased because &lt;strong>ability&lt;/strong> is unobserved: more able individuals get more education &lt;em>and&lt;/em> earn higher wages, even holding education constant.&lt;/p>
&lt;p>Her proposed instrument: &lt;strong>quarter of birth&lt;/strong>. The idea is that compulsory schooling laws require students to stay in school until age 16. Students born earlier in the year reach age 16 earlier in the school year, so they can legally drop out with less total education. Quarter of birth thus affects years of education completed but should not directly affect wages.&lt;/p>
&lt;p>&lt;em>(Based on Angrist and Krueger, 1991)&lt;/em>&lt;/p>
&lt;hr>
&lt;h2 id="regression-output">Regression Output&lt;/h2>
&lt;h3 id="table-1-ols-regression">Table 1: OLS Regression&lt;/h3>
&lt;p>Dependent variable: Log hourly wage&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Variable&lt;/th>
&lt;th>Coefficient&lt;/th>
&lt;th>Std. Error&lt;/th>
&lt;th>t-statistic&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Years of education&lt;/td>
&lt;td>0.0700&lt;/td>
&lt;td>0.0035&lt;/td>
&lt;td>20.00&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Black&lt;/td>
&lt;td>-0.236&lt;/td>
&lt;td>0.018&lt;/td>
&lt;td>-13.11&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Married&lt;/td>
&lt;td>0.121&lt;/td>
&lt;td>0.016&lt;/td>
&lt;td>7.56&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Region (South)&lt;/td>
&lt;td>-0.098&lt;/td>
&lt;td>0.015&lt;/td>
&lt;td>-6.53&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Constant&lt;/td>
&lt;td>5.02&lt;/td>
&lt;td>0.052&lt;/td>
&lt;td>96.54&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>$R^2 = 0.132$, $n = 3{,}010$&lt;/p>
&lt;hr>
&lt;h3 id="table-2-first-stage-regression">Table 2: First-Stage Regression&lt;/h3>
&lt;p>Dependent variable: Years of education&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Variable&lt;/th>
&lt;th>Coefficient&lt;/th>
&lt;th>Std. Error&lt;/th>
&lt;th>t-statistic&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Born Q1 (Jan–Mar)&lt;/td>
&lt;td>-0.152&lt;/td>
&lt;td>0.067&lt;/td>
&lt;td>-2.27&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Black&lt;/td>
&lt;td>-1.47&lt;/td>
&lt;td>0.12&lt;/td>
&lt;td>-12.25&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Married&lt;/td>
&lt;td>0.38&lt;/td>
&lt;td>0.11&lt;/td>
&lt;td>3.45&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Region (South)&lt;/td>
&lt;td>-0.54&lt;/td>
&lt;td>0.10&lt;/td>
&lt;td>-5.40&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Constant&lt;/td>
&lt;td>12.84&lt;/td>
&lt;td>0.078&lt;/td>
&lt;td>164.62&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>$R^2 = 0.087$, $n = 3{,}010$&lt;/p>
&lt;p>&lt;strong>First-stage F-statistic on excluded instrument (Born Q1):&lt;/strong> $F = 5.15$&lt;/p>
&lt;hr>
&lt;h3 id="table-3-iv2sls-regression">Table 3: IV/2SLS Regression&lt;/h3>
&lt;p>Dependent variable: Log hourly wage &lt;br>
Endogenous variable: Years of education &lt;br>
Instrument: Born Q1 (= 1 if born January–March)&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Variable&lt;/th>
&lt;th>Coefficient&lt;/th>
&lt;th>Std. Error&lt;/th>
&lt;th>t-statistic&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Years of education&lt;/td>
&lt;td>0.142&lt;/td>
&lt;td>0.061&lt;/td>
&lt;td>2.33&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Black&lt;/td>
&lt;td>-0.133&lt;/td>
&lt;td>0.075&lt;/td>
&lt;td>-1.77&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Married&lt;/td>
&lt;td>0.075&lt;/td>
&lt;td>0.043&lt;/td>
&lt;td>1.74&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Region (South)&lt;/td>
&lt;td>-0.060&lt;/td>
&lt;td>0.038&lt;/td>
&lt;td>-1.58&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Constant&lt;/td>
&lt;td>4.11&lt;/td>
&lt;td>0.78&lt;/td>
&lt;td>5.27&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>$n = 3{,}010$&lt;/p>
&lt;hr>
&lt;h2 id="questions">Questions&lt;/h2>
&lt;h3 id="question-1-ols-interpretation">Question 1: OLS Interpretation&lt;/h3>
&lt;p>(a) Interpret the OLS coefficient on years of education. Be precise about units and magnitude.&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>(b) Why might this coefficient be a &lt;em>biased&lt;/em> estimate of the causal effect of education on wages? What is the likely direction of the bias? Explain using the omitted variable bias formula.&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;hr>
&lt;h3 id="question-2-first-stage">Question 2: First Stage&lt;/h3>
&lt;p>(a) Interpret the coefficient on &amp;ldquo;Born Q1&amp;rdquo; in the first-stage regression. What does it tell us about the relationship between quarter of birth and years of education?&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>(b) The first-stage F-statistic on the excluded instrument is $F = 5.15$. What does this tell us? Should we be concerned? Why or why not?&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;hr>
&lt;h3 id="question-3-iv2sls-interpretation">Question 3: IV/2SLS Interpretation&lt;/h3>
&lt;p>(a) Interpret the 2SLS coefficient on years of education. How does it compare to the OLS estimate?&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>(b) Using the simple IV formula $\hat{\beta}_1^{IV} = \frac{Cov(\text{log wage}, \text{Born Q1})}{Cov(\text{educ}, \text{Born Q1})}$, explain intuitively why the IV estimate is larger than the OLS estimate.&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;hr>
&lt;h3 id="question-4-bias-direction">Question 4: Bias Direction&lt;/h3>
&lt;p>Given that the OLS estimate (0.070) is &lt;em>smaller&lt;/em> than the 2SLS estimate (0.142), what does this imply about the direction of bias in OLS? Is this surprising? Why or why not?&lt;/p>
&lt;p>&lt;em>Hint: Think about what you would expect if ability bias drives OLS upward. What else could be going on?&lt;/em>&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;hr>
&lt;h3 id="question-5-late-and-validity">Question 5: LATE and Validity&lt;/h3>
&lt;p>(a) The IV estimate is a &lt;strong>Local Average Treatment Effect (LATE)&lt;/strong>. In this context, who are the &lt;em>compliers&lt;/em> — the group whose behavior is actually affected by the instrument?&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>(b) Name one threat to the validity of quarter of birth as an instrument for education. That is, provide a reason why the exogeneity or exclusion restriction might fail.&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;hr>
&lt;p>\newpage&lt;/p>
&lt;h2 id="instructor-notes--do-not-distribute">INSTRUCTOR NOTES — DO NOT DISTRIBUTE&lt;/h2>
&lt;h3 id="question-1-ols-interpretation-1">Question 1: OLS Interpretation&lt;/h3>
&lt;p>&lt;strong>(a)&lt;/strong> One additional year of education is associated with approximately 7.0% higher hourly wages, controlling for race, marital status, and region. (Technically: a one-year increase in education is associated with a 0.070 increase in log wages, or about 7.0% higher wages.)&lt;/p>
&lt;p>&lt;strong>(b)&lt;/strong> If ability is omitted:&lt;/p>
&lt;ul>
&lt;li>$\text{Bias} = \beta_{\text{ability}} \times \frac{Cov(\text{educ}, \text{ability})}{Var(\text{educ})}$&lt;/li>
&lt;li>$\beta_{\text{ability}} &amp;gt; 0$ (ability raises wages)&lt;/li>
&lt;li>$Cov(\text{educ}, \text{ability}) &amp;gt; 0$ (more able people get more education)&lt;/li>
&lt;li>Therefore bias is &lt;strong>positive&lt;/strong>: OLS &lt;em>overestimates&lt;/em> the causal effect of education.&lt;/li>
&lt;li>This is the standard prediction. We expect OLS &amp;gt; true causal effect if ability bias dominates.&lt;/li>
&lt;/ul>
&lt;h3 id="question-2-first-stage-1">Question 2: First Stage&lt;/h3>
&lt;p>&lt;strong>(a)&lt;/strong> Being born in Q1 (January–March) is associated with &lt;strong>0.152 fewer years of education&lt;/strong> compared to being born in Q2–Q4, controlling for race, marital status, and region. This is consistent with compulsory schooling laws: Q1 students reach the legal dropout age earlier in the school year and can exit with less total schooling.&lt;/p>
&lt;p>&lt;strong>(b)&lt;/strong> The F-statistic of 5.15 is &lt;strong>below the conventional threshold of 10&lt;/strong> for strong instruments. This is a &lt;strong>weak instrument&lt;/strong> concern.&lt;/p>
&lt;p>Consequences of weak instruments:&lt;/p>
&lt;ul>
&lt;li>2SLS estimates are biased toward OLS in finite samples&lt;/li>
&lt;li>Standard errors are unreliable (confidence intervals have incorrect coverage)&lt;/li>
&lt;li>The 2SLS estimate may not be trustworthy&lt;/li>
&lt;/ul>
&lt;p>This is a well-known limitation of the Angrist-Krueger design with a single quarter-of-birth indicator. The original paper used multiple QOB indicators interacted with year of birth to generate more variation — but even then, weak instrument concerns have been raised (Bound, Jaeger, and Baker, 1995).&lt;/p>
&lt;h3 id="question-3-iv2sls-interpretation-1">Question 3: IV/2SLS Interpretation&lt;/h3>
&lt;p>&lt;strong>(a)&lt;/strong> The 2SLS estimate implies that an additional year of education &lt;em>causes&lt;/em> approximately a 14.2% increase in hourly wages. This is &lt;strong>twice as large&lt;/strong> as the OLS estimate of 7.0%.&lt;/p>
&lt;p>The standard errors are also much larger (0.061 vs. 0.0035), which is typical of IV — we are using less variation (only the part driven by the instrument), so estimates are noisier.&lt;/p>
&lt;p>&lt;strong>(b)&lt;/strong> Intuition for the IV formula: The numerator captures how much wages differ by quarter of birth. The denominator captures how much education differs by quarter of birth. Since the education difference (denominator) is small (only 0.15 years), even a modest wage difference gets &amp;ldquo;scaled up&amp;rdquo; substantially when we divide. This is partly why the IV estimate is large — and why weak instruments are dangerous (a small, noisy denominator amplifies noise).&lt;/p>
&lt;h3 id="question-4-bias-direction-1">Question 4: Bias Direction&lt;/h3>
&lt;p>The fact that OLS (0.070) &amp;lt; IV (0.142) is &lt;strong>surprising&lt;/strong> if we believe ability bias is the main problem, because ability bias should push OLS &lt;em>up&lt;/em>, not down.&lt;/p>
&lt;p>Possible explanations:&lt;/p>
&lt;ol>
&lt;li>&lt;strong>Measurement error in education&lt;/strong>: If education is measured with error (self-reported years of schooling), OLS is attenuated toward zero. IV corrects for measurement error because the instrument is correlated with &lt;em>true&lt;/em> education, not the measurement error. This attenuation could be larger than the upward ability bias.&lt;/li>
&lt;li>&lt;strong>LATE vs. ATE&lt;/strong>: The IV estimate applies to &lt;em>compliers&lt;/em> — people whose education was affected by compulsory schooling laws. These are individuals at the margin of dropping out, who likely have lower baseline education. The return to education may be &lt;em>higher&lt;/em> for this group than for the average person in the sample (heterogeneous treatment effects). So IV &amp;gt; OLS could reflect a higher return for compliers, not necessarily that OLS is biased downward for the whole population.&lt;/li>
&lt;li>&lt;strong>Weak instrument bias&lt;/strong>: With $F = 5.15$, the IV estimate is biased toward OLS, meaning the true IV (if we had a strong instrument) might be even larger — or the current estimate could be unreliable.&lt;/li>
&lt;/ol>
&lt;p>The measurement error explanation is particularly compelling and is emphasized in the literature.&lt;/p>
&lt;h3 id="question-5-late-and-validity-1">Question 5: LATE and Validity&lt;/h3>
&lt;p>&lt;strong>(a)&lt;/strong> The &lt;strong>compliers&lt;/strong> are individuals whose total years of education were affected by compulsory schooling laws — specifically, those who would have dropped out of school earlier if they could have (i.e., if they had reached age 16 before the end of the school year). These are people on the margin of the dropout decision.&lt;/p>
&lt;p>The IV estimate does &lt;em>not&lt;/em> apply to:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Always-takers&lt;/strong>: People who would have stayed in school regardless of when they turned 16 (e.g., college-bound students)&lt;/li>
&lt;li>&lt;strong>Never-takers&lt;/strong>: People who dropped out before age 16 regardless&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(b)&lt;/strong> Threats to validity:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Season-of-birth and family background&lt;/strong>: If families with different characteristics have children at different times of year (e.g., higher-SES families are more likely to have spring babies), then quarter of birth is correlated with ability/background, violating exogeneity. There is some evidence of this (Buckles and Hungerman, 2013).&lt;/li>
&lt;li>&lt;strong>Age-at-test effects&lt;/strong>: Students born earlier in the year are older when they take any given test or enter the labor market at a given calendar date. If age itself affects productivity, quarter of birth has a direct effect on wages (violating exclusion).&lt;/li>
&lt;li>&lt;strong>School entry age effects&lt;/strong>: Quarter of birth affects the age at which a child starts school. If starting school younger/older affects learning or development, this is a channel from Z to Y that does not operate through &lt;em>total years&lt;/em> of education.&lt;/li>
&lt;/ul>
&lt;h3 id="general-teaching-notes">General Teaching Notes&lt;/h3>
&lt;ul>
&lt;li>Use this activity to reinforce that &lt;strong>IV estimates are noisy&lt;/strong> — much larger standard errors than OLS.&lt;/li>
&lt;li>The weak instrument finding ($F = 5.15$) is a great discussion point. Ask students: &amp;ldquo;If you were the referee, would you trust this estimate?&amp;rdquo;&lt;/li>
&lt;li>The &amp;ldquo;wrong sign&amp;rdquo; on bias direction (OLS &amp;lt; IV) is one of the most important findings in the returns-to-education literature. It highlights that multiple sources of bias can operate simultaneously (ability bias up, measurement error down), and that LATE may differ from ATE.&lt;/li>
&lt;li>Time permitting, discuss how Angrist and Krueger&amp;rsquo;s original paper used 3 quarter-of-birth dummies interacted with 10 year-of-birth dummies (30 instruments!) — which created its own problems (many weak instruments, overfitting in the first stage).&lt;/li>
&lt;/ul></description></item><item><title/><link>https://econ3500s26.netlify.app/content/practice/ch9-activity1-threat-detective/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/practice/ch9-activity1-threat-detective/</guid><description>&lt;p>ECON3500: Econometrics and Applications&lt;/p>
&lt;h1 id="in-class-activity-threat-detective">In-Class Activity: Threat Detective&lt;/h1>
&lt;p>&lt;strong>Chapter 9 — Threats to Internal Validity&lt;/strong>&lt;/p>
&lt;p>For each scenario below, a researcher estimates a regression and interprets the coefficient on the key independent variable as a causal effect. Your job is to play &lt;strong>threat detective&lt;/strong>: identify what could go wrong.&lt;/p>
&lt;p>For each scenario:&lt;/p>
&lt;ul>
&lt;li>(a) Identify which threat(s) to internal validity are most likely present. Choose from:
&lt;ol>
&lt;li>Omitted variable bias&lt;/li>
&lt;li>Wrong functional form&lt;/li>
&lt;li>Errors-in-variables bias (measurement error)&lt;/li>
&lt;li>Sample selection bias&lt;/li>
&lt;li>Simultaneous causality bias&lt;/li>
&lt;/ol>
&lt;/li>
&lt;li>(b) Explain the likely &lt;strong>direction of bias&lt;/strong> on the coefficient of interest (upward or downward). Be specific about your reasoning.&lt;/li>
&lt;li>(c) Propose &lt;strong>one concrete solution&lt;/strong> the researcher could implement.&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="scenario-1-education-and-earnings">Scenario 1: Education and Earnings&lt;/h2>
&lt;p>A researcher estimates the following model using data from employed adults aged 25&amp;ndash;65:&lt;/p>
&lt;p>$$\widehat{earnings}_i = 15{,}200 + 4{,}800 \cdot educ_i$$&lt;/p>
&lt;p>where $earnings_i$ is annual earnings in dollars and $educ_i$ is years of schooling. The researcher concludes that each additional year of education causes earnings to increase by $4,800.&lt;/p>
&lt;p>&lt;strong>(a) Threat(s):&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(b) Direction of bias and reasoning:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(c) Proposed solution:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;hr>
&lt;h2 id="scenario-2-police-and-crime">Scenario 2: Police and Crime&lt;/h2>
&lt;p>A researcher collects data on 300 U.S. cities and estimates:&lt;/p>
&lt;p>$$\widehat{crimerate}_i = 12.4 + 0.83 \cdot policepc_i + controls$$&lt;/p>
&lt;p>where $crimerate_i$ is violent crimes per 1,000 residents and $policepc_i$ is police officers per 1,000 residents. The researcher is puzzled: &amp;ldquo;More police appears to &lt;em>increase&lt;/em> crime.&amp;rdquo;&lt;/p>
&lt;p>&lt;strong>(a) Threat(s):&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(b) Direction of bias and reasoning:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(c) Proposed solution:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;hr>
&lt;h2 id="scenario-3-job-training-and-wages">Scenario 3: Job Training and Wages&lt;/h2>
&lt;p>A large firm offers a voluntary job training program. A researcher compares the wages of workers who enrolled in the program to those who did not:&lt;/p>
&lt;p>$$\widehat{wage}_i = 22.50 + 3.10 \cdot training_i$$&lt;/p>
&lt;p>where $wage_i$ is the hourly wage one year after the program was offered and $training_i = 1$ if the worker enrolled. The researcher concludes the program raised wages by $3.10/hour.&lt;/p>
&lt;p>&lt;strong>(a) Threat(s):&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(b) Direction of bias and reasoning:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(c) Proposed solution:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;hr>
&lt;h2 id="scenario-4-health-insurance-and-health">Scenario 4: Health Insurance and Health&lt;/h2>
&lt;p>A researcher uses survey data in which respondents self-report both their health insurance coverage and their health status on a 1&amp;ndash;10 scale. They estimate:&lt;/p>
&lt;p>$$\widehat{health}_i = 5.2 + 0.9 \cdot insured_i + controls$$&lt;/p>
&lt;p>where $insured_i = 1$ if the respondent reports having health insurance. However, it is known that roughly 10% of respondents misreport their insurance status (some insured people say they are uninsured, and vice versa).&lt;/p>
&lt;p>&lt;strong>(a) Threat(s):&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(b) Direction of bias and reasoning:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(c) Proposed solution:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;hr>
&lt;h2 id="scenario-5-advertising-and-sales">Scenario 5: Advertising and Sales&lt;/h2>
&lt;p>A national retail chain estimates the effect of advertising spending on store revenue using quarterly data from its 150 stores:&lt;/p>
&lt;p>$$\widehat{revenue}_i = 320{,}000 + 5.2 \cdot adspend_i$$&lt;/p>
&lt;p>where $revenue_i$ is quarterly store revenue in dollars and $adspend_i$ is quarterly advertising spending in dollars. The company allocates more advertising budget to stores that had strong sales the previous quarter.&lt;/p>
&lt;p>&lt;strong>(a) Threat(s):&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(b) Direction of bias and reasoning:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>(c) Proposed solution:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;hr>
&lt;p>\newpage&lt;/p>
&lt;h2 id="instructor-notes--do-not-distribute">INSTRUCTOR NOTES — DO NOT DISTRIBUTE&lt;/h2>
&lt;h3 id="scenario-1-education-and-earnings-1">Scenario 1: Education and Earnings&lt;/h3>
&lt;p>&lt;strong>(a) Threats:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Omitted variable bias&lt;/strong> is the primary threat. Ability, family background, and motivation are correlated with both education and earnings but are omitted from the regression.&lt;/li>
&lt;li>&lt;strong>Sample selection bias&lt;/strong> is also present: the sample is restricted to employed adults. People with very low education may be disproportionately unemployed and excluded from the sample, which could bias the estimated return to education.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(b) Direction of bias:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>OVB: Upward bias. Ability is positively correlated with education (higher-ability people get more schooling) and positively correlated with earnings (higher-ability people earn more). Since both correlations are positive, the omitted variable bias formula gives a positive bias, so 4,800 likely overstates the true causal effect.&lt;/li>
&lt;li>Sample selection: Also likely upward. Among people with low education, only those with favorable unobserved characteristics (e.g., motivation, connections) remain employed, which compresses the apparent earnings gap between low- and high-education groups less than it should — but the net effect is ambiguous.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(c) Solutions:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Add control variables for ability (e.g., test scores), family background (parental education, income).&lt;/li>
&lt;li>Use an instrumental variable (e.g., proximity to a college, compulsory schooling laws, quarter of birth).&lt;/li>
&lt;li>Include the full adult population (employed and unemployed) to address selection.&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h3 id="scenario-2-police-and-crime-1">Scenario 2: Police and Crime&lt;/h3>
&lt;p>&lt;strong>(a) Threats:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Simultaneous causality bias&lt;/strong> is the primary threat. Crime rates affect police hiring decisions (cities with more crime hire more police), and police presence may also affect crime rates. Causality runs in both directions.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(b) Direction of bias:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Upward bias on the police coefficient. High crime causes cities to hire more police, creating a positive correlation between police and crime that is not the causal effect of police on crime. The true causal effect of police on crime is likely negative (more police reduces crime), but simultaneous causality pushes the estimated coefficient upward — potentially making it positive, as we see here.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(c) Solutions:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Use an instrumental variable that affects police staffing but does not directly affect crime (e.g., electoral cycles, firefighter staffing as in Levitt 1997, or federal grants for police hiring).&lt;/li>
&lt;li>Exploit natural experiments (e.g., terror alert levels that exogenously increase police presence, as in Klick and Tabarrok 2005).&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h3 id="scenario-3-job-training-and-wages-1">Scenario 3: Job Training and Wages&lt;/h3>
&lt;p>&lt;strong>(a) Threats:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Sample selection bias&lt;/strong> (self-selection into treatment). Workers who voluntarily enroll in training are likely systematically different from those who do not — they may be more motivated, more career-oriented, or already on an upward trajectory.&lt;/li>
&lt;li>&lt;strong>Omitted variable bias&lt;/strong> — motivation and ambition are omitted and correlated with both training enrollment and wages.&lt;/li>
&lt;li>Note: Students may identify either or both. Accept OVB as an answer, but emphasize that when the selection is into the treatment itself, &amp;ldquo;sample selection bias&amp;rdquo; is the more precise label per Stock &amp;amp; Watson Ch. 9.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(b) Direction of bias:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Upward bias. Workers who self-select into training are likely more motivated and ambitious, traits that independently lead to higher wages. The coefficient of 3.10 likely overstates the true causal effect of the program.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(c) Solutions:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Randomize access to the training program (RCT).&lt;/li>
&lt;li>Use the initial offer of training as an instrument (intent-to-treat / IV approach) if the program was offered to a random subset.&lt;/li>
&lt;li>Compare wages before and after training for participants vs. non-participants (difference-in-differences), though this still requires a parallel trends assumption.&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h3 id="scenario-4-health-insurance-and-health-1">Scenario 4: Health Insurance and Health&lt;/h3>
&lt;p>&lt;strong>(a) Threats:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Errors-in-variables bias (measurement error)&lt;/strong> in the independent variable. If 10% of respondents misreport their insurance status, $insured_i$ is measured with classical measurement error.&lt;/li>
&lt;li>Students may also identify &lt;strong>simultaneous causality&lt;/strong> (healthier people may be more likely to have jobs that provide insurance, and insurance may improve health) and &lt;strong>omitted variable bias&lt;/strong> (income, education, and health behaviors are correlated with both insurance and health outcomes).&lt;/li>
&lt;li>All three are defensible, but the scenario is written to highlight measurement error.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(b) Direction of bias:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Measurement error in a binary independent variable causes &lt;strong>attenuation bias&lt;/strong> — the coefficient is biased toward zero. The true effect of insurance on health is likely larger in magnitude than 0.9.&lt;/li>
&lt;li>Key point for students: Classical measurement error in X always biases the coefficient toward zero, regardless of the sign of the true effect.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(c) Solutions:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Use administrative records on insurance coverage (e.g., insurer enrollment data) instead of self-reports to eliminate measurement error.&lt;/li>
&lt;li>Use an instrumental variable (e.g., Medicaid eligibility cutoffs, employer mandate thresholds).&lt;/li>
&lt;li>Reference for discussion: The Oregon Health Insurance Experiment randomly assigned Medicaid access.&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h3 id="scenario-5-advertising-and-sales-1">Scenario 5: Advertising and Sales&lt;/h3>
&lt;p>&lt;strong>(a) Threats:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Simultaneous causality bias&lt;/strong> is the primary threat. The company explicitly allocates more advertising to stores with strong prior sales. Revenue drives advertising spending, not just the reverse.&lt;/li>
&lt;li>&lt;strong>Omitted variable bias&lt;/strong> may also be present: store location quality, local economic conditions, and management quality affect both revenue and the advertising budget allocated.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(b) Direction of bias:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Upward bias. Stores with high revenue receive more advertising budget, creating a positive feedback loop. The coefficient of 5.2 overstates the true causal return to an additional dollar of advertising.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>(c) Solutions:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Randomly assign advertising budgets across stores (an A/B test or field experiment).&lt;/li>
&lt;li>Use an instrumental variable for advertising spending that is unrelated to store performance (e.g., random variation in local media costs).&lt;/li>
&lt;li>Use lagged advertising spending (from two or more quarters ago) as the independent variable to break the contemporaneous simultaneity, though this does not fully resolve the problem if the allocation rule is persistent.&lt;/li>
&lt;/ul></description></item><item><title/><link>https://econ3500s26.netlify.app/content/practice/ch9-activity2-regression-audit/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/practice/ch9-activity2-regression-audit/</guid><description>&lt;p>ECON3500: Econometrics and Applications&lt;/p>
&lt;h1 id="in-class-activity-regression-audit">In-Class Activity: Regression Audit&lt;/h1>
&lt;p>&lt;strong>Chapter 9 — Threats to Internal Validity&lt;/strong>&lt;/p>
&lt;h2 id="setup">Setup&lt;/h2>
&lt;p>A school district superintendent wants to know: &lt;strong>does reducing class size improve student test scores?&lt;/strong> She hires an economist to study the question using data from 420 elementary schools across California.&lt;/p>
&lt;p>The key variables are:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>testscr&lt;/strong> — district average test score (combined math and reading, scale 600&amp;ndash;720)&lt;/li>
&lt;li>&lt;strong>str&lt;/strong> — student-teacher ratio (average class size proxy)&lt;/li>
&lt;li>&lt;strong>el_pct&lt;/strong> — percent of students who are English learners&lt;/li>
&lt;li>&lt;strong>avginc&lt;/strong> — district average household income (in $1,000s)&lt;/li>
&lt;li>&lt;strong>meal_pct&lt;/strong> — percent of students qualifying for free/reduced-price meals (a proxy for poverty)&lt;/li>
&lt;li>&lt;strong>calworks&lt;/strong> — percent of students in public assistance programs&lt;/li>
&lt;/ul>
&lt;p>The economist estimates three specifications. Review the output below and answer the questions that follow.&lt;/p>
&lt;hr>
&lt;h3 id="specification-1-bivariate-regression">Specification (1): Bivariate regression&lt;/h3>
&lt;pre tabindex="0">&lt;code>. regress testscr str
Source | SS df MS Number of obs = 420
-------------+---------------------------------- F(1, 418) = 22.58
Model | 7794.11004 1 7794.11004 Prob &amp;gt; F = 0.0000
Residual | 144315.484 418 345.252354 R-squared = 0.0512
-------------+---------------------------------- Adj R-squared = 0.0490
Total | 152109.594 419 363.029819 Root MSE = 18.580
------------------------------------------------------------------------------
testscr | Coef. Std. Err. t P&amp;gt;|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
str | -2.279808 .4798256 -4.75 0.000 -3.223068 -1.336549
_cons | 698.9330 9.467491 73.83 0.000 680.3231 717.5428
------------------------------------------------------------------------------
&lt;/code>&lt;/pre>&lt;h3 id="specification-2-adding-demographic-controls">Specification (2): Adding demographic controls&lt;/h3>
&lt;pre tabindex="0">&lt;code>. regress testscr str el_pct meal_pct
Source | SS df MS Number of obs = 420
-------------+---------------------------------- F(3, 416) = 361.68
Model | 124045.959 3 41348.6529 Prob &amp;gt; F = 0.0000
Residual | 28063.6355 416 67.4606612 R-squared = 0.8156
-------------+---------------------------------- Adj R-squared = 0.8143
Total | 152109.594 419 363.029819 Root MSE = 8.2135
------------------------------------------------------------------------------
testscr | Coef. Std. Err. t P&amp;gt;|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
str | -0.998309 .2704117 -3.69 0.000 -1.530032 -0.466587
el_pct | -0.121573 .0332578 -3.66 0.000 -0.186919 -0.056228
meal_pct | -0.547235 .0240462 -22.75 0.000 -0.594531 -0.499940
_cons | 700.3918 5.537407 126.48 0.000 689.5001 711.2835
------------------------------------------------------------------------------
&lt;/code>&lt;/pre>&lt;h3 id="specification-3-adding-income-and-income-squared">Specification (3): Adding income and income-squared&lt;/h3>
&lt;pre tabindex="0">&lt;code>. regress testscr str el_pct meal_pct avginc avginc_sq
Source | SS df MS Number of obs = 420
-------------+---------------------------------- F(5, 414) = 241.17
Model | 126306.099 5 25261.2198 Prob &amp;gt; F = 0.0000
Residual | 25803.4953 414 62.3272831 R-squared = 0.8305
-------------+---------------------------------- Adj R-squared = 0.8284
Total | 152109.594 419 363.029819 Root MSE = 7.8948
------------------------------------------------------------------------------
testscr | Coef. Std. Err. t P&amp;gt;|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
str | -0.734206 .2637618 -2.78 0.006 -1.252840 -0.215571
el_pct | -0.176017 .0341101 -5.16 0.000 -0.243040 -0.108993
meal_pct | -0.420495 .0295224 -14.24 0.000 -0.478520 -0.362470
avginc | 3.850891 0.582873 6.61 0.000 2.704831 4.996951
avginc_sq | -0.043100 0.009935 -4.34 0.000 -0.062620 -0.023580
_cons | 663.7046 7.745928 85.69 0.000 648.4748 678.9345
------------------------------------------------------------------------------
&lt;/code>&lt;/pre>&lt;hr>
&lt;h2 id="questions">Questions&lt;/h2>
&lt;p>&lt;strong>1. Interpret the coefficient on &lt;code>str&lt;/code> in Specification (1). Is it statistically significant?&lt;/strong>&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;p>&lt;strong>2. Compare the coefficient on &lt;code>str&lt;/code> across the three specifications. What happens to it as controls are added? What does this pattern suggest about the direction of omitted variable bias in Specification (1)?&lt;/strong>&lt;/p>
&lt;p>\vspace{4cm}&lt;/p>
&lt;p>&lt;strong>3. Specification (1) omits &lt;code>el_pct&lt;/code> and &lt;code>meal_pct&lt;/code>. Using the omitted variable bias formula, explain why omitting these variables likely biases the coefficient on &lt;code>str&lt;/code> in Specification (1). Be specific about:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>The likely correlation between &lt;code>str&lt;/code> and the omitted variable&lt;/strong>&lt;/li>
&lt;li>&lt;strong>The likely sign of the omitted variable&amp;rsquo;s coefficient&lt;/strong>&lt;/li>
&lt;li>&lt;strong>The resulting direction of bias&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>\vspace{4cm}&lt;/p>
&lt;p>&lt;strong>4. Specification (3) adds &lt;code>avginc&lt;/code> and &lt;code>avginc_sq&lt;/code> (income squared). What threat to internal validity does including &lt;code>avginc_sq&lt;/code> address? Why might the relationship between income and test scores be nonlinear?&lt;/strong>&lt;/p>
&lt;p>\vspace{3cm}&lt;/p>
&lt;p>&lt;strong>5. Even after adding all these controls, what threats to internal validity might remain? Identify at least two specific threats and, for each, explain:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>What the specific concern is&lt;/strong>&lt;/li>
&lt;li>&lt;strong>What additional data or method could help address it&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>\vspace{5cm}&lt;/p>
&lt;p>&lt;strong>6. The superintendent wants to use these results to argue that the state legislature should fund smaller class sizes across California. Briefly assess:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;strong>(a) Does this study have internal validity for estimating the causal effect of class size on test scores in California? Why or why not?&lt;/strong>&lt;/li>
&lt;li>&lt;strong>(b) Would these results have external validity if applied to schools in a developing country? Why or why not?&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>\vspace{5cm}&lt;/p>
&lt;hr>
&lt;p>\newpage&lt;/p>
&lt;h2 id="instructor-notes--do-not-distribute">INSTRUCTOR NOTES — DO NOT DISTRIBUTE&lt;/h2>
&lt;h3 id="question-1">Question 1&lt;/h3>
&lt;p>The coefficient on &lt;code>str&lt;/code> in Specification (1) is -2.28. Interpretation: A one-unit increase in the student-teacher ratio (i.e., one more student per teacher) is associated with a 2.28-point decrease in district average test scores, holding nothing else constant.&lt;/p>
&lt;p>It is statistically significant at the 1% level: t = -4.75, p &amp;lt; 0.001. The 95% confidence interval [-3.22, -1.34] does not include zero.&lt;/p>
&lt;h3 id="question-2">Question 2&lt;/h3>
&lt;p>The coefficient on &lt;code>str&lt;/code> changes across specifications:&lt;/p>
&lt;ul>
&lt;li>Spec (1): -2.28&lt;/li>
&lt;li>Spec (2): -1.00&lt;/li>
&lt;li>Spec (3): -0.73&lt;/li>
&lt;/ul>
&lt;p>The coefficient shrinks in magnitude (toward zero) as controls are added. This pattern suggests that &lt;strong>Specification (1) overstates the negative effect&lt;/strong> — i.e., the omitted variable bias in Spec (1) is &lt;strong>negative&lt;/strong> (biasing the coefficient away from zero, making it more negative than the true causal effect).&lt;/p>
&lt;p>This makes sense: districts with high student-teacher ratios tend to be poorer districts that also have worse test scores for other reasons. Once we control for those other reasons (poverty, English learners, income), the remaining effect attributable to class size is smaller.&lt;/p>
&lt;h3 id="question-3">Question 3&lt;/h3>
&lt;p>Using the OVB formula: $bias = \delta_1 \times \gamma_{omitted}$&lt;/p>
&lt;p>&lt;strong>For &lt;code>meal_pct&lt;/code> (poverty):&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Correlation between &lt;code>str&lt;/code> and &lt;code>meal_pct&lt;/code>: Likely &lt;strong>positive&lt;/strong>. Poorer districts have less funding and therefore larger class sizes (higher student-teacher ratios).&lt;/li>
&lt;li>Coefficient on &lt;code>meal_pct&lt;/code> in the regression: &lt;strong>Negative&lt;/strong> (-0.55). Poverty is associated with lower test scores.&lt;/li>
&lt;li>Direction of bias: positive $\times$ negative = &lt;strong>negative bias&lt;/strong>. Omitting poverty makes the coefficient on &lt;code>str&lt;/code> more negative than its true value. This is consistent with what we observe.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>For &lt;code>el_pct&lt;/code> (English learners):&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Correlation between &lt;code>str&lt;/code> and &lt;code>el_pct&lt;/code>: Likely &lt;strong>positive&lt;/strong>. Districts with more English learners tend to be urban or under-resourced and may have higher student-teacher ratios.&lt;/li>
&lt;li>Coefficient on &lt;code>el_pct&lt;/code>: &lt;strong>Negative&lt;/strong> (-0.12). More English learners is associated with lower average test scores.&lt;/li>
&lt;li>Direction of bias: positive $\times$ negative = &lt;strong>negative bias&lt;/strong>. Same direction.&lt;/li>
&lt;/ul>
&lt;p>Both omitted variables bias the &lt;code>str&lt;/code> coefficient in the same direction (more negative), which is why the coefficient shrinks when they are added.&lt;/p>
&lt;h3 id="question-4">Question 4&lt;/h3>
&lt;p>Including &lt;code>avginc_sq&lt;/code> addresses the threat of &lt;strong>wrong functional form&lt;/strong>. The relationship between income and test scores is likely nonlinear — specifically, concave (diminishing returns). An additional $1,000 in average income probably improves test scores more in poor districts than in wealthy districts.&lt;/p>
&lt;p>Evidence: The coefficient on &lt;code>avginc_sq&lt;/code> is negative (-0.043) and statistically significant (t = -4.34), confirming the concave relationship. Including the squared term improves R-squared from 0.8156 to 0.8305.&lt;/p>
&lt;p>If we only included &lt;code>avginc&lt;/code> linearly, we would be imposing a constant marginal effect of income across all income levels, which is a misspecification.&lt;/p>
&lt;h3 id="question-5">Question 5&lt;/h3>
&lt;p>Remaining threats (students should identify at least two):&lt;/p>
&lt;ol>
&lt;li>
&lt;p>&lt;strong>Omitted variable bias&lt;/strong>: Even with these controls, there may be unobserved factors correlated with both &lt;code>str&lt;/code> and test scores. Examples:&lt;/p>
&lt;ul>
&lt;li>Teacher quality: Districts that can afford smaller classes may also attract better teachers. We cannot separate the class-size effect from the teacher-quality effect.&lt;/li>
&lt;li>Parental involvement: Wealthier districts may have more engaged parents, which improves test scores independent of class size.&lt;/li>
&lt;li>School resources (beyond class size): Libraries, technology, facilities.&lt;/li>
&lt;li>&lt;em>Solution&lt;/em>: Add controls for teacher qualifications, school spending per pupil, or parental education. Alternatively, use an instrumental variable or a natural experiment (e.g., class size rules that create discontinuities, as in Angrist and Lavy 1999).&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Simultaneous causality&lt;/strong>: Districts that observe low test scores may respond by reducing class sizes (hiring more teachers). If so, low test scores cause small class sizes, creating reverse causality.&lt;/p>
&lt;ul>
&lt;li>&lt;em>Solution&lt;/em>: Use an instrument for class size, or exploit policy rules that exogenously assign class sizes (e.g., maximum class size rules that create sharp cutoffs).&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Errors-in-variables bias&lt;/strong>: The student-teacher ratio is a proxy for class size, not actual class size. Districts may have non-teaching staff counted in the ratio, or some teachers may serve as specialists rather than classroom instructors. This measurement error would attenuate the coefficient toward zero.&lt;/p>
&lt;ul>
&lt;li>&lt;em>Solution&lt;/em>: Use actual class size data rather than student-teacher ratios.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Sample selection bias&lt;/strong>: The sample includes only California districts. If certain types of districts are systematically excluded (e.g., very small rural districts, charter schools), results may not reflect the true relationship even within California.&lt;/p>
&lt;ul>
&lt;li>&lt;em>Solution&lt;/em>: Verify sample coverage and compare included vs. excluded districts on observables.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ol>
&lt;h3 id="question-6">Question 6&lt;/h3>
&lt;p>&lt;strong>(a) Internal validity:&lt;/strong>
The study has &lt;strong>limited internal validity&lt;/strong>. While the addition of controls substantially reduces omitted variable bias (as evidenced by the coefficient changing from -2.28 to -0.73), the remaining threats identified in Question 5 — particularly unobserved teacher quality, simultaneous causality, and measurement error — mean we cannot be confident the estimate of -0.73 represents the true causal effect. The study is observational, not experimental, so there is no way to fully rule out confounding.&lt;/p>
&lt;p>That said, the study does a reasonable job of addressing the most obvious sources of bias, and the consistent negative sign across specifications is suggestive. The coefficient is &amp;ldquo;better&amp;rdquo; than the naive estimate but still potentially biased.&lt;/p>
&lt;p>&lt;strong>(b) External validity:&lt;/strong>
External validity to a developing country is &lt;strong>weak&lt;/strong>. Several factors differ:&lt;/p>
&lt;ul>
&lt;li>The relationship between class size and learning may differ due to different pedagogical approaches, curriculum, teacher training, and student background.&lt;/li>
&lt;li>Class sizes in developing countries are often much larger (40&amp;ndash;80 students), well outside the range observed in California (~15&amp;ndash;25). Extrapolating beyond the data range is unreliable.&lt;/li>
&lt;li>School infrastructure, teacher quality, and resource constraints differ fundamentally.&lt;/li>
&lt;li>The outcome measure (standardized test scores) may not be comparable.&lt;/li>
&lt;/ul>
&lt;p>The results may generalize to other U.S. states with similar demographics and school systems, but even that requires caution about institutional differences.&lt;/p></description></item><item><title/><link>https://econ3500s26.netlify.app/content/practice/ch9-activity3-study-doctor-solutions/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/practice/ch9-activity3-study-doctor-solutions/</guid><description>&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-regression-validity--solutions">In-Class Activity: Regression Validity — Solutions&lt;/h1>
&lt;p>&lt;strong>Chapter 9 — Assessing Studies Based on Multiple Regression&lt;/strong>&lt;br>
Time: ~15-20 minutes&lt;/p>
&lt;hr>
&lt;h2 id="your-job">Your Job&lt;/h2>
&lt;p>Each example below is a research study.
All six are based on real papers, but the descriptions below are simplified for class use.&lt;/p>
&lt;p>For each study:&lt;/p>
&lt;ol>
&lt;li>What is the &lt;strong>goal&lt;/strong>?
&lt;ul>
&lt;li>Causal inference&lt;/li>
&lt;li>Forecasting&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>What is the &lt;strong>main problem&lt;/strong>?
&lt;ul>
&lt;li>Omitted variable bias&lt;/li>
&lt;li>Wrong functional form&lt;/li>
&lt;li>Errors-in-variables bias&lt;/li>
&lt;li>Sample selection bias&lt;/li>
&lt;li>Simultaneous causality bias&lt;/li>
&lt;li>External validity only / not mainly an internal-validity problem&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>Why is that the right diagnosis?&lt;/li>
&lt;li>What is one concrete fix or improvement?&lt;/li>
&lt;/ol>
&lt;hr>
&lt;h2 id="quick-diagnosis-guide">Quick Diagnosis Guide&lt;/h2>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>If the problem is&amp;hellip;&lt;/th>
&lt;th>Ask yourself&amp;hellip;&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;strong>OVB&lt;/strong>&lt;/td>
&lt;td>Is there some omitted factor that affects $Y$ and is correlated with $X$?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Wrong functional form&lt;/strong>&lt;/td>
&lt;td>Did we force a straight-line relationship when the true relationship is curved or interactive?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Measurement error&lt;/strong>&lt;/td>
&lt;td>Is $X$ or $Y$ measured noisily, inaccurately, or systematically wrong?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Sample selection&lt;/strong>&lt;/td>
&lt;td>Are some observations missing because of the outcome or some unobserved factor tied to it?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Simultaneous causality&lt;/strong>&lt;/td>
&lt;td>Does $Y$ also affect $X$?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>External validity&lt;/strong>&lt;/td>
&lt;td>Even if the study is internally valid, would the result generalize to a different setting?&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;hr>
&lt;h2 id="example-1-catholic-schooling-and-educational-attainment">Example 1: Catholic Schooling and Educational Attainment&lt;/h2>
&lt;p>A researcher studies whether attending a Catholic high school increases graduation and college attendance. Students who attend Catholic schools may also come from families that are more motivated, more religious, or more education-focused to begin with.&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Causal inference&lt;/strong>&lt;/li>
&lt;li>Diagnosis: &lt;strong>Omitted variable bias&lt;/strong>&lt;/li>
&lt;li>Why: Students who attend Catholic schools are selected. Family motivation, religiosity, discipline, and neighborhood context may affect both school choice and later attainment.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Add better controls&lt;/li>
&lt;li>Use a credible IV or lottery-style design&lt;/li>
&lt;li>Compare similar students more carefully&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="example-2-oregon-medicaid-lottery">Example 2: Oregon Medicaid Lottery&lt;/h2>
&lt;p>The Oregon Health Insurance Experiment used a lottery to study the effects of Medicaid for low-income uninsured adults in Oregon. A policymaker wants to use those estimates to predict what the effects would be in a very different state with different hospitals, demographics, and eligibility rules.&lt;/p>
&lt;ul>
&lt;li>Goal: Usually &lt;strong>causal inference&lt;/strong> in the original study, but the policymaker&amp;rsquo;s question is about &lt;strong>external validity&lt;/strong>&lt;/li>
&lt;li>Diagnosis: &lt;strong>External validity only / not mainly an internal-validity problem&lt;/strong>&lt;/li>
&lt;li>Why: The question is whether Oregon lottery estimates transport to a very different setting. Students should talk about hospitals, baseline uninsured rates, take-up, and the local policy environment.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Replicate in more settings&lt;/li>
&lt;li>Compare institutional context&lt;/li>
&lt;li>Ask whether the treated and target settings are genuinely comparable&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="example-3-survey-earnings-vs-administrative-records">Example 3: Survey Earnings vs. Administrative Records&lt;/h2>
&lt;p>Bound and Krueger compare workers' self-reported earnings in surveys to administrative earnings records. Suppose a researcher estimates the effect of earnings on some outcome using only the self-reported survey measure.&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Causal inference&lt;/strong> or prediction; either answer is acceptable if justified&lt;/li>
&lt;li>Diagnosis: &lt;strong>Errors-in-variables bias&lt;/strong>&lt;/li>
&lt;li>Why: Self-reported earnings differ from administrative records. The observed regressor may contain measurement error, and Bound-Krueger show that it is not purely classical.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Use administrative records&lt;/li>
&lt;li>Validate survey responses&lt;/li>
&lt;li>Be cautious about assuming classical attenuation only&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="example-4-wages-of-married-women">Example 4: Wages of Married Women&lt;/h2>
&lt;p>In Heckman&amp;rsquo;s classic sample-selection setup, wages are only observed for married women who choose to work. A researcher regresses wages on education using only women with observed wages.&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Causal inference&lt;/strong>&lt;/li>
&lt;li>Diagnosis: &lt;strong>Sample selection bias&lt;/strong>&lt;/li>
&lt;li>Why: Wages are only observed for women who work. Selection into employment depends on unobservables that may also affect wages.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Model the selection process&lt;/li>
&lt;li>Use Heckman-style correction methods&lt;/li>
&lt;li>Gather information on nonworkers if possible&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="example-5-children-and-mothers-labor-supply">Example 5: Children and Mothers' Labor Supply&lt;/h2>
&lt;p>A researcher regresses a mother&amp;rsquo;s labor supply on the number of children she has and finds that women with more children work less. He concludes that having another child reduces labor supply by exactly that amount.&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Causal inference&lt;/strong>&lt;/li>
&lt;li>Diagnosis: &lt;strong>Simultaneous causality bias&lt;/strong>&lt;/li>
&lt;li>Why: Fertility affects labor supply, but labor supply choices may also affect fertility decisions. Family preferences and timing decisions tie the two together.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>IV&lt;/li>
&lt;li>Natural experiment&lt;/li>
&lt;li>Exogenous variation in family size&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="example-6-earnings-and-experience">Example 6: Earnings and Experience&lt;/h2>
&lt;p>Following the classic earnings literature, a researcher regresses log earnings on years of schooling and years of labor-market experience. She includes experience only as a linear term, even though the earnings profile appears to rise early in the career and then flatten.&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Prediction&lt;/strong> or description, though students may argue causal inference if they justify it carefully&lt;/li>
&lt;li>Diagnosis: &lt;strong>Wrong functional form&lt;/strong>&lt;/li>
&lt;li>Why: A linear term imposes a constant marginal effect of experience, but the classic earnings profile is concave.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Add experience squared&lt;/li>
&lt;li>Use logs&lt;/li>
&lt;li>Plot the data first&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;h2 id="final-checkup">Final Checkup&lt;/h2>
&lt;p>Choose &lt;strong>one&lt;/strong> of the six studies above and answer:&lt;/p>
&lt;p>&lt;strong>If you were the journal referee, would you trust the causal claim? Why or why not?&lt;/strong>&lt;/p>
&lt;p>&lt;em>Answers will vary — the key is applying the correct diagnosis and explaining how it undermines or qualifies the study&amp;rsquo;s conclusions.&lt;/em>&lt;/p></description></item><item><title/><link>https://econ3500s26.netlify.app/content/practice/ch9-activity3-study-doctor-student/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/practice/ch9-activity3-study-doctor-student/</guid><description>&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-regression-validity">In-Class Activity: Regression Validity&lt;/h1>
&lt;hr>
&lt;h2 id="example-1-catholic-schooling-and-educational-attainment">Example 1: Catholic Schooling and Educational Attainment&lt;/h2>
&lt;p>A researcher studies whether attending a Catholic high school increases graduation and college attendance. Students who attend Catholic schools may also come from families that are more motivated, more religious, or more education-focused to begin with.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>\newpage&lt;/p>
&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-regression-validity-1">In-Class Activity: Regression Validity&lt;/h1>
&lt;h2 id="example-2-oregon-medicaid-lottery">Example 2: Oregon Medicaid Lottery&lt;/h2>
&lt;p>The Oregon Health Insurance Experiment used a lottery to study the effects of Medicaid for low-income uninsured adults in Oregon. A policymaker wants to use those estimates to predict what the effects would be in a very different state with different hospitals, demographics, and eligibility rules.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>One fix or follow-up question:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>\newpage&lt;/p>
&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-regression-validity-2">In-Class Activity: Regression Validity&lt;/h1>
&lt;h2 id="example-3-survey-earnings-vs-administrative-records">Example 3: Survey Earnings vs. Administrative Records&lt;/h2>
&lt;p>Bound and Krueger compare workers' self-reported earnings in surveys to administrative earnings records. Suppose a researcher estimates the effect of earnings on some outcome using only the self-reported survey measure.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>\newpage&lt;/p>
&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-regression-validity-3">In-Class Activity: Regression Validity&lt;/h1>
&lt;h2 id="example-4-wages-of-married-women">Example 4: Wages of Married Women&lt;/h2>
&lt;p>In Heckman&amp;rsquo;s classic sample-selection setup, wages are only observed for married women who choose to work. A researcher regresses wages on education using only women with observed wages.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>\newpage&lt;/p>
&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-regression-validity-4">In-Class Activity: Regression Validity&lt;/h1>
&lt;h2 id="example-5-children-and-mothers-labor-supply">Example 5: Children and Mothers' Labor Supply&lt;/h2>
&lt;p>A researcher regresses a mother&amp;rsquo;s labor supply on the number of children she has and finds that women with more children work less. He concludes that having another child reduces labor supply by exactly that amount.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>\newpage&lt;/p>
&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-regression-validity-5">In-Class Activity: Regression Validity&lt;/h1>
&lt;h2 id="example-6-earnings-and-experience">Example 6: Earnings and Experience&lt;/h2>
&lt;p>Following the classic earnings literature, a researcher regresses log earnings on years of schooling and years of labor-market experience. She includes experience only as a linear term, even though the earnings profile appears to rise early in the career and then flatten.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2.5cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p></description></item><item><title/><link>https://econ3500s26.netlify.app/content/practice/ch9-activity3-study-doctor/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/practice/ch9-activity3-study-doctor/</guid><description>&lt;p>ECON 3500 Econometrics and Applications&lt;br>
Spring 2026&lt;/p>
&lt;h1 id="in-class-activity-regression-validity">In-Class Activity: Regression Validity&lt;/h1>
&lt;p>&lt;strong>Chapter 9 — Assessing Studies Based on Multiple Regression&lt;/strong>&lt;br>
Time: ~15-20 minutes&lt;/p>
&lt;hr>
&lt;h2 id="your-job">Your Job&lt;/h2>
&lt;p>Each example below is a research study.
All six are based on real papers, but the descriptions below are simplified for class use.&lt;/p>
&lt;p>For each study:&lt;/p>
&lt;ol>
&lt;li>What is the &lt;strong>goal&lt;/strong>?
&lt;ul>
&lt;li>Causal inference&lt;/li>
&lt;li>Forecasting&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>What is the &lt;strong>main problem&lt;/strong>?
&lt;ul>
&lt;li>Omitted variable bias&lt;/li>
&lt;li>Wrong functional form&lt;/li>
&lt;li>Errors-in-variables bias&lt;/li>
&lt;li>Sample selection bias&lt;/li>
&lt;li>Simultaneous causality bias&lt;/li>
&lt;li>External validity only / not mainly an internal-validity problem&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>Why is that the right diagnosis?&lt;/li>
&lt;li>What is one concrete fix or improvement?&lt;/li>
&lt;/ol>
&lt;hr>
&lt;h2 id="quick-diagnosis-guide">Quick Diagnosis Guide&lt;/h2>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>If the problem is&amp;hellip;&lt;/th>
&lt;th>Ask yourself&amp;hellip;&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;strong>OVB&lt;/strong>&lt;/td>
&lt;td>Is there some omitted factor that affects $Y$ and is correlated with $X$?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Wrong functional form&lt;/strong>&lt;/td>
&lt;td>Did we force a straight-line relationship when the true relationship is curved or interactive?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Measurement error&lt;/strong>&lt;/td>
&lt;td>Is $X$ or $Y$ measured noisily, inaccurately, or systematically wrong?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Sample selection&lt;/strong>&lt;/td>
&lt;td>Are some observations missing because of the outcome or some unobserved factor tied to it?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Simultaneous causality&lt;/strong>&lt;/td>
&lt;td>Does $Y$ also affect $X$?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>External validity&lt;/strong>&lt;/td>
&lt;td>Even if the study is internally valid, would the result generalize to a different setting?&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;hr>
&lt;h2 id="example-1-catholic-schooling-and-educational-attainment">Example 1: Catholic Schooling and Educational Attainment&lt;/h2>
&lt;p>A researcher studies whether attending a Catholic high school increases graduation and college attendance. Students who attend Catholic schools may also come from families that are more motivated, more religious, or more education-focused to begin with.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;hr>
&lt;h2 id="example-2-oregon-medicaid-lottery">Example 2: Oregon Medicaid Lottery&lt;/h2>
&lt;p>The Oregon Health Insurance Experiment used a lottery to study the effects of Medicaid for low-income uninsured adults in Oregon. A policymaker wants to use those estimates to predict what the effects would be in a very different state with different hospitals, demographics, and eligibility rules.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>One fix or follow-up question:&lt;/strong>&lt;/p>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;hr>
&lt;h2 id="example-3-survey-earnings-vs-administrative-records">Example 3: Survey Earnings vs. Administrative Records&lt;/h2>
&lt;p>Bound and Krueger compare workers' self-reported earnings in surveys to administrative earnings records. Suppose a researcher estimates the effect of earnings on some outcome using only the self-reported survey measure.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;hr>
&lt;h2 id="example-4-wages-of-married-women">Example 4: Wages of Married Women&lt;/h2>
&lt;p>In Heckman&amp;rsquo;s classic sample-selection setup, wages are only observed for married women who choose to work. A researcher regresses wages on education using only women with observed wages.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;hr>
&lt;p>\newpage&lt;/p>
&lt;h2 id="example-5-children-and-mothers-labor-supply">Example 5: Children and Mothers' Labor Supply&lt;/h2>
&lt;p>A researcher regresses a mother&amp;rsquo;s labor supply on the number of children she has and finds that women with more children work less. He concludes that having another child reduces labor supply by exactly that amount.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;hr>
&lt;h2 id="example-6-earnings-and-experience">Example 6: Earnings and Experience&lt;/h2>
&lt;p>Following the classic earnings literature, a researcher regresses log earnings on years of schooling and years of labor-market experience. She includes experience only as a linear term, even though the earnings profile appears to rise early in the career and then flatten.&lt;/p>
&lt;p>&lt;strong>Goal:&lt;/strong> __________________________________________&lt;/p>
&lt;p>&lt;strong>Main diagnosis:&lt;/strong> __________________________________&lt;/p>
&lt;p>&lt;strong>Why?&lt;/strong>&lt;/p>
&lt;p>\vspace{2cm}&lt;/p>
&lt;p>&lt;strong>One fix:&lt;/strong>&lt;/p>
&lt;p>\vspace{1.5cm}&lt;/p>
&lt;hr>
&lt;h2 id="final-checkup">Final Checkup&lt;/h2>
&lt;p>Choose &lt;strong>one&lt;/strong> of the six studies above and answer:&lt;/p>
&lt;p>&lt;strong>If you were the journal referee, would you trust the causal claim? Why or why not?&lt;/strong>&lt;/p>
&lt;p>\vspace{4cm}&lt;/p>
&lt;hr>
&lt;p>\newpage&lt;/p>
&lt;h2 id="instructor-notes--do-not-distribute">INSTRUCTOR NOTES — DO NOT DISTRIBUTE&lt;/h2>
&lt;h3 id="preferred-diagnoses">Preferred diagnoses&lt;/h3>
&lt;p>&lt;strong>Example 1: Catholic Schooling and Educational Attainment&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Causal inference&lt;/strong>&lt;/li>
&lt;li>Diagnosis: &lt;strong>Omitted variable bias&lt;/strong>&lt;/li>
&lt;li>Why: Students who attend Catholic schools are selected. Family motivation, religiosity, discipline, and neighborhood context may affect both school choice and later attainment.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Add better controls&lt;/li>
&lt;li>Use a credible IV or lottery-style design&lt;/li>
&lt;li>Compare similar students more carefully&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Example 2: Oregon Medicaid Lottery&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Goal: Usually &lt;strong>causal inference&lt;/strong> in the original study, but the policymaker&amp;rsquo;s question is about &lt;strong>external validity&lt;/strong>&lt;/li>
&lt;li>Diagnosis: &lt;strong>External validity only / not mainly an internal-validity problem&lt;/strong>&lt;/li>
&lt;li>Why: The question is whether Oregon lottery estimates transport to a very different setting. Students should talk about hospitals, baseline uninsured rates, take-up, and the local policy environment.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Replicate in more settings&lt;/li>
&lt;li>Compare institutional context&lt;/li>
&lt;li>Ask whether the treated and target settings are genuinely comparable&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Example 3: Survey Earnings vs. Administrative Records&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Causal inference&lt;/strong> or prediction; either answer is acceptable if justified&lt;/li>
&lt;li>Diagnosis: &lt;strong>Errors-in-variables bias&lt;/strong>&lt;/li>
&lt;li>Why: Self-reported earnings differ from administrative records. The observed regressor may contain measurement error, and Bound-Krueger show that it is not purely classical.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Use administrative records&lt;/li>
&lt;li>Validate survey responses&lt;/li>
&lt;li>Be cautious about assuming classical attenuation only&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Example 4: Wages of Married Women&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Causal inference&lt;/strong>&lt;/li>
&lt;li>Diagnosis: &lt;strong>Sample selection bias&lt;/strong>&lt;/li>
&lt;li>Why: Wages are only observed for women who work. Selection into employment depends on unobservables that may also affect wages.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Model the selection process&lt;/li>
&lt;li>Use Heckman-style correction methods&lt;/li>
&lt;li>Gather information on nonworkers if possible&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Example 5: Children and Mothers' Labor Supply&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Causal inference&lt;/strong>&lt;/li>
&lt;li>Diagnosis: &lt;strong>Simultaneous causality bias&lt;/strong>&lt;/li>
&lt;li>Why: Fertility affects labor supply, but labor supply choices may also affect fertility decisions. Family preferences and timing decisions tie the two together.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>IV&lt;/li>
&lt;li>Natural experiment&lt;/li>
&lt;li>Exogenous variation in family size&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Example 6: Earnings and Experience&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Goal: &lt;strong>Prediction&lt;/strong> or description, though students may argue causal inference if they justify it carefully&lt;/li>
&lt;li>Diagnosis: &lt;strong>Wrong functional form&lt;/strong>&lt;/li>
&lt;li>Why: A linear term imposes a constant marginal effect of experience, but the classic earnings profile is concave.&lt;/li>
&lt;li>Fixes:
&lt;ul>
&lt;li>Add experience squared&lt;/li>
&lt;li>Use logs&lt;/li>
&lt;li>Plot the data first&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;h3 id="real-paper-anchors">Real-paper anchors&lt;/h3>
&lt;ul>
&lt;li>Example 1: Altonji, Elder, and Taber (2005), &lt;em>Journal of Political Economy&lt;/em>, &amp;ldquo;Selection on Observed and Unobserved Variables: Assessing the Effectiveness of Catholic Schools&amp;rdquo;&lt;/li>
&lt;li>Example 2: Finkelstein et al. (2012), &lt;em>Quarterly Journal of Economics&lt;/em>, &amp;ldquo;The Oregon Health Insurance Experiment: Evidence from the First Year&amp;rdquo;&lt;/li>
&lt;li>Example 3: Bound and Krueger (1991), &lt;em>Journal of Labor Economics&lt;/em>, &amp;ldquo;The Extent of Measurement Error in Longitudinal Earnings Data&amp;rdquo;&lt;/li>
&lt;li>Example 4: Heckman (1979), &lt;em>Econometrica&lt;/em>, &amp;ldquo;Sample Selection Bias as a Specification Error&amp;rdquo;&lt;/li>
&lt;li>Example 5: Angrist and Evans (1998), &lt;em>American Economic Review&lt;/em>, &amp;ldquo;Children and Their Parents' Labor Supply: Evidence from Exogenous Variation in Family Size&amp;rdquo;&lt;/li>
&lt;li>Example 6: Mincer (1974), &lt;em>Schooling, Experience, and Earnings&lt;/em>&lt;/li>
&lt;/ul>
&lt;h3 id="teaching-notes">Teaching notes&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>The cleanest way to run this is:&lt;/p>
&lt;ol>
&lt;li>Give groups 8 minutes to diagnose all six studies.&lt;/li>
&lt;li>Cold-call one group per example.&lt;/li>
&lt;li>For Example 2, push students to distinguish &lt;strong>internal&lt;/strong> from &lt;strong>external&lt;/strong> validity.&lt;/li>
&lt;li>For Example 3, ask whether the measurement error is likely classical or non-classical.&lt;/li>
&lt;li>For Example 6, ask whether wrong functional form threatens causal interpretation, prediction, or both.&lt;/li>
&lt;/ol>
&lt;/li>
&lt;li>
&lt;p>If you want a faster version, assign only Examples 1, 3, 4, and 5.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>If you want a harder version, require students to say something about the &lt;strong>direction&lt;/strong> of bias for Examples 1, 3, 4, and 5.&lt;/p>
&lt;/li>
&lt;/ul></description></item><item><title/><link>https://econ3500s26.netlify.app/content/w6-01-in-class-exercise/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://econ3500s26.netlify.app/content/w6-01-in-class-exercise/</guid><description>&lt;p>Version: Fall 2020&lt;br>
EC200 Econometrics and Applications&lt;/p>
&lt;p>&lt;strong>In-Class Exercise - Multiple Linear Regression&lt;/strong> \&lt;/p>
&lt;p>Consider a dataset on earnings in the United States. We are interested
in the returns to education - how much an extra year of schooling &amp;ldquo;buys&amp;rdquo;
you in terms of weekly wages (...as of 1980). You&amp;rsquo;re also worried about
whether one&amp;rsquo;s education suffers from omitted variable bias.&lt;/p>
&lt;ol>
&lt;li>
&lt;p>You estimate two equations: $$\begin{aligned}
\widehat{wage} &amp;amp;= 146.95 + 60.21educ\
\widehat{educ} &amp;amp; = 5.84 + 0.075IQ\end{aligned}$$&lt;/p>
&lt;p>Based on these results, is 60.21 an overestimate or underestimate of
the returns to education? How do you know?&lt;/p>
&lt;/li>
&lt;li>
&lt;p>You estimate another equation:
$\widehat{education} = -128.89 +42.06 educ + 5.14 IQ$&lt;/p>
&lt;p>What is the interpretation of the coefficient on $educ$? What is the
interpretation of the constant?&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Now, you control for experience and age and estimate the following
population regression model:&lt;/p>
&lt;p>$$wage_i = \beta_0 + \beta_1 educ_i + \beta_2 IQ_i + \beta_3 exper_i + \beta_4 age_i + \beta_5 age_i^2 + u_i$$&lt;/p>
&lt;p>A one-year increase in age is associated with what change in wages?
(mind the squared term)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Finally, because you are worried about omitted variable bias, you
include father&amp;rsquo;s and mother&amp;rsquo;s education.&lt;/p>
&lt;ol>
&lt;li>
&lt;p>Why might parent&amp;rsquo;s education might directly affect wage?&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Which other independent variables do you think parent&amp;rsquo;s
education might affect? Explain.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>How did controlling for parent&amp;rsquo;s education affect the returns to
education? The returns to IQ?&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/li>
&lt;/ol></description></item></channel></rss>