THE EFFECTS OF NEWS SHOCKS AND BOUNDED RATIONALITY ON MACROECONOMIC VOLATILITY by BRIAN JOSEPH DOMBECK A DISSERTATION Presented to the Department of Economics and the Graduate School of the University of Oregon in partial fulfillment of the requirements for the degree of Doctor of Philosophy June 2017 DISSERTATION APPROVAL PAGE Student: Brian Joseph Dombeck Title: The Effects of News Shocks and Bounded Rationality on Macroeconomic Volatility This dissertation has been accepted and approved in partial fulfillment of the requirements for the Doctor of Philosophy degree in the Department of Economics by: Bruce McGough Co-Chair George Evans Co-chair Jeremy Piger Core Member Arafaat Valiani Institutional Representative and Scott L. Pratt Dean of the Graduate School Original approval signatures are on file with the University of Oregon Graduate School. Degree awarded June 2017 ii c© 2017 Brian Joseph Dombeck iii DISSERTATION ABSTRACT Brian Joseph Dombeck Doctor of Philosophy Department of Economics June 2017 Title: The Effects of News Shocks and Bounded Rationality on Macroeconomic Volatility This dissertation studies the impact embedding boundedly rational agents in real business cycle-type news-shock models may have on a variety of model predictions, from simulated moments to structural parameter estimates. In particular, I analyze the qualitative and quantitative effects of assuming agents are boundedly rational in a class of DSGE models which attempt to explain the observed volatility and comovements in key aggregate measures of U.S. economic performance as the result of endogenous responses to information in the form of “news shocks”. The first chapter explores the theoretical feasibility of relaxing the rational expectations hypothesis in a three-sector real business cycle (RBC) model which generates boom-bust cycles as a result of periods of optimism and pessimism on the part of households. The second chapter determines whether agents forming linear forecasts of shadow prices in a nonlinear framework can lead to behavior approximately consistent with fully informed individuals in a one-sector real business cycle model. The third chapter analyzes whether empirical estimates of the relative importance of anticipated shocks may be biased by assuming rational expectations. By merging the two hitherto separate but complementary strands of literature related to bounded rationality and news shocks I am able to conduct in-depth analysis of the importance of both the information agents have and what they choose to do with it. At its core, the study of news in macroeconomics is a study of the specific role alternative iv information sets play in generating macroeconomic volatility. Adaptive learning on the other hand is concerned with the behavior of agents given an information set. Taken together, these fields jointly describe the input and the “black box” which produce model predictions from DSGE models. While previous research has been conducted on the effects of bounded rationality or news shocks in isolation, this dissertation marks the first set of research explicitly focused on the interaction of these two model features. v CURRICULUM VITAE NAME OF AUTHOR: Brian Joseph Dombeck GRADUATE AND UNDERGRADUATE SCHOOLS ATTENDED: University of Oregon, Eugene, OR Louisiana State University, Baton Rouge, LA DEGREES AWARDED: Doctor of Philosophy, Economics, 2017, University of Oregon Master of Science, Economics, 2013, University of Oregon Bachelor of Science, Economics, 20012, Louisiana State University AREAS OF SPECIAL INTEREST: Applied Structural Macroeconomics Bounded Rationality Information Choice GRANTS, AWARDS AND HONORS: Kleinsorge Summer Research Fellowship, University of Oregon, 2016 Kleinsorge Summer Research Fellowship, University of Oregon, 2014 Kleinsorge First Year Fellowship, University of Oregon, 2012 Graduate Teaching Fellowship, University of Oregon, 2012-2017 vi ACKNOWLEDGEMENTS The completion of this dissertation would not have been possible without the support of many groups of people. I thank Professors Bruce McGough, George Evans, Jeremy Piger, Van Kolpin, Anne van den Nouweland , Kaj Gittings, Bulent Unel, Naci Mocan, Carter Hill, and Charles Roussel for all of their guidance and mentorship, and for teaching me to think like an economist. I thank Savannah Dombeck, Kellie Geldreich, and the rest of the University of Oregon Department of Economics staff for their assistance in navigating the myriad of requirements necessary for completing the Doctoral program. Finally, I thank my family for their love, patience, and understanding over the years. vii For Savannah viii TABLE OF CONTENTS Chapter Page I. INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 II. EXPECTATIONAL STABILITY AND THE “COMOVEMENT PROBLEM” . . 4 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Incorporating News Shocks into DSGE Models . . . . . . . . . . . . . . . . 9 Adaptive Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Application to Beaudry and Portier (2004) . . . . . . . . . . . . . . . . . . 17 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 III. SHADOW PRICE LEARNING IN A NEWS-SHOCK MODEL . . . . . . . . . . 28 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 The Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 Equilibrium . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40 News Shocks and Expectations Formation . . . . . . . . . . . . . . . . . . . 41 Simulation and Model Performance . . . . . . . . . . . . . . . . . . . . . . . 50 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 IV. LEARNING VS NEWS: WHAT DRIVES BUSINESS CYCLES? . . . . . . . . . 57 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 The Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 Adaptive Learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 Estimation Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75 Main Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88 ix Chapter Page APPENDICES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 A. APPENDIX FOR CHAPTER II . . . . . . . . . . . . . . . . . . . . . 90 B. APPENDIX FOR CHAPTER III . . . . . . . . . . . . . . . . . . . . . 100 C. APPENDIX FOR CHAPTER IV . . . . . . . . . . . . . . . . . . . . . 105 REFERENCES CITED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123 x LIST OF FIGURES Figure Page 1. IRF for Accurate News . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 2. IRF for Inaccurate News . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50 3. Model Simulation Under SP-learning . . . . . . . . . . . . . . . . . . . . . . . . . 51 4. Relative Importance of News: Key Macroeconomic Variables, RE vs NRE . . . . . 86 5. Relative Importance of News: Government Spending, RE vs NRE . . . . . . . . . 87 xi LIST OF TABLES Table Page 1. Calibrated Parameters for Three-sector Model . . . . . . . . . . . . . . . . . . . . 23 2. Alternate Parameterizations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 3. Calibrated Parameters for One-sector Model . . . . . . . . . . . . . . . . . . . . . 47 4. t-tests for Data Generating Process, News, 230 Periods . . . . . . . . . . . . . . . 52 5. t-tests for Data Generating Process, No News, 230 Periods . . . . . . . . . . . . . 52 6. t-tests for Data Generating Process, News, 1000 Periods . . . . . . . . . . . . . . 54 7. t-tests for Data Generating Process, No News, 1000 Periods . . . . . . . . . . . . 54 8. Predicted Business Cycle Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . 55 9. Calibrated Parameters for Estimated One-sector Model . . . . . . . . . . . . . . 81 10. Estimated Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 11. Model Predictions: Volatility and Persistence, Data vs NRE vs RE . . . . . . . . 88 xii CHAPTER I INTRODUCTION A primary objective of applied structural macroeconomics is to identify the determinants of business cycles and create empirically relevant dynamic stochastic general equilibrium (DSGE) models. While most modern DSGE models assume agents have rational expectations and restrict attention to the effect of unanticipated shocks as drivers of the business cycle, my dissertation studies the theoretic and empirical effects of embedding boundedly rational agents in real business cycle-type news-shock models to determine whether and when these assumptions influence the predictions generated by our models. News-shock models seek to generate positive comovement between consumption, investment, total labor supply, and output - which is consistent with postwar U.S. macroeconomic data - in response to news about the future state of the economy. In Chapter II I examine whether such models are compatible with a relaxation of the RE assumption in the context of the three-sector RBC model introduced in Beaudry and Portier (2004) by endowing agents with a perceived law of motion (PLM) for the evolution of the economy. Given their beliefs about the economy’s transitional path, agents take actions which result in an actual law of motion (ALM) for the economy. Forecasting errors as captured by differences between agents’ PLM and the ALM are then used to update their forecasting model via recursive least squares. This process of adaptive learning about the laws of motion for the economy plays out each period. If agents acting in this way eventually learn to behave in the same way as fully rational agents, the rational expectations equilibrium (REE) is said to be expectationally stable (E-stable). I find that 1 the RE solution for this specific news shock model is E-Stable, and that this finding is robust to a variety of alternative parameterizations and timing assumptions. Furthermore I show that under certain assumptions regarding agents’ information sets, if the REE is E-stable in any model without news shocks it will remain E-stable when news shocks are included. This implies the effects of news shocks on model predictions can be safely considered in a wide variety of existing models whose REE have been shown to be E- stable. In Chapter III I explore the qualitative and quantitative effects of allowing agents to be boundedly optimal in the sense of Evans and McGough (2015) in the well-known news-shock model of Jaimovich and Rebelo (2009). Boundedly optimal agents are similar to their boundedly rational counterparts in that they are not endowed with knowledge of the conditional distribution of all variables in the model. However, instead of merely learning about the laws of motion for endogenous variables they are also endowed with forecasting models for the shadow prices of endogenous state variables. Forecasts of shadow prices are updated recursively which gives rise to a rich set of behavior as agents incorporate news shocks into their forecasts. By embedding this behavioral process in a nonlinear DSGE model, I am able to show through simulations that agents’ behavior will converge close to that of fully rational agents, a remarkable feat considering how simplistic the process is. Furthermore I compare actual US business cycle statistics with those obtained by calibrating and simulating the model with shadow-price learning agents and find that bounded optimality has a significant effect on the properties of simulated data as compared to that generated under RE. Finally, in Chapter IV I utilize Bayesian estimation techniques to determine whether the estimated relative importance of news in the news-shock model of Schmitt-Grohe and Uribe (2012) is robust to the inclusion of adaptive learning. News shocks and 2 adaptive learning both act as expectations-based sources of business cycle activity in macroeconomic dynamic stochastic general equilibrium (DSGE) models. This chapter explores whether these sources act as complements or substitutes to each other by comparing estimates of the relative importance of news shocks obtained across expectation formation mechanisms. I find the relative importance of news shocks is amplified by up to 35% for key macroeconomic variables under learning as compared to rational expectations, implying that news shocks may be more important for driving business cycles than existing estimates would otherwise suggest. The chapters primary contributions are to clarify which assumptions the importance of news is robust to and to produce a new estimate of the constant-gain learning parameter from a novel joint news-and-learning estimation routine. 3 CHAPTER II EXPECTATIONAL STABILITY AND THE “COMOVEMENT PROBLEM” Introduction Starting with Kydland and Prescott (1982) the economic literature seeking to explain stylized facts and comovements observed in postwar macroeconomic U.S. data through supply-side innovations has flourished. So-called real business cycle (RBC) theory maintains innovations to productivity are responsible for the booms and busts which comprise the business cycle. This implies a peculiar interpretation of cyclical activity: while expansions are driven by technological progress, recessions must be caused by technological regress.1 In recent years a subset of RBC literature has begun exploring alternate mechanisms through which the economy might experience business cycles, focusing in particular on the potential role of information and expectations. For example in Beaudry and Portier (2004), the model of which is the focus for this chapter, news about the future generates expectationally driven business cycles of the sort described in Pigou (1927) in which the economy experiences boom-bust periods resulting from the actions of forward-looking agents responding to noisy signals of future technological innovation. As noted in Krusell and McKay (2010), business cycles are characterized by positive comovement in aggregate consumption, investment, employment, and output. While standard RBC models generate positive pro-cyclical comovements in key variables in 1 Stiglitz (2014) points out that “... by implication, the Great Depression was marked by an episode of acute amnesia, where in large parts of the world, people got less productive!” 4 response to contemporaneous productivity shocks, they are unable to generate these positive comovements in response to anticipated productivity shocks. Such models typically imply news about future total factor productivity (TFP) will cause consumption to move opposite investment, employment, and output. For example, the baseline calibration of Kydland and Prescott’s original model suggests households will substitute investment for consumption in anticipation of higher future marginal factor productivity stemming from anticipated future technological growth. Because leisure is a normal good, households also reducing employment, causing a drop in the level of output. Thus, good news about the future state of the economy causes a recession today. Several dynamic stochastic general equilibrium (DSGE) models have arisen in an effort to fix this “comovement problem”.2Beaudry and Portier (2004) show that standard one and two sector RBC models are incapable of generating qualitatively realistic expectationally driven business cycles, that is, business cycles featuring positive comovement amongst key macroeconomic variables in response to news about the future. They propose a three-sector RBC model which utilizes a CES production function for consumption and short-term substitutability constraints on consumption and investment. Jaimovich and Rebelo (2009) augment the canonical RBC model to include variable capital utilization, investment adjustment costs, and a novel preference specification which allows the modeler to manipulate the strength of the wealth effect on labor supply. Lorenzoni (2009) studies a version of the Phelps-Lucas “island model” capable of generating empirically realistic comovements in response to anticipated demand shocks where firms receive a noisy public signal of future aggregate productivity and a private signal of their own productivity. Krusell and McKay (2010) modify a version of 2DSGE models are fully microfounded models of agent-level behavior in an economy. In addition to describing the decision making and expectation formation processes for households, firms, governments, central banks, and other economic agents. 5 the Diamond-Mortensen-Pissarides search-and-matching model to allow the number of firms, which is endogenous and interpreted as investment, to react to news about future productivity. Guo et al. (2015) show variable capital utilization and increasing returns to scale production are sufficient modifications to allow a standard RBC model to produce qualitatively and quantitatively realistic expectationally driven business cycles in response to news about future aggregate demand shocks. Each of these approaches to fixing the comovement problem do so by changing the structure of the economy considered. But these departures from canon have not been accompanied by an analysis of whether the resulting rational expectations equilibrium (REE) are expectationally stable (E-stable); that is, whether the REE is “learnable”. E-stability of an REE can be seen as a measure of the model’s reliance on the rational expectations hypothesis. This paper proceeds by analyzing the expectational stability properties of the (unique) RE solution to the three-sector RBC model presented in Beaudry and Portier (2004). The goal is to discover whether news shock models can deliever REE which are simultaneously E-stable and capable of generating qualitatively realistic expectationally driven business cycles. My main results suggest that E-stability properties are robust to the inclusion of news shocks. To this point the news-shock literature has exclusively relied on the assumption championed by Muth (1961) that agents have model consistent or rational expectations (RE). In other words, agents are perfectly informed as to the precise laws of motion governing the evolution of the economy. Critically, this implies their forecasts will be correct on average. This assumption elegantly addresses the critique of Lucas (1976) that changes in policy may naturally result in changes to the behavior of agents. While Lucas and Muth both carefully pointed out RE is not meant to be taken as a realistic description of the way in which individual agents behave, it is natural to ask whether weaker 6 assumptions about the capabilities of agents could result in similar model predictions.3 In particular, it is worth asking if an alternative expectation formation mechanism can be both a behavioral description and a useful tool for analyzing economic models. One attractive alternative proposed in Marcet and Sargent (1989a,b,c) is to assume agents are engaged in “adaptive learning”. The literature on boundedly rational agents4 supposes agents are endowed with a perceived law of motion(PLM) about the true structure of the economy, but are unsure of the precise coefficients governing the laws of motion. Regarding their PLM as true the agents make optimal choices given their information set. Since the actions of agents today influence the state of the economy tomorrow, this gives rise to the actual law of motion (ALM) for the economy. Agents behave as econometricians and update their estimates of the laws of motion using a learning algorithm based on the observed forecast errors each period. A REE is said to be E-Stable if the PLM converges (asymptotically) to the REE, that is, if agents can eventually learn the true dynamics of the economy. Learning thus provides one way of checking the sensitivity of models to the standard rational expectations assumption: if a 3Specifically, Muth (1961) states “[Rational Expectations] does not assert that the scratch work of entrepreneurs resembles the system of equations in any way; nor does it state that predictions of entrepreneurs are perfect or that their expectations are all the same” while Lucas (1978) states “...[Rational Expectations] does not describe the way agents think about their environment, how they learn, process information, and so forth. It is rather a property likely to be (approximately) possessed by the outcome of this unspecified process of learning and adapting.” 4See Evans and Honkapohja (2001) for a comprehensive guide to the theory. 7 particular REE is not E-stable, then the predictions of the model may be perceived as less robust than a similar model featuring a learnable REE. Given the rich set of behavior observed in models which contemplate changes to information sets and/or expectations formation mechanisms, it is natural to check whether the innovations necessary to generate expectationally driven business cycles are consistent with relaxing the assumption of rational expectations. This is particularly important in the context of new-shock models, as stability results have been shown to be quite sensitive to specific assumptions regarding the information sets of agents. For example, Bullard and Eusepi (2014) show that excluding the value of contemporaneous endogenous variables from agents’ information sets breaks the link between determinacy and E-stability found in McCallum (2007). Since by construction news-shock models consider novel assumptions regarding agents’ information sets while simultaneously altering modifying well-known DSGE models, an exploration of the E-stability properties in this class of models appears called for. I begin by describing news shocks and how they are implemented into macroeconomic DSGE models. Next I introduce adaptive learning and demonstrate the procedure for determining whether a given REE is E-stable in the presence of news shocks. Finally, I apply the learning analysis to the 3-sector RBC model of Beaudry and Portier (2004). The chapter concludes with a brief discussion of the main results. 8 Incorporating News Shocks into DSGE Models Consider an economy with temporary equilibria described by the stationary system of non-linear expectational difference equations Eˆt [f (Xt+1, Xt, Xt−1, νt)] = 0 where X a vector of endogenous and exogenous variables and ν a vector of anticipated and unanticipated exogenous white-noise shocks. While non-linear solutions to these types of problems are increasingly being computed numerically, typically the system is linearized via a first-order Taylor-series expansion about the steady state X¯. The model now takes the form xt = A+BEˆtxt+1 + Cxt−1 +Dνt where x a vector of endogenous variables in terms of deviation from steady-state. Partitioning xt into endogenous variables yt and auxiliary state variables wt, the model can be rewritten yt = α + βyt−1 + χwt + δEˆtyt+1 (2.1) wt = ϕwt−1 +Mνt (2.2) News shocks are exogenous stochastic shocks which are realized today but do not impact economic fundamentals until some time in the future. Unanticipated or “surprise” shocks thus can be thought of as a particular form of news shocks in which the realization and impact occur contemporaneously. In either case, the shocks are incorporated easily into 9 the model by adding new auxiliary variables to the vector wt, and appropriately defining the matrices ϕ and M .5 Adaptive Learning E-Stability: Contemporaneous Expectations I focus on the minimum state variable (MSV) solutions to the system given by (2.1) and (2.2), which corresponds to a solution with the same set of variables obtained by solving the model under rational expectations. Restricting attention to the MSV solutions helps facilitate a direct comparison of solutions obtained under different expectation formation assumptions. Before continuing it is worth noting that under rational expectations it is irrelevant whether agents time t information sets include the values of the contemporaneous endogenous variables yt or if they observe only lagged endogenous variables yt−1 and must instead forecast them: because the expectations are model consistent, the forecast will not be systematically incorrect. When considering bounded rationality, however, this is no longer the case; differences between the PLM and the ALM will drive a wedge between expected and realized values and these residuals may well be serially correlated. This 5For example, suppose agents receives news zero and one periods in advance. Then we can write equation (2.2) as ( ε0t ε1t ) = ( 0 1 0 0 )( ε0t−1 ε1t−1 ) + ( 1 0 0 1 )( ν0t ν1t ) where wt = ( ε0t , ε 1 t ) and νt = ( ν0t , ν 1 t ) . In this case the total time t innovation to the exogenous variables in equation (2.1) is given by ε0t = ν 0 t + ε 1 t−1 = ν 0 t + ν 1 t−1. Agents receive the information about the partial innovation ν1t−1 in period t−1, but it does not impact the economy until period t; the partial innovation ν0t is a surprise shock learned about in period t which affects fundamentals contemporaneously. 10 wedge can significantly impact the E-stability property of REE, and thus I consider both cases in what follows. Regardless of the specific assumptions on the information set of agents, the MSV solutions of the system above have the form yt = a+ byt−1 + cwt (2.3) wt = ϕwt−1 +Mνt (2.4) Suppose agents are endowed with the form of the solution given in equation (2.3) as their PLM. Then given the information assumptions above, expectations are given by Eˆtyt = yt Eˆtyt+1 = a+ bEˆtyt + cϕwt = a+ byt + cϕwt which can be substituted into equation (2.1) to yield the ALM (I − δb) yt = (α + δa) + βyt−1 + (χ+ δcϕ)wt (2.5) This provides a mapping from beliefs captured by the PLM to the ALM. This “T-map” is given by T (a, b, c) = { (I − δb)−1 (α + δa) , (I − δb)−1 β, (I − δb)−1 (χ+ δcϕ)} 11 E-stability of a solution is determined by the matrix differential equation d dτ (a, b, c) = T (a, b, c)− (a, b, c) (2.6) It is worth noting that a REE ( a¯, b¯, c¯ ) can be interpreted as the solution to the fixed point problem 0 = T ( a¯, b¯, c¯ )− (a¯, b¯, c¯) i.e. a set of beliefs which are self-fulfilling and coincide on average with realizations such that ( I − δ (I + b¯)) a¯ = α (2.7a) δb¯2 − b¯+ β = 0 (2.7b)( I − δb¯) c¯− δc¯ϕ = χ (2.7c) Inspection of equation (2.7b) shows the coefficient matrix b is independent of the assumed structure for anticipated and unanticipated shocks; that is, the coefficient matrix b is independent of ϕ and M from (2.2). Furthermore, for a given b equations (2.7a) and (2.7c) uniquely determine the coefficient matrices a and c. However, b is the solution of a matrix-quadratic which is not in general unique, and hence solution techniques such as the well-known methods of Blanchard and Kahn, Uhlig, Klein, or Sims, must be employed to obtain the REE. 12 The Jacobian of the vectorized matrix differential equation (2.6) evaluated at the REE a¯, b¯, c¯ can be shown to be comprised of three blocks6 DTa(a¯, b¯) = I ⊗ ( I − δb¯)−1 δ (2.8a) DTb(b) = [( I − δb¯)−1 β]′ ⊗ [(I − δb¯)−1 δ] (2.8b) DTc(b¯, c¯) = ϕ ′ ⊗ (I − δb¯)−1 δ (2.8c) Proposition 10.3 of Evans and Honkapohja (2001) suggests the REE is E-Stable if all eigenvalues of the matrices DTa(a¯, b¯), DTb(b¯), and DTC(b¯, c¯) have real parts less than 1. This can be easily established given a particular model, but the objective of this paper is to determine what effect - if any - the inclusion of anticipated shocks has on the E- stability of a given REE. This is established by Proposition 2.3.1 Proposition 2.3.1. If agents’ information sets include the value of contemporaneous endogenous variables and the REE corresponding to the core of a model featuring news shocks, defined as the system of equations which exists when news shocks are shut down, is E-stable, then the REE corresponding to the model with news shocks included is also E-stable. 6For example, since the equation governing the evolution of b is independent of a and c we may start with the equation Tb(b) = (I − δb)−1 β the matrix differential of this equation with respect to b is obtained by applying the rule dF−1 = −F−1(dF )F−1, and hence dTb(b) = (I − δb)−1 δ(db) (I − δb)−1 β Since dvecx = vecdx and vec(ABC) = (C ′ ⊗ A)vecB the vectorized Jacobian DTb = ∂vecTb/∂(vecb)′- which determines local stability of this equation evaluated at a particular b- is given by DTb(b) = [ (I − δb)−1 β ]′ ⊗ [ (I − δb)−1 δ ] Similar operations can be employed to obtain DTa and DTc. 13 Proof. Suppose the REE is E-stable without anticipated shocks included in the model. Then since the value of b¯ is independent of ϕ and M (which define the structure of exogenous shocks), and since a is uniquely determined for given b, it follows the structural modifications necessary to incorporate news shocks will have no effect on the eigenvalues of the matrices DTa and DTb. Furthermore, these modifications can always be done such that ϕ a block-upper triangular matrix with upper-left block corresponding to the no- news model and lower-right block a nilpotent lower-shift matrix. Since the eigenvalues of DTc are the set of pairwise-products of eigenvalues of ϕ ′ and ( I − δb¯)−1 δ, the inclusion of the nilpotent sub-matrix implies DTc will acquire a set of zero eigenvalues in addition to exactly the same eigenvalues as in the no-news model, thereby preserving E-stability of the REE. Proposition 2.3.1 suggests that to check whether an REE for a particular model is E-stable it will be sufficient to consider the model without news shocks. If it is E-stable without news then it will continue to be E-stable in the presence of news. One immediate implication of this is that any models with well known unique and E-stable REE can be modified to include news. However, the conditions which imply a REE is E-stable depend critically on the information assumed to be known to agents at the time of making their forecast. In the next section I consider the effect of introducing an informational friction to the decision making process of agents. E-Stability: Delayed Information Assumption In contrast to the preceding subsection, I now assume the values of contemporaneous endogenous variables are unknown to agents at the time they must make their decisions. Instead, agents must form forecasts of these values using their PLM. This 14 approach emphasizes the (possible) distinction between individual and aggregate-level endogenous variables in macroeconomic models; for example, individual households consumption/savings decisions are typically functions of future real interest rates, which are themselves determined by the aggregate consumption/savings decision of the economy as a whole. Thus each household must form a forecast of the future aggregate action. I again consider the model given by equations (2.1) and (2.2). The PLM is again assumed to take the form of the MSV solution, which is still yt = a+ byt−1 + cwt (2.9) wt = ϕwt−1 +Mνt (2.10) However, the information assumptions now imply that expectations are given by Eˆtyt = a+ byt−1 + cwt and Eˆtyt+1 = a+ bEˆtyt + cϕwt = (I + b)a+ b2yt−1 + (bc+ cϕ)wt As before these can be inserted into equation (2.1) to obtain the ALM yt = [α + δ (I + b) a] + [ β + δb2 ] yt−1 + [χ+ δ (bc+ cϕ)]wt (2.11) Thus the mapping of beliefs from the PLM to the ALM is given by T (a, b, c) = { α + δ (I + b) a, β + δb2, χ+ δ (bc+ cϕ) } 15 and expectational stability of a solution is determined by the matrix differential equation d dτ (a, b, c) = T (a, b, c)− (a, b, c) (2.12) where again an REE can be seen as the solution to the fixed point problem 0 = T ( a¯, b¯, c¯ )− (a¯, b¯, c¯) It is simple to verify the solution to this problem is identical to that implied by equations (2.7a), (2.7b), and (2.7c), establishing that the specific informational assumptions are equivalent under RE: when agents are perfectly informed to the true ALM for the economy it matters not whether they know the values of contemporaneous endogenous variables or forecast them, as their forecasts will by assumption be correct on average. However, the additional complication for boundedly rational households can have non-trivial effects on the conditions necessary to ensure asymptotic convergence to the REE. In particular, Proposition 10.1 in Evans and Honkapohja (2001) states the RE solution (a¯, b¯, c¯) is E-Stable if all eigenvalues of the Jacobian of the vectorized T- map, which again is comprised of three blocks, have real parts less than 1. Under this 16 alternative timing regime, the blocks are given by7 DTa(a¯, b¯) = I ⊗ δ + I ⊗ δb¯ DTb(b¯) = b¯ ′ ⊗ δ + I ⊗ δb¯ DTc(b¯, c¯) = ϕ ′ ⊗ δ + I ⊗ δb¯ As with the contemporaneous timing assumption, the matrices DTa and DTb are unaffected by the modifications to ϕ and M necessary to include news shocks, and hence their eigenvalues are similarly unaffected. However, DTc is clearly affected; furthermore, this matrix is the sum of two matrices. Very little can be said of the eigenvalues of the sum of matrices in general, and thus analysis of E-stability under this delayed information set must rely on numerical exercises. In what follows I apply the analytic E-stability results from this section to the model of Beaudry and Portier (2004), and find that the REE are E-stable under both informational assumptions for a wide range of parameter constellations and informational structures Application to Beaudry and Portier (2004) Any solution to the comovement problem must address two issues. First, good news about the future must lead to increased demand for investment when the information is obtained as opposed to when the shock actually materialize. And second, the increased investment must be financed by increasing employment as opposed to decreasing 7Since the Kronecker product is a bilinear map, we have DTa ( a¯, b¯ ) = I ⊗ δ (I + b¯) = I ⊗ δ + I ⊗ δb¯ 17 consumption. There are a variety of methods available for achieving these desired results. In what follows I will apply the E-stability analysis of the previous section to the news-shock model of Beaudry and Portier (2004), in which good news about the future generates a boom in the current period: all major macroeconomic variables - consumption, investment, labor supply, and output - rise upon receipt of the positive information regarding future economic fundamentals. The economy of Beaudry and Portier (2004) is comprised of three sectors: investment or “durable” goods, intermediate “nondurable” goods, and final composite consumption goods. Households make a consumption/savings decision using income earned by supplying labor to the durable and nondurable good markets and renting capital to the consumption good market; this decision is partially informed the arrival of information about a future technological innovation i.e. a news shock. All markets are competitive. The durable and nondurable goods are produced using CRTS technology from household labor and a fixed factor, and production is augmented by sector-specific technology. The consumption good is produced from the capital stock and the nondurable good using CES technology featuring complementarity between the inputs. The keys to generating positive comovement in this model are twofold. Because the nondurable good and capital are complements in production of the consumption good, news about the technology used in producing the nondurable good causes a contemporaneous change to the demand for investment. And because the consumption and investment decisions are essentially decoupled from each other - the value of investment is directly related only to the loss of leisure by increasing labor in the durable good sector, as opposed to the loss of utility from reducing consumption - the increased investment is purchased by households working more rather than less. Thus, the model 18 produces qualitatively realistic expectationally driven business cycles in which good news about the future generates a boom in all key macroeconomic variables today. The Model Formally, the composite consumption good Ct is produced from the nondurable good Xt and the (predetermined) capital stock Kt−1 according to Ct = ( aXνt + (1− a)Kνt−1 ) 1 ν (2.13) where ν ≤ 0 to ensure the inputs are complements in production. The nondurable good is produced from household labor lx,t, a fixed factor Fx, and technology θx,t according to the constant returns to scale (CRTS) production function Xt = θx,tl αx x,tF 1−αx x (2.14) where 0 ≤ αx ≤ 1 captures labor’s share of nondurable-good production. The law of motion for the aggregate capital stock is Kt = (1− δ)Kt−1 + It (2.15) where 0 ≤ δ ≤ 1 the depreciation rate and It gross private investment produced by the durable goods sector from household labor lk,t, a fixed factor Fk, and technology θk,t according to the CRTS production function It = θk,tl αk k,tF 1−αk k (2.16) 19 where 0 ≤ αk ≤ 1 captures labor’s share of durable-good production. The fixed factors are inelastically supplied by households and have the effect of introducing diminishing returns in labor supply while maintaining CRTS in overall production. These fixed factors can be thought of as any scarce resource that constrains production such as privately held land or managerial capital. Households are infinitely lived and receive utility from consumption and disutility from supplying labor. The lifetime utility function is assumed separable in consumption and labor, and is given by U = Eˆ0 [ ∞∑ t=0 βt { log(Ct)− v0 ( l¯ − lx,t − lk,t )}] (2.17) where 0 < β < 1 the discount factor, v0 > 0 scales the disutility of supplying labor, and l¯ total disposable time. Eˆt denotes the subjective expectations of the household given their time t information set. The flow budget constraint is Ct + PtIt ≤ Wx,tlx,t +Wk,tlk,t +RtKt−1 + Πx,t + Πk,t (2.18) where Pt the price of the investment good in terms of the consumption good and Πx,t and Πk,t the returns to renting the fixed factor in the nondurable and durable goods sectors, respectively. The aggregate resource constraint is defined to be Yt = Ct + PtIt (2.19) 20 where Yt the total output of the economy. Technology of the durable good exhibits a deterministic trend such that θk,t = g0,ke g1t (2.20) while the technology of the nondurable grows stochastically according to θx,t = g0,xe g1tθˆx,t (2.21) θˆx,t = θˆ λ x,t−1e ε0t (2.22) where 0 < λ < 1. One may interpret innovations to nondurable good-specific technology as capturing the process of product differentiation e.g. the creation of higher quality or entirely new products. New goods will require a higher stock of infrastructure, and this complementarity between nondurable TFP and the capital stock is key to the model’s ability to generate qualitatively realistic expectationally driven business cycles. News itself is assumed to take the form of an anticipated shock to the growth of nondurable technology. In particular, ε0t = ν 0 t + ν n t−n where n is the horizon for which the shock is anticipated. This is in contrast to the “noisy news” view, which models news shocks as noisy signals of future innovations. Both formulations in some sense support the notion of revisions and agents being surprised by realizations which differ from their expectations. But the anticipated shocks version has the advantage of being straightforward to incorporate into the systems of equations comprising modern macroeconomic models; furthermore, it is simple to consider alternative assumptions for the information flows e.g. the length and/or density of the anticipated shock structure. 21 E-Stability Solving this model for a particular parameterization involves detrending the non- stationary variables, calculating the non-stochastic steady state, and then log-linearizing the system around this non-stochastic steady state; see Appendix A for details. Abusing notation, the (linear) system of expectational difference equations can be put in standard form yt = α + βyt−1 + χwt + δEˆtyt+1 (2.23) wt = ϕwt−1 +Mνt (2.24) where yt = ( C˜t, l˜x,t, l˜k,t, I˜t, K˜t, X˜t, Y˜t, P˜t, R˜t, W˜x,t, W˜k,t, Λ˜t, Q˜t, θ˜k,t, θ˜x,t, ˜ˆ θx,t )′ the endogenous variables, wt = (ε 0 t , ε 1 t , · · · , εnt )′ a vector of auxiliary state variables designed to pass news shocks through time, and νt = (ν 0 t , ν 1 t , · · · , νnt )′ a vector of anticipated and unanticipated exogenous stochastic shocks.8 The elements of the matrices α, β, χ, and δ correspond to the log-linearized equations which describe a temporary equilibrium and are functions of the structural parameters; the non-zero elements of ϕ, and M are selected to imply the desired structure for the arrival of anticipated and unanticipated shocks. Beaudry and Portier (2004) calibrate the model parameters using results from previous studies or to achieve specific steady-state values. My analysis of the E-stability of their news-shock model begins with their baseline calibrations, which are summarized in Table 1. 8Technology for the durable good is assumed to be deterministic. Incorporating a stochastic element would simply require modifying wt and νt to include the appropriate auxiliary and exogenous variables and making the corresponding revisions to the filtering matrices ϕ and M . 22 Baseline Parameterization Parameter Values β 0.98 δ 0.05 αx 0.60 αk 0.97 l¯ 2 v0 1 Fx 1 Fk 1 λ 0.999 ν -3.78 TABLE 1. Calibrated Parameters for Three-sector Model Inclusion of News Shocks Putting the model in standard form allows an immediate application of the E- stability results from the section which introduced adaptive learning. In particular, it is simple for any given parameterization to calculate the eigenvalues of the T-map’s vectorized Jacobian which indicate whether or not the system is E-stable. I begin by setting n = 0, that is, the case where there is no news and all shocks are completely unanticipated. The largest real roots of DTa(a¯, b¯), DTb(b¯), and DTc(b¯, c¯) under the contemporaneous information assumption are 0.5199, 0.51947, and 0 respectively, while the corresponding numbers are 0.1547, 0.1538, and 0 under the delayed information assumption. Thus the RE solution for this model is E-Stable under both informational assumptions without news. Proposition 2.3.1 implies that the eigenvalues should be unaffected by the inclusion of news shocks under the contemporaneous timing assumption. Indeed, setting n = 1 so that households receive information about the technological innovation 1-period ahead and keeping all other parameters fixed at their baseline levels results in no change to the 23 eigenvalues for DTa(a¯, b¯), DTb(b¯), or DTc(b¯, c¯), as expected. Furthermore, while there is no corollary for the delayed timing assumption, numerical results suggest the E-stability of the REE is similarly unaffected by the inclusion of news under the delayed timing assumption. The main contribution of this chapter is the finding that news shocks do not alter the E-stability properties for REE. This is important because E-stability results are, in general, sensitive to the assumptions regarding the timing and flow of information. While news shocks are essentially a modification of the exogenous driving forces for the economy which would not in general be expected to alter E-stability results, their impact is felt through the revision of expectations and subsequent behavioral changes on the part of households from the arrival of new information. This informational feedback effect is responsible for the fact that some REE which are E-stable under the contemporaneous timing assumption may fail to be E-stable under the delayed timing assumption. The main results of this chapter affirm the robustness of REE to news shocks and suggest modelers wishing to incorporate anticipated shocks into their models may do so without sacrificing on the robustness of their REE to alternate expectations formations assumptions. Robustness In what follows I seek to characterize what role, if any, the specific calibration choices for structural parameters and the new shock structure play in determining whether the REE is E-stable. I proceed by first considering alternate calibrations for the structural parameters and then exploring the effect of lengthier forecasting horizons and denser informational structures. I find that the E-stability of the REE is robust to 24 all constellations consistent with the model being able to generate qualitatively realistic expectationally driven business cycles, with and without news shocks.9 To analyze whether alternative parameterizations may impact the E-stability results, I adopt the following strategy: I change the value of structural parameters one at a time while holding all others at their baseline calibration, searching only within the parameter space consistent with the model generating qualitatively realistic expectationally driven business cycles. The REE is E-stable for all parameterizations considered and under both informational timing assumptions. Table 2 displays the smallest and largest considered for each parameter. Robustness Checks Parameter Smallest Value Largest Value β 0.001 1 δ 0 1 αx 0.001 0.999 αk 0.001 0.999 ν -100 -0.001 a 0.001 0.999 TABLE 2. Alternate Parameterizations In general, the largest real parts of eigenvalues are larger when households discount the future less (smaller β), when the depreciation rate of capital is smaller (smaller δ), when the decreasing returns to producing the nondurable good are smaller (larger αx), when the decreasing returns to producing the durable good are larger (smaller αk), when the complementarity between capital and nondurable goods in producing the consumption good is weaker (larger ν), and as the relative importance of the nondurable good to capital in producing consumption decreases (smaller a). Even so, setting these parameters jointly 9The ability of the model to solve the comovement problem is dependent only the value of ν, which governs the substitutability between capital and the nondurable good in the production of the consumption good. As long as these factors are complements in production of the consumption good i.e. ν ≤ 0, the model will be capable of generating qualitatively realistic expectationally driven business cycles. 25 to values which on their own would tend to imply relatively large eigenvalues does not cause the REE to become expectationally unstable. One might also be concerned about assumed structure for news shocks themselves, that is, the length and density of the forecasting horizon. To analyze what role this structure plays in the E-stability of the REE I proceed as follow. First, I extend the forecasting horizon while keeping the structural parameters at their baseline levels by considering n = 1, 2, ..., 10. Second, I allow for multiple anticipated shocks, that is, households receive multiple pieces of information for the eventual innovation. For example, households may receive news about the innovation four and eight periods ahead. Neither changes to the length of the forecasting horizon nor its density have any impact on the largest real parts of eigenvalues of DTa(a¯, b¯), DTb(b¯), or DTc(b¯, c¯) under either timing assumption. Conclusion That the three-sector news-shock model of Beaudry and Portier (2004) can generate qualitatively realistic expectationally driven business cycles while the RE solution is E- stable suggest there is nothing fundamental about news shocks per se to imply models incorporating them will be expectationally unstable. In fact, Proposition 2.3.1 shows that the REE of any model be E-stable when news shocks are included in the time t information set as long as it is E-stable when news shocks are not included. While no analytic result exists for alternative timing specifications, numerical experimentation suggests this may be true more generally, as alternate informational structures did not impact the largest eigenvalues of the associated T-map’s vectorized Jacobian. 26 This is a particularly important finding given the well-known sensitivity of E- stability of REE to specific assumptions regarding agents information sets. Since news- shock models rely critically on novel changes to agent’s information sets, and since no other studies have examined E-stability in the context of a news-shock model, this study suggests a potential justification for RE on the grounds that boundedly rational reduced- form learning agents can learn the RE coefficients over time. More work will need to be done to examine whether the REE of such news-shock models demonstrate similar learnability properties under alternative forms of bounded rationality, such as Euler- equation learning, infinite-horizon learning, or shadow-price learning. 27 CHAPTER III SHADOW PRICE LEARNING IN A NEWS-SHOCK MODEL Introduction The relatively recent interest in using news to explain macroeconomic fluctuations began largely from the asset-price bubbles experienced in the late 1990s with the tech- boom. The “irrational exuberance” exhibited by the market was difficult to generate in most standard macroeconomic DSGE models, and it is this gap which the original news- shock model of Beaudry and Portier (2004) attempted to fill. As shown in Chapter II, by modifying the structure of a RBC model in a particular way the authors were able to generate boom-bust cycles fueled by households experiencing periods of optimism and pessimism which created liquidation cycles in the capital stock. Yet this is just one way of solving the “comovement problem” found in RBC models, and it comes at the expense of making significant changes to the standard RBC model. In particular, it requires three sectors and very strong complementarity of inputs to final goods production, both of which are somewhat unusual in the broader context of typical workhorse macroeconomic models. Jaimovich and Rebelo (2009) on the other hand provide a news-shock model capable of solving the comovement problem while remaining approachable to researchers familiar with standard RBC models. The core of the model is essentially a discrete-time Ramsey model with elastic labor supply. Savings are converted to investment according to some technological process, and investment is used to create a depreciating capital stock. This textbook structure is augmented by assuming variable capacity utilization of capital, a 28 cost to adjusting investment from its previous level, and a novel preference specification nesting the well known specifications of Greenwood et al. (1988) and King et al. (1988). While these features taken together represent a significant departure from standard RBC models, the individual elements are fairly standard. The relative simplicity of the model makes it an excellent candidate for exploring the qualitative and quantitative effects of relaxing the RE assumption in favor of a weaker assumption. Furthermore, since it is quite similar to well-known RBC models, the results of such an experiment exist in an environment in which some context has already been provided. While many alternatives to RE exist, I ultimately choose to consider bounded optimality of the sort described in Evans and McGough (2015). To place this decision in a broader context, I begin by describing the two benchmark theories of bounded rationality: Euler-equation learning and infinite-horizon learning. Under Euler-equation learning, found e.g. in Evans and Honkapohja (2006) and Honkapohja et al. (2012), agents behave in accordance with their Euler equations which relate control decisions today with the forecasted values of decisions tomorrow. This approach eliminates the reliance on RE in deriving the equilibrium dynamics of the economy found in the reduced-form learning approach of Chapter II. Rather than assuming agents are rational, solving their dynamic programming problem, and then relaxing the RE assumption, Euler-equation learning studies the evolution of agent’s beliefs and the path of the economy when their expectations are boundedly rational even when solving their dynamic programming problem. One of the benefits of Euler-equation learning is its simplicity: agents need only make one-period ahead forecasts to determine their optimal control decisions. 29 However, a possible shortcoming is this approach requires agents to forecast the future value of control variables, a somewhat strange exercise in a representative agent model where these are completely under the control of the agents. The infinite-horizon approach to learning of Preston (2005) avoids this by formally expressing decisions today as being determined by agent’s expectations of their future lifetime budget constraint and transversality conditions. Behavior is thus based on the relationship between future wealth and the implied control decisions. One of the main benefits of the infinite-horizon approach is the strict adherence to microfoundations which implies behavior is truly optimal given beliefs. In Branch et al. (2012a) a “hybrid” of the infinite-horizon and Euler-equation approaches called N-step optimal learning is developed. N-step optimal learning assumes agents explicitly take current and future expected values of wealth over a finite range into account when making their control decisions, which are themselves anchored to the Euler equation as a behavioral primitive. This learning mechanism has the infinite- horizon approach as a limiting case, and may be viewed as its finite-horizon version. N- step optimal learning captures both the simplicity of Euler-equation learning and the rigorous adherence to fully optimal behavior imparted by infinite-horizon learning, but it still requires considerable sophistication on the part of agents in the model. A recent alternative similar in spirit but requiring less agent-level sophistication is the shadow-price learning (SP-learning) approach of Evans and McGough (2015). Rather than taking the behavioral primitive to be the Euler equations, which often embody complicated nonlinear relations between contemporaneous control decisions and the evolution of the determinants of future income, SP-learning assumes agents base their decisions on the standard first-order necessary conditions (FONCs) derived from the intertemporal Lagrangian. These FONCs describe a set of conditions which must be 30 satisfied by the agents control decisions in order to be considered optimal, and they are functions of the expected present value of shadow prices for endogenous state variables. SP-learners form forecasts of these shadow prices using a linear PLM and update their beliefs over time using recursive least squares; thus the behavior is by definition optimal given beliefs. SP-learning is an attractive alternative to RE for many reasons. First, the informational assumptions are quite natural: agents need understand little more than their preferences and budget constraint and take as given many of the same variables that real households do e.g. wages and interest rates. Second, the behavior of agents is quite intuitive: they simply contemplate how changing their behavior today would impact key variables tomorrow based on their beliefs and take action accordingly. Third, the updating of beliefs occurs via recursive least squares, an exercise which any student of introductory econometrics could conduct. Finally, it lends itself to considering heterogeneity of agents along some dimension such as information or initial wealth, as these differences will cause different transition paths under learning for different households. While much attention in the bounded rationality literature has focused on determining whether agents endowed with boundedly rational expectations may learn to behave rationally, there is a somewhat smaller literature which examines the effect of relaxing RE on the quantitative predictions generated by a given DSGE model. For example, Williams (2003a) compares data generated by simulations of an RBC and a NK model under RE and reduced-form learning. He finds that reduced-form learning has an extremely small effect on generated moments. He then considers an alternative learning approach where agents’ decisions incorporate uncertainty about the structure of the economy into their decision rules. This “structural learning” approach is shown to substantially increase the volatility and persistence of key macroeconomic variables 31 generated by an otherwise standard RBC model. Given the discussion above this is not too surprising: the “structural learning” approach embodies to varying degrees the same core idea behind Euler-equation and infinite-horizon learning, namely that agents should try to act optimally given their beliefs, and that these beliefs may be quite different from those prescribed by RE. This chapter proceeds in the spirit of Williams (2003a) by trying to determine the quantitative effects on business cycle statistics generated by the Jaimovich and Rebelo (2009) news-shock model when agents are assumed to be SP-learners. The central goal of this chapter is to address what effect relaxing the assumption of RE in favor of SP- learning has on the empirical relevance of this particular news-shock model, and whether the behavior of SP-learning households will come to approximate that of their fully rational counterparts. I first describe the economic environment, including the calibration for the model, the equations governing a temporary equilibrium, and the way in which news shocks are modeled. I then present the results of simulation exercises to generate and compare data on business cycles within the model to those found in US data. The chapter concludes by reviewing the main results and discussing paths for future research. The Model The representative household chooses consumption Ct and hours worked ht to maximize the lifetime utility function Eˆ0 ∞∑ t=0 βtU(Vt) (3.1) 32 where 0 < β < 1 the household’s discount factor and U the period utility function which takes the CRRA form U(Vt) = V 1−σt − 1 1− σ (3.2) where σ > 0 the inverse intertemporal elasticity of substitution and the argument Vt is given by Vt = Ct − ψh 1+ 1 θ t St 1 + 1 θ (3.3) where ψ > 0 scales the disutility of labor supply and θ > 0 governs the Frisch elasticity of labor supply. St is a geometric average of current and past habit-adjusted consumption and takes the form St = C γ t S 1−γ t−1 (3.4) where 0 ≤ γ ≤ 1 governs the magnitude of the wealth elasticity of labor supply. This preference specification, often referred to as “JR preferences”, allows the modeler to calibrate the wealth effect of labor supply to be “small” while permitting a balanced growth path. The comovement problem in typical RBC models is partially caused by the wealth effect of labor supply dominating the substitution effect, thereby causing consumption and hours worked to move in opposite directions upon receipt of good news, and hence this utility function makes it simple to develop qualitatively realistic expectationally driven business cycles. Note this specification nests two well-known and important preference specifications. γ = 0 corresponds to the preferences of Greenwood et al. (1988) in which labor supply 33 depends only on current real wages and is independent of the marginal utility of income, while γ = 1 corresponds to the preferences of King et al. (1988) which are compatible with a balanced growth path at the optimal steady state of the economy. Small values for γ allow the economy to be consistent with a balanced growth path while also implying a very weak wealth effect of labor supply, both of which are important features for any dynamic stochastic general equilibrium (DSGE) model hoping to explain movement in key macroeconomic variables through news shocks.1 Households are assumed to own physical capital Kt and rent it to firms in a competitive market. Each household’s stock of capital evolves according to the law of motion Kt = (1− δ (ut))Kt−1 + It [ 1− Φ ( It It−1 )] (3.5) where It gross private investment. Note the timing convention: subscripts correspond to the time period in which the variable is decided, hence in time t the household chooses Kt and takes as given the predetermined capital stock Kt−1, which was chosen in period t− 1. Φ ( It It−1 ) imposes a cost to adjusting investment from its previous level while δ (ut) implies capital depreciation is a function of its utilization rate ut. Both functions are convex in their arguments, and I follow SGU in assuming the quadratic functional forms δ(ut) = δ0 + δ1(ut − 1) + δ2 2 (ut − 1)2 (3.6) 1Note that the case of γ = 0 is not consistent with a balanced growth path: along a steady state the real interest rate must be growing at a constant rate, and thus the marginal utility of income will also be growing. A balanced growth path requires the wealth and substitution effects from the constantly growing real wage to cancel, but γ = 0 ensures that only the substitution effect exists, and hence labor supply will be changing at a rate inconsistent with the rest of the economy. 34 where δ0 > 0 the steady-state depreciation rate, δ1 > 0 determines the steady-state value of ut, and δ2 > 0 captures the rental rate elasticity of capacity utilization and Φ ( µIt ) = κ 2 ( µIt − µI )2 (3.7) where µIt ≡ ItIt−1 the growth rate of investment, κ > 0 a scaling parameter, and µI the steady-state growth rate of gross private investment. This specification implies Φ ( µI ) = Φ′ ( µI ) = 0 and Φ′′ ( µI ) > 0, i.e. there are no investment-adjustment costs on a balanced growth path. Each period the household receives labor income from working ht hours at rate Wt, rental income from from renting utKt−1 units of effective capital at gross rental rate Rt, and lump sum firm-profits of Πt. The household uses this income to purchase consumption and investment goods. The flow budget constraint is given by Ct + AtIt ≤ Wtht +Rt (utKt−1) + Πt (3.8) where At is an exogenous process representing the current state of technology for producing investment goods from consumption goods with stationary growth rate µAt ≡ At At−1 and steady-state value µA. In a decentralized equilibrium At may be interpreted as the relative price of investment goods in terms of consumption goods, that is a unit of the investment good may be exchanged for At units of the consumption good. Households are assumed to make their choices to maximize their expected lifetime utility, and hence each period they solve a constrained optimization problem to maximize expected discounted utility. Formally, households choose a set of stochastic processes 35 {Ct, ht, ut, It, Kt, St}∞t=0 to maximize 3.1 subject to the constraints given by equations 3.2-3.8 and initial conditions for the endogenous state variables I−1, K−1, and S−1. This can be written as a standard dynamic constrained maximization problem: max Ct,ut,ht,It,Kt,St Eˆt ∞∑ t=0 βtU (Vt) subject to U (Vt) = V 1−σt − 1 1− σ Vt = Ct − ψh 1+ 1 θ t St 1 + 1 θ St = C γ t S 1−γ t−1 Ct + AtIt ≤ Wtht +Rt (utKt−1) + Πt Kt = (1− δ(ut))Kt−1 + It [ 1− Φ ( It It−1 )] The household’s time t control decisions uˆt = (Ct, ut, ht, It, Kt, St) ′ will depend on their expectations of the future shadow price of the endogenous state variables x1t = (It−1, Kt−1, St−1) ′, which are variables the household can directly affect in the future through their actions today.2 Under RE all agents within the economy know the conditional distributions of all variables, and hence the laws of motion for the endogenous and exogenous state variables are known and the expected future value of these shadow prices is straightforward to compute. Standard solution techniques under RE involve deriving Euler equations relating today’s control decisions to those of tomorrow to describe the behavior of households and combining these and other optimality conditions, resource constraints, and laws of motion to arrive a system of expectational difference equations. 2While agents will also use their expectations of future realizations of the exogenous state variables in their decision making process, their actions do not affect the marginal values of these variables; hence, they need not concern themselves with the expected future shadow prices of these exogenous states. 36 Typically this system is nonlinear, and hence the solution to the first-order approximation is considered. In what follows I will replace RE with a version of bounded rationality based on shadow prices called SP-learning. Under SP-learning, households do not know the conditional distribution of all variables, and in particular they do not know the precise way in which shadow prices depend on their behavior. Instead they estimate the value of these shadow prices using a linear forecasting rule which is a function of observables. The solution technique under SP-learning stands in stark contrast to that of RE. A major difference is that while agents are endowed with linear forecasting rules, their behavior is embedded within the nonlinear specification of the model. This implies except for special cases, the relevant equilibrium notion is that of a restricted-perceptions equilibrium (RPE) as opposed to a REE. A RPE can be thought of as an equilibrium arising from agents’ optimally misspecified beliefs which are consistent with the stochastic processes realized in the economy. Given their forecasting model agents are unable to detect a misspecification. The first task of this chapter is determining how similar this RPE is to the implied REE. Denote the shadow prices of investment, capital, and habit-adjustment by λIt , λ K t and λSt , respectively. These have the simple interpretation as the time t value of an additional unit of their corresponding endogenous state in time t. For example, λKt is the marginal value of an additional unit of preinstalled capital Kt−1 in time t. With this interpretation the FONCs describing optimal household choices of Ct, ht, and ut given beliefs about the future values of these shadow prices can be obtained via simple variational arguments.3 3Note that given optimal decisions for consumption, labor supply, and capacity utilization the choices for the geometric average of habit-adjusted consumption St, gross investment It, and hence next period’s capital stock Kt+1 are pinned down by equations (3.4), (3.8), and (3.5) respectively. 37 The details of this derivation are presented in Appendix B. The end result is the following three FONCs in the controls UCt(Vt) + ∂St ∂Ct βEˆtλ S t+1 = ∂It ∂Ct ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) −Uht(Vt) = ∂It ∂ht ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) ∂Kt ∂ut βEˆtλ K t+1 = ∂It ∂ut ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) and the following three FONCs in the endogenous state variables λIt = ∂Kt ∂It−1 βEˆtλ K t+1 λKt = ∂It ∂Kt−1 ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) + ∂Kt ∂Kt−1 βEˆtλ K t+1 λSt = USt−1(Vt) + ∂St ∂St−1 βEˆtλ S t+1 It can be shown that the endogenous shadow prices can be expressed in terms of the contemporaneous controls and states, and hence their actual value is determined as a function of the contemporaneous decisions of the household. However, the household makes these decisions based on their expectations of the value of future shadow prices without knowing the nonlinear way in which the actual values are determined. Given a set of beliefs and a linear PLM for the evolution of these shadow prices, SP-learners take optimal actions given their forecasts of future shadow prices. This behavior results in the ALM for the shadow prices. Beliefs are updated as weighted averages of current beliefs and forecast errors via the recursive least squares algorithm as is standard in the adaptive learning literature. A crucial question addressed by studying household behavior under 38 this less stringent specification for expectation formation is whether the household’s beliefs will (approximately) converge to those of a rational agent. The model is closed by describing the production side of the economy. It is important to keep in mind that the only agents operating as SP-learners are the households; firms behave in a manner consistent with rational expectations. One possible justification for this discrepancy in assumptions about expectations formation is that these types of agents may be more likely to have the means to act in a highly sophisticated manner, whereas the average household likely does not. The representative firm pays for ht worker-hours and rents utKt−1 units of effective capital to produce output Yt using CRT technology according to the production function Yt = zt (utKt−1) 1−α hαt (3.9) where α ∈ (0, 1) governs the labor share of output in steady state. As is typical in RBC- type models, the supply side of the economy is subjected to exogenous stochastic shocks, given here by a transitory shock zt. Factor markets are competitive and hence the gross rental rate equals the value of the marginal product of effective capital Rt = (1− α) Yt utKt−1 (3.10) and the wage paid is equal to the value of the marginal product of labor Wt = α Yt ht (3.11) 39 Output is fungible and may be used for private consumption or gross investment. Total demand is thus given by Yt = Ct + AtIt (3.12) Equilibrium A temporary equilibrium for all periods t ≥ 0 is described collectively by the optimal behavior of households given beliefs: UCt(Vt) + ∂St ∂Ct βEˆtλ S t+1 = ∂It ∂Ct ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) (3.13a) −Uht(Vt) = ∂It ∂ht ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) (3.13b) ∂Kt ∂ut βEˆtλ K t+1 = ∂It ∂ut ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) (3.13c) Kt = (1− δ (ut))Kt−1 + It [ 1− Φ ( It It−1 )] (3.13d) Ct + AtIt ≤ Wtht +Rt (utKt−1) (3.13e) St = C γ t S 1−γ t−1 (3.13f) I−1,K−1, S−1 given (3.13g) where the utility function, its argument, and the functional forms for δ(ut) and Φ ( It It−1 ) are given by equations (3.2), (3.3), (3.6), and (3.7), respectively. Appendix B expresses these equations in terms of the underlying variables as opposed to derivatives, which I present here for the sake of expositional clarity. The market clearing conditions, aggregate 40 resource constraint, production function, and factor-market prices are given by Yt = Ct + AtIt (3.14a) Yt = zt (utKt−1) 1−α (ht) α (3.14b) Wt = α Yt ht (3.14c) Rt = (1− α) Yt utKt−1 (3.14d) The laws of motion for the exogenous stochastic processes At and zt along with the precise way in which news shocks are included in the model are described below. News Shocks and Expectations Formation It remains to describe the particular way in which shadow price learning agents form their expectations. The variables relevant to household decision making but outside of their control in time t - that is, the predetermined control variables, factor prices, and the exogenous state variables (plus a constant) - are collected into xt = (1, It−1, Kt−1, St−1,Wt, Rt, At, zt) ′, and households are assumed to have a PLM for the endogenous-state shadow prices given by λt = Htxt, where λt = ( λIt , λ K t , λ S t )′ and Ht denotes the household’s time t estimates of the coefficient-matrix relating the state variables to the value of the shadow price. There is one issue that must be addressed: Wt and Rt are exact functions of ht, utKt−1, and the exogenous processes zt and At, and hence including all of these variables in the regression will lead to perfect multicollinearity. To avoid this I drop Wt and Rt from 41 the set of regressors utilized by the household and denote this subset of state variables as x˜t = (1, It−1, Kt−1, St−1, At, zt) ′.4 The household’s PLM for the shadow prices is thus given by the linear model λt = H ′ tx˜t (3.15) where Ht is updated via recursive least squares according to the dynamic system RH,t = R H t−1 + gt ( x˜t−1x˜′t−1 −RHt−1 ) (3.16) Ht = Ht−1 + gtR−1H,tx˜t−1 ( λt−1 −H ′t−1x˜t−1 )′ (3.17) RH,t is the household’s time t estimate of the second-moment matrix for the regressors while the “gain” parameter gt controls how much weight households put on new information. Much of the theoretical work exploring whether agents can learn the coefficient-matrix of an economy’s REE, such as Evans and Honkapohja (2001) assumes a decreasing gain such as gt = t −1 , which implies the household response to forecast errors vanishes asymptotically. Alternatively, studies focused on simulation or estimation of DSGE models under learning, such as Williams (2003b) and Milani (2007), employ a constant gain where gt = g¯, which implies the household is a lifelong learner and continually revises its coefficient estimates placing the most weight on the most recent observations. I will simulate the model under both assumptions. Under constant-gain 4One could also consider assuming the household does not observe some of the exogenous processes and instead uses wages and the gross rental rate in their forecasts. This would require specifying a PLM and dynamic system for updating beliefs for these price variables. It can also lead to serious issue with asymptotic multicollinearity as household’s beliefs converge to those of rational agents, because as the residuals in the dynamic system for updating beliefs go to zero the actions of the household become perfectly collinear with market prices. 42 learning I calibrate the gain to be 0.0152, which is in line with recent estimates from the empirical adaptive learning literature. The value of shadow prices in time t is not known ex-ante to the household, and is determined as a result of their control choices. Hence, time t beliefs are updated using information available through time t − 1. This clarifies the central idea of shadow price learning: households make optimal decisions given their (misspecified) beliefs and the information available to them in the moment. That beliefs may be misspecified highlights a particularly salient feature of SP-learning in particular and bounded rationality in general by embracing the notion that agents in the economy may certainly fail to fully understand the dynamics governing its evolution and yet an equilibrium may still exist. Taking expectations of equation (3.15) we have Eˆtλt+1 = HtEˆtx˜t+1, and hence households must forecast the future values of regressors.5. The values of the time t + 1 endogenous state variables It, Kt, and St are pinned down by the time t flow budget constraint, capital accumulation equation, and geometric-average identity respectively, and it is natural to assume the household knows these values. In addition the transition equations for the exogenous processes zt and At are assumed to be known to the household.6 5I have assumed certainty equivalence on the parts of households; that is, they believe their time t estimate of the coefficient matrix is the correct estimate now and in the future, and hence behave accordingly. Thus household’s do not need to attempt to forecast the way in which their estimates will change in the future. 6Since there is no feedback between the household’s decisions and the evolution of these processes this is a very simple estimation exercise from an econometric standpoint if the households actually observe all of this data. An alternative approach would carefully consider the variables a household seems likely to observe. Exogenous variables that directly influence the household, such as the technology available for converting consumption goods into investment goods At in the flow budget constraint seem quite natural; however it is less obvious that households would directly observe productivity zt. As mentioned previously, one could drop these as regressors in favor of e.g. wages and/or the rental rate of capital, but this may cause the coefficient estimation routine to suffer from asymptotic multicollinearity and/or bias from sampling error present in using estimated data as an explanatory variable. 43 These expectations are augmented by the inclusion of news shocks to the information set. A news shock can be thought of as information which arrives exogenously to the household about the value of some future innovation to an exogenous process, but (importantly) does not impact any economic fundamentals contemporaneously. The action generated by news shocks is therefore entirely in the response of agents to this information about the future. News is modeled as an anticipated shock to the exogenous processes and hence has the interpretation of imparting incomplete (but accurate) information to the household about future economic fundamentals. In particular let the law of motion for the exogenous processes, indexed by w = (A, z), take the form ln(wt) = ρw ln(wt−1) + ε0w,t ε0w,t = ε 1 w,t−1 + ν 0 w,t [ ν0w,t ∈ It ] ε1w,t = ε 2 w,t−1 + ν 1 w,t [ ν1w,t−1 ∈ It ] ... = ... εnw,t = ε n−1 w,t−1 + ν n w,t [ νnw,t−n ∈ It ] where It the time t information set of the representative household, νkw,t−k the k-period ahead anticipated shock, and [ νkw,t−k ∈ It ] = 1 if νkw,t−k ∈ It and 0 otherwise. This specification allows the modeler to easily and parsimoniously consider a variety of assumptions regarding the precise details of informational acquisition by households.7 7In fact, this requires the creation of only as many auxiliary state variables as the lengthiest forecasting horizon. For example, if a household receives information four and eight periods in advance the longest forecasting horizon is eight periods, and thus eight auxiliary state variables must be generated. An alternative but equivalent specification would generate a set of k state variables for each signal vkw,t−k so that e.g. if a household received information four and eight periods in advance one would need to generate 12 additional state variables. 44 The system above makes clear the sense in which news shocks here are being modeled as partial information about the total value of the innovation. This permits a compact autoregressive representation ln(wt) = ρw ln(wt−1) + ε0w,t (3.18) εw,t = ϕwεw,t−1 +Mwνw,t (3.19) where νw,t = ( ν0w,t, ν 1 w,t, ..., ν n w,t )′ distributes i.i.d normal with mean zero and variance- covariance matrix equal to the identity matrix. The matrix ϕw and vectors εw,t, νw,t, and Mw are given in Appendix B. To see how introducing news as anticipated shocks can have real effects, suppose households were to receive information four and eight periods in advance about the time t + 1 shock, and that the exogenous process is subjected to an unanticipated shock which arrives in time t+1. Then the time t expectations about the evolution of process w in time t+ 1 would forecast its value to be Eˆt ln(wt+1) = ρw ln(wt) + ν 4 w,t+1−4 + ν 8 w,t+1−8 while the actual value realized in time t+ 1 would be ln(wt+1) = ρw ln(wt) + ν 0 w,t+1 + ν 4 w,t+1−4 + ν 8 w,t+1−8 This enables the consideration of a variety of interesting behavior as households revise their expectations in response to new information. For example, the expected value for the innovation prior to time t + 1 could fail to materialize, that is, ν0w,t+1 = − (ν4w,t+1−4 + ν8w,t+1−8), or could materialize exactly as expected i.e. ν0w,t+1 = 0. More 45 generally one may think of the household continually adjusting expectations throughout time in response to the receipt of new information, and this information being descriptive of the innovation’s final value to a varying degree in any given period. Recall the household’s PLM for shadow prices is a linear function of the time t + 1 endogenous state variables and exogenous processes. Since households are assumed to know the law of motion for all exogenous processes they will incorporate the anticipated shocks directly into their decision rules for time t controls via their forecast of future shadow prices. That is, the time t expectation of future shadow prices based on the PLM λt = H ′ tx˜t is Eˆtλt+1 = H ′ tEˆtx˜t+1 where Eˆtx˜t+1 =  1 EˆtIt EˆtKt EˆtSt EˆtAt+1 Eˆtzt+1  =  1 It Kt St ρAAt + Eˆtε 0 A,t+1 ρzzt + Eˆtε 0 z,t+1  where the household’s expectation of the future shock conditional on all information received up to time t, Eˆtε 0 w,t+1, is described above. Calibration The model is calibrated using a combination of commonly used values in the literature, estimates obtained from Schmitt-Grohe and Uribe (2012), and steady-state targets for some endogenous variables. σ = 1 which corresponds to logarithmic utility, θ = 46 Parameter Value Description σ 1 Intertemporal Elasticity of Substitution θ 1.4 Frisch-labor Supply Elasticity γ 0.001 Wealth Elasticity of Labor Supply β 0.985 Subjective Discount Factor α 0.64 Steady-state Labor Share δ0 0.025 Steady-state Depreciation Rate u 1 Steady-state Capacity Utilization Rate h 0.2 Steady-state Labor Supply κ 1.3 Adjustment Cost Acceleration ρz 0.5 Persistence of Investment-specific TFP Growth ρA 0.9 Persistence of TFP Growth TABLE 3. Calibrated Parameters for One-sector Model 1.4 so that the wage elasticity of labor supply is 2.5 when the wealth effect of labor supply is shut off, and γ is set to 0.001 which is simultaneously consistent with an extremely small wealth effect of labor supply and a balanced growth path. β is assumed to be 0.985 so that the steady-state gross real interest rate is 1.5 percent, and α is set to 0.64 so that labor’s share of output in steady-state is 64 percent. Steady state quarterly depreciation δ0 is set to 2.5 percent and δ2 is chosen so that the elasticity of δ(ut) is 0.15. δ1 is calibrated to ensure steady-state capacity utilization equals 1, while the disutility-scale parameter ψ is chosen so that household’s spent 20% of their time working. The second derivative of the investment adjustment cost function κ is set to 1.3, though this is subjected to robustness checks since the literature has little to say about this parameter. The autoregressive parameters for growth rates of zt and At are set to 0.5 and 0.9, respectively, which are consistent with the estimated values obtained in Schmitt-Grohe and Uribe (2012). This is summarized in Table ?? The relative importance of anticipated vs surprise shocks for each exogenous process is set consistent with estimates from Schmitt-Grohe and Uribe (2012). In particular, the standard deviations for the surprise components of zt and At are set to 0.21 and 0.65, respectively, while the corresponding standard deviations 47 for the (cumulative) anticipated components are 0.32 and 0.2. This implies the majority (60%) of variation in the growth rate of investment-specific technology is anticipated, while just under 25% of variation in TFP is anticipated. To develop a benchmark against which to compare the results from SP-learning, I linearize the temporary equilibrium around the non-stochastic steady state. The resulting system of first order expectational difference equations can be easily solved under rational expectations, and the resulting equilibrium is the REE. The Response to News I turn now to an exploration of the response to news shocks by key macroeconomic variables in the model. Figure 1 shows the responses by consumption, investment, hours worked, and output when a fully rational household learns at time t = 0 that there will be a 1 unit increase in the value of investment-specific or total factor productivity in time t = 3, and the expected innovation arrives as expected. In both cases the model generates positive comovement amongst all variables at the time the news is received: good news about the future causes all aggregate variables to rise. The total response to news about a shock to investment-specific technology zt is more subdued than that regarding a shock to TFP At, because the price of investment affects output only through its effect on capital accumulation while an increase in TFP directly increases output and factor prices everything else held constant. Interestingly this gives rise to a discrepancy in the relative impact of news on the overall response: most of the movement in key variables stemming from news about zt occurs in the period the news is received, while most of the movement from news about At occurs in the period the shock occurs. Clearly including additional exogenous processes will lead to a richer set 48 FIGURE 1. IRF for Accurate News of possible reactions to news of each; indeed Schmitt-Grohe and Uribe (2012) include five additional exogenous disturbances and allow agents to receive information of their future values, in which case one should no longer speak generically of “news”. Now suppose the news received by households turns out to be completely false. That is, at t = 0 households come to expect a 1 unit increase in the exogenous processes will occur in t = 3 which does not materialize. Figure 2 plots the IRF resulting from this thought experiment on rational households. Again, the receipt of news generates positive comovement in the key macroeconomic variables. However, once the news is shown to have been erroneously optimistic all variables tend back towards their initial steady state values. Contrary to the prediction of Beaudry and Portier (2004), in this model all variables remain above their steady-state levels for an extended period of time. Thus even in the face of incorrect news the positive comovement amongst variables remains is preserved. 49 FIGURE 2. IRF for Inaccurate News Simulation and Model Performance Having demonstrated the ability of the model to generate qualitatively realistic expectationally driven business cycles in response to news about the future I now turn to an evaluation of the model’s ability to generate quantitatively realistic empirical moments under SP-learning. I begin by determining whether the behavior of SP-learners will cause the economy to converge (approximately) to the REE. Figure 3 shows a simulation in which the initial beliefs of agents are perturbations of the linearized RE solution, the economy begins in steady state, and agents update their beliefs with a constant gain. For each variable the solid red line is the RE steady-state value, while the dashed line is the average of the actual variable realizations. 50 FIGURE 3. Model Simulation Under SP-learning Since households employ a linear forecasting model in a nonlinear environment, convergence to the rational expectations equilibrium is not to be expected. Indeed, Figure 3 provides strong evidence that the economy converges to some RPE - the key macroeconomic variables all fluctuate around some stationary value - but the implied RPE seems quite far from the REE as judged by the respective central tendencies of the variables. Tables 4 and 5 perform tests for equality of means for key model variables from simulating the model 1000 times each for 230 periods under varying assumptions regarding households expectations formation mechanism and their access to news. Table 4 displays the cross-sectional means of consumption, investment, hours worked, output, and the endogenous shadow prices under RE and SP-learning when agents receive news. Generally the cross-sectional means are higher under SP-learning than under RE, and the difference 51 RE Mean SPL Mean pval 95% Low 95% High Consumption 0.533 0.568 0.000 -0.036 -0.032 Labor Supply 0.200 0.212 0.000 -0.013 -0.012 Investment 0.154 0.172 0.000 -0.019 -0.018 Output 0.687 0.736 0.000 -0.052 -0.046 Investment SP 0.000 0.060 0.000 -0.061 -0.059 Capital SP 4.250 4.043 0.000 0.193 0.219 Habit-adjustment SP -154.107 -155.120 0.000 0.704 1.321 TABLE 4. t-tests for Data Generating Process, News, 230 Periods RE Mean SPL Mean pval 95% Low 95% High Consumption 0.534 0.563 0.000 -0.031 -0.027 Labor Supply 0.200 0.210 0.000 -0.011 -0.010 Investment 0.154 0.169 0.000 -0.016 -0.014 Output 0.688 0.728 0.000 -0.043 -0.038 Investment SP -0.000 0.034 0.000 -0.035 -0.034 Capital SP 4.248 4.068 0.000 0.167 0.192 Habit-adjustment SP -154.085 -154.949 0.000 0.558 1.170 TABLE 5. t-tests for Data Generating Process, No News, 230 Periods is highly statistically significant: the p-value for the t-test of mean equality across expectations formation mechanisms imply rejection of the null hypothesis that the means are equal at any reasonable level of significance. Put another way, I am able to reject the null hypothesis that the data comes from the same data generating process, which of course is true. Similar findings are found for the case where households do not receive news in Table 5. Again the null hypothesis that the simulated data came from the same data generating process can be rejected at any meaningful level of significance. While the discussion above suggests the rejection of mean equality should not be surprising, it is is worth noting each simulation is run for a relatively short amount of time. In Tables 6 and 7 I increase the simulation length to 1000 periods in order to allow the learning algorithm more time to converge. No significant changes arise: the RPE is statistically quite distinct from the REE. 52 A deeper inspection of the exact mechanisms driving this wedge between the RPE and REE may proceed from several points of observation. First, the data-generating process under SP-learning is inherently nonlinear. This contrasts with the REE which is obtained via a first-order approximation to the equilibrium dynamics of the model. Exploring higher-order approximations may shed light on precisely how important the nonlinearities are to the data-generating processes. Second, comparing tables 4 and 5 suggest that including anticipated shocks in agents’ information sets increases the size of the wedge between the RPE and REE. While the cross-sectional mean values under REE are the same whether news is included or not, those obtained under SP-learning are closer to that of REE when there is no news. This suggest something about the news pushes SP-learning household behavior away from that of their ration counterparts, which is especially interesting given the results of the previous chapter showing news shocks should not matter to the learnability of an REE. Finally, it appears that SP-learning agents systematically behave in such a way as to cause the shadow price of investment to be positive. This will occur if household behavior is such that the gross level of investment is increasing (It > It−1) and capital is being utilized at a rate above the rational expectations steady state (ut > 1), or if the opposite cases are true. It is telling that this is exactly the behavior prescribed by the model in the event that the household receives news about future productivity, and likely explains why news drives a wedge between RE and NRE: under NRE households may go through periods in which they respond too strongly (relative to RE) to news of the future. In contrast to Williams (2003a), where reduced-form learning converged to the REE and simulated data did not differ meaningfully whether agents were rational or boundedly rational, these results suggest that the convergence to the RPE should cause simulated business cycle statistics to differ from those obtained under RE. To explore this further, 53 RE Mean SPL Mean pval 95% Low 95% High Consumption 0.533 0.635 0.000 -0.103 -0.101 Labor Supply 0.200 0.238 0.000 -0.038 -0.037 Investment 0.154 0.192 0.000 -0.039 -0.039 Output 0.687 0.824 0.000 -0.138 -0.136 Investment SP -0.000 0.117 0.000 -0.118 -0.116 Capital SP 4.251 3.620 0.000 0.625 0.638 Habit-adjustment SP -154.125 -162.404 0.000 8.017 8.542 TABLE 6. t-tests for Data Generating Process, News, 1000 Periods RE Mean SPL Mean pval 95% Low 95% High Consumption 0.533 0.600 0.000 -0.068 -0.066 Labor Supply 0.200 0.223 0.000 -0.024 -0.023 Investment 0.153 0.177 0.000 -0.023 -0.023 Output 0.686 0.774 0.000 -0.088 -0.087 Investment SP -0.000 0.038 0.000 -0.038 -0.038 Capital SP 4.251 3.797 0.000 0.449 0.460 Habit-adjustment SP -154.070 -158.385 0.000 4.056 4.573 TABLE 7. t-tests for Data Generating Process, No News, 1000 Periods Figure 8 compares simulated data from the model with and without news under the assumption of RE and SP-learning against that of actual quarterly U.S. data. The data is from Jaimovich and Rebelo (2009) and covers the range 1944:Q1-2004:Q4. The logarithm of the simulated data is detrended by the Hodrick-Prescott (HP) filter using a smoothing parameter of 1600, consistent with quarterly data. I conducted 1000 simulations of 230 periods each, which is the length of the US data sample. I report the statistics for the cross-sectional average standard deviations of hours worked, investment, and consumption relative to that of output across all simulations along with the correlations between output and hours worked, investment, and consumption. US data suggests investment is the most volatile, while hours worked tends to be roughly as volatile as output and consumption is much smoother. Significant differences exist in the volatility of simulated data between RE and SP-learning, whether news 54 US Data RE (News) SPL (News) RE (No News) SPL (No News) σh/σY 0.968 0.714 0.716 0.714 0.715 σI/σY 3.103 2.386 2.167 2.282 2.023 σC/σY 0.712 0.737 0.706 0.746 0.725 corr(Y, h) 0.860 1.000 1.000 1.000 1.000 corr(Y, I) 0.890 0.850 0.879 0.922 0.912 corr(Y,C) 0.770 0.969 0.969 0.977 0.986 TABLE 8. Predicted Business Cycle Statistics shocks are included or not. All model simulations except SP-learning with news shocks suggest hours worked are much too smooth, and relative volatility of consumption under SP-learning is extremely close to the data. Only the specification with SP-learning and news shocks matches the stylized facts pertaining to the ordering of relative volatilities of investment, hours worked, and consumption. The simulated correlation between hours worked and output is unit, which does not match the data, but is expected: output and hours worked are perfectly correlated by assumption in the RBC model. The correlation between output and investment under SP-learning is closer to that of the data than under RE regardless of whether news is included or not, while both expectations formations mechanisms do a poor job capturing the correlation between output and consumption. Conclusion The analysis of this paper has shown the behavior implied by SP-learning households is quite distinct from that of their rational counterparts. While agents in this news-shock model can not converge to the exact REE, the process of making decisions based on linear forecasting rules of shadow prices and updating these forecasts via adaptive least squares leads agents to converge to an RPE. The difference between the implied RPE and the REE seem to be exacerbated by the inclusion of anticipated shocks in agents’ information sets. Furthermore, this behavior causes the model to generate simulated moments for key 55 business cycle statistics which are quite distinct from those obtained under RE. In some cases the model under SP-learning appears to better approximate US data, while in others the RE approach is more accurate. One interesting extension would be to consider informational heterogeneity in this environment. Angeletos and La’O (2010) considers the qualitative and quantitative effects of informational heterogeneity in a Lucas-Phelps “island” model where households on a continuum of islands receive public and private signals about the productivity of aggregate and island-specific productivity. They are somewhat surprised to find that information heterogeneity does not have any serious effect on welfare or quantitative features of the model. But Hellwig (2010) observes this goes back to an insight going back to Hayek (1945) suggesting that markets parsimoniously convey all relevant information through prices, and thus heterogeneity by itself is not likely to represent a significant source of amplification or persistence in simulated data. However, SP-learning introduces a behavioral friction in the way agents utilize information which is likely to be exacerbated by informational frictions stemming from heterogeneity. While it is purely speculative, I suspect SP-learning (and potentially other forms of bounded rationality) may provide a simple means of making informational heterogeneity matter in DSGE models. 56 CHAPTER IV LEARNING VS NEWS: WHAT DRIVES BUSINESS CYCLES? Introduction Having established in Chapter II that news shocks do not necessarily impinge upon the E-stability properties of a REE, and having conducted a calibrate and simulate exercise in Chapter III to understand the interaction between information flows, bounded rationality, and restricted perceptions equilibrium, I turn now to an empirical exploration of the relationship between news shocks and expectation formation. The central goal of this chapter is to determine whether the estimated importance of news - measured as the relative contribution of anticipated shocks versus surprise shocks in generating macroeconomic volatility - is affected by the way in which agents are assumed to form their expectations. In what is widely regarded to be the seminal work in the modern empirical news shock literature, Beaudry and Portier (2006) explores the relationship between stock prices, the growth rate of technology, and business cycles using several specifications of VARs. They find evidence that innovations to total factor productivity (TFP) are often anticipated, and further that responses to this anticipation are responsible for a large fraction of business cycle fluctuations. This contrasts with the typical RBC view of technological innovation by surprise which had begun to be questioned as early as Basu et al. (2004) in which improvements to technology are shown to typically have no contemporaneous impact on output. 57 Beaudry and Lucke (2010) allows news shocks to compete with more traditional sources of volatility in a VAR identified using short-run and long-run timing assumptions and find that surprise shocks play little role in economic volatility compared to news shocks. Barsky and Sims (2012) allow traditional news shocks to compete with “animal spirit shocks” a la Lorenzoni (2009). The authors conclude that much of the innovation in measured consumer confidence is due to traditional news shocks and that most of the relation between changes in consumer confidence and economic activity are due to news. In Beaudry et al. (2011) the focus is on what the authors refer to as “optimism shocks” which are essentially sunspots that exogenously shift expectations. These shocks are identified using sign restrictions and are shown to permanently affect current and future economic activity in a way strongly resembling news shocks. The narrative approaches of Alexopoulos (2011) and Ramey (2011) have similarly found support for the notion that news is an important determinant of volatility in aggregate economic variables. Schmitt-Grohe and Uribe (2012) modifies the well-known RBC-type news-shock model of Jaimovich and Rebelo (2009) to include seven exogenous variables which are subjected to both anticipated and unanticipated shocks. Using likelihood-based classical and Bayesian estimation techniques they find anticipated shocks account for about half of observed volatility in output and consumption. Conversely, Khan and Tsoukalas (2012) allow news shocks to compete against traditional sources of macroeconomic volatility in a small-scale monetary RBC model similar to that considered in Smets and Wouters (2007). Their results imply anticipated shocks are responsible for only 15% of volatility in output. These strongly contrasting results highlight the preeminent role of model specification in DSGE estimation. While much of the empirical news shock literature has approached the topic from a reduced form or structural VAR perspective, Blanchard et al. (2013) and Beaudry and 58 Portier (2014) point out estimation via VARs in news-rich environments may suffer from a nonfundamentalness problem. Alternatively, one may estimate the relative importance of anticipated shocks in a news-rich dynamic stochastic general equilibrium (DSGE) model. However, the construction of such a model requires the modeler to make a number of assumptions. One of the most important is the way in which agents’ expectations are formed, and all previous authors on the subject have utilized the rational expectations hypothesis. While much early work on adaptive learning focuses on whether particular rational expectations equilibrium (REE) are expectationally stable (E-stable), Williams (2003a) and Adam (2005) each explores the impact of adaptive learning on macroeconomic volatility and persistence in calibrated and simulated DSGE models.1 Their results suggest that relaxing rational expectations (RE) in favor of an adaptive learning approach not only increases persistence and volatility in key macroeconomic variables, but may also provide a better fit to the data than models based on RE. More recently, adaptive learning has been used in estimated DSGE models. Milani (2007) estimates a monetary-NK DSGE model in which agents are assumed to have what he refers to as “near-rational expectations” (NRE), that is, they form expectations using a correctly specified forecasting model but are unsure of the precise coefficients governing the laws of motion of the economy.2 In stark contrast to the results obtained under RE, the model under NRE simultaneously fits the data better and ascribes little importance to mechanical sources of volatility and persistence such as habit formation 1A REE is said to be E-stable if the beliefs of boundedly rational agents converge (asymptotically) to those of fully rational agents. Besides reflecting the sensitivity of models to the specific assumptions placed on agents’ expectation formation mechanisms and serving as an asymptotic justification for the use of RE, E-stability has been proposed as a selection criterion for indeterminate models. 2This is also referred to as “Euler equation learning” because agents are learning about endogenous variables which appear in the Euler equations of otherwise rational agents. 59 and inflation indexation. Similar results obtain in e.g. Milani and Rajbhandari (2012) and Slobodyan and Wouters (2012) in various DSGE models and under various forms of bounded rationality. Relatedly, Eusepi and Preston (2011) show that periods of optimism and pessimism arising from the erroneous forecasts (and subsequent updating of beliefs) of boundedly rational agents in an otherwise standard stochastic growth model generate the same type of expectationally driven business cycles as those found in news-shock models. This observation presents a natural question: to what extent are estimates of the importance of anticipated shocks impacted by the specific assumptions governing agents’ expectation formation mechanism? Do news shocks serve as proxies for an underlying learning process in the data? Or could it be that the combination of news shocks and adaptive learning could actually enhance the relative importance of anticipated shocks by exacerbating the waves of optimism and pessimism? My main results from this chapter suggest the importance of news is robust to the inclusion of adaptive learning. In particular, the relative importance of news is estimated to be up to thirty-five percent greater for key macroeconomic variables such as output, consumption, investment, and hours worked under NRE than under RE. However, these differences arise from movement in endogenous variables caused by the arrival of the anticipated event as opposed to movement caused by the anticipation itself, what Sims (2016) refers to as the impact of “realized news” and “pure news”, respectively. These findings speak to current differences in structural estimates of the importance of news, which appear to be sensitive to the structure of the model considered. Since parameter estimates obtained from DSGE models under learning often attenuate the importance of specific mechanical sources of volatility and persistence relative to RE, this 60 study will clarify whether the importance of news is also sensitive to specific assumptions governing expectation formations. Furthermore, it provides additional evidence regarding the merits of using adaptive learning in empirical applications. Despite a preponderance of evidence suggesting its implausibility as a description of individual or aggregate behavior, RE is still the paradigm for describing expectations in dynamic macroeconomic models.3 It is noteworthy that while the present study suggests the model under learning and RE produce similar model predictions for volatility and persistence, the models produce dissimilar estimates regarding the relative importance of news, implying forecasts generated from the underlying models may be quite different. The chapter proceeds as follows. I first describe the news-shock RBC-type model which is to be estimated. Next, I describe the specific assumptions made for agent- level expectation formation. This is followed by a detailed description of the estimation methodology and the presentation of the main results for the chapter. The chapter concludes with a discussion of the main results which attempts to place them in the context of recent empirical macroeconomic literature. The Model The economy considered is that of Schmitt-Grohe and Uribe (2012) which itself augments the news-shock RBC model of Jaimovich and Rebelo (2009) considered in the previous chapter. Aggregate demand consists of a government which levies lump- sum taxes to finance expenditure and a representative household making the usual 3See e.g. Assenza et al. (2014) for a summary of the state of learning-to-forecast and learning-to- optimize laboratory experiments, which have largely failed to find support for RE as a description of aggregate behavior. 61 consumption/savings and labor supply decisions while also choosing the rate at which to utilize their existing capital stock. Adjustments to gross investment are assumed to be costly. In addition, household preferences incorporate direct and indirect internal habit- formation. Aggregate supply consists of competitive firms which produce a fungible final good from labor supplied through monopolistically-competitive labor unions and physical capital rented from the household. The model features seven exogenous variables which have been shown in the literature to be empirically important mechanical sources of volatility and persistence. Finally, each of the exogenous variables are subjected to disturbances in the form of anticipated and unanticipated exogenous stochastic white-noise shocks, that is, news and surprise shocks. Households The representative household maximizes lifetime utility which is given by E0 ∞∑ t=0 βtζtU(Vt) (4.1) where β ∈ (0, 1) the household’s discount factor, ζt a stationary exogenous stochastic time- preference shock, and U the period utility function which takes the CRRA form U(Vt) = V 1−σt − 1 1− σ (4.2) 62 where σ > 0 the inverse intertemporal elasticity of substitution. The argument of the utility function is given by Vt = Ct − bCt−1 − ψhθtSt (4.3) where Ct and ht are private consumption and hours worked, respectively. b ∈ [0, 1) controls the degree of internal habit formation, ψ > 0 scales the disutility of labor supply, and θ > 1 governs the Frisch elasticity of labor supply.4 St is a geometric average of current and past habit-adjusted consumption and evolves according to St = (Ct − bCt−1)γ S1−γt−1 (4.4) where γ ∈ (0, 1] governs the magnitude of the wealth elasticity of labor supply. This preference specification was first used in Jaimovich and Rebelo (2009). The comovement problem in RBC models is partially caused by the wealth effect of labor supply dominating the substitution effect, thereby causing consumption and hours worked to move in opposite directions upon receipt of good news about the future. The JR specification for utility allows the modeler to change the strength of the wealth effect of labor supply by calibrating γ. Households are assumed to own and rent physical capital to firms in a competitive market. The capital stock accumulates according to Kt = (1− δ (ut))Kt−1 + zIt It [ 1− Φ ( It It−1 )] (4.5) 4In the special case where there is no internal or external habit formation i.e. b = γ = 0, the Frisch labor supply is ηλ = 1θ−1 , hence the restriction that θ > 1 63 where Kt−1 the predetermined capital stock determined in period t − 1, It gross private investment, and zIt a stationary exogenous stochastic shock to the technology for converting investment into installed capital. The function Φ (·) imposes an investment-adjustment cost on households while δ (·) implies capital depreciation is a function of the current-period capital utilization rate ut. Both functions are increasing, convex and take quadratic functional forms. In particular we have δ(ut) = δ0 + δ1(ut − 1) + δ2 2 (ut − 1)2 (4.6) where δ0 > 0 the steady-state depreciation rate, δ1 > 0 determines the steady-state value of ut, and δ2 > 0 captures the rental rate elasticity of capacity utilization. The time- varying investment-adjustment cost function is given by Φ ( µIt ) = κ 2 ( µIt − µI )2 (4.7) consistent with Christiano et al. (2005), where µIt ≡ ItIt−1 the growth rate of investment with steady state µI and κ > 0 a scaling parameter which implies the growth rates of investment and output are autocorrelated. This specification implies Φ ( µI ) = Φ′ ( µI ) = 0, i.e. there are no investment-adjustment costs on a balanced growth path, and κ = Φ′′ ( µI ) . The household receives labor income from supplying ht hours at rate W ∗ t , rental income from from renting utKt−1 units of effective capital at gross rental rate Rt, and lump-sum dividends from labor-union membership and firm-profits of Πt. The household uses this income to pay lump-sum taxes Tt and purchase private consumption and gross 64 private investment. The flow budget constraint is thus given by Ct + AtIt + Tt ≤ W ∗t ht +Rt (utKt−1) + Πt (4.8) where At a difference-stationary exogenous stochastic process representing the current state of technology for producing investment goods from consumption goods with growth rate µAt ≡ AtAt−1 . In a decentralized equilibrium At may be interpreted as the relative price of investment goods in terms of consumption goods; that is, a unit of the investment good may be exchanged for At units of the consumption good. Labor Market Households are assumed to have market power, albeit in an indirect way. There is a continuum of monopolistically competitive labor unions which households supply labor to, and these unions negotiate wages with final goods producers. The market power of these unions is captured by a stationary exogenous stochastic process ωt, which can be interpreted simply as a time-varying markup in wages. The solutions to the final-good producers’ cost-minimization and unions’ profit-maximization problems together imply all unions charge the same price for labor services and pay the same wage to member- households, and hence the wage received by households is given by W ∗t = Wt/ωt where Wt the price charged by unions to firms for labor services and µ W t ≥ 1 the exogenously time-varying wage markup which controls the level of market power the labor unions have with steady-state value µW . This friction drives a wedge between the wage paid by firms and that received by workers. Profits of the labor unions are remitted to all member households as a lump-sum dividend. Details of the solution to the optimization problems of firms and unions are presented in Appendix C. 65 Firms, Government, and Market Clearing The representative firm contracts ht worker-hours and rents utKt−1 units of effective capital to produce a final good Y outt using CRTS technology according to Y outt = zt (utKt−1) αk (Xtht) αh (XtF ) 1−αk−αh (4.9) where the parameters αk, αh ∈ (0, 1) govern the steady-state output shares of effective capital and effective labor, respectively, and satisfy αk + αh ≤ 1. F is a fixed factor (e.g land or managerial capital) which introduces production exhibits diminishing returns to scale in effective capital and labor while ensuring production exhibits overall CRTS. Production is augmented by a stationary exogenous stochastic Hicks-neutral productivity shock zt and a difference-stationary exogenous stochastic Harrod-neutral (i.e. labor- augmenting) productivity shock Xt with growth rate µ X t = Xt Xt−1 and steady-state growth µX . The market for effective capital is competitive and hence the gross rental rate equals the value of the marginal product of effective capital Rt = αk Y outt utKt−1 (4.10) and the wage paid by firms to labor unions is similarly equal to the value of the marginal product of labor Wt = αh Y outt ht (4.11) 66 The final good is fungible and may be used for private consumption, private investment, or government spending Gt, and hence the aggregate resource constraint is Y outt = Ct + AtIt +Gt (4.12) where Y outt is total output. Because At and Xt are non-stationary, output displays a stochastic trend which can be shown to be XYt = XtA αk/(αk−1) t . The government maintains a balanced budget such that Gt = Tt. Furthermore, government spending exhibits a stochastic trend XGt = ( XGt−1 )ρxg ( XYt−1 )1−ρxg which co-integrated with the trend in output so that the share of government spending in output is stationary; however ρxg ∈ [0, 1) allows the trend to be smoother than that of output. Detrended government spending gt ≡ Gt/XGt is a stationary exogenous stochastic process with steady state g. Since XGt is predetermined, and since detrended government spending is stationary, the current level of government spending is independent of contemporaneous changes to exogenous variables; however, these may affect government spending with a lag via corresponding changes to the trend path of output. Temporary Equilibrium The temporary equilibrium for this economy obtains where all households maximize utility given expectations, all firms maximize profits, and all markets clear. This can be 67 described the nonlinear system of expectational difference equations Kt = (1− δ (ut))Kt−1 + zIt It [ 1− Φ ( It It−1 )] (4.13a) Y outt = Ct + AtIt +Gt (4.13b) Y outt = zt (utKt−1) αk (Xtht) αh (XtF ) 1−αk−αh (4.13c) St = (Ct − bCt−1)γ S1−γt−1 (4.13d) Vt = Ct − bCt−1 − ψhθtSt (4.13e) Λt = ( ζtV −σ t − γSt Ct − bCt−1 Πt ) − bβEˆt [( ζt+1V −σ t+1 − γSt+1 Ct+1 − bCtΠt+1 )] (4.13f) Λt = θζtV −σ t ψh θ−1 t St Wt/ωt (4.13g) Πt = ζtV −σ t ψh θ t + βEˆt [ (1− γ)St+1 St Πt+1 ] (4.13h) QtΛt = βEˆtΛt+1 [(Rt+1ut+1 +Qt+1(1− δ(ut+1)))] (4.13i) Rt = δ ′(ut)Qt (4.13j) AtΛt = QtΛtz I t ( 1− Φ ( It It−1 ) − Φ′ ( It It−1 )( It It−1 )) (4.13k) + βEˆt [ Qt+1Λt+1z I t+1Φ ′ ( It+1 It )( It+1 It )2] (4.13l) Wt = αh Y outt ht (4.13m) Rt = αh Y outt utKt−1 (4.13n) where Λt the marginal value of income, Πt the marginal value of household’s stock of current and past habit-adjusted consumption (i.e. the Lagrange multiplier from relaxing the constraint on St), and ΛtQt the marginal value of pre-installed capital. Qt can thus be interpreted as marginal Tobin’s Q. Eˆt denotes the (possibly non-rational) expectations of the household given their time t beliefs and information set. The temporary equilibrium 68 can be concisely described by Eˆt [f (Yt+1, Yt, Yt−1, νt)] = 0 (4.14) where Yt and νt are collections of endogenous and exogenous variables and exogenous stochastic white-noise shocks, respectively. Stationarity, Information Flows, and the Log-linearized System Because At and Xt are non-stationary the system (4.14) must be transformed to be in terms of stationary variables. The details of this transformation and the resulting stationary nonlinear system of expectational difference are presented in Appendix C. Letting Yˆt be a collection of (detrended) endogenous and exogenous variables, the resulting stationary system can be concisely describe as Eˆt [ f ( Yˆt+1, Yˆt, Yˆt−1, νt )] = 0 (4.15) Yˆt includes the seven stationary exogenous variables x = { z, ζ, zI , g, ω, µA, µX } which are assumed to follow AR(1) processes such that lnxt = ρx lnxt−1 + 0x,t where ρx ∈ (0, 1). In addition to an unanticipated or “surprise” shock, households receive information about each of the exogenous variables 4 and 8 periods in advance. These news shocks can be parsimoniously modeled by introducing auxiliary variables εnx,t for 69 n = 0, 1, ..., q where q the longest forecasting horizon and ε0x,t = ε 1 x,t−1 + σ 0 xν 0 x,t ε1x,t = ε 2 x,t−1 ε2x,t = ε 3 x,t−1 ε3x,t = ε 4 x,t−1 ε4x,t = ε 5 x,t−1 + σ 4 xν 4 x,t ε5x,t = ε 6 x,t−1 ε6x,t = ε 7 x,t−1 ε7x,t = ε 8 x,t−1 ε8x,t = σ 8 xν 8 x,t where νkx,t for k = 0, 4, 8 are assumed to be Gaussian white-noise shocks with unit variance and effective standard deviation σkx. Repeated substitution of the auxiliary variables implies ε0x,t = σ 0 xν 0 x,t + σ 4 xν 4 x,t−4 + σ 8 xν 8 x,t−8, and hence household expectations are given by Eˆtε 0 x,t+n =  0 if n > 8 σ8xν 8 x,t+n−8 if 8 ≥ n > 4 σ4xν 4 x,t+n−4 + σ 8 xν 8 x,t+n−8 if 4 ≥ n > 1 i.e. household’s expectations of future exogenous variables are constantly updated as news shocks flow into their information sets. 70 While the anticipated components do not affect contemporaneous economic fundamentals, the information reveals something about the future state of the economy. Households incorporate this information into their decision rules, and it is precisely the fact that news shocks affect various endogenous variables in different ways at different times which allows the econometrician to identify the effect of a particular news shock.5 Note that the inclusion of such news shocks implies no departure from rational expectations: The anticipated shocks are fundamental to the model, and hence rational households must incorporate them into their forecasts of the future. The system describing the informational flow structure can be represented by collecting auxiliary variables into vectors εxt = ( ε0x,t, ε 1 x,t, ..., ε 8 x,t )′ , the Gaussian white-noise shocks into vectors νxt = ( ν0x,t, ν 1 x,t, ..., ν 8 x,t )′ , and writing wt = ϕwt−1 +Mνt (4.16) where wt = ( εzt , ε ζ t , ε zI t , ε g t , ε ω t , ε µA t , ε µX t )′ and νt = ( νzt , ν ζ t , ν zI t , ν g t , ν ω t , ν µA t , ν µX t )′ . Given the assumed structure of information flows, ϕ is an upper-shift matrix with 1’s on the super- diagonal and zeros elsewhere while M is simply a sparse matrix containing the effective standard deviations of the anticipated and unanticipated shocks.6 Following the majority of applied structural macroeconomic literature, I log-linearize the stationary system (4.15) around its non-stochastic steady state. Denote by y˜t the collection of endogenous and exogenous variables in terms of their percent-deviation from 5See Schmitt-Grohe and Uribe (2012) or Beaudry and Portier (2014) for more on the issue of identification in news-shock DSGE models. 6In what follows I assume the exogenous stochastic shocks are pure white-noise and hence are uncorrelated across forecasting horizons and across exogenous variables. It would be simply to modify these assumptions, however, and would amount to appropriately changing ϕ and M to capture the new relationships. 71 non-stochastic steady state. Then the log-linearized system of equations, including those defining the information flow structure, can be written compactly as y˜t = α + βy˜t−1 + χwt + δEˆty˜t+1 (4.17a) wt = ϕwt−1 +Mνt (4.17b) A variety of well-known solution techniques exist for obtaining the solution to a linear system of first-order expectational difference equations like that defined by equations (4.17a) and (4.17b) under the assumption of rational expectations e.g. Blanchard and Kahn (1980), Uhlig et al. (1995), Klein (2000), or Sims (2002). However, the objective of this chapter is to compare and contrast the estimated importance of anticipated shocks when agents are assumed to be rational versus when the rational expectations hypothesis is slightly relaxed. Thus, the next section describes the assumed expectation formation mechanism and adaptive learning process for the boundedly rational agents. Adaptive Learning As is clear from the temporary equilibrium described by (4.17a), household actions are conditional on their expectations of the future state of the economy. I depart from RE by assuming households behave as econometricians equipped with a forecasting model. Their beliefs, referred to as the perceived law of motion (PLM), take the form of the minimum-state variable (MSV) solution obtained under rational expectations; that is, agents form forecasts according to y˜t = at−1 + bt−1y˜t−1 + ct−1wt wt = ϕwt−1 +Mνt (4.18) 72 This model is “correct” in the sense that its structure is consistent with the REE; there are no omitted or included irrelevant explanatory variables.7 However, the exact values may be different from those implied by the REE. The model is closed by assuming households update their beliefs over time according to the constant-gain least squares (CGLS) algorithm. Denoting household beliefs as ξ′t = (at, bt, ct) we have ξt = ξt−1 + g¯Rˆ−1t Zt−1 ( y˜t − ξ′t−1Zt−1 )′ (4.19) Rˆt = Rˆt−1 + g¯ ( Zt−1Z ′t−1 − Rˆt−1 ) (4.20) where the data used to forecast is captured by Z ′t−1 = ( 1, y˜′t−1, w ′ t ) and g¯ the “constant- gain” parameter which describes the relative importance of recent forecast errors. The system (4.19)-(4.20) defines a household’s time t estimate of the coefficients ξt and the matrix of second-moments Rˆt as a weighted average of their previous estimates and the new information contained in the forecast error t = y˜t − ξ′t−1Zt−1. Note that setting g¯ = t−1 would result in agents using the well-known recursive least squares (RLS) algorithm for updating beliefs; furthermore, if the REE is expectationally stable (E-stable) then it obtains as a special limiting case where t→∞ and g¯→ 0. 7I am implicitly assuming household’s understand the informational flow structure; that is, they understand which news is relevant for which exogenous variables and that they receive news 0, 4, and 8 periods in advance. This is an intuitively appealing assumption, and also a weak one, given that the lack of a feedback-loop implies agents would very quickly learn the true laws of motion governing the evolution of this exogenous system. 73 Given the PLM described by equation (4.18) and assuming agents do not observe the contemporaneous value of endogenous variables i.e. Eˆty˜t 6= y˜t8, expectations are given by Eˆty˜t = at−1 + bt−1y˜t−1 + ct−1wt (4.21) Eˆty˜t+1 = at−1 + bt−1Eˆty˜t + ct−1ϕwt = [(I + bt−1) at−1] + b2t−1y˜t−1 + [bt−1ct−1 + ct−1ϕ]wt (4.22) which can be substituted into the system (4.17a) to yield the actual law of motion (ALM) for the economy y˜t = [α + δ (I + bt−1) at−1] + [ β + δb2t−1 ] y˜t−1 + [χ+ δ (bt−1ct−1 + ct−1ϕ)]wt wt = ϕwt−1 +Mνt (4.23) Equation (4.23) emphasizes the feedback mechanism implied by adaptive learning: household perceptions of the economy - captured by at−1, bt−1, and ct−1 - directly impact the actual state of the economy through their influence over household decision making. Collecting the endogenous, exogenous, and auxiliary variables into a vector St = (y˜t, wt)′, the ALM for the economy can be written St = At + FtSt−1 +Gtνt (4.24) where νt i.i.d∼ N (0, I) and At, Ft, and Gt are time-varying coefficient matrices formed from deep structural parameters of the economy and household beliefs. The central goal of this chapter is to compare estimates of the deep parameters obtained under RE against those 8This distinction does not matter under RE as all agents are endowed with model-consistent expectations, and at first blush may appear strange in the context of a representative agent model. However, it is a natural assumption to make in general equilibrium settings where individual agents must forecast the aggregate behavior of all other agents to determine their individual behavior. 74 obtained under near-rational expectations, specifically those corresponding to the effective standard deviation of the anticipated and unanticipated shocks. Estimation Methodology Following the recent literature in estimating DSGE models with learning e.g. Milani (2007), Slobodyan and Wouters (2012), and Milani and Rajbhandari (2012), I choose to estimate the parameters of (4.27) using Bayesian Markov Chain Monte Carlo (MCMC) methods. In the present context there are three main advantages to Bayesian versus classical maximum likelihood estimation. First, as described in An and Schorfheide (2007), maximum likelihood procedures do not allow an econometrician to avoid estimates which are known to be unlikely given previously acquired institutional knowledge. Bayesian estimation allows for the augmentation of the data through a joint prior distribution which re-weights the likelihood. In this way the econometrician can easily and transparently allow information which is not contained in the data to have an impact on the estimated results. Second, Bayesian methods result in an entire posterior distribution, as opposed to point estimates obtained under maximum likelihood. This emphasizes the fact that our models merely approximate the economy by allowing the econometrician to report results in terms of probabilities rather than a single “true” parameter value. This facilitates an honest communication of results in line with the norms in a variety of disciplines ranging from political science to meteorology. Finally, the likelihood functions associated with DSGE models can present major problems for optimization routines. Because these models are approximations of a data 75 generating process, the corresponding likelihood functions often feature many peaks and cliffs. MCMC approaches such as the Random-Walk Metropolis Hastings (RWMH) algorithm are robust to these topographic challenges as their success derives precisely because they explore the entire parameter space, thereby eliminating the well-known problem of Newtonian optimizers getting stuck in inappropriate spaces due to initial starting values. The object of interest is the posterior distribution p(Θ|Y T ) which describes the probability of a particular parameter constellation Θ given data Y T . The posterior distribution is obtained through Bayes’ Law as p(Θ|Y T ) ∝ p(Y T |Θ)p(Θ) (4.25) where the likelihood function p(Y T |Θ) captures the probability of observing the data in Y T given Θ and the prior distribution p(Θ) summarizes the econometrician’s a priori knowledge of the parameters Θ. In general the posterior distribution does not take any known form, and hence I utilize the RWMH algorithm to sample from the target distribution. Because the resulting sequence of random samples is a Markov chain, the proposal distribution should be chosen to ensure proposals are accepted neither too often nor too infrequently. If samples are accepted too frequently then the sequence will be highly serially correlated; if samples are accepted too infrequently than the algorithm will fail to explore the entirety of parameter space. But choosing a proposal distribution to yield an acceptance rate which satisfies this criteria can be extremely difficult, especially in high- dimensional settings, because the various elements may behave in strikingly different ways. 76 To overcome this challenge I utilize the Log Adaptive Metropolis (LAP) procedure of Shaby and Wells (2010). In addition to recursively updating the variance-covariance matrix of the proposal distribution vis-a-vis the adaptive Metropolis algorithm of Haario et al. (2001), LAP multiplicatively updates the tuning parameter proportionally to the deviation of the actual acceptance rate from its target. This results in a proposal distribution which quickly establishes an acceptance rate in close proximity to the desired one, while also respecting the relationships between estimated parameters revealed by the history of draws. The Observables To estimate the DSGE model given by (4.24) I must specify a link between the unobserved state-variables of the model and their observed empirical counterparts. I utilize quarterly U.S. data provided by Schmitt-Grohe and Uribe (2012) on the demeaned growth rates of real per-capita output, consumption, investment, government expenditure, and hours worked along with the growth rates of total factor productivity (TFP) and the relative price of investment denoted gYt , g C t , g I t , g G t , g h t , g TFP t , and g PI t , respectively. I use this data to facilitate comparisons between my results and previous estimates of the relative importance of anticipated shocks. The data runs from 1955:Q2 to 2006:Q4 which corresponds to 207 observations.9 The growth rate of real per-capita output is assumed to be measured with error, which is required by the fact that the RBC model implies a linear restriction between these seven 9Because the model does not feature structural change I am implicitly assuming the nature of “news” has remained the same over the entire period. This may not be an entirely satisfying assumption, and future work will examine the impact of splitting the sample a la Beaudry et al. (2011) in which optimism shocks are found to be much more important in their post-1983 subsample than in their pre- 1983 subsample. 77 observables which is not satisfied by the data.10 Formally, the observable variables are linked to the non-stationary variables from (4.13a)-(4.13n) by  gYt gCt gIt ggt ght gTFPt gPIt  =  ∆ log (Yt) ∆ log (Ct) ∆ log (AtIt) ∆ log (Gt) ∆ log (ht) ∆ log ( ztX 1−αk t ) ∆ log (At)  +  σYme 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0   eYt eCt eIt egt eht eTFPt ePIt  or concisely as Yt = HSt +Det (4.26) where et i.i.d∼ N (0, I) a vector of white-noise measurement error shocks and σYme the effective standard deviation of measurement error in output growth. H is a selection matrix of which defines the observed variables in terms of the unobserved states. The system of transition equations (4.24) together with the system of measurement equations (4.26) imply estimation can be conducted by analyzing the (potentially time-varying) state-space model St = At + FtSt−1 +Gtνt Yt = HSt +Det (4.27) 10This is the well-known stochastic singularity problem in estimating linearized DSGE models is called. See e.g. Ruge-Murcia (2007) for more on this and its impact on estimation. 78 which is a linear-Gaussian system in the econometric-state variables St and variables with directly observed counterparts Yt. The Likelihood Function Given the state-space model in (4.27), the value of the likelihood function evaluated at a given parameter constellation Θ can be easily calculated using the Kalman Filter. The introduction of adaptive learning requires the coefficient-matrices of the transition equations to be updated using estimates of the state variables based on the CGLS algorithm. I follow Slobodyan and Wouters (2008) and use the simple filtered estimates for the states rather than the smoothed estimates. The additional updating step presents two potential computational difficulties. First, agents updated estimates may imply the system is non-stationary; thus I incorporate a projection facility as in Marcet and Sargent (1989a) which causes agents to skip their updating step when they realize their new estimates are non-nonsensical. Second, MSV learning in linearized-DSGE models can produce estimated second-moment matrices with very small minimum eigenvalues due to perfect-multicollinearity from the models’ implied linear restrictions; hence, I incorporate a ridge correction mechanism as in Slobodyan and Wouters (2012) which conditionally adds an arbitrarily small constant to the diagonal of the estimated second-moment matrix.11 Provided initial values for the states S0|0, the estimated mean square error (MSE) matrix P0|0, beliefs for the second moment matrix Rˆ0, and beliefs for the coefficient matrices x0 = (a0, b0, c0), the Kalman Filtering routine proceeds by repeating 5 steps 11I set the value of this small ridge to be 1e− 6 times a conformable identity matrix. 79 1. Calculate the a priori estimates of the time t states St|t−1 and MSE matrix Pt|t−1 2. Use these estimates to produce a priori estimates of the time t observables Yt|t−1 and corresponding variance-covariance matrix Ωt|t−1 3. Calculate the a posteriori time t value of the (log) likelihood function 4. Update the estimated coefficient matrices At, Ft, and Gt implied by agents updating their beliefs Rˆt and ξt according to the CGLS algorithm 5. Calculate the a posteriori estimates of the time t states St|t and MSE matrix Pt|t for t = 1, 2, ...T . The projection facility and ridge-correction mechanism are conditionally applied during step 4, which does not exist under RE. Initial values for the states and estimated MSE matrix are set to the unconditional mean and variance-covariance matrix of the states, respectively, while initial beliefs are assumed to be equal to the REE. Since the REE is E-stable, this implies agents’ estimates will fluctuate around the REE, thereby facilitating a simple comparison between the estimated posterior densities. The Prior The parameter vector Θ contains all of the parameters of the model, some of which are calibrated. In particular, the discount factor β is set to 0.99 implying a steady-state quarterly real-interest rate of 1% if δ1 = δ2 = 0. σ is set to 1 to yield logarithmic utility while αk and αh are set to 0.225 and 0.675, respectively, implying mild decreasing returns to scale. δ0 is the steady-state depreciation rate and is set to 2.5 percent. The fixed factor F and steady-state values for the Hicks-neutral technology shock z, preference shock ζ, and investment-specific technology shock zI are normalized to 1. The steady-state value 80 Calibrated Parameters Parameter Values Description β 0.99 Discount factor σ 1 Logarithmic Utility αk 0.225 Effective Capital Share of Output αh 0.675 Effective Labor Share of Output δ0 0.025 Steady-state Depreciation Rate F 1 Fixed Factor z 1 Hicks-neutral technology shock ζ 1 Preference shock zI 1 Investment-specific technology shock ω 1.15 Wage-markup shock µA 0.9972 Growth rate, Relative Price of Investment shock µX 1.0032 Growth rate, Harrod-neutral technology shock h 0.2 Steady-state Hours Worked u 1 Steady-state Capacity Utilization Rate TABLE 9. Calibrated Parameters for Estimated One-sector Model for the wage-markup shock ω is set to 1.15 and g is set such that government’s share of output is 0.20 in steady state. The steady-state values of the growth of Harrod-neutral technology µX and the relative price of investment µA are set to 1.0032 and 0.9957, respectively, to match the long-run averages implied by the data. Finally, δ1 and ψ are calibrated to ensure capacity utilization and hours worked in steady state are 1 and 0.2, respectively. This information is summarized in Table 9. Each of the remaining parameters which are to be estimated require an prior distribution. These priors may reflect knowledge about the parameters obtained from previous studies, and often help ensure the estimation routine focuses on “sensible” values e.g. ensuring standard deviations are non-negative. I choose my priors to be consistent with those in Schmitt-Grohe and Uribe (2012). In particular, the parameter θ governing the Frisch elasticity of labor supply is distributed Gamma with mean 5 and standard deviation 2, the curvature of the depreciation function δ2 is distributed inverse- Gamma with mean 0.01 and standard deviation 0.05, and the curvature of the investment 81 adjustment cost function κ is distributed Gamma with mean 10 and standard deviation 2. These correspond to rather loose priors, and the inverse-Gamma on κ ensures it will be strictly positive. The standard deviations for the structural shocks are assumed to follow Gamma distributions to allow for the possibility that some shocks are completely unimportant i.e. have standard deviations equal to zero. The surprise shocks and output’s measurement error have mean and standard deviation equal to 0.1, and the anticipated shocks have mean and standard deviation such that surprise shocks account for 50% of total variance in each exogenous variables at the prior. The internal habit adjustment parameter b as well as the autoregressive parameters for each shock ρx are assumed to follow Beta distributions with mean 0.5 and standard deviation 0.2 which correspond to hump-shaped prior distributions over the unit interval. The habit-adjustment term γ from the JR preferences is uniformly distributed over the unit interval. Finally, the constant-gain parameter g¯ is distributed Gamma with mean and standard deviation 0.035, which is consistent with priors used in recent learning-estimation literature while placing non-trivial probability at near-zero values. Main Results Table 10 reports the median and 95% highest posterior distribution interval (HPDI) implied by the priors and 500,000 draws from the estimated posterior distribution under RE and NRE. The most notable differences between models occur with respect to the estimated standard deviations for the wage-markup shock (σ0ω,σ 4 ω,σ 8 ω) and the growth rate of the relative price of investment (σ0µA ,σ 4 µA ,σ 8 µA). Under RE, surprise shocks are responsible for about half of the variance in the wage-markup shock at the posterior median, while under NRE they are responsible for almost none. A similar shift occurs 82 Prior and Posterior Distribution for Estimated Parameters Prior Posterior - RE Posterior - Learning Param. Distr. 95% Low Med. 95% High 95% Low Med. 95% High 95% Low Med. 95% High σ0z Gamma 0.000 0.069 0.303 0.649 0.730 0.814 0.639 0.733 0.824 σ4z Gamma 0.000 0.049 0.214 0.000 0.057 0.252 0.000 0.062 0.283 σ8z Gamma 0.000 0.049 0.212 0.000 0.051 0.194 0.000 0.051 0.249 σ0ζ Gamma 0.000 0.069 0.300 2.795 3.398 4.081 2.741 3.352 4.043 σ4ζ Gamma 0.000 0.049 0.214 0.000 0.051 0.215 0.000 0.049 0.219 σ8ζ Gamma 0.000 0.049 0.215 0.000 0.050 0.219 0.000 0.051 0.205 σ0zI Gamma 0.000 0.069 0.302 4.701 5.599 6.563 4.634 5.602 6.560 σ4zI Gamma 0.000 0.049 0.214 0.000 0.051 0.230 0.000 0.047 0.215 σ8zI Gamma 0.000 0.049 0.214 0.000 0.051 0.211 0.000 0.047 0.207 σ0ω Gamma 0.000 0.069 0.302 0.000 0.206 1.115 0.000 0.079 0.410 σ4ω Gamma 0.000 0.049 0.212 0.000 0.153 1.103 0.000 0.054 0.245 σ8ω Gamma 0.000 0.049 0.215 0.000 0.133 1.091 0.917 1.038 1.169 σ0g Gamma 0.000 0.069 0.302 0.000 0.073 0.309 0.000 0.067 0.279 σ4g Gamma 0.000 0.049 0.214 3.686 4.236 4.816 3.718 4.289 4.900 σ8g Gamma 0.000 0.049 0.212 0.000 0.051 0.223 0.000 0.053 0.246 σ0µA Gamma 0.000 0.069 0.303 0.015 0.277 0.372 0.001 0.214 0.352 σ4µA Gamma 0.000 0.049 0.212 0.000 0.115 0.341 0.000 0.180 0.352 σ8µA Gamma 0.000 0.049 0.213 0.000 0.083 0.306 0.000 0.096 0.337 σ0µX Gamma 0.000 0.069 0.304 0.032 0.078 0.149 0.024 0.076 0.148 σ4µX Gamma 0.000 0.049 0.214 0.000 0.021 0.065 0.000 0.024 0.074 σ8µX Gamma 0.000 0.049 0.215 0.000 0.044 0.124 0.000 0.044 0.125 σYme Gamma 0.000 0.069 0.305 0.541 0.594 0.651 0.542 0.594 0.652 ρx,g Beta 0.125 0.500 0.871 0.117 0.420 0.733 0.131 0.439 0.738 ρz Beta 0.131 0.501 0.873 0.797 0.858 0.914 0.798 0.861 0.920 ρζ Beta 0.129 0.502 0.869 0.015 0.109 0.231 0.013 0.103 0.225 ρzI Beta 0.127 0.499 0.868 0.729 0.798 0.868 0.724 0.796 0.864 ρω Beta 0.115 0.500 0.858 0.917 0.950 0.980 0.923 0.955 0.985 ρg Beta 0.132 0.499 0.873 0.903 0.942 0.979 0.906 0.944 0.979 ρµA Beta 0.128 0.500 0.869 0.351 0.465 0.575 0.359 0.469 0.580 ρµX Beta 0.132 0.500 0.873 0.889 0.951 0.990 0.877 0.950 0.988 θ Gamma 1.503 4.760 8.961 2.936 3.696 4.490 3.057 3.803 4.645 κ Gamma 3.171 9.479 18.048 4.709 5.889 7.203 4.625 5.713 7.017 δ2 InvGamma 0.000 0.000 0.002 0.030 0.035 0.042 0.029 0.035 0.041 γ Unif 0.001 0.499 0.951 0.000 0.001 0.008 0.000 0.001 0.006 b Beta 0.503 0.707 0.885 0.813 0.841 0.867 0.812 0.840 0.865 g¯ Gamma 0.000 0.024 0.106 0 0 0 0.000 0.007 0.025 TABLE 10. Estimated Parameters 83 in the implied variance for the growth rate of the relative price of investment: under RE the surprise shock is responsible for about 80% of total variance, while it is responsible for only about 50% under NRE. Thus, the learning model ascribes greater importance to anticipated shocks for driving exogenous variables than the model with RE. That the values for mechanical sources of volatility and persistence such as the habit formation parameter b and the rental rate elasticity of capacity utilization δ2 are so similar across models is somewhat surprising. Previous estimation-with-learning exercises as in Milani and Rajbhandari (2012) and Slobodyan and Wouters (2012) found that mechanical sources of volatility and persistence failed to maintain their importance when agents were assumed to be boundedly rational. However, my results are consistent with the earlier finding from Slobodyan and Wouters (2007) that differences in estimates between RE and models with learning are to a large extent driven by differences in the information sets of agents. Because agents’ initial beliefs correspond to the REE the information set of agents under NRE is quite similar to that of their rational counterparts. To assess the quantitative importance of anticipated shocks versus surprise shocks, I conduct a forecast-error variance decomposition (FEVD) exercise on the implied growth rates of output, consumption, investment, hours worked, and government spending. For surprise shocks typical FEVDs have a simple interpretation: because the shock is realized in the same period it is announced, the induced volatility can be directly attributed to the change in fundamentals. Anticipated shocks, however, have two separate impacts on the system. Sims (2016) refers to these impacts as “pure news’ and “realized news”, respectively. Pure news captures volatility resulting from the change in agents’ expectations of future fundamentals before those fundamentals have actually changed. Realized news captures 84 volatility resulting from the actual arrival of the anticipated event. For instance, if news arrives four periods in advance, horizons up to four periods will represent pure news, while horizons after four periods will represent both pure and realized news. Since an unconditional FEVD is an asymptotic result, it will mix these two effects. If we are interested in the relative importance of anticipated versus surprise shocks then we should look at FEVDs conducted on shorter horizons. Figure 4 shows the 95% HPDI of the estimated variance for the growth rates of output, consumption, investment, and hours worked explained by anticipated shocks across models and FEVD horizons from 7500 draws from the posterior distributions, while Figure 5 displays the same information for the growth rate of government expenditure.12 The vertical axis gives the median percent of volatility explained by anticipated shocks, while the horizontal axis gives the FEVD horizon. The light-shaded (dark-shaded) region corresponds to the 95% HPDI from the model under NRE (RE). In all cases the importance of news is greater under the learning model. Furthermore the HPDIs are larger under NRE than under RE. This suggests that news is not merely a proxy for the endogenous learning process, but that in fact the combination of news and learning leads to additional volatility than would otherwise be implied. However it is worth noting that the entirety of this difference occurs from the realized news component: consistent with the results from Sims (2016), the relative importance of pure news is similarly low under both models. Given the observed differences in the relative importance of anticipated vs unanticipated shocks across expectation formation mechanisms, it is natural to wonder 12I calculate FEVDs for horizons h = {0, 1, 2, 3, 4, 5, 6, 7, 8, 16}. Technically an unconditional FEVD is taken in the limit as the forecasting horizon goes to infinity. In practice this convergence occurs relatively quickly. Horizons longer than 16 result in variance shares which are nearly identical to those generated at 16. 85 Share of Volatility Due to Anticipated Shocks, RE vs NRE 0 1 2 3 4 5 6 7 8 16 0 20 40 60 FEVD Horizon P e rc e n t E x p la in e d Variance of Output Growth 0 1 2 3 4 5 6 7 8 16 0 10 20 30 40 FEVD Horizon P e rc e n t E x p la in e d Variance of Consumption Growth 0 1 2 3 4 5 6 7 8 16 0 20 40 60 FEVD Horizon P e rc e n t E x p la in e d Variance of Investment Growth 0 1 2 3 4 5 6 7 8 16 0 20 40 60 80 FEVD Horizon P e rc e n t E x p la in e d Variance of Growth in Hours Worked FIGURE 4. Relative Importance of News: Key Macroeconomic Variables, RE vs NRE whether there are differences in each model’s ability to fit the data. Table 11 compares the standard deviation (relative to output), correlation with output, and first-order autocorrelation coefficients for the growth rates of output, consumption, investment, hours worked, government expenditure, TFP, and the relative price of investment obtained from the actual data to that obtained from 7500 simulations of each model. The deep structural parameters for each simulation were generated via a random draw of the respective model’s posterior distribution. The summary statistics are the median value across these simulations. The variables Y,C, I, h, g, TFP and A refer to the growth rates of output, consumption, investment, hours worked, government expenditure, TFP, and the relative 86 Share of Volatility Due to Anticipated Shocks, RE vs NRE 0 1 2 3 4 5 6 7 8 16 0 20 40 60 80 100 FEVD Horizon P er ce n t E x p la in ed Variance of Government Expenditure Growth FIGURE 5. Relative Importance of News: Government Spending, RE vs NRE price of investment, respectively, and relative standard deviations are the ratio of the standard deviation of the variable to that of the growth rate of output. The summary statistics produced by both models are extremely similar to each other and are broadly consistent with the data. The growth rates of consumption and investment are estimated to be overly volatile, while the growth rate of the relative price of investment is somewhat less volatile; however, the underlying standard deviation of output growth is almost identical to that of the data. Furthermore the relative standard deviations of growth in hours worked, government spending, and TFP are all quite accurate compared to the data. The estimated correlations with the data are weaker than those observed in the data for the growth rates of investment, hours worked, and TFP, but match quite well for consumption and government spending. Finally, the estimated 87 Model Predictions: Volatility and Persistence, Data vs NRE vs RE Y C I h g TFP A Relative Standard Deviation Data 1.00 0.56 2.52 0.92 1.25 0.83 0.45 Model - NRE 1.00 0.70 3.06 0.97 1.15 0.85 0.35 Model -RE 1.00 0.70 3.05 0.98 1.18 0.88 0.37 Correlation with Output Data 1.00 0.50 0.69 0.72 0.25 0.40 -0.12 Model - NRE 1.00 0.51 0.57 0.40 0.26 0.27 0.01 Model -RE 1.00 0.49 0.55 0.37 0.27 0.29 0.01 Autocorrelation Data 0.28 0.20 0.53 0.60 0.05 -0.01 0.49 Model - NRE 0.32 0.46 0.63 0.18 0.03 -0.01 0.46 Model -RE 0.34 0.48 0.65 0.19 0.03 -0.00 0.46 TABLE 11. Model Predictions: Volatility and Persistence, Data vs NRE vs RE persistence of consumption is too high, while the estimated persistence of hours worked is too low.13 Conclusion This chapter has demonstrated that the estimated relative importance of anticipated news versus unanticipated news, as interpreted by the previous literature, is robust to the inclusion of learning. In particular it appears that news and learning serve to augment each other: in general, the estimated macroeconomic volatility in key variables is more a function of news under NRE than under RE. This is especially interesting given that both models produce quantitatively similar predictions for overall volatility and persistence. 13My estimates are broadly similar but generally closer to the data than those produced in Schmitt- Grohe and Uribe (2012). For example, the sum of squared deviations of data vs estimated relative standard deviation is around 0.33 under both NRE and RE under my estimates compared to 1.6 under theirs. This is driven to a large extent to the relative standard deviation of investment to output growth which is much greater under their estimates due to a much lower estimated standard deviation of output (0.91 in the data and my estimates, 0.73 under their estimates). 88 However, pure news is responsible for almost none of the estimated volatility in either model; it is only once the anticipated shock materializes that news begins to be important. It is possible that this finding is sensitive to the particular form of learning assumed. The NRE approach undertaken in this chapter represents a relatively small departure from RE. Future research will look at the relative impact on model predictions and estimates across a variety of bounded rationality assumptions. Departures from RE such as the infinite-horizon learning of Preston (2005), finite-horizon learning of Branch et al. (2012a), and shadow-price learning of Evans and McGough (2015) all reflect larger departures from RE, and provide a natural avenue for future projects studying the empirical importance of both news shocks and adaptive learning in DSGE model estimation. It will be particularly interesting to observe whether differences emerge between between bounded rationality of the “learning-to-forecast” and “learning-to-optimize” types emerge. 89 APPENDIX A APPENDIX FOR CHAPTER II Competitive Equilibrium Firms The representative firm in the competitive durable goods sector solves max lk,t PtIt −Wk,tlk,t −Rk,tl˜x s.t. It = exp (θk,t)l αk k,t l˜ 1−αk k where Pt the price of the investment good in terms of the consumption good, Wk,t and Rk,t the rents paid to labor lk,t and the fixed factor l˜k, respectively, and θk,t is a measure of technology.1 The FONCs are given by Wk,t = αkPt It lk,t (A.1) Rk,t = (1− αk)pt It l˜k (A.2) Thus labor and the fixed factor in this sector are paid their marginal products in terms of the numeraire good. 1 Technology in the durable goods sector evolves deterministically according to log θk,t = g0,k + g1 ∗ t 90 Consumption Good The representative firm solves max Xt,Kt−1 Ct − Px,tXt −RtKt−1 s.t Ct = (aX v t + (1− a)Kvt−1) 1 v where Px,t the price of the nondurable good Xt in terms of the consumption good and Rt the gross real interest rate. The FONCs are given by Rt = (1− a)C1−vt Kv−1t−1 (A.3) Px,t = aC 1−v t X v−1 t (A.4) This will be used below to determine the value of labor and the fixed factor in the non- durable goods sector Xt. The nondurable good is an intermediate good in production, and hence is only consumed by final goods producers. The representative firm in the competitive nondurable goods sector solves max lx,t,l˜x Px,tXt −Wx,tlx,t −Rx,tl˜x s.t Xt = exp (θx,t)l αx x,t l˜ 1−αx x 91 where Wx,t and Rx,t the rents paid to labor lx,t and the fixed factor l˜x, respectively, and θx,t is a measure of technology. 2 The FONCs are given by Wx,t = αxXt/lx,t (A.5) Rx,t = rx,t (1− αx)Xt/l˜x (A.6) (A.7) We can use the demand from the representative final good producer to determine the wage and rental rates. Equations (A.5) and (A.4) together imply Wx,t = aC 1−v t αx Xvt lx,t (A.8) and equations (A.6) and (A.4) together imply rx,t = aC 1−v t (1− αx) Xvt l˜x (A.9) 2 Technology in the nondurable goods sector evolves according to log θx,t = g0,x + g1 ∗ t+ log θˆx,t log θˆx,t = λ log θˆx,t−1 + t where t an i.i.d mean zero shock which is known to the firms when making their production decision and vt−n is the news shock revealed to households. 92 Households The household problem is given by max Ct,lx,t,lk,t,It,Kt logCt + v0(l¯ − lx,t − lk,t) (A.10) subject to the law of motion for capital and the flow budget constraint Kt = (1− δ)Kt−1 + It (A.11) Ct + PtIt = Wx,tlx,t +Wk,tlk,t +RtKt−1 + Πt (A.12) The LaGrangian is thus L = Eˆ0 ∞∑ t=0 βt { logCt + v0(l¯ − lx,t − lk,t) } + Λt {Wx,tlx,t +Wk,tlk,t +RtKt−1 + Πt − Ct − PtIt} + ΛtQt {(1− δ)Kt−1 + It −Kt} where Λt the marginal value of income, Qt Tobin’s marginal Q, and hence ΛtQt the marginal value of capital in terms of consumption. The first-order necessary conditions 93 for an interior solution to the household’s problem are given by ∂L ∂Ct = 0 ⇐⇒ Λt = 1 Ct (A.13a) ∂L ∂lx,t = 0 ⇐⇒ v0 = ΛtWx,t (A.13b) ∂L ∂lk,t = 0 ⇐⇒ v0 = ΛtWk,t (A.13c) ∂L ∂It = 0 ⇐⇒ Pt = Qt (A.13d) ∂L ∂Kt = 0 ⇐⇒ ΛtQt = βEˆt [Λt+1 (Rt+1 +Qt+1(1− δ))] (A.13e) 94 Equilibrium A temporary equilibrium is a set { Ct, lx,t, lk,t, It, Kt, Xt, Yt, Pt, Rt,Wx,t,Wk,t,Λt, Qt, θk,t, θx,t, θ˜x,t } such that consumers and firms are utility maximizing and markets clear: Yt = Ct + PtIt (A.14a) Ct = ( aXνt + (1− a)Kνt−1 ) 1 ν (A.14b) Xt = θx,tl αx x,tF 1−αx x (A.14c) It = θk,tl αk k,tF 1−αk k (A.14d) Kt = (1− δ)Kt−1 + It (A.14e) Λt = 1 Ct (A.14f) v0 = ΛtWx,t (A.14g) v0 = ΛtWk,t (A.14h) Pt = Qt (A.14i) ΛtQt = βEˆt [Λt+1Rt+1 + Λt+1Qt+1(1− δ)] (A.14j) Rt = (1− a) ( Ct Kt−1 )1−v (A.14k) Wx,t = aαxC 1−v t Xvt lx,t (A.14l) Wk,t = Ptαk It lk,t (A.14m) θk,t = g0,ke g1t (A.14n) θx,t = g0,xe g1tθ˜x,t (A.14o) θ˜x,t = θ˜ λ x,t−1e t (A.14p) (A.14q) 95 where Λt and ΛtQt the marginal value (or shadow price) of income and installed capital, respectively. Qt is the ratio of these marginal values and can be interpreted as the relative price of installed capital in terms of the consumption good, i.e. marginal Tobin’s Q. Equation (A.14a) is the economy’s aggregate resource constraint and equations (A.14b)-(A.14d) are the production functions for the consumption, nondurable, and durable goods respectively. Equation (A.14e) is the capital accumulation equation. Optimal household decision making for consumption, nondurable good labor supply, durable good labor supply, investment, and capital yields equations (A.14f)-(A.14j), respectively. Markets are competitive and hence factors of production are paid the value of their marginal products, as specified by equations (A.14k)-(A.14m). Finally, technology for producing the durable and nondurable goods is given by equations (A.14n)-(A.14p). 96 Detrending Define the for any non-stationary variable x the stationary counterpart xt = xˆte g1t. The temporary equilibrium in terms of detrended variables is Yˆt = Cˆt + PtIˆt (A.15a) Cˆt = ( aXˆνt + (1− a) ( Kˆt−1 eg1 )ν) 1ν (A.15b) Xˆt = θˆx,tl αx x,tF 1−αx x (A.15c) Iˆt = θˆk,tl αk k,tF 1−αk k (A.15d) Kˆt = (1− δ)Kˆt−1 eg1 + Iˆt (A.15e) Λˆt = 1 Cˆt (A.15f) v0 = ΛˆtWˆx,t (A.15g) v0 = ΛˆtWˆk,t (A.15h) Pt = Qt (A.15i) eg1ΛˆtQt = βEˆt [ Λˆt+1Rt+1 + Λˆt+1Qt+1(1− δ) ] (A.15j) Rt = (1− a) ( Cˆt Kˆt−1 eg1 )1−v (A.15k) Wˆx,t = aαxCˆ 1−v t Xˆvt lx,t (A.15l) Wˆk,t = Ptαk Iˆt lk,t (A.15m) θˆk,t = g0,k (A.15n) θˆx,t = g0,xθ˜x,t (A.15o) θˇx,t = θˇ λ x,t−1e t (A.15p) 97 Log-Linearization The entire detrended, log-linearized system of equilibrium conditions is given by Y˜t − ( C Y ) C˜t − ( PI Y ) P˜t − ( PI Y ) I˜t = 0 (A.16) C˜t − a ( X C )v X˜t = (1− a) ( K C )v K˜t−1 (A.17) X˜t − θ˜x,t − αxl˜x,t = 0 (A.18) I˜t − θ˜k,t − αk l˜k,t = 0 (A.19) K˜t − ( I K ) I˜t = ( 1− δ eg1 ) K˜t−1 (A.20) Λ˜t + C˜t = 0 (A.21) Λ˜t + W˜x,t = 0 (A.22) Λ˜t + W˜k,t = 0 (A.23) Q˜t − P˜t = 0 (A.24) Λ˜t + Q˜t = EˆtΛ˜t+1 + ( 1 R +Q(1− δ) ) EˆtR˜t+1 + ( Q(1− δ) R +Q(1− δ) ) EˆtQ˜t+1 (A.25) R˜t + (R(v − 1)) C˜t = (R(v − 1)) K˜t−1 (A.26) l˜x,t + W˜x,t − (1− v) C˜t − vX˜t = 0 (A.27) l˜k,t + W˜k,t − P˜t − I˜t = 0 (A.28) θ˜k,t = 0 (A.29) θ˜x,t − ˜ˇθx,t = 0 (A.30) ˜ˇθx,t = λ ˜ˇθx,t−1 + t (A.31) 98 where a ˜tilde over a variable indicates the percent deviation from steady state and variables without a time subscript indicate their steady state value.3 3Note that the gross interest rate is expressed in “deviation from steady state” rather than “percent deviation from steady state”, since the variable is already expressed in percentage terms. 99 APPENDIX B APPENDIX FOR CHAPTER III Solving the Model Suppose the household was acting optimally and contemplated reallocating a small amount of consumption for investment. The marginal cost of the reduction in consumption is the direct loss of utility today as well as the expected discounted value of changing the geometric average of habit-adjusted consumption. The marginal benefit is the expected discounted value of the additional investment, which affects future investment adjustment costs and increases tomorrow’s capital stock. Hence the household’s first-order necessary condition for Ct is UCt(Vt) + ∂St ∂Ct βEˆtλ S t+1 = ∂It ∂Ct ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) (B.1) Now suppose the household considered a small increase in their labor supply. The additional labor income could be used to increase investment, and hence the marginal benefit is the expected discounted value of the additional investment they can afford, which again affects future investment adjustment costs and increases tomorrow’s capital stock. The marginal cost is the disutility associated with higher labor supply, and hence the household’s first-order necessary condition for ht is −Uht(Vt) = ∂It ∂ht ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) (B.2) 100 If the household were to utilize more of their predetermined capital stock they would receive rental income which could be used to finance additional investment, again affecting tomorrow’s capital stock and investment adjustment costs. However working capital more intensely also increase the rate of depreciation, and thus the first-order necessary condition for ut is ∂Kt ∂ut βEˆtλ K t+1 = ∂It ∂ut ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) (B.3) The optimal choices of Ct, ht, and ut pin down the value for It through the flow budget constraint, which pins down the value for Kt through the capital accumulation equation. Likewise, the optimal choice of Ct generically determines the value of St. Thus equations (B.1), (B.2), and (B.3) fully describe the optimal behavior of the household as a function of their beliefs about the future shadow prices. The expected value of these shadow prices may also be derived using variational arguments. The time t value of additional It−1 in time t holding everything else constant is the resulting change to investment adjustment costs which changes the size of tomorrow’s capital stock, that is λIt = ∂Kt ∂It−1 βEˆtλ K t+1 (B.4) The time t value of additional preinstalled capital in time t is the expected discounted value of the additional rental income (which could be invested) plus the value of directly increasing tomorrow’s capital stock, that is λKt = ∂It ∂Kt−1 ( βEˆtλ I t+1 + ∂Kt ∂It βEˆtλ K t+1 ) + ∂Kt ∂Kt−1 βEˆtλ K t+1 (B.5) 101 Finally, the time t value of a small change to the predetermined level of habit-adjusted consumption in time t is the direct change to utility today, as well as the expected discounted value of the resulting change in the level of habit-adjusted consumption, that is λSt = USt−1(Vt) + ∂St ∂St−1 βEˆtλ S t+1 (B.6) The solution to the household’s optimization problem satisfies the FONCs for the controls given by ( Ct − ψhθtSt )−σ ( 1− γψh θ tSt Ct ) + γSt Ct βEˆtλ S t+1 = 1 At βEˆt ( λIt+1 + ( 1− Φ ( It It−1 ) − Φ′ ( It It−1 )( It It−1 )) λKt+1 ) (B.7a) ( Ct − ψhθtSt )−σ ( θψhθ−1t St Wt ) = 1 At βEˆt ( λIt+1 + ( 1− Φ ( It It−1 ) − Φ′ ( It It−1 )( It It−1 )) λKt+1 ) (B.7b) δ′(ut) Rt βEˆtλ K t+1 = 1 At βEˆt ( λIt+1 + ( 1− Φ ( It It−1 ) − Φ′ ( It It−1 )( It It−1 )) λKt+1 ) (B.7c) λIt = Φ ′ ( It It−1 )( It It−1 )2 βEˆtλ K t+1 (B.7d) λKt = (Rtut) 1 At βEˆt ( λIt+1 + ( 1− Φ ( It It−1 ) − Φ′ ( It It−1 )( It It−1 )) λKt+1 ) + (1− δ(ut))βEˆtλKt+1 (B.7e) λSt = ( Ct − ψhθtSt )−σ (− (1− γ)ψhθtSt St−1 ) + β (1− γ)St St−1 Eˆtλ S t+1 (B.7f) It can be shown that the shadow prices can be calculated as λIt = Φ ′ ( It It−1 )( It It−1 )2( Rt δ′(ut) )( Ct − ψhθtSt )−σ (θψhθ−1t St Wt ) (B.8a) λKt = (1 + utδ ′(ut)− δ(ut)) ( Rt δ′(ut) )( Ct − ψhθtSt )−σ (θψhθ−1t St Wt ) (B.8b) λSt = [( Ct − ψhθtSt )−σ ((1− γ)St St−1 )][(−Ct γSt )( 1− γψh θ tSt Ct − θψh θ−1 t St Wt ) − ψhθt ] (B.8c) The full dynamic system under SP-learning is described by the following equations, comprised of the optimality conditions for households and firms, resource constraints, and 102 identities: ( Ct − ψhθtSt )−σ ( 1− γψh θ tSt Ct ) + γSt Ct βEˆtλ S t+1 = 1 At βEˆt ( λIt+1 + ( 1− Φ ( It It−1 ) − Φ′ ( It It−1 )( It It−1 )) λKt+1 ) ( Ct − ψhθtSt )−σ (− θψhθ−1t St Wt ) = 1 At βEˆt ( λIt+1 + ( 1− Φ ( It It−1 ) − Φ′ ( It It−1 )( It It−1 )) λKt+1 ) δ′(ut) Rt βEˆtλ K t+1 = 1 At βEˆt ( λIt+1 + ( 1− Φ ( It It−1 ) − Φ′ ( It It−1 )( It It−1 )) λKt+1 ) λIt = Φ ′ ( It It−1 )( It It−1 )2 ( Rt δ′(ut) )( Ct − ψhθtSt )−σ ( θψhθ−1t St Wt ) λKt = ( 1 + utδ ′(ut)− δ(ut) )( Rt δ′(ut) )( Ct − ψhθtSt )−σ ( θψhθ−1t St Wt ) λSt = [( Ct − ψhθtSt )−σ ( (1− γ)St St−1 )] ×[(−Ct γSt )( 1− γψh θ tSt Ct − θψh θ−1 t St Wt ) − ψhθt ] Kt = (1− δ (ut))Kt−1 + It [ 1− Φ ( It It−1 )] Ct +AtIt ≤Wtht +Rt (utKt−1) St = C γ t S 1−γ t−1 Yt = Ct +AtIt Yt = zt (utKt−1)1−α (ht)α Wt = α Yt ht Rt = (1− α) Yt utKt−1 Eˆtλ I t = H I t x˜t Eˆtλ K t = H K t x˜t Eˆtλ S t = H S t x˜t RH,t = RH,t−1 + gt ( x˜t−1x˜′t−1 −RH,t−1 ) Ht = Ht−1 + gtR−1H,tx˜t−1 ( λt−1 −H′t−1x˜t−1 )′ 103 Recursive Representation of News-Shocks We can write the evolution of any exogenous variable indexed by w as ln(wt) = ρw ln(wt−1) + ε0w,t (B.9) εw,t = ϕwεw,t−1 +Mwνw,t (B.10) where εw,t = ( ε0w,t, ε 1 w,t, .., ε n w,t ) a vector of auxiliary state variables which carry the news shocks through time, ϕ a lower-shift matrix with 1’s on the super-diagonal and zeros elsewhere, νw,t = ( ν0w,t, ν 1 w,t, ..., ν n w,t )′ a vector of (anticipated and unanticipated) shocks, and Mw = ([ ν0w,t ∈ It ] , [ ν1w,t−1 ∈ It ] , ..., [ νnw,t−n ∈ It ]) a row-vector of 1’s and 0’s respecting the specific assumptions regarding the information obtained by households. 104 APPENDIX C APPENDIX FOR CHAPTER IV The Labor Market Final-goods producers demand a composite labor good hct = [∫ 1 0 ht(j) 1 1+µt dj ]1+µt where ht(j) denotes labor of type j ∈ [0, 1] and µt ≥ 0 is the exogenous stochastic wage markup. The cost minimization problem for a final-goods producer is min ht(j) ∫ 1 0 Wt(j)ht(j)dj subject to hct ≤ [∫ 1 0 ht(j) 1 1+µt dj ]1+µt the solution to which yields the conditional factor demand functions h∗t (j) = hct ( Wt(j) W ct )− (1+µt) µt where W ct = [∫ 1 0 Wt(j) − 1 µt dj ]−µt the cost of a single unit of the composite labor input. Labor ht(j) is provided by monopolistically competitive unions. Since labor is freely mobile all unions pay their members the same wage rate Wt and charge firms Wt(j). The profit maximization problem for these unions is max Wt(j) (Wt(j)−Wt)ht(j) subject to ht(j) = h ∗ t (j) the solution to which is W ∗t (j) = (1 + µt)Wt. It is clear from this expression all unions will charge firms the same price denoted W ∗t , and that this wage is marked up in accordance 105 with the value of market power captured by µt. This implies firms will demand identical quantities of each type of labor, that is ht(j) = h c t for all j. The profits of each union j are thus Πt(j) = µtWth c t , which is the same for all unions. These profits are rebated to member-households as a lump sum. Since the unions determine the hours of labor allocated, firms choose their labor demand, and households choose how much labor to provide, an equilibrium requires ∫ 1 0 ht(j)dj = h c t = ∫ ht(ω)dω. 106 Inducing Stationarity Many of the endogenous variables inherit the non-stationarity of At and Xt. In particular, it can be shown that – Ct, St, Y out t , and Wt grow at rate µ Y t = XYt XYt−1 – Kt and It grow at rate µ K t = XKt XKt−1 where XKt = At XYt – Gt grows at rate µ G t = XGt XGt−1 – Rt and Qt grow at rate µ A t – Πt and Λt grow at rate ( µYt )−σ Thus we define the stationary variables Yˆ outt = Y outt XYt , Cˆt = Ct XYt , Wˆt = Wt XYt , Iˆt = AtIt XYt , Kˆt = AtKt XYt , gt = Gt XGt , Rˆt = Rt At , Qˆt = Qt At , Πˆt = ( XYt )σ Πt, and Λˆt = ( XYt )σ Λt. ut and ht are already stationary variables. By manipulating the equations defining a non-stationary temporary equilibrium one can express the system in terms of the stationary variables. In particular, one can show the stationarized dynamic system of equilibrium conditions to be 107 given by Kˆt = (1− δ (ut)) Kˆt−1 µKt + zIt Iˆt [ 1− Φ ( Iˆt Iˆt−1 µKt )] Yˆt = Cˆt + Iˆt + gtX G,Y t Yˆt = zt ( ut Kˆt−1 µKt )αk (ht) αh (F )1−αk−αh Vˆt = Cˆt − bCˆt−1 µYt − ψhθt Sˆt Sˆt = ( Cˆt − bCˆt−1 µYt )γ ( Sˆt−1 µYt )1−γ Λˆt = ζtVˆ −σt − γSˆt Cˆt − b Cˆt−1µYt Πˆt − bβEˆt (µYt+1)−σ ζt+1Vˆ −σt+1 − γSˆt+1 Cˆt+1 − b CˆtµYt+1 Πˆt+1  Λˆt = θζtVˆ −σ t ψh θ−1 t Sˆt Wˆt/ωt Πˆt = ζtVˆ −σ t ψh θ t + β(1− γ)Eˆt [ Sˆt+1 Sˆt Πˆt+1 ( µYt+1 )1−σ] QˆtΛˆt = βEˆt [ µAt+1 ( µYt+1 )−σ Λˆt+1 ( Rˆt+1ut+1 + Qˆt+1(1− δ(ut+1)) )] Rˆt = δ ′(ut)Qˆt Λˆt = QˆtΛˆtz I t ( 1− Φ ( Iˆt Iˆt−1 µKt ) − Φ′ ( Iˆt Iˆt−1 µKt )( Iˆt Iˆt−1 µKt )) + βEˆt Qˆt+1Λˆt+1zIt+1µAt+1 (µYt+1)−σ Φ′ ( Iˆt+1 Iˆt µKt+1 )( Iˆt+1 Iˆt µKt+1 )2 Wˆt = αh Yˆt ht Rˆt = αk Yˆt ut Kˆt−1 µKt 108 where the functional forms for the depreciation rate and investment-adjustment costs are given by δ (ut) = δ0 + δ1 (ut − 1) + δ2 2 (ut − 1)2 Φ ( Iˆt Iˆt−1 µKt ) = κ 2 ( Iˆt Iˆt−1 µKt − µI )2 The laws of motion for the exogenous stochastic processes can be written ln (zt/z) = ρz ln (zt−1/z) + 0z,t ln (ζt/ζ) = ρζ ln (ζt−1/ζ) + 0ζ,t ln ( zIt /z I ) = ρzI ln ( zIt−1/z I ) + 0zI ,t ln (gt/g) = +ρg ln (gt−1/g) + 0g,t ln (ωt/ω) = ρω ln (ωt−1/ω) + 0ω,t ln ( µAt /µ A ) = ρµA ln ( µAt−1/µ A ) + 0µA,t ln ( µXt /µ X ) = ρµX ln ( µXt−1/µ X ) + 0µX ,t µYt = µ X t ( µAt )αk/(αk−1) µKt = µ X t ( µAt )1/(αk−1) XG,Yt = ( XG,Yt−1 )ρx,g µYt = XGt XYt 109 where the shock structure for any stochastic process x = { z, ζ, zI , g, ω, µA, µX } is given by 0x,t =  1 x,t−1 + ν 0 x,t 1x,t =  2 x,t−1 2x,t =  3 x,t−1 3x,t =  4 x,t−1 4x,t =  5 x,t−1 + ν 4 x,t 5x,t =  6 x,t−1 6x,t =  7 x,t−1 7x,t =  8 x,t−1 8x,t = ν 8 x,t Finally, the growth rates of the seven key macroeconomic variables considered in terms of the stationary variables is gYt = Yt Yt−1 = Yˆt Yˆt−1 XYt XYt−1 = Yˆt Yˆt−1 µYt gCt = Ct Ct−1 = Cˆt Cˆt−1 XYt XYt−1 = Cˆt Cˆt−1 µYt gIt = AtIt At−1It−1 = Iˆt Iˆt−1 XYt XYt−1 = Iˆt Iˆt−1 µYt ght = ht ht−1 gGt = Gt Gt−1 = gt gt−1 ( XG,Yt−1 )ρxg−1 gTFPt = zt zt−1 ( µXt )1−αk gAt = µ A t 110 Steady State Computation and Calibration Let x be the steady-state value of the stationary random variable xˆt. The steady states for z, ζ, and zI are normalized to 1, g is set such that the output share of government spending is 0.20, ω is set to 1.15, and µY and µA are set to the mean gross growth rate of per capita output and relative price of investment observed in the data which, given the stochastic trends in output and government spending, implies µX = µY µA αk 1−αk and XG,Y = ( µY )1/(ρx,g−1) . The parameter µI is set to µK so that investment-adjustment costs (and their derivative) are zero in steady state, while δ1 is set to 1 βµA µY −σ − (1− δ0) to ensure the steady-state capacity utilization rate u is 1. The scale parameter ψ governing the disutility of labor in the utility function is calibrated to ensure the steady-state value of h is 0.20. With these calibrations for the steady-state values of exogenous processes and free parameters, the non-stochastic steady state can be easily computed from the equilibrium conditions and constraints given above. In particular, the 111 steady state can be shown to be R = δ1 h = 0.2 u = 1 Q = 1 K = αkz ( µK u )1−αk hαh R  1 1−αk I = ( 1− 1− δ0 µK ) K Y = RuK αkµK W = αh Y h g = 0.2 C = Y − I − gXGY S = C(1− b/µY ) (µY ) γ−1γ ψ = [( θhθ−1S W/ω )( 1− bβ (µY )−σ)−1 + ( γS (C − bC/µY ) )( hθ 1− β(1− γ) (µY )1−σ )]−1 V = ( C − bC µY ) − ψhθS Π = ( ζV −σψhθ ) ( 1− β(1− γ) (µY )1−σ)−1 Λ = θζV −σψhθ−1S W/ω 112 Log-Linearization The stationarized system can be log-linearized around its non-stochastic steady state. Denote by a t˜ilde the deviation from steady state, that is, for any variable x we have x˜t = log ( xt x ) . 1. The capital accumulation equation is Kˆt = (1− δ (ut)) Kˆt−1 µKt + zIt Iˆt [ 1− Φ ( Iˆt Iˆt−1 µKt )] Taking logs of both sides we have log Kˆt = log ( (1− δ (ut)) Kˆt−1 µKt + zIt Iˆt [ 1− Φ ( Iˆt Iˆt−1 µKt )]) Noting that logK = log ( (1− δ(u)) K µK + zII ( 1− Φ ( I I µK ))) , the first-order Taylor series expansion around the steady state can be shown to be K˜t = ( 1− δ(u) µK ) K˜t−1 − ( δ′(u)u µK ) u˜t − ( 1− δ(u) µK ) µ˜Kt + ( zII K ) z˜It + ( zII K ) I˜t 2. The aggregate resource constraint is Yˆt = Cˆt + Iˆt + gtX G,Y t Taking logs of both sides we have log Yˆt = log ( Cˆt + Iˆt + gtX G,Y t ) 113 Noting that log Yˆ = log ( Cˆ + Iˆ + gXG,Y ) , the first-order Taylor series expansion around the steady state can be shown to be Y˜t = ( C Y ) C˜t + ( I Y ) I˜t + ( gXG,Y Y ) X˜G,Yt + ( gXG,Y Y ) g˜t 3. The production function is Yˆt = zt ( ut Kˆt−1 µKt )αk (ht) αh (F )1−αk−αh Taking logs on both sides yields log Yˆt = log zt + αk log ut + αk log Kˆt−1 − αk log µKt + αh log ht + (1− αk − αh) logF Since the equation is log-linear, subtracting each side by log Y = log z + αk log u + αk log Kˆ − αk log µK + αh log h + (1− αk − αh) logF , the log-linearized production function can be shown to be Y˜t = z˜t + αku˜t + αkK˜t−1 − αkµ˜Kt + αhh˜t 4. The utility function’s argument is Vˆt = Cˆt − bCˆt−1 µYt − ψhθt Sˆt Taking logs of both sides yields log Vˆt = log ( Cˆt − bCˆt−1 µYt − ψhθt Sˆt ) 114 Noting that log Vˆ = log ( Cˆ − b Cˆ µY − ψhθSˆ ) , the first-order Taylor series expansion around the steady state can be shown to be V˜t = ( C V ) C˜t − ( bC/µY V ) C˜t−1 + ( bC/µY V ) µ˜Yt − ( ψhθSθ V ) h˜t − ( ψhθS V ) S˜t 5. The identity for the geometric average of current and past habit-adjusted consumption is Sˆt = ( Cˆt − bCˆt−1 µYt )γ ( Sˆt−1 µYt )1−γ Taking logs of both sides yields log Sˆt = γ log ( Cˆt − bCˆt−1 µYt ) + (1− γ) log ( Sˆt−1 µYt ) Noting that log Sˆ = γ log ( Cˆ − b Cˆ µY ) + (1− γ) log ( Sˆ µY ) , the first-order Taylor series expansion around the steady state can be shown to be S˜t = ( γ 1− b/µY ) C˜t − ( γb/µY 1− b/µY ) C˜t−1 + (1− γ)S˜t−1 + ( γb/µY 1− b/µY − (1− γ) ) µ˜Yt 6. The FONC for consumption is Λˆt = ζtVˆ −σt − γSˆt Cˆt − b Cˆt−1µYt Πˆt − bβEˆt ζt+1Vˆ −σt+1 − γSˆt+1 Cˆt+1 − b CˆtµYt+1 Πˆt+1 (µYt+1)−σ  115 Taking logs of both sides yields log Λˆt = log ζtVˆ −σt − γSˆt Cˆt − b Cˆt−1µYt Πˆt − bβEˆt ζt+1Vˆ −σt+1 − γSˆt+1 Cˆt+1 − b CˆtµYt+1 Πˆt+1 (µYt+1)−σ  Noting that log Λˆ = log (( ζVˆ −σ − γSˆ Cˆ−b Cˆ µY Πˆ ) − bβEˆ [( ζVˆ −σ − γSˆ Cˆ−b Cˆ µY Πˆ )( µY )−σ]) , the first-order Taylor series expansion around the steady state can be shown to be Λ˜t = φ1 ( ζ˜t − σV˜t − φ3 ( Eˆtζ˜t+1 − σEˆtV˜t+1 )) − φ2 { S˜t + Π˜t + ( b/µY 1− b/µY ) C˜t−1 − ( b/µY 1− b/µY ) µ˜Yt − ( 1 + b2β ( µY )−σ−1 1− b/µY ) C˜t −φ3 ( EˆtS˜t+1 + EˆtΠ˜t+1 − ( 1 1− b/µY ) EˆtC˜t+1 )} + [ φ3 ( σ (φ1 − φ2)− φ2b/µ Y 1− b/µY )] Eˆt+1µ Y t+1 where φ1 = ζV −σ Λ , φ2 = γSΠ ΛC(1−b/µY ) , and φ3 = bβ ( µY )−σ . 7. The FONC for labor supply is Λˆt = θζtVˆ −σ t ψh θ−1 t Sˆt Wˆt/ωt Taking logs of both sides yields log Λˆt = log ( θζtVˆ −σ t ψh θ−1 t Sˆt Wˆt/ωt ) This equation is linear in logs; subtracting log Λˆ = log ( θζVˆ −σψhθ−1Sˆ Wˆ /ω ) directly yields Λ˜t = ζ˜t − σV˜t + (θ − 1) h˜t + S˜t − W˜t + ω˜t 116 8. The FONC for the geometric average of current and past habit-adjusted consumption is Πˆt = ζtVˆ −σ t ψh θ t + βEˆt [ (1− γ) Sˆt+1 Sˆt Πˆt+1 ( µYt+1 )1−σ] Taking logs of both sides yields log Πˆt = log ( ζtVˆ −σ t ψh θ t + βEˆt [ (1− γ) Sˆt+1 Sˆt Πˆt+1 ( µYt+1 )1−σ]) Noting that log Πˆ = log ( ζVˆ −σψhθ + β [ (1− γ) Sˆ Sˆ Πˆ ( µY )1−σ]) , the first-order Taylor series expansion around the steady state can be shown to be ΠΠ˜t = ( ζV −σψhθ ) ( ζ˜t − σV˜t + θh˜t ) + ( β(1− γ)Π (µY )1−σ)(EˆtS˜t+1 − S˜t + EˆtΠ˜t+1 + (1− σ)Eˆtµ˜Yt+1) 9. The FONC for capital is QˆtΛˆt = βEˆt [ Λˆt+1 ( Rˆt+1ut+1 + Qˆt+1(1− δ(ut+1)) )( µAt+1 ( µYt+1 )−σ)] Taking logs of both sides yields log Qˆt + log Λˆt = log ( βEˆt [ Λˆt+1 ( Rˆt+1ut+1 + Qˆt+1(1− δ(ut+1)) )( µAt+1 ( µYt+1 )−σ)]) 117 Noting that log Qˆ + log Λˆ = log ( β [ Λˆ ( Rˆu+ Qˆ(1− δ(u)) )( µA ( µY )−σ)]) , the first-order Taylor series expansion around the steady state can be shown to be Q˜t + Λ˜t = ( βµA ( µY )−σ Q ){ (Ru+Q(1− δ(u))) ( EˆtΛ˜t+1 + Eˆtµ˜ A t+1 − σEˆtµ˜Yt+1 ) + (Ru)EˆtR˜t+1 + ((R−Qδ′(u))u) Eˆtu˜t+1 + (Q(1− δ(u))) EˆtQ˜t+1 } 10. The FONC for the capacity utilization rate of capital is Rˆt = δ ′(ut)Qˆt Taking logs of both sides yields log Rˆt = log (δ ′(ut)) + log Qˆt Thus the first-order Taylor series expansion around the steady state can be shown to be R˜t = δ′′(u)u δ′(u) u˜t + Q˜t 11. The FONC for investment is Λˆt = z I t ( 1− Φ ( Iˆt Iˆt−1 µKt ) − Φ′ ( Iˆt Iˆt−1 µKt )( Iˆt Iˆt−1 µKt )) QˆtΛˆt + βEˆt [( zIt+1Φ ′ ( It+1 It µKt+1 )( It+1 It µKt+1 )2 Qˆt+1Λˆt+1 )( µAt+1 ( µYt+1 )−σ)] 118 Taking logs of both sides yields log Λˆt = log ( zIt ( 1− Φ ( Iˆt Iˆt−1 µKt ) − Φ′ ( Iˆt Iˆt−1 µKt )( Iˆt Iˆt−1 µKt )) QˆtΛˆt +βEˆt [( zIt+1Φ ′ ( It+1 It µKt+1 )( It+1 It µKt+1 )2 Qˆt+1Λˆt+1 )( µAt+1 ( µYt+1 )−σ)]) Noting that at steady state log Λˆ = log ( zI ( 1− Φ ( Iˆ Iˆ µK ) − Φ′ ( Iˆ Iˆ µK )( Iˆ Iˆ µK )) QˆΛˆ +βEˆ [( zIΦ′ ( I I µK )( I I µK )2 QˆΛˆ )( µA ( µY )−σ)]) the first-order Taylor series expansion around the steady state can be shown to be Λ˜t = z IQ { z˜It + Q˜t + Λ˜t + Φ′′ ( µK ) ( µK )2 [ I˜t−1 − µ˜Kt − I˜t +βµA ( µY )−σ µK ( EˆtI˜t+1 + Eˆtµ˜ K t+1 − I˜t )]} 12. The wage valuation equation is Wˆt = αh Yˆt ht Taking logs of both sides yields log Wˆt = logαh + log Yˆt − log ht 119 This is linear in logs. Subtracting both sides by log Wˆ = logαh + log Yˆ + log h the log-linearized equation can be shown to be W˜t = Y˜t − h˜t 13. The effective-capital valuation equation is Rˆt = αk Yˆt ut Kˆt−1 µKt Taking logs of both sides yields log Rˆt = logαk + log Yˆt − log ut − log Kˆt−1 + log µKt This is linear in logs. Subtracting both sides by log Rˆ = logαk + log Yˆ − log u − log Kˆt−1 + log µK the log-linearized equation can be shown to be R˜t = Y˜t − u˜t − K˜t−1 + µ˜Kt 14. Output growth is given by µYt = µ X t ( µAt )αk/(αk−1) Taking logs of both sides yields log µYt = log µ X t − ( αk 1− αk ) log µAt 120 This is linear in logs. Subtracting both sides by log µY = log µX − ( αk 1−αk ) log µA the log-linearized equation can be shown to be µ˜Yt = µ˜ X t − ( αk 1− αk ) µ˜At 15. Capital (and investment) growth is given by µKt = µYt µAt Taking logs of both sides yields log µKt = log µ Y t − log µAt This is linear in logs. Subtracting both sides by log µK = log µY − log µA the log- linearized equation can be shown to be µ˜Kt = µ˜ Y t − µ˜At 16. The stochastic trend for government spending is XG,Yt = ( XG,Yt−1 )ρx,g µYt Taking logs of both sides yields logXG,Yt = ρx,g logX G,Y t−1 − log µYt 121 This is linear in logs. Subtracting both sides by logXG,Y = ρx,g logX G,Y − log µY the log-linearized equation can be shown to be X˜G,Yt = ρx,gX˜ G,Y t−1 − µ˜Yt 17. The stationary exogenous stochastic processes x = { z, ζ, zI , g, ω, µA, µX } all evolve according to xt = x 1−ρxxρxt−1 0 x,t Taking logs of both sides yields lnxt = (1− ρx)x+ ρx lnxt−1 + ε0x,t where ε0x,t = ln  0 x,t. This is linear in logs. Subtracting both sides by lnx = (1− ρx)x+ ρx lnx the log-linearized equation can be shown to be x˜t = ρxx˜t−1 + ε0x,t 122 REFERENCES CITED Adam, K. (2005). Learning to Forecast and Cyclical Behavior of Output and Inflation. Macroeconomic Dynamics, 9(01):1–27. Aiyagari, S. R. (1994). ”Uninsured Idiosyncratic Risk and Aggregate Saving”. The Quarterly Journal of Economics, pages 659–684. Alexopoulos, M. (2011). “Read All about It!! What Happens Following a Technology Shock?”. American Economic Review, 101(4):1144–79. Allen, S. G. (1985). “Why Construction Industry Productivity is Declining”. The Review of Economics and Statistics, 67(4):661–69. An, S. and Schorfheide, F. (2007). Bayesian Analysis of DSGE Models. Econometric reviews, 26(2-4):113–172. Angeletos, G.-M., Hellwig, C., and Pavan, A. (2007). “Dynamic Global Games of Regime Change: Learning, Multiplicity, and the Timing of Attacks”. Econometrica, 75(3):711–756. Angeletos, G.-M. and La’O, J. (2010). ”Noisy Business Cycles”. In NBER Macroeconomics Annual 2009, Volume 24, pages 319–378. University of Chicago Press. Angeletos, G.-M. and Pavan, A. (2004). “Transparency of Information and Coordination in Economies with Investment Complementarities”. Technical report, National Bureau of Economic Research. Assenza, T., Bao, T., Hommes, C., Massaro, D., et al. (2014). Experiments on expectations in macroeconomics and finance. Experiments in macroeconomics, 17:11A`70. Barsky, R. B. and Sims, E. R. (2012). “Information, Animal Spirits, and the Meaning of Innovations in Consumer Confidence”. American Economic Review, 102(4):1343–77. Basu, S., Fernald, J., and Kimball, M. (2004). Are Technology Improvements Contractionary? Technical report, National Bureau of Economic Research. Basu, S., Fernald, J. G., and Kimball, M. S. (2006). Are Technology Improvements Contractionary? The American Economic Review, 96(5):1418–1448. 123 Beaudry, P. and Lucke, B. (2010). Letting Different Views about Business Cycles Compete. In NBER Macroeconomics Annual 2009, Volume 24, pages 413–455. University of Chicago Press. Beaudry, P., Nam, D., and Wang, J. (2011). Do Mood Swings Drive Business Cycles and Is It Rational? Technical report, National Bureau of Economic Research. Beaudry, P. and Portier, F. (2004). “An Exploration into Pigou’s Theory of Cycles”. Journal of Monetary Economics, 51(6):1183–1216. Beaudry, P. and Portier, F. (2006). “Stock Prices, News, and Economic Fluctuations”. American Economic Review, 96(4):1293–1307. Beaudry, P. and Portier, F. (2014). “News-Driven Business Cycles: Insights and Challenges”. Journal of Economic Literature, 52(4):993–1074. Becker, R. A. (1980). ”On the Long-run Steady State in a Simple Dynamic Model of Equilibrium with Heterogeneous Households”. The Quarterly Journal of Economics, 95(2):375–382. Benhabib, J. and Farmer, R. E. (1994). “Indeterminacy and Increasing Returns”. Journal of Economic Theory, 63(1):19–41. Bernanke, B. S. and Mishkin, F. S. (1997). “Inflation Targeting: A New Framework for Monetary Policy?”. Technical report, National Bureau of Economic Research. Blanchard, O. J. and Kahn, C. M. (1980). The Solution of Linear Difference Models under Rational Expectations. Econometrica: Journal of the Econometric Society, pages 1305–1311. Blanchard, O. J., L’Huillier, J.-P., and Lorenzoni, G. (2013). “News, Noise, and Fluctuations: An Empirical Exploration”. American Economic Review, 103(7):3045–70. Blinder, A. S. (1999). “Central Banking in Theory and Practice”. Mit press. Blinder, A. S. (2013). “After the Music Stopped: The Financial Crisis, the Response, and the Work Ahead”. Penguin. Branch, W., Evans, G. W., and McGough, B. (2012a). Finite-horizon learning. In CDMA Working Paper. Branch, W., Evans, G. W., and McGough, B. (2012b). “Finite Horizon Learning”. CDMA Working Paper Series 201204, Centre for Dynamic Macroeconomic Analysis. 124 Bullard, J. and Eusepi, S. (2014). ”When Does Determinacy Imply Expectational Stability?”. International Economic Review, 55(1):1–22. Burnside, C., Eichenbaum, M., and Rebelo, S. (1995). “Capital Utilization and Returns to Scale”. In NBER Macroeconomics Annual 1995, Volume 10, pages 67–124. MIT Press. Cho, S. (2015). “Determinacy and E-stability Under Reduced-form Learning”. Economics Letters. Christiano, L. J., Eichenbaum, M., and Evans, C. L. (2005). Nominal Rigidities and the Dynamic Effects of a Shock to Monetary Policy. Journal of political Economy, 113(1):1–45. Dombeck, B. (2016). “Implications of Solving the Co-movement Problem vis-a-vis Expectational Stability”. Ellison, M. and Pearlman, J. (2011). “Saddlepath Learning”. Journal of Economic Theory, 146(4):1500–1519. Eusepi, S. and Preston, B. (2010). “Central Bank Communication and Expectations Stabilization”. American Economic Journal: Macroeconomics, 2(3):235–71. Eusepi, S. and Preston, B. (2011). “Expectations, Learning, and Business Cycle Fluctuations”. American Economic Review, 101(6):2844–72. Evans, G. W. and Honkapohja, S. (2001). Learning and Expectations in Macroeconomics. Princeton University Press. Evans, G. W. and Honkapohja, S. (2006). “Monetary Policy, Expectations and Commitment”. The Scandinavian Journal of Economics, 108(1):15–38. Evans, G. W. and McGough, B. (2015). ”Learning to Optimize”. Working paper. Farmer, R. E. and Guo, J.-T. (1994). “Real Business Cycles and the Animal Spirits Hypothesis”. Journal of Economic Theory, 63(1):42–72. Greenwood, J., Hercowitz, Z., and Huffman, G. W. (1988). “Investment, Capacity Utilization, and the Real Business Cycle”. The American Economic Review, pages 402–417. Guerro´n-Quintana, P. and Nason, J. M. (2012). ”Bayesian Estimation of DSGE Models. Guo, J.-T., Sirbu, A.-I., and Weder, M. (2015). “News About Aggregate Demand and the Business Cycle”. Journal of Monetary Economics, (0). 125 Haario, H., Saksman, E., and Tamminen, J. (2001). An Adaptive Metropolis Algorithm. Bernoulli, pages 223–242. Hellwig, C. (2005). “Heterogeneous Information and the Benefits of Transparency”. Technical report, UCLA mimeo. Hellwig, C. (2010). ”Comment on ”Noisy Business Cycles”. Technical report, National Bureau of Economic Research. Hommes, C., Sonnemans, J., Tuinstra, J., and Van De Velden, H. (2007). “Learning in Cobweb Experiments”. Macroeconomic Dynamics, 11(S1):8–33. Honkapohja, S., Mitra, K., and Evans, G. W. (2012). ”Notes on Agents Behavioral Rules Under Adaptive Learning and Studies of Monetary Policy”. Jaimovich, N. and Rebelo, S. (2009). “Can News about the Future Drive the Business Cycle?”. American Economic Review, 99(4):1097–1118. Karnizova, L. (2010). ”News versus Sunspot Shocks in a New Keynesian Model”. Economics: The Open-Access, Open-Assessment E-Journal, 4. Karnizova, L. V. (2007). ”News versus Sunspot Shocks in Linear Rational Expectations Models”. Department of Economics, University of Ottawa= De´p. de Science e´conomique, Universite´ d’Ottawa. Khan, H. and Tsoukalas, J. (2012). “The Quantitative Importance of News Shocks in Estimated DSGE Models”. Journal of Money, Credit and Banking, 44(8):1535–1561. King, R. G., Plosser, C. I., and Rebelo, S. T. (1988). “Production, Growth and Business Cycles: I. The Basic Neoclassical Model”. Journal of monetary Economics, 21(2):195–232. Klein, P. (2000). Using the Generalized Schur Form to Solve a Multivariate Linear Rational Expectations Model. Journal of Economic Dynamics and Control, 24(10):1405–1423. Krusell, P. and McKay, A. (2010). “News Shocks and Business Cycles”. FRB Richmond Economic Quarterly, 96(4):373–397. Kydland, F. E. and Prescott, E. C. (1982). “Time to Build and Aggregate Fluctuations”. Econometrica: Journal of the Econometric Society, pages 1345–1370. Leeper, E. M., Richter, A. W., and Walker, T. B. (2012). “Quantitative Effects of Fiscal Foresight”. American Economic Journal: Economic Policy, 4(2):115–44. 126 Lorenzoni, G. (2009). “A Theory of Demand Shocks”. American Economic Review, 99(5):2050–84. Lucas, R. E. (1976). “Econometric Policy Evaluation: A Critique”. In Carnegie-Rochester conference series on public policy, volume 1, pages 19–46. Elsevier. Lucas, R. E. (1978). “Asset Prices in an Exchange Economy”. Econometrica: Journal of the Econometric Society, pages 1429–1445. Marcet, A. and Sargent, T. J. (1989a). “Convergence of Least-Squares Learning in Environments with Hidden State Variables and Private Information”. The Journal of Political Economy, pages 1306–1322. Marcet, A. and Sargent, T. J. (1989b). “Convergence of Least Squares Learning Mechanisms in Self-Referential Linear Stochastic Models”. Journal of Economic theory, 48(2):337–368. Marcet, A. and Sargent, T. J. (1989c). “Least-Squares Learning and the Dynamics of Hyperinflation”. In International Symposia in Economic Theory and Econometrics, edited by William Barnett, John Geweke, and Karl Shell, pages 119–137. McCallum, B. T. (2007). “E-stability vis-a-vis Determinacy Results for a Broad Class of Linear Rational Expectations Models”. Journal of Economic Dynamics and Control, 31(4):1376–1391. McFadden, D. (1989). “A Method of Simulated Moments for Estimation of Discrete Response Models Without Numerical Integration”. Econometrica: Journal of the Econometric Society, pages 995–1026. McGough, B. (2006). “Shocking Escapes”. Economic Journal, 116(511):507–528. Milani, F. (2007). “Expectations, Learning and Macroeconomic Persistence”. Journal of Monetary Economics, 54(7):2065–2082. Milani, F. and Rajbhandari, A. (2012). Expectation Formation and Monetary DSGE Models: Beyond the Rational Expectations Paradigm. Advances in Econometrics, 28:253. Mitra, K., Evans, G. W., and Honkapohja, S. (2013). “Policy Change and Learning in the RBC Model”. Journal of Economic Dynamics and Control, 37(10):1947 – 1971. Morris, S. and Shin, H. S. (2002). “Social Value of Public Information”. The American Economic Review, 92(5):1521–1534. 127 Morris, S., Shin, H. S., and Tong, H. (2006). “Social Value of Public Information: Morris and Shin (2002) Is Actually Pro-Transparency, Not Con: Reply”. The American Economic Review, pages 453–455. Muth, J. F. (1961). “Rational Expectations and the Theory of Price Movements”. Econometrica: Journal of the Econometric Society, pages 315–335. Pigou, A. C. (1927). “Industrial Fluctuations”. Macmillan. Preston, B. J. (2005). ”Learning About Monetary Policy Rules When Long-horizon Expectations Matter”. 1. Ramey, V. A. (2009). “Identifying Government Spending Shocks: It’s All in the Timing”. Working Paper 15464, National Bureau of Economic Research. Ramey, V. A. (2011). “Identifying Government Spending Shocks: It’s all in the Timing”. The Quarterly Journal of Economics, 126(1):1–50. Ruge-Murcia, F. J. (2007). Methods to Estimate Dynamic Stochastic General Equilibrium Models. Journal of Economic Dynamics and Control, 31(8):2599–2636. Schmitt-Grohe, S. and Uribe, M. (2012). “What’s News in Business Cycles”. Econometrica, 80(6):2733–2764. Shaby, B. and Wells, M. T. (2010). Exploring an adaptive metropolis algorithm. Technical report. Sims, C. A. (2002). Solving Linear Rational Expectations Models. Computational economics, 20(1):1–20. Sims, E. (2016). What’s news in News? A cautionary note on using a variance decomposition to assess the quantitative importance of news shocks. Journal of Economic Dynamics and Control, 73(1):41–60. Slobodyan, S. and Wouters, R. (2007). Adaptive Learning in an Estimated Medium–Size DSGE Model. Technical report, mimeo. Slobodyan, S. and Wouters, R. (2008). Estimating a Medium–scale DSGE Model with Expectations Based on Small Forecasting Models. Technical report, mimeo. Slobodyan, S. and Wouters, R. (2012). Learning in an Estimated Medium-scale DSGE Model. Journal of Economic Dynamics and control, 36(1):26–46. Smets, F. and Wouters, R. (2007). “Shocks and Frictions in US Business Cycles: A Bayesian DSGE Approach”. American Economic Review, 97(3):586–606. 128 Stiglitz, J. E. (2014). “Reconstructing Macroeconomic Theory to Manage Economic Policy”. Working Paper 20517, National Bureau of Economic Research. Svensson, L. E. O. (2006). “Social Value of Public Information: Comment: Morris and Shin (2002) Is Actually Pro-Transparency, Not Con”. American Economic Review, 96(1):448–452. Tong, H. (2004). “Do Transparency Standards Improve Macroeconomic Forecasting?”. Unpublished Paper, 455. Uhlig, H. et al. (1995). A Toolkit for Analyzing Nonlinear Dynamic Stochastic Models Easily. Veldkamp, L. L. (2011). “Information Choice in Macroeconomics and Finance”. Princeton University Press. Williams, N. (2003a). ”Adaptive Learning and Business Cycles”. Williams, N. (2003b). ”Adaptive Learning and Business Cycles. Manuscript, Princeton University. Woodford, M. (2007). “The Case for Forecast Targeting as a Monetary Policy Strategy”. Journal of Economic Perspectives, 21(4):3–24. Woodford, M. (2013). “Fedspeak: Does it Matter How Central Bankers Explain Themselves?”. Yang, S.-C. S. and Traum, N. (2011). “When Does Government Debt Crowd Out Investment?”. In 2011 Meeting Papers, number 479. Society for Economic Dynamics. 129