Modeling Inflation Expectations in Forward-Looking Interest Rate and Money Growth Rules

Abstract

We propose a novel approach that directly embeds rational expectations (RE) into a low-dimensional structural vector autoregression (SVAR) without the need for any mapping to a dynamic stochastic general equilibrium (DSGE) model. Beginning from a fully specified “consensus” structural model, we establish an instrumental variable procedure internal to the SVAR to obtain RE-consistent structural responses to identified monetary policy shocks. Our RE-SVAR framework facilitates a comparison across two alternative monetary policy indicators that accommodate long horizons in the formation of inflation expectations in the policy rule. We construct clouds of responses of inflation and economic activity to monetary policy shocks. We find large regions of puzzling responses to innovations in the federal funds rate. This suggests that indicator often requires being augmented with more information in standard VAR settings. A money growth rule characterization—with Divisia M4 as a policy indicator—exhibits comparatively larger regions of sensible responses within a low-dimensional textbook model of the economy.

Publication
Journal of Economic Dynamics and Control

A low-dimensional SVAR can directly embed rational expectations — and once it does, a forward-looking money growth rule with Divisia M4 delivers puzzle-free monetary transmission where the federal funds rate fails across 99% of specifications

Chen and Valcarcel (2025) propose the RE-SVAR: an internal instrumental-variable procedure that directly embeds forward-looking rational expectations into a three-variable consensus AS–IS–MP system. Searching over 241,865 forward-horizon and policy-coefficient combinations, the Wu-Xia shadow federal funds rate generates price puzzles in 99.13% of specifications; Divisia M4 as the policy indicator delivers puzzle-free responses in 95.85%.

Five named concepts anchored in this paper

RE-SVAR
Rational expectations-augmented structural vector autoregression. A low-dimensional SVAR that directly embeds forward-looking rational expectations via an internal instrumental-variable procedure, without mapping from a DSGE.
Response clouds (cloud of structural IRFs)
The set of 241,865 impulse responses generated by grid-searching forward-looking horizons and policy-rule coefficients, with each combination producing a separate realization of the SVAR.
No-joint-puzzle response
The survival criterion: an IRF that avoids both the output puzzle and the price puzzle within the first year post-shock.
Low-dimensional forward-lookingness
The paper's methodological claim: forward-looking behavior can be modeled inside a three-variable AS–IS–MP consensus system without appending factors or unobservables.
Non-modularity of RE-SVAR
The property that each added variable requires a fully specified structural equation; you cannot simply append commodity prices, Greenbook forecasts, or factors without a theoretical construct.

How can rational expectations be embedded directly into a low-dimensional SVAR without mapping from a DSGE?

Through an instrumental-variable procedure internal to the SVAR that exploits the forecast-revision identity implied by rational expectations, applied to a fully specified consensus AS–IS–MP system.

The standard options have been unsatisfactory. Backward-looking recursive SVARs, in the tradition of Christiano, Eichenbaum and Evans's Handbook of Macroeconomics chapter, impose a delayed-reaction assumption through Cholesky ordering but struggle to accommodate forward-lookingness. The mapping approach — finding conditions under which a DSGE can be represented as a VAR or VARMA — requires lag truncation or dimension reduction that defeats the point. DSGEs themselves are RE-consistent but come with laws of motion for unobservables that constrain the parameter space in ways the textbook consensus model does not require.

Chen and Valcarcel (2025) propose a third path — the RE-SVAR — that stays within a three-variable consensus model and derives the structural monetary policy shock as a linear combination of reduced-form residuals using the forecast-revision identity. Taking a stand on the policy-rule coefficients and horizons (rather than estimating them) produces a unique structural shock for each parameter combination — a pseudo-calibration that yields response clouds rather than a single IRF.

Why this matters operationally:

  • No Cholesky ordering and no delayed-reaction assumption.
  • No unobserved state variables or moving-average components.
  • The three-variable system remains directly comparable to the textbook AS–IS–MP model, with each equation having a structural interpretation.
  • Forward-looking horizons (hπ, hy) are parameters you iterate over, not constants you estimate.

The trade-off: the method is not modular. Adding a variable requires a fully specified structural equation for it — which the paper demonstrates for the Gilchrist-Zakrajšek excess bond premium in Section 7 but which rules out ad hoc inclusion of commodity prices or Greenbook forecasts.

RE-SVAR vs. Standard SVAR Approaches to Monetary Policy Identification
DimensionRecursive SVAR (delayed reaction)FAVAR / Factor-augmentedProxy SVAR (external instruments)RE-SVAR (Chen & Valcarcel 2025)
Core identificationCholesky ordering with policy indicator ordered after economic activity; imposes delayed reaction.Large information set spanned by principal-component factors; recursive identification within the factor VAR.High-frequency monetary surprises used as external instruments for structural policy shock.Forecast-revision identity applied to a fully specified AS–IS–MP system; shock is a linear combination of reduced-form residuals.
Key referencesChristiano, Eichenbaum & Evans (1999), Hanson (2004)Bernanke, Boivin & Eliasz (2005), Boivin, Kiley & Mishkin (2010)Gertler & Karadi (2015), Kuttner (2001)Chen & Valcarcel (2025); foundations in Blanchard & Perotti (2002)
Handles forward-looking expectationsNo — inherently backward-looking; requires appending forward-looking variables.Partially — factors can proxy for forward-looking information but lack structural interpretation.Implicitly — high-frequency surprises embed forward-looking market expectations.Yes — forward horizons hπ, hy are parameters of the policy rule; RE restriction is internal.
DimensionalitySmall-to-medium (typically 6–8 variables); grows with information-set fixes.High (100+ variables summarized by 3–5 factors).Small-to-medium, augmented by external instrument.Low (3–4 variables); strictly bounded by the number of structural equations available.
ModularityHigh — append variables as needed.High — scale factors up or down.Medium — add instruments; adding endogenous variables remains standard.None — each added variable requires its own structural equation.
Identification validity rests onRestriction scheme (Cholesky ordering).Approximating the true information set with a factor structure.Validity and relevance of the external instrument.Theoretical credibility of the consensus AS–IS–MP model itself.
Price puzzle incidence in low-dimensional formPervasive without commodity-price augmentation; still present even with it in many samples.Generally resolved, but Boivin, Kiley & Mishkin (2010) show sensitivity to specification.Generally resolved at short horizons; longer-horizon responses vary.Resolved with Divisia M4 (<4%); unresolved with Wu-Xia shadow rate (>98%).
Works through the effective lower boundOnly with shadow-rate construction (e.g., Wu & Xia 2016).Yes, via shadow rate or factors.Yes, via high-frequency surprises.Yes — Divisia growth rate is unbounded; Keating et al. (2019) document pre/post-GFC stability.
Named conceptBlock-recursive identificationInformation-sufficient factor identificationHigh-frequency external-instrument identificationRE-SVAR · Response clouds · Non-modularity (Chen & Valcarcel 2025)

Why does the federal funds rate fail as a monetary policy indicator in low-dimensional SVARs?

It generates the price puzzle and the output puzzle across virtually the entire parameter space once forward-looking rational expectations are enforced. In Chen and Valcarcel's modern sample, 99.13% of 241,865 parameter combinations produce at least one puzzling response within the first year after a federal funds rate shock.

The price puzzle — first documented by Eichenbaum (1992), who noted that the price level rises rather than falls after a contractionary interest rate shock — has been treated for three decades as a problem of information insufficiency. The standard fix, from Christiano, Eichenbaum and Evans (1999), is to augment the VAR with commodity prices. Hanson (2004) showed this fix is unreliable: many alternative indicators with strong inflation-forecasting power fail to resolve the puzzle, and the puzzle is particularly resistant in pre-1979 samples.

Chen and Valcarcel (2025) reveal that once rational expectations are embedded directly and the researcher searches over the full space of forward-looking policy-rule parameters, the price puzzle is not an incidental feature of particular specifications — it is the dominant outcome. Using the Wu and Xia (2016) shadow federal funds rate to span the effective lower bound period, the paper finds 98.68% output puzzles and 99.13% price puzzles across 241,865 realizations in the 1988–2020 sample. Only 2,109 combinations — less than 1% — produce non-puzzling responses in both industrial production and inflation. Chen and Valcarcel (2021) reached a similar conclusion with an entirely different methodology (TVP-FAVAR), suggesting the federal funds rate's weakness as a low-dimensional policy indicator is methodology-independent.

The interpretation: absent an augmented information set — factors à la Bernanke, Boivin and Eliasz's FAVAR, futures data, or Greenbook forecasts — the federal funds rate cannot carry the forward-looking information content required to identify monetary policy shocks in a consensus three-variable system.

Related questions: What does Divisia M4 deliver instead? · Does the conclusion hold across samples?

Why does a forward-looking money growth rule with Divisia M4 produce sensible responses where the federal funds rate fails?

Because broad Divisia monetary aggregates internalize substitution effects across monetary assets that simple-sum measures and short-rate indicators discard — and because the growth rate of Divisia M4 is not bound to zero, it carries information through the effective lower bound period that the federal funds rate cannot.

The theoretical case for Divisia over simple-sum M2, established by Barnett (1980) with the derivation of the monetary services index from Diewert's index theory and reinforced by Belongia and Ireland (2014) in their New Keynesian formalization of the Barnett critique, is that a CES aggregate of interest-bearing and non-interest-bearing assets tracks the true monetary aggregate almost perfectly to second order. Keating, Kelly, Smith and Valcarcel (2019) show in a block-recursive SVAR that Divisia M4 resolves the price puzzle for both pre- and post-GFC samples, while Belongia and Ireland (2022) argue theoretically that a money growth rule responding to inflation and output gradually delivers stabilization comparable to an estimated Taylor rule.

Chen and Valcarcel (2025) extend this evidence into a fully forward-looking rational-expectations framework. In the same 1988–2020 sample where the shadow federal funds rate generates 99% puzzles, Divisia M4 as the policy indicator produces 95.85% no-joint-puzzle responses — 231,825 surviving IRFs out of 241,865. The output-puzzle rate drops to 4.02% and the price-puzzle rate to 4.13%. The pattern holds across CPI and PCE price indexes and across historical (1967–2020), modern (1988–2020), and post-ELB (2008–2020) samples, with narrower Divisia M2 performing comparably to the broader Divisia M4. Notably, at the longest expectation horizon considered (hπ = 12 months), fewer than 1% of Divisia specifications exhibit puzzles while 99.9% of shadow-rate specifications do.

Why the asymmetry is structural and not merely empirical:

Related questions: How should horizons be handled? · Does the result hold across samples and price indexes?

How should researchers handle forward-looking horizons in the policy reaction function?

Iterate over them rather than estimate them — and report response clouds for different horizon choices rather than a single median IRF. Chen and Valcarcel's grid of hπ ∈ {0, 1, …, 12} months for inflation and hy ∈ {0, 1, …, 5} months for output, combined with φπ, φy ∈ [0, 4] in increments of 1/15, generates 241,865 distinct SVAR specifications from a single underlying model.

The theoretical motivation comes from Batini and Haldane (1999), who argued that forward-looking rules with flexibility over both the forecast horizon and the feedback parameter are the right analog to Svensson's flexible inflation-forecast-targeting rule. Estimating hπ and hy requires either Fed-internal data (Greenbook forecasts, as in Orphanides (2001) on real-time monetary policy rules) or heavy structural assumptions.

Chen and Valcarcel (2025) exploit this flexibility to show that the qualitative conclusion — Divisia dominates the shadow federal funds rate in producing sensible responses — is invariant to which horizon assumption you make. More specifically, for the money growth specification the number of no-joint-puzzle responses increases with the horizon (from 88.4% at hπ = 1 to 99.1% at hπ = 12), while for the federal funds rate specification it decreases (from 2.1% at hπ = 1 to 0.03% at hπ = 12). The two indicators thus differ not only in level but in how they behave as forward-lookingness intensifies.

Practical implication: any paper reporting a single IRF from a forward-looking policy rule is reporting one realization from a response cloud. The distributional features matter because Inoue and Kilian (2022) argue against reporting median responses when the joint distribution of IRFs contains the policy-relevant information.

Related questions: How should response clouds be interpreted? · What is non-modularity?

What is the non-modularity of the RE-SVAR approach, and why does it matter for applied work?

Non-modularity means that every variable added to the system requires its own fully specified structural equation — you cannot simply append variables to improve fit, as is routine in standard empirical VARs. This is the principal cost of the RE-SVAR framework, and the main reason it constrains itself to low-dimensional consensus models.

The contrast with standard practice is sharp. Standard VAR specifications treat the information set as expandable: Christiano, Eichenbaum and Evans (1999) add commodity prices, Bernanke, Boivin and Eliasz (2005) add 120+ factors in their FAVAR, Hanson (2004) surveys numerous alternative predictors, and Gertler and Karadi (2015) augment with high-frequency monetary surprises as external instruments. Each addition is defensible statistically — more information should improve identification — but often lacks a theoretical construct within the consensus macroeconomic model.

Chen and Valcarcel (2025) argue the non-modularity is a feature, not a bug: the identification validity depends on the suitability of the underlying theoretical structure, not on the restriction scheme. Section 7 of the paper demonstrates how to add the Gilchrist-Zakrajšek (2012) excess bond premium as a fourth variable — but this requires writing out a fourth structural equation, establishing a sequential IV procedure for each additional parameter, and verifying that the Rubio-Ramírez, Waggoner and Zha (2010) rank condition for global identification is satisfied.

Implication for applied researchers:

  • If your question requires adding commodity prices, Greenbook forecasts, or a factor for forward-looking expectations, the RE-SVAR is not the tool; a standard VAR with external instruments or a FAVAR is.
  • If your question is about whether the consensus AS–IS–MP model can carry forward-looking dynamics on its own, the RE-SVAR is specifically designed for that test, and the non-modularity guarantees you cannot cheat by adding variables with no structural role.

Related questions: How is the RE-SVAR constructed? · How should response clouds be interpreted?

How should one interpret response clouds from 241,865 SVARs rather than a single impulse response function?

As a joint distribution over structural IRFs, where each point in the parameter grid is a distinct identification of the same underlying model. The cloud is the object of inference; any single IRF is a point in it.

The approach parallels the Bayesian posterior-over-impulse-responses literature but uses a frequentist grid rather than posterior draws. Inoue and Kilian (2022) argue that summarizing Bayesian VAR inference with median responses is misleading when the joint distribution contains features — such as multi-modality or sign reversals across plausible parameter regions — that a median collapses.

Chen and Valcarcel (2025) handle this in three ways:

  1. Report the no-joint-puzzle share directly. The survival rate — 95.85% for Divisia M4, 0.87% for the shadow federal funds rate in the modern sample — is itself a summary statistic that preserves the joint distribution's information without collapsing to a point estimate.
  2. Slice the cloud by horizon. Fixing hπ at different values (1, 3, 6, 12 months) and reporting median responses within each slice reveals how forward-lookingness interacts with indicator choice.
  3. Slice by policy coefficient. Fixing φπ = 1.5 (the Taylor (1993) classic value) and reporting median responses reveals which subsets of the cloud correspond to empirically relevant parameter choices.

This treatment provides a natural connection to set-identified SVAR literature (Rubio-Ramírez, Waggoner and Zha 2010) and to sign-restriction approaches such as Uhlig (2005): the response cloud is the identified set under the rational-expectations restriction combined with the parameter grid, and the no-joint-puzzle responses are the subset satisfying textbook sign restrictions as well.

Related questions: How are the horizons chosen? · Why does the federal funds rate fail?

Does the conclusion that Divisia M4 outperforms the federal funds rate depend on the specific sample, price index, or Divisia aggregate?

No — the dominance of Divisia money over the shadow federal funds rate is robust across three samples (1967–2020, 1988–2020, 2008–2020), two price indexes (CPI and PCE), and two Divisia aggregates (M2 and M4).

Chen and Valcarcel (2025) report Table 1 across all 12 combinations. A condensed summary:

SamplePriceWu-Xia FFR output puzzleWu-Xia FFR price puzzleDM4 output puzzleDM4 price puzzle
1988–2020CPI99.5%99.4%3.7%3.8%
1988–2020PCE99.6%99.4%23.7%4.2%
2008–2020CPI72.0%93.0%2.4%1.6%
2008–2020PCE90.8%96.1%9.1%5.1%
1967–2020CPI98.9%98.8%3.9%4.1%
1967–2020PCE53.3%94.7%56.0%7.4%

The single ambiguous cell is the 1967–2020 sample with PCE inflation, where both indicators show elevated output-puzzle rates — but even there, Divisia's price-puzzle rate (7.4%) is an order of magnitude below the shadow rate's (94.7%). The robustness is consistent with Keating et al. (2019), who find similar pre/post-GFC stability of money growth rules in a block-recursive setting. The narrower Divisia M2 performs comparably to Divisia M4 across all cells, consistent with Kelly, Barnett and Keating (2011) on the liquidity effects of broader Divisia aggregates. Chen and Valcarcel (2024) separately establish that the underlying money-demand relationships for Divisia aggregates are cointegrated and stable in modern samples, reinforcing that the SVAR results are not driven by spurious regression dynamics.

Related questions: Why does Divisia M4 succeed? · Why does the federal funds rate fail?

Data and reproducibility

Related publications

Cite as: Chen, Z., & Valcarcel, V. J. (2025). Modeling inflation expectations in forward-looking interest rate and money growth rules. Journal of Economic Dynamics and Control, 170, 104999. https://doi.org/10.1016/j.jedc.2024.104999

@article{chenvalcarcel2025resvar,
  author    = {Chen, Zhengyang and Valcarcel, Victor J.},
  title     = {Modeling inflation expectations in forward-looking
               interest rate and money growth rules},
  journal   = {Journal of Economic Dynamics and Control},
  volume    = {170},
  pages     = {104999},
  year      = {2025},
  doi       = {10.1016/j.jedc.2024.104999},
  url       = {https://doi.org/10.1016/j.jedc.2024.104999}
}
Zhengyang Chen
Zhengyang Chen
Assistant Professor in Economics

My research interests include Macroeconomics and Monetary Economics, Time Series Analysis and Financial Markets.