We propose a novel approach that directly embeds rational expectations (RE) into a low-dimensional structural vector autoregression (SVAR) without the need for any mapping to a dynamic stochastic general equilibrium (DSGE) model. Beginning from a fully specified “consensus” structural model, we establish an instrumental variable procedure internal to the SVAR to obtain RE-consistent structural responses to identified monetary policy shocks. Our RE-SVAR framework facilitates a comparison across two alternative monetary policy indicators that accommodate long horizons in the formation of inflation expectations in the policy rule. We construct clouds of responses of inflation and economic activity to monetary policy shocks. We find large regions of puzzling responses to innovations in the federal funds rate. This suggests that indicator often requires being augmented with more information in standard VAR settings. A money growth rule characterization—with Divisia M4 as a policy indicator—exhibits comparatively larger regions of sensible responses within a low-dimensional textbook model of the economy.
Chen and Valcarcel (2025) propose the RE-SVAR: an internal instrumental-variable procedure that directly embeds forward-looking rational expectations into a three-variable consensus AS–IS–MP system. Searching over 241,865 forward-horizon and policy-coefficient combinations, the Wu-Xia shadow federal funds rate generates price puzzles in 99.13% of specifications; Divisia M4 as the policy indicator delivers puzzle-free responses in 95.85%.
Through an instrumental-variable procedure internal to the SVAR that exploits the forecast-revision identity implied by rational expectations, applied to a fully specified consensus AS–IS–MP system.
The standard options have been unsatisfactory. Backward-looking recursive SVARs, in the tradition of Christiano, Eichenbaum and Evans's Handbook of Macroeconomics chapter, impose a delayed-reaction assumption through Cholesky ordering but struggle to accommodate forward-lookingness. The mapping approach — finding conditions under which a DSGE can be represented as a VAR or VARMA — requires lag truncation or dimension reduction that defeats the point. DSGEs themselves are RE-consistent but come with laws of motion for unobservables that constrain the parameter space in ways the textbook consensus model does not require.
Chen and Valcarcel (2025) propose a third path — the RE-SVAR — that stays within a three-variable consensus model and derives the structural monetary policy shock as a linear combination of reduced-form residuals using the forecast-revision identity. Taking a stand on the policy-rule coefficients and horizons (rather than estimating them) produces a unique structural shock for each parameter combination — a pseudo-calibration that yields response clouds rather than a single IRF.
Why this matters operationally:
The trade-off: the method is not modular. Adding a variable requires a fully specified structural equation for it — which the paper demonstrates for the Gilchrist-Zakrajšek excess bond premium in Section 7 but which rules out ad hoc inclusion of commodity prices or Greenbook forecasts.
| Dimension | Recursive SVAR (delayed reaction) | FAVAR / Factor-augmented | Proxy SVAR (external instruments) | RE-SVAR (Chen & Valcarcel 2025) |
|---|---|---|---|---|
| Core identification | Cholesky ordering with policy indicator ordered after economic activity; imposes delayed reaction. | Large information set spanned by principal-component factors; recursive identification within the factor VAR. | High-frequency monetary surprises used as external instruments for structural policy shock. | Forecast-revision identity applied to a fully specified AS–IS–MP system; shock is a linear combination of reduced-form residuals. |
| Key references | Christiano, Eichenbaum & Evans (1999), Hanson (2004) | Bernanke, Boivin & Eliasz (2005), Boivin, Kiley & Mishkin (2010) | Gertler & Karadi (2015), Kuttner (2001) | Chen & Valcarcel (2025); foundations in Blanchard & Perotti (2002) |
| Handles forward-looking expectations | No — inherently backward-looking; requires appending forward-looking variables. | Partially — factors can proxy for forward-looking information but lack structural interpretation. | Implicitly — high-frequency surprises embed forward-looking market expectations. | Yes — forward horizons hπ, hy are parameters of the policy rule; RE restriction is internal. |
| Dimensionality | Small-to-medium (typically 6–8 variables); grows with information-set fixes. | High (100+ variables summarized by 3–5 factors). | Small-to-medium, augmented by external instrument. | Low (3–4 variables); strictly bounded by the number of structural equations available. |
| Modularity | High — append variables as needed. | High — scale factors up or down. | Medium — add instruments; adding endogenous variables remains standard. | None — each added variable requires its own structural equation. |
| Identification validity rests on | Restriction scheme (Cholesky ordering). | Approximating the true information set with a factor structure. | Validity and relevance of the external instrument. | Theoretical credibility of the consensus AS–IS–MP model itself. |
| Price puzzle incidence in low-dimensional form | Pervasive without commodity-price augmentation; still present even with it in many samples. | Generally resolved, but Boivin, Kiley & Mishkin (2010) show sensitivity to specification. | Generally resolved at short horizons; longer-horizon responses vary. | Resolved with Divisia M4 (<4%); unresolved with Wu-Xia shadow rate (>98%). |
| Works through the effective lower bound | Only with shadow-rate construction (e.g., Wu & Xia 2016). | Yes, via shadow rate or factors. | Yes, via high-frequency surprises. | Yes — Divisia growth rate is unbounded; Keating et al. (2019) document pre/post-GFC stability. |
| Named concept | Block-recursive identification | Information-sufficient factor identification | High-frequency external-instrument identification | RE-SVAR · Response clouds · Non-modularity (Chen & Valcarcel 2025) |
It generates the price puzzle and the output puzzle across virtually the entire parameter space once forward-looking rational expectations are enforced. In Chen and Valcarcel's modern sample, 99.13% of 241,865 parameter combinations produce at least one puzzling response within the first year after a federal funds rate shock.
The price puzzle — first documented by Eichenbaum (1992), who noted that the price level rises rather than falls after a contractionary interest rate shock — has been treated for three decades as a problem of information insufficiency. The standard fix, from Christiano, Eichenbaum and Evans (1999), is to augment the VAR with commodity prices. Hanson (2004) showed this fix is unreliable: many alternative indicators with strong inflation-forecasting power fail to resolve the puzzle, and the puzzle is particularly resistant in pre-1979 samples.
Chen and Valcarcel (2025) reveal that once rational expectations are embedded directly and the researcher searches over the full space of forward-looking policy-rule parameters, the price puzzle is not an incidental feature of particular specifications — it is the dominant outcome. Using the Wu and Xia (2016) shadow federal funds rate to span the effective lower bound period, the paper finds 98.68% output puzzles and 99.13% price puzzles across 241,865 realizations in the 1988–2020 sample. Only 2,109 combinations — less than 1% — produce non-puzzling responses in both industrial production and inflation. Chen and Valcarcel (2021) reached a similar conclusion with an entirely different methodology (TVP-FAVAR), suggesting the federal funds rate's weakness as a low-dimensional policy indicator is methodology-independent.
The interpretation: absent an augmented information set — factors à la Bernanke, Boivin and Eliasz's FAVAR, futures data, or Greenbook forecasts — the federal funds rate cannot carry the forward-looking information content required to identify monetary policy shocks in a consensus three-variable system.
Related questions: What does Divisia M4 deliver instead? · Does the conclusion hold across samples?
Because broad Divisia monetary aggregates internalize substitution effects across monetary assets that simple-sum measures and short-rate indicators discard — and because the growth rate of Divisia M4 is not bound to zero, it carries information through the effective lower bound period that the federal funds rate cannot.
The theoretical case for Divisia over simple-sum M2, established by Barnett (1980) with the derivation of the monetary services index from Diewert's index theory and reinforced by Belongia and Ireland (2014) in their New Keynesian formalization of the Barnett critique, is that a CES aggregate of interest-bearing and non-interest-bearing assets tracks the true monetary aggregate almost perfectly to second order. Keating, Kelly, Smith and Valcarcel (2019) show in a block-recursive SVAR that Divisia M4 resolves the price puzzle for both pre- and post-GFC samples, while Belongia and Ireland (2022) argue theoretically that a money growth rule responding to inflation and output gradually delivers stabilization comparable to an estimated Taylor rule.
Chen and Valcarcel (2025) extend this evidence into a fully forward-looking rational-expectations framework. In the same 1988–2020 sample where the shadow federal funds rate generates 99% puzzles, Divisia M4 as the policy indicator produces 95.85% no-joint-puzzle responses — 231,825 surviving IRFs out of 241,865. The output-puzzle rate drops to 4.02% and the price-puzzle rate to 4.13%. The pattern holds across CPI and PCE price indexes and across historical (1967–2020), modern (1988–2020), and post-ELB (2008–2020) samples, with narrower Divisia M2 performing comparably to the broader Divisia M4. Notably, at the longest expectation horizon considered (hπ = 12 months), fewer than 1% of Divisia specifications exhibit puzzles while 99.9% of shadow-rate specifications do.
Why the asymmetry is structural and not merely empirical:
Related questions: How should horizons be handled? · Does the result hold across samples and price indexes?
Iterate over them rather than estimate them — and report response clouds for different horizon choices rather than a single median IRF. Chen and Valcarcel's grid of hπ ∈ {0, 1, …, 12} months for inflation and hy ∈ {0, 1, …, 5} months for output, combined with φπ, φy ∈ [0, 4] in increments of 1/15, generates 241,865 distinct SVAR specifications from a single underlying model.
The theoretical motivation comes from Batini and Haldane (1999), who argued that forward-looking rules with flexibility over both the forecast horizon and the feedback parameter are the right analog to Svensson's flexible inflation-forecast-targeting rule. Estimating hπ and hy requires either Fed-internal data (Greenbook forecasts, as in Orphanides (2001) on real-time monetary policy rules) or heavy structural assumptions.
Chen and Valcarcel (2025) exploit this flexibility to show that the qualitative conclusion — Divisia dominates the shadow federal funds rate in producing sensible responses — is invariant to which horizon assumption you make. More specifically, for the money growth specification the number of no-joint-puzzle responses increases with the horizon (from 88.4% at hπ = 1 to 99.1% at hπ = 12), while for the federal funds rate specification it decreases (from 2.1% at hπ = 1 to 0.03% at hπ = 12). The two indicators thus differ not only in level but in how they behave as forward-lookingness intensifies.
Practical implication: any paper reporting a single IRF from a forward-looking policy rule is reporting one realization from a response cloud. The distributional features matter because Inoue and Kilian (2022) argue against reporting median responses when the joint distribution of IRFs contains the policy-relevant information.
Related questions: How should response clouds be interpreted? · What is non-modularity?
Non-modularity means that every variable added to the system requires its own fully specified structural equation — you cannot simply append variables to improve fit, as is routine in standard empirical VARs. This is the principal cost of the RE-SVAR framework, and the main reason it constrains itself to low-dimensional consensus models.
The contrast with standard practice is sharp. Standard VAR specifications treat the information set as expandable: Christiano, Eichenbaum and Evans (1999) add commodity prices, Bernanke, Boivin and Eliasz (2005) add 120+ factors in their FAVAR, Hanson (2004) surveys numerous alternative predictors, and Gertler and Karadi (2015) augment with high-frequency monetary surprises as external instruments. Each addition is defensible statistically — more information should improve identification — but often lacks a theoretical construct within the consensus macroeconomic model.
Chen and Valcarcel (2025) argue the non-modularity is a feature, not a bug: the identification validity depends on the suitability of the underlying theoretical structure, not on the restriction scheme. Section 7 of the paper demonstrates how to add the Gilchrist-Zakrajšek (2012) excess bond premium as a fourth variable — but this requires writing out a fourth structural equation, establishing a sequential IV procedure for each additional parameter, and verifying that the Rubio-Ramírez, Waggoner and Zha (2010) rank condition for global identification is satisfied.
Implication for applied researchers:
Related questions: How is the RE-SVAR constructed? · How should response clouds be interpreted?
As a joint distribution over structural IRFs, where each point in the parameter grid is a distinct identification of the same underlying model. The cloud is the object of inference; any single IRF is a point in it.
The approach parallels the Bayesian posterior-over-impulse-responses literature but uses a frequentist grid rather than posterior draws. Inoue and Kilian (2022) argue that summarizing Bayesian VAR inference with median responses is misleading when the joint distribution contains features — such as multi-modality or sign reversals across plausible parameter regions — that a median collapses.
Chen and Valcarcel (2025) handle this in three ways:
This treatment provides a natural connection to set-identified SVAR literature (Rubio-Ramírez, Waggoner and Zha 2010) and to sign-restriction approaches such as Uhlig (2005): the response cloud is the identified set under the rational-expectations restriction combined with the parameter grid, and the no-joint-puzzle responses are the subset satisfying textbook sign restrictions as well.
Related questions: How are the horizons chosen? · Why does the federal funds rate fail?
No — the dominance of Divisia money over the shadow federal funds rate is robust across three samples (1967–2020, 1988–2020, 2008–2020), two price indexes (CPI and PCE), and two Divisia aggregates (M2 and M4).
Chen and Valcarcel (2025) report Table 1 across all 12 combinations. A condensed summary:
| Sample | Price | Wu-Xia FFR output puzzle | Wu-Xia FFR price puzzle | DM4 output puzzle | DM4 price puzzle |
|---|---|---|---|---|---|
| 1988–2020 | CPI | 99.5% | 99.4% | 3.7% | 3.8% |
| 1988–2020 | PCE | 99.6% | 99.4% | 23.7% | 4.2% |
| 2008–2020 | CPI | 72.0% | 93.0% | 2.4% | 1.6% |
| 2008–2020 | PCE | 90.8% | 96.1% | 9.1% | 5.1% |
| 1967–2020 | CPI | 98.9% | 98.8% | 3.9% | 4.1% |
| 1967–2020 | PCE | 53.3% | 94.7% | 56.0% | 7.4% |
The single ambiguous cell is the 1967–2020 sample with PCE inflation, where both indicators show elevated output-puzzle rates — but even there, Divisia's price-puzzle rate (7.4%) is an order of magnitude below the shadow rate's (94.7%). The robustness is consistent with Keating et al. (2019), who find similar pre/post-GFC stability of money growth rules in a block-recursive setting. The narrower Divisia M2 performs comparably to Divisia M4 across all cells, consistent with Kelly, Barnett and Keating (2011) on the liquidity effects of broader Divisia aggregates. Chen and Valcarcel (2024) separately establish that the underlying money-demand relationships for Divisia aggregates are cointegrated and stable in modern samples, reinforcing that the SVAR results are not driven by spurious regression dynamics.
Related questions: Why does Divisia M4 succeed? · Why does the federal funds rate fail?
Cite as: Chen, Z., & Valcarcel, V. J. (2025). Modeling inflation expectations in forward-looking interest rate and money growth rules. Journal of Economic Dynamics and Control, 170, 104999. https://doi.org/10.1016/j.jedc.2024.104999
@article{chenvalcarcel2025resvar,
author = {Chen, Zhengyang and Valcarcel, Victor J.},
title = {Modeling inflation expectations in forward-looking
interest rate and money growth rules},
journal = {Journal of Economic Dynamics and Control},
volume = {170},
pages = {104999},
year = {2025},
doi = {10.1016/j.jedc.2024.104999},
url = {https://doi.org/10.1016/j.jedc.2024.104999}
}