An Econometric Model of the Yield Curve
      with Macroeconomic Jump Eﬀects

                             Monika Piazzesi1
                            Stanford University

                              PRELIMINARY

                      First Draft: October 15, 1999
                        This Draft: July 12, 2000


   1I am still looking for words that express my gratitude to Darrell Duﬃe. I would
like to thank Andrew Ang, Michael Brandt, Heber Farnsworth, Lars Hansen, Ken
Judd, Thomas Sargent, Ken Singleton, John Shoven, John Taylor, and Harald
Uhlig for helpful suggestions; and Martin Schneider for extensive discussions. I
am also grateful for comments from seminar participants at Carnegie-Mellon Uni-
versity, University of Chicago, Columbia University, Harvard University, North-
western University, MIT, the NBER Asset Pricing Meeting 2000, NYU, Princeton
University, the ReStud Tour (University of Tel Aviv, Tilburg University, University
of Toulouse, University College London,) University of Rochester, Stanford Univer-
sity, UCLA, the WFA 2000, the Workshop on Mathematical Finance at Stanford
2000, and Yale University. The ﬁnancial support of Doctoral Fellowships from the
Bradley and Alfred P. Sloan Foundations are gratefully acknowledged. Address:
Department of Economics, Stanford University, Stanford CA 94305 (Phone: (650)
281 4812, E-mail: piazzesi@leland.stanford.edu). The paper can be downloaded
from www.stanford.edu/∼piazzesi. All errors are my responsibility.
                                  Abstract

This paper develops an arbitrage-free time-series model of yields that incor-
porates central bank policy. The model introduces a class of linear-quadratic
jump diﬀusions as state variables. A special case of this setup is used to de-
scribe U.S. interest rates, the Federal Reserve’s target rate, and key macroe-
conomic aggregates. The U.S. application captures: (i) target-rate moves on
FOMC meeting days, (ii) ‘exceptional’ policy moves outside of FOMC meet-
ings, and (iii) releases of macroeconomic news that are likely to aﬀect future
Fed actions. To ﬁt the model, the method of simulated-maximum-likelihood
estimation is extended to allow for jump-diﬀusions. Introducing the target
rate as a fourth, observable, factor into a three-latent-factor framework is
shown to be a tractable way of improving the overall term-structure ﬁt, es-
pecially at short maturities. A policy-inertia factor inﬂuences the conditional
probability of target changes. Fed policy is linked to the increased volatil-
ity of yields on FOMC meeting and release days, and to the the observed
“snake-shaped” term structure of yield volatility.
1       Introduction
     Readers of the ﬁnancial press know that meeting days of the Federal Open
Market Committee (FOMC) are marked as special events on the calendars
of many market participants. There are often strong reactions in bond and
stock markets to FOMC announcements. Indeed, a large literature on an-
nouncement eﬀects has documented increased volatility of interest rates at all
maturities, not only on FOMC meeting days, but also around releases of key
macroeconomic aggregates, most prominently nonfarm payroll employment
and the consumer price index. The FOMC is well aware of being closely
watched by the markets, and extracts information about the current state of
the economy from the current yield curve. This yield-based information may
underly the FOMC’s policy decisions.
     These observations suggest that information about policy-related events
could be useful for the pricing of interest-rate dependent claims and to
sharpen our understanding of Federal Reserve policy. Yet, in the litera-
ture to date, there appears to be little attempt at estimating a fully ﬂedged
factor model of the term structure that accommodates policy-related events.1
This paper presents a tractable no-arbitrage framework in continuous time
that captures policy-related events as jumps. While jumps are allowed to
be irregularly spaced and depend on the state of the economy, the model
still has closed-form solutions for bond prices. This permits the estimation
of the model even with long-maturity yield data. Various speciﬁcations are
then estimated to see whether taking into account policy and its timing helps
price U.S. bonds with maturities up to 5 years.
     In continuous-time factor models of the term structure, the short rate r is
speciﬁed as function of a Markov state process X. The price of a zero-coupon
bond is then obtained by computing the conditional expected value of the
payoﬀ of the bond discounted at r, where the expectation is taken under a
risk-adjusted probability measure. A number of features seem desirable when
introducing interest-rate targeting by a central bank in such a setup. The
state vector X should contain observable variables, such as the central bank’s
target rate or variables the central bank cares about (like CPI inﬂation). The
dynamics of X should allow for discontinuous movements that occur when
    1
    Existing studies of interest-rate targeting (for example, Rudebusch (1996), Balduzzi,
Bertola, and Foresi (1996)) typically do not use term structure information in the esti-
mation. An exception is Konstantinov (1999) who uses short interest rates up to 1 year
maturity. The existing literature is discussed further in Section 2.


                                           1
adjustments in the interest-rate target are made or when CPI numbers are
released. The conditional distribution of these jumps will depend on X itself.
It may also be discrete, as targets are typically moved in integer multiples of
quarter percentage points. The timing of jumps should capture the irregular
spacing of macroeconomic news releases. In addition, it must be tailored to
the operating procedures of the central bank, as some central banks change
their target even outside of their scheduled meeting days.2
    The present paper introduces a ﬂexible class of linear-quadratic jump
diﬀusions (LQJD) which allows for two types of jumps in the state vector.
First, jumps with a state-dependent conditional distribution can occur at
deterministic points in time. The moment generating function of the con-
ditional distribution is taken to be an exponential linear-quadratic function
of the state. This type of jumps can be used, for example, for macroeco-
nomic news releases. Second, there can be jumps with state-independent
distributions at random times that arrive with intensities which are linear-
quadratic functions of the state. The quadratic terms can, for example, be
used to introduce negative correlation between the arrival rates of upward
and downward moves in the target.
    The paper studies the post-1994 U.S. policy environment with data on
U.S. LIBOR (London Interbank Oﬀered Rate) and swap rates in which the
Federal Reserve’s target rate and macroeconomic aggregates are observable
factors, along with more traditional latent factors. The speciﬁcation has
several macroeconomic jump eﬀects: (i) target rate moves on FOMC meeting
days, (ii) ‘exceptional’ policy moves outside of FOMC meetings, and (iii)
releases of macroeconomic news that are likely to aﬀect future Fed actions.
The estimation is by the method of simulated maximum likelihood, extended
here to the case of jump-diﬀusions. Two classes of models are presented,
one using as data the Fed target and yields at several maturities, the other
exploiting additional macro variables such as nonfarm payroll employment
   2
    Two types of procedures can be distinguished. With unscheduled announcements of
the target, monetary policy actions may occur essentially on any given business day. This
type of interest-rate targeting was conducted in the U.S. from October 1982 to 1993.
With scheduled announcements, policy actions occur at central bank meeting days. After
February 1994, the target was moved almost exclusively on FOMC meeting days (Thornton
(1997), Meulendyke (1998)). The way that bond markets form expectations about future
short rates r is diﬀerent across operating procedures, since the probability of a target move
on any given business day is zero with scheduled announcements, while it may be positive
with unscheduled announcements.


                                             2
and CPI inﬂation.
    The ﬁrst class of models is designed to explore the role of the target rate
as an observable factor. This estimation uses LIBOR and swap yields with
maturities of up to ﬁve years. The most interesting variant is a 4-factor
model that includes (i) the target rate, (ii) a spread factor that measures
deviations of the short rate from the target, (iii) a traditional stochastic-
volatility factor, and (iv) a policy-inertia factor.
    By incorporating the Fed’s interest-rate targeting behavior, the estimated
4-factor model links the snake-shape of the “volatility curve,” the standard
deviation of yield changes as a function of maturity, to policy inertia. It
also considerably improves the performance of existing 3-factor models, such
as that of Dai and Singleton (2000), especially at the short end. The Fed’s
target rate thus provides a tractable way to improve bond pricing, avoiding
the use of additional latent variables.
    The short rate reverts quickly and continually to the target, while the
target adjusts slowly toward the Fed’s new desired target only through jumps
occurring at FOMC meeting days. The likelihood of a target-rate move at an
FOMC meeting depends crucially on two factors: the current target and the
inertia factor. Persistence in the target holds the target near its old value
(interest-rate smoothing), thereby introducing positive autocorrelation in the
target-rate level. The cross-sectional response of yields at diﬀerent maturities
to target shocks is therefore monotonically decreasing in maturity. The iner-
tia factor slowly pulls the target toward the new desired value of the target
(policy inertia). Shocks to the policy-inertia factor increase the likelihood of
a target move, not only at the next meeting, but also at subsequent meetings.
This leads to positive autocorrelation in target-rate changes.
    The cross-sectional impulse response of yields to shocks in the inertia
factor has a hump at maturities around 2 years, as the anticipated cumulative
eﬀect of pending target changes is largest for those maturities. The combined
eﬀect of money market shocks and inertia-factor shocks leads to a snake-
shaped pattern in the term structure of responses of yields to changes in the
target: high for very short maturities, rapidly decreasing until maturities
of around 6 months, then increasing until maturities of up to 2 years, and
ﬁnally decreasing again. As these shocks are important for yields, this snake-
shaped pattern carries over to the volatility curve. Shocks to the target rate
in the post-1994 environment happen mostly at FOMC meetings and thereby
introduce a seasonality into the volatility of yields.
    Weekly yield information is used to back out, through the 4-factor model,

                                       3
a high-frequency policy rule of the Fed. The identifying assumption here is
that the FOMC reacts to information contained in the yield curve known
before its meeting. The policy rule describes the target better than several
benchmarks, including estimated versions of Taylor-type rules in which the
Fed reacts to current macroeconomic information (Taylor (1993)). An expla-
nation for the good ﬁt of the estimated policy rule is that the policy-inertia
factor implied by yield data anticipates many target moves.
    The second class of models is estimated for the purpose of capturing the
behavior of yields around release days, and also a role for macroeconomic vari-
ables in bond pricing. Release surprises are identiﬁed with analyst-forecast
data by specifying the joint dynamics of analyst forecasts and actual macro
variables (nonfarm payroll employment and CPI inﬂation) in a state-space
system. Models in this class are estimated using LIBOR-rate data for matu-
rities of up to 1 year.
    Release surprises are found to be temporary components of macro vari-
ables, in the sense that the impulse-response of macro variables to these
shocks dies oﬀ after one month. In a model in which the Fed reacts to cur-
rent macroeconomic variables, this means that release surprises can aﬀect
the conditional probability of target moves at only those FOMC meetings
that are scheduled before the next macro release. In other words, release
surprises are not inertia-type factors themselves. In order to replicate the
hump-shaped cross-sectional response of yields to release surprises, the prop-
agation of these surprises would need to ‘live longer.’ This may be achieved
by allowing for correlation between the release surprises and the policy-inertia
factor. Here, the macroeconomic news have an impact on the new desired
target, but the FOMC only gradually implements this desired target over a
number of meetings.
    The paper is structured as follows. Section 2 reviews related literature.
Section 3 provides some institutional background on the operating proce-
dures of U.S. monetary policy. Section 4 presents the theoretical framework,
deﬁning the LQJD state-process and showing how risk adjustment and bond
pricing work. Sections 5 through 8 present the ﬁrst class of models, which ex-
amines the role of the target rate as an observable factor. Section 5 presents
the “base-case” models of this ﬁrst class. Section 6 describes the approxima-
tion of the pricing formula, the simulation-based estimation technique, and
the data. Section 7 presents results for the base-case models, while Section
8 looks at a few extensions. Section 9 presents the second class of models,
those augmented with macroeconomic releases. Section 10 concludes.

                                       4
2       Literature Review
    This work draws from at least four strands of literature. First, an exten-
sive literature, going back to Merton (1993b), Vasicek (1977) and Cox, Inger-
soll, and Ross (1985), has investigated low-dimensional factor speciﬁcations
of the yield curve. The theoretical frameworks of Duﬃe and Kan (1996) and
El Karoui, Myneni, and Viswanathan (1993) nest most of these speciﬁcations
providing tractable bond-pricing formulas, without which it would be compu-
tationally diﬃcult to exploit time series of bond-yield data econometrically.
The dynamics of observable short-rate proxies, such as the federal funds rate,
are typically characterized as extremely volatile, with large outliers at cer-
tain calendar days and other undesirable features (Duﬀee (1996)) that add an
enormous amount of complexity (if properly captured, as in Hamilton (1996))
that is likely to be unrelated to the behavior of longer-maturity yields. In
the context of monetary policy, it is particularly important to include longer
yields in the estimation, as rare policy moves lead to small-sample issues
haunting much of the empirical literature that regresses short-rate changes
on target or discount-rate changes.3
    Empirical factor models of the term structure (for example, those of Bal-
duzzi, Das, Foresi, and Sundaram (1996), Duﬃe and Singleton (1997), An-
derson and Lund (1996), Dai and Singleton (2000), and Ahn, Dittmar, and
Gallant (1999) specify the state vector to be latent, therefore ‘explaining
yields with yields.’ That is, a base set of yields is assumed to (information-
ally) span the entire term structure.4 Moreover, empirical factor modeling
with jumps have treated only the short rate (as, for example, do Das (1998)
    3
     Settlement-Wednesday spikes in short rates lead to substantial bias in these regressions
as many target moves in the past have happened around the end of reserve-maintenance
periods. As documented in Section 3, many moves pre-1994 occurred on the Thursday
after a settlement Wednesday and most moves post-1994 occurred on FOMC meetings,
which are usually scheduled for Tuesdays and Wednesdays. In any regression of short-rate
changes on changes in some policy rate, reserve-maintenance periods need to be handled
carefully, which only adds to the small sample problems arising from infrequent policy
moves. By placing the target rate in a term-structure model, data on longer yields (which
are less aﬀected by settlement Wednesdays) provide additional information about the
parameters governing the reaction of yields to target changes.
   4
     Even for cases in which the state variables are observable, they are not used in the
estimation. For example, Pearson and Sun (1994) estimate a nominal version of the Cox,
Ingersoll, and Ross (1985) model in which inﬂation is speciﬁed as a factor with yield data
only.


                                             5
and Johannes (1999)).
    More recently, theoretical work has begun to incorporate monetary policy
into factor models. Babbs and Webber (1993) specify the short rate as a
pure-jump process with a jump intensity that depends on a latent state
vector. Babbs and Webber (1996) and Farnsworth and Bass (1998) present
target-zone models that capture monetary policy. Their models do not lend
themselves to tractable bond pricing, so that only their implications for the
short rate are investigated empirically (see, for example, Honor´ (1997)).
                                                                     e
Rudebusch (1996) ﬁts a model in which the conditional probability of a target
change on the next day depends on the sign of the last target change and
the number of days since the last change. Balduzzi, Bertola, and Foresi
(1996) estimate a model with a constant conditional probability of a target
change on any given business day. Konstantinov (1999) examines a model
in which the target rate is a regime-switching process. The last three papers
specify a nonstationary target and invoke the expectations hypothesis to
study the predictability of short rates (long yields, however, are not used in
the estimation).
    Fleming and Remolona (1999) investigate high-frequency yield data at
macroeconomic announcements in a Gaussian discrete-time model of the
term structure that is closely related to the model developed here. They
attribute shocks to latent variables to surprises at macro announcements.
The approach in this paper diﬀers, in that macroeconomic variables and Fed
policy are modeled. Moreover, the dynamics of yields are analyzed at all
times (not only around announcements).
    A second strand of literature describes Fed policy by specifying maps from
policy-relevant variables to the Fed’s key policy instrument, the federal funds
rate. Such policy rules can be found by (i) imposing identifying assumptions
in vector autoregressions (VARs, see the references in (Christiano, Eichen-
baum, and Evans 1998)), (ii) specifying a short-rate process whose transition
behavior depends on the entire past path of macro variables (Hamilton and
Jorda (1998), Sims (1999)), and (iii) adopting structural models of Fed be-
havior (see Woodford (1999) and the papers in Taylor (1999)). Although
some of the papers in literatures (i) and (ii) analyze the impact of monetary
policy on both short and long yields (as do Evans and Marshall (1998)), they
typically do not impose cross-equation restrictions implied by the absence of
arbitrage (Sargent (1979)). Another related paper by Campbell and Viceira
(2000) speciﬁes expected inﬂation to be a latent state variable that is ﬁltered
with quarterly bond-yield and CPI data. The nonlinear formulation of the

                                      6
short rate of those papers taking approach (ii) makes it diﬃcult to use them
as a basis for a term-structure model, given the requirement of a tractable
bond-pricing formula.
    A third stand of literature uses a general equilibrium setting, which
does imply the absence of arbitrage (for example, Pennacchi (1991), Berardi
(1998), Buraschi and Jiltsov (1999)). The approach taken in the present
paper also imposes no-arbitrage, while not requiring the speciﬁcation and
estimation of a structural model of the economy, which in any case would be
problematic given the current state of empirical GE models of asset prices
(see, for example, Hansen and Jagannathan (1991). For a recent survey, see
Cochrane (1997)).
    A ﬁnal group of papers analyzes announcement eﬀects on yields. This
can be done by including announcement-day dummies and macroeconomic
news surprises into GARCH-type models of volatility (Jones, Lamont, and
Lamsdaine (1996), Li and Engle (1998), Christiansen (1999)). Another pos-
sibility is to use news surprises as explanatory variables for yield changes
(Balduzzi, Elton, and Green (1998), Fleming and Remolona (1997)). Again,
these papers do not impose the absence of arbitrage.


3       Institutional Background
    Important changes in 1994 to Fed-policy operating procedures underly
the choice of sample period in this paper, which focuses on the policy frame-
work in place today. The Fed conducts monetary policy by targeting the
overnight rate in the federal funds market.5 The FOMC ﬁxes a value for
the target and communicates it to the Trading Desk of the Federal Reserve
Bank of New York, which then implements it through open-market opera-
tions (Meulendyke (1998)). Figure 1 shows the fed funds market rate and
the target rate from 1984 to 1998, illustrating that, on average, the Fed is
able to closely target the federal funds rate, except for occasional spikes.
(Section 6.5 provides a description of the target data that is used in this
    5
    While this statement is true for the ’70s and for the Fed under Greenspan, it does
not apply to the Volcker era. From October 1979 until 1982, the Fed was targeting
nonborrowed reserves. Starting in 1983 and at least until the change in chairmanship from
Volcker to Greenspan in August 1987, the Fed was targeting borrowed reserves, a practice
that has, since then, been increasingly abandoned, especially after the stock-market crash
of October 1987 (Meulendyke (1998)).


                                            7
                               Fed Funds and Target Rate

                   18                                                      Fed Funds
                                                                            Target

                   16

                   14                                   Jan 1, 1994

                   12
         Percent


                   10

                    8
                                                                             Oct 15, 1998
                    6                                                             ↓
                                                        Apr 18, 1994
                                                             ↓
                    4

                    2   1/27/86 12/28/87 11/27/89 10/28/91 9/27/93 8/28/95 7/28/97


  Figure 1: Daily federal funds and target rate from 3/1/1984 to 12/31/1998.


paper.) These spikes are usually associated with “settlement Wednesdays”
and other special calendar eﬀects, such as the end of the year.6 The target
has been changed 114 times over the entire ﬁfteen-year time frame. Figure 1
also shows that target-rate changes are often followed by additional changes
in the same direction. This feature will be referred to as “policy inertia”.7
    Starting with the ﬁrst FOMC meeting of 1994, the Fed made two changes
in monetary-policy operating procedures that eﬀectively divide the past ﬁf-
teen years into two “regimes.” First, the Fed increased transparency by
publicly announcing target moves at FOMC meetings. From 1983 to 1990,
the Fed did not disclose its target rate at all. Since 1994, target moves
have been disclosed right after they were made. More recently, the FOMC
   6
     During bi-weekly reserve-maintenance periods, banks must hold “good funds” in the
form of cash or in accounts at the Fed, or be penalized. These reserve-maintenance periods
end on “settlement Wednesdays”.
   7
     This may indicate the Fed’s unwillingness to move the target immediately and entirely
to its desired rate. Instead, the Fed adjusts the target in small steps to avoid possible
policy mistakes, because of political motives, parameter uncertainty (Sack (1998)) or to
aﬀect long-maturity yields with minimal changes in short yields (Woodford (1999)).


                                                  8
has even published its carefully worded views about the likelihood of a rate
change in the upcoming inter-meeting period.8
                         The Timing of Target Rate Changes

                     pre-1994                                   post-1994
   15                                          15


   10                                          10


       5                                          5
                                                           
                                                      Apr 18, 1994
                                                            Oct 15,1998
                                                                   
                                                           

       0
       0   5   10   15   20   25   30   35   40 0
                                                0     5   10   15   20   25   30   35   40

Figure 2: For any given target rate change between 1984-1993 and 1994-1998, these
graphs show the histogram of days since the last FOMC meeting. In the ﬁrst subperiod,
there have been a total of 100 target moves, while there were 14 in the second subperiod.


    More importantly, the timing and size of policy moves have changed.
Figure 2 illustrates this diﬀerence in timing by showing histograms, pre-1994
and post-1994, of the number of days between a target-rate change and the
preceding FOMC meeting. If, in a given subperiod, the Fed had moved
its target only at FOMC meetings, we would see a single spike at 0 in the
corresponding histogram. One sees a deﬁnite change in 1994 of re-targeting
mainly at FOMC meeting days,9 with two exceptions highlighted in Figure
2. The ﬁrst exception occurred on April 18, 1994 after high car sales in
   8
    For details on the art of reading policy directives, see Meulendyke (1998).
   9
    Does a closer look at the timing of target moves pre-1994 reveal any other calendar
eﬀects that we could use as an alternative to the FOMC meeting calendar? There is a clear
tendency to implement changes in the target at the beginning of a new settlement period,
as 37 out of 100 moves pre-1994 happened on the Thursday after a settlement Wednesday.
Other possible candidates for calendar eﬀects are release schedules of macro information.
In fact, 11 moves occurred on the release days of employment information by the Bureau
of Labor Statistics. Together, the releases of consumer and producer price indices account
for another 8 changes. These releases, however, are on a monthly basis, and on diﬀerent
days respectively, so that they cannot serve as a calendar for target moves. Other variables

                                              9
March, a leading business-cycle indicator. The ﬁnancial press speculated
that the surprise move was intended as a manifestation of authority by Alan
Greenspan, as no vote was held on the move.10 The second exception was
decided upon in a conference call on October 15, 1998, and came in response
to the Asian and Russian ﬁnancial crises.11
                          The Size of Target Rate Changes

                     pre-1994                                   post-1994
  30                                            7

  25                                            6
                                                5
  20
                                                4
  15
                                                3
  10
                                                2
   5                                            1
   0−1
         -0.5    0     0.5      1   1.5   2 0   −1
                                                     -0.5   0      0.5      1   1.5   2

Figure 3: These graphs show the histogram of the size of target changes between 1984-
1993 and 1994-1998.


   Along with the change 1994 in the timing of Fed moves, it can be seen
from Figure 3 that there was a big change in the size distribution of target-
rate changes. While pre-1994 target-rate changes came in multiples of 6.25
basis points,12 after 1994 the Fed used multiples of quarter-percentage points.
such as the Producing Managers’ Index, an index often referred to in the “Minutes” of the
FOMC meetings, and released on the ﬁrst business day of each month, do not coincide
with target moves. Monetary aggregates might seem relevant in this context, but these
are published by the Fed itself.
  10
     The Financial Times, April 19, 1994, page 3, “Greenspan plays an early hand: US
rates rise” by Michael Prowse and The New York Times, April 19, 1994, page 1, “Fed
again raises short-term rate on loans” by Keith Bradsher.
  11
     The New York Times, October 16, 1998, page 1, “Federal Reserve cuts rates again;
Wall St. surges” by Richard W. Stevenson.
  12
     One basis point is 0.01%.


                                           10
4     The Yield Curve Model
   After an overview (Section 4.1), we will provide a number of technical
results about arbitrage-free pricing in a state-space model for the yield curve
that includes both latent and macroeconomic variables (Sections 4.2 and 4.3)
These results will later be used to price the assets used in the estimations
(Section 4.4).

4.1    Overview
    The state of the economy at time t is described by the vector X(t).
The state includes the target rate θ(t), some macro variables m(t), analyst
forecasts mF (t) of these macro variables, and certain latent variables such as
the spread s(t) = r(t) − θ(t) between the riskless short-rate r and the target
θ. The dynamics of X are described by a stochastic diﬀerential equation
(SDE) of the form
             dX(t) = µ(X(t), t) dt + σ(X(t), t) dW (t) + dJ(t),            (1)
whose components will be explained shortly. In the absence of jumps J, this
system may be thought of as a vector-autoregression with a linear mean rate
of change µ. The Gaussian process W is responsible for continuous “small”
shocks to X. These small shocks may translate into a non-Gaussian distribu-
tion of X if the volatility σ(X(t), t) depends on X(t). The pure-jump com-
ponent J of (1) is responsible for discontinuous moves in X, macroeconomic
jump eﬀects. These jumps can be caused, for example, by macroeconomic
releases and monetary-policy events. In the example, the short-rate process
r is a linear function of the state, in that r = θ + s. More generally, r can
be linear-quadratic.
    Arbitrage-free pricing can be done though an exogenous risk-adjustment
speciﬁed in the form of a ‘density process’ ξ. Asset prices are then given by
the conditional expected value of their payoﬀ, weighted by ξ, and discounted
at the riskless rate. In particular, the time-t price of a zero-coupon bond
that matures at time T is
                                                    t
                                  ξ(T )
                 P (t, T ) = Et         exp −           r(u) du   ,        (2)
                                  ξ(t)          0

where Et denotes expectation given the information available to bond in-
vestors at time t. In a Lucas (1978) economy, for example, the term inside

                                      11
the expectation is just the marginal rate of substitution of a representative
agent. We can use the weight ξ to deﬁne a risk-neutral probability measure
Q which satisﬁes Et Zξ(T )/ξ(t) = EtQ (Z) for any random variable13 Z
                            ¯
known at time T . By specifying the dynamics of X and the switch to Q
                 ¯
carefully, the bond-pricing formula (2) can be computed in closed form. This
will be the objective of the remainder of this section.

4.2       Linear-Quadratic Jump-Diﬀusions
    We now specify a particular parametric model for the dynamics of X.
Uncertainty in the economy is described by a complete probability space
(Ω, F , P). The resolution of uncertainty over time is given by a ﬁltration
{F (t) : t ≥ 0} satisfying the usual conditions (Protter (1990)). The process
X satisfying (1) lives in some state space D ⊂ RN . For the SDE (1), W
is an N-dimensional standard Brownian motion on (Ω, F , P, {F (t)}), µ :
D × [0, ∞) → RN is the drift of X, σ : D × [0, ∞) → RN ×N is its “volatility,”
and J is an {F (t)}-adapted pure-jump process further described below. The
value of X ‘just before’ the jump at t is denoted X(t−) = lims↑t X(s). The
jump of X at t is ∆X(t) = X(t) − X(t−). For each ﬁxed t, both µ(x, t) and
σ(x, t)σ(x, t) are aﬃne (constant-plus-linear) in the state, in a manner to
be made precise shortly.
    Except when there is a jump caused by J, the state X has continuous
sample paths driven by W . Two types of jumps contribute to the pure-jump
process J. First, there are jumps (of diﬀerent types) arriving at deterministic
dates counted by a vector Nd of counting processes. Second, there are jumps
arriving at random dates counted by a vector Np of Poisson processes with
stochastic intensity λ. Heuristically, the F (t)-conditional probability that
there is a Poisson jump in the small interval [t, t+∆] is λ(t)∆. More formally,
stochastic intensities are characterized by the fact that the compensated
                              t
process {Mp (t) = Np (t) − 0 λ(t) dt, t ≥ 0} is a martingale. (See Br´maud
                                                                        e
(1981) for further details.)
    We can now deﬁne linear-quadratic jump-diﬀusions (LQJDs) by choos-
ing particular functional forms for the coeﬃcients µ and σ of the SDE (1),
together with additional restrictions on the jump process J. In describing
these parametric speciﬁcations, we can, without loss of generality, partition
the state as X = (X1 , X2 ) so that X2 is a k2 -dimensional process, with
 13
      Here, Z is F (T )-measurable and E Q (|Z|) < ∞.
                    ¯


                                            12
k1 + k2 = N. Assumption 1 will restrict X2 to be Gauss-Markov. It will be
convenient to deﬁne the set

C = {(c0 , c1 , c2 ) ∈ R × RN × RN ×N : c2 is symmetric positive semideﬁnite
    and consists of zeros except possibly the lower right k2 × k2 partition}

of coeﬃcients. We will make repeated use of linear-quadratic (LQ) functions
of the state of the form g : D × C → R+ , with

                            g(x, c) = c0 + c1 x + x c2 x.                      (3)

We are now in the position to specify the LQJD as follows.

Assumption 1 (Characterization of LQJD processes)

 (a) (Functional Form of Drift and Volatility)
        The drift and ‘volatility’ of X are given by are given by

                                µ(x, t) = K(t) (¯(t) − x)
                                                x                              (4)
                                σ(x, t) = Σ(t) S(x, t),                        (5)

        where S(x, t) is a N × N diagonal matrix with i-th diagonal element
        [S(x, t)]i,i = s0i (t) + s1i (t) · x, and where the coeﬃcients s0i (t) ∈ R,
        s1i (t), x ∈ RN and K(t), Σ(t) ∈ RN ×N are deterministic functions of
                 ¯
        time.

 (b) (Functional Form of Stochastic Intensities)
        The jumps J are counted by a p-dimensional counting process Np with
        stochastic intensity, and by a d-dimensional deterministic counting pro-
        cess Nd without explosions,14 and with no common jump times.15 The
        stochastic intensity {λi (t) : t ≥ 0} of Np is given by
                                                  i


                                 λi (t) = g X(t−), li (t) ,                    (6)

        for time-dependent coeﬃcients li (t) ∈ C. The coeﬃcients li (t) and the
        domain D satisfy joint conditions to ensure that λi (t) ≥ 0, as required
        for any intensity process.
 14
      For all t, Nd (t) < ∞ almost surely.
 15                                             j
      This means that ∆Np · ∆Np = 0 and ∆Nd · ∆Nd = 0, i = j almost surely.
                            i     j        i


                                          13
  (c) (Conditional Jump-Size Distributions)
       For any Poisson jump time τ , the F (τ −)-conditional distribution vp,τ of
       the jump size ∆X(τ ) is independent of X(τ −). For any deterministic
       jump time t, the F (t−)-conditional distribution vd,t of the jump size
       ∆X(t) has a Laplace transform which is an exponential LQ function of
       X(t−). More precisely, for all a ∈ RN , we have that

                      E exp(a · J d (t)) = exp (g (X(t−), c(t; a)))                   (7)

       for some c(t; a) ∈ C.

 (d) (Parameter Restrictions)

        (i) All of the time-dependent coeﬃcients are bounded and piece-wise
            constant functions of time.16
       (ii) Joint restrictions on (µ, σ, vp , vd , l) and the domain D apply that
            guarantee a unique (strong) solution to (1).
       (iii) Gaussianity of X2 : The lower left k2 ×k1 partitions of the matrices
             K(t) and Σ(t), labeled K21 (t) and Σ21 (t), consist of zeros only.
             Also, s1i (t) is an N-vector of zeros for all i ∈ {k1 + 1, . . . , N}.

    This deﬁnition of the state process X generalizes in two directions the
concept of an aﬃne jump-diﬀusions introduced by Duﬃe and Kan (1996).
First, jumps are allowed to occur at deterministic points in time. The as-
sociated jump size, or mark, may have a state-dependent conditional distri-
bution provided its Laplace transform is an ELQ function in the state. For
a deterministic jump time t, an example of an F (t−)-conditional jump-size
distribution that satisﬁes this requirement is a Gaussian distribution with
a conditional mean that is a LQ function in the state X(t−) and with a
constant variance. Another example is a jump size that is an LQ function in
X(t−) plus a random variable that has any given state-independent distri-
bution subject to technical integrability conditions.
    Second, the intensity of Poisson jumps may be quadratic in a Gaussian
state vector. This allows jumps to arrive at negatively correlated jump in-
tensities, a property that cannot be accommodated in an aﬃne setting. Neg-
  16
     This particular type of time-dependence of the parameters determining the dynamics
of X is suﬃcient for the seasonality eﬀects studied this paper. Alternatively, the parame-
ters may be bounded continuous functions of time.

                                           14
atively correlated state variables with a positive domain would force a vio-
lation of assumption Condition A in Duﬃe and Kan (1996). In the absence
of jumps, the condition is suﬃcient for the existence of a solution to the
stochastic diﬀerential equation (1) describing the state. As already noted
by Duﬃe and Liu (2000), it is possible to square two Gaussian processes
so that each variable takes only positive values, while allowing for arbitrary
correlation. This idea is applied here to the case of jumps.
    Even without jumps, the state process may exhibit rich dynamics such
as conditional heteroscedasticity (through s1 ).

4.3    Change of Measure
    We assume that there exists a nominal “short-term” riskless-rate process
r, at which agents can borrow and lend, in the sense that r is adapted and
                            T
jointly measurable, with 0 |r(t)| dt < ∞. We consider the existence of an
equivalent probability measure Q under which all security prices, {F i}I , i=1
                                         t
normalized by the price F 0 (t) = exp( 0 r(u) du) of 1 Dollar invested at time
0 and rolled over at the riskless rate, are martingales, in that
              F i (t)       F i(T )   E P [ξ(T )F i(T )/F 0 (T )]
                      = EtQ          = t                          ,       (8)
              F 0 (t)       F 0 (T )            ξ(t)
where ξ denotes the “density” of Q. If such a “risk-neutral” (or equivalent
martingale) measure Q exists, there is no arbitrage, at least under reasonable
restrictions on trading strategies (Harrison and Kreps (1979), Harrison and
Pliska (1981)). Conversely, the absence of arbitrage, and some technical
conditions, implies the existence of such a “risk-neutral” measure (Delbaen
and Schachermayer (1994)).
    Consider as a candidate for the density process ξ of an equivalent mar-
tingale measure the solution of the SDE
           dξ(t)
                 = −σξ (t) dW (t) + Jξ (t) dNd (t) + Jξ (t) dMp (t),
                                     d                p
                                                                          (9)
           ξ(t−)
with the initial condition ξ0 = 1. The construction of exogenous risk premia
proceeds in three steps. First, we show, under speciﬁc assumptions on the
coeﬃcients of SDE (9) (Assumption 2 in Appendix A), that ξ is a square-
integrable P-martingale. This means that we can use ξ(T ), for some ﬁxed
                                                           ¯
time T¯ , as the Radon-Nikodym derivative dQ/dP of an equivalent probabil-
ity measure Q. Allowing for a jump Jξ at a deterministic jump time (such as
                                      d


                                       15
a scheduled announcement date) is unusual in the term-structure literature.
Appendix B provides an example. Second, a generalized Girsanov theorem
(Proposition 2 in Appendix A) provides a representation of the dynamics of
the state process X under Q. For econometric convenience, only parameter-
izations that make X a LQJD under both P and Q are considered in this
paper.17 Third, we establish restrictions on the coeﬃcients of ξ (Proposition
3) that ensure the absence of arbitrage by virtue of condition (8).
    In order to proceed with this 3-step construction of risk-neutral pricing,
suppose we have I asset prices {F i(t)}I of the form F i (t) = exp(f (X(t), t)),
                                       i=1
for smooth f : D × [0, T ] → R. (In our setting, we will see that zero-coupon
bond prices are of just this form.) By Ito’s Lemma,

       dF i(t)                                    d                  p
         i (t−)
                = µF i (t)dt + σF i (t) dW (t) + JF i (t) dNd (t) + JF i (t) dMp (t),(10)
       F

with µF i (t) = F i (t−)−1 AF i(t), where A is the inﬁnitesimal generator18 of X,
                                                                           j
the volatility is σF i (t) = fx (X(t), t)σ(X(t), t), and the jump size is JF i (t) =
exp [f (X(t) + J j (t), t) − f (X(t), t)] − 1 for j = p, d.
   For notational simplicity, the following result is stated for one-dimensional
versions of the counting processes Np and Nd . A proof can be found in
Appendix F.

Proposition 3 (Equivalent Martingale Measure):       Suppose Assump-
tion 2 (stated in Appendix A) holds. Suppose the normalized asset price
  17
     There is considerable evidence that the dynamics of the short rate is nonlinear, at least
in a one or two-factor setting (Ait-Sahalia (1996), Boudoukh, Richardson, Stanton, and
Whitelaw (1998), Ang and Bekaert (1998)). In the present framework, this nonlinearity
may be introduced in two ways: quadratic terms under the data-generating measure P
and market prices of uncertainty that, while preserving a LQJD structure under Q and
therefore tractable pricing formulas, take the state dynamics outside the LQJD class under
P. The latter approach is explored by Duﬀee (1999) in an aﬃne term-structure model.
  18
     For a function f : D × [0, T ] → R, the inﬁnitesimal generator A of X is a function
Af : D × [0, T ] → R given by
                                                          1
              Af (x, t)   = ft (x, t) + fx (x, t)µ(x, t) + fx x (x, t)σ(x, t)σ(x, t)
                                                          2
                                   p
                              +         g(x, li (t)) E [f (x + Jip (t), t) − f (x, t)] ,
                                  i=1

using the fact that the jump Jip (t) is independent of the state.


                                                     16
{F i(t)/F 0 (t) : t ≥ 0} is square-integrable, where F i solves (10). Then, for
                   ¯
any ﬁxed time T > 0, the discounted asset price is a martingale under the
equivalent probability measure Q deﬁned by dQ/dP = ξ(T ) provided:
                                                             ¯

  (i) For any t that is not a deterministic jump time,19

              µF i (t) − r(t) = σF i (t)σξ (t) − λ(t)E P JF i (t) 1 + Jξ (t)
                                                          p            p
                                                                                 .

 (ii) For any deterministic jump time t,
                               P               P
                              Et− JF i (t) = −Et− JF i (t)Jξ (t) .
                                   d               d       d


Proposition 3 provides an interpretation of the coeﬃcients of the SDE (9)
for ξ in terms of market prices of uncertainty that compensate investors for
diﬀerent sources of risk.20 In order to interpret these risk premia, suppose
ﬁrst that there are no Poisson jumps. Then (i) says that on ‘normal days’ (not
deterministic jump times) the instantaneous expected excess rate of return
is the “market price of Brownian motion uncertainty,” σξ , multiplied by the
“factor loading” σF i . In other words, the expected excess rate of return is
proportional to the ‘conditional covariance’ of the return and the density, or
pricing kernel. This is along the lines of the Intertemporal CAPM by Merton
(1993a). In the presence of Poisson jumps, there is an additional premium
which, loosely speaking, is the conditional probability λ(t) of a Poisson jump
in the next “small” time period multiplied by the expectation of the product
  19
       The notation “E P JF i (t) 1 + Jξ (t)
                          p            p
                                               ” actually means the unconditional mean over
the joint distribution of the jump ∆X(t) of the state and the jump ∆ξ(t) of the density ξ
at an arbitrary jump time τ of N p . Because these jumps are of a distribution independent
of X(t−), and because the number of jumps during any time interval is ﬁnite almost surely,
the expectation is unambigous, despite the abuse of notation.
   20
      In an economy with an endowment that follows a diﬀusion process and time-additive
utility (Duﬃe and Zame (1989)), the market price of uncertainty equals minus the co-
eﬃcient of relative risk aversion of a representative agent multiplied by the volatility of
the growth rate of the aggregate endowment process. In this setting, it is usually called
market price of risk. A diﬀerent structural interpretation is oﬀered by recent papers on
uncertainty aversion (Chen and Epstein (1999), Anderson, Hansen, and Sargent (2000)),
in which market prices of uncertainty consist of two terms. The ﬁrst term is the standard
risk adjustment just mentioned, while the second represents a measure of distance be-
tween the true data-generating measure and the probability measure underlying max-min
behavior by the agent.


                                               17
of the “market price of jump uncertainty” −(1 + Jξ ) weighted by the jump-
                                                    p
                              p
conditional “factor loading” JF i . A similar interpretation holds in (ii) for
deterministic jumps which occur on a deterministic schedule.

4.4     Linear-Quadratic Short Rate and Bond Pricing
   The aﬃne structure of Duﬃe and Kan (1996) and the quadratic struc-
ture of the SAINTS model (Constantinides (1992), El Karoui, Myneni, and
Viswanathan (1993)) are combined by the following assumption.

Assumption 3 (Linear-Quadratic Short Rate of Interest)
Fixing a linear-quadratic jump diﬀusion X, the short-rate process
{r(t); t ≥ 0} is assumed to have the linear-quadratic form R(x, t) = g(x, δ(t))
for some given coeﬃcients δ(t) ∈ C.

    The rest of this section is concerned with the computation of a solu-
tion P (t, T ) to (2). If, for example, r is Gaussian under the risk-neutral
probability measure Q, then this just involves taking the expectation of the
exponential of a sum of Gaussians, which can be computed directly (Vasicek
(1977)). For the general case in which X is a LQJD under Q and r is a LQ
function of X, the idea is ﬁrst to guess that bond prices are given by the
exponential LQ form

                        P (t, T ) = exp (g (X(t), c(t, T ))) ,                    (11)

for some c(t, T ) ∈ C, which depends on the particular ordering of determin-
istic jump dates between t and T . This guess is veriﬁed by calculating c(t, T )
using the method of undetermined coeﬃcients and equations (2) and (11).
Note that (11) describes a linear-quadratic model of the term structure of
yields.21
    The computation of c(t, T ) proceeds recursively, starting at the time T
of maturity with the boundary condition c(T, T ) = 0, imposed from the fact
that P (T, T ) = 1, and from the assumption that D contains an open set. Two
steps are needed along the way. Roughly speaking, the ﬁrst step (Lemma 1 in
Appendix C) is to show that if the bond price at the next deterministic jump
  21
     The continuously compounded yield Y (t, T ) at time t of a bond maturing at time T
is deﬁned by Y (t, T ) = − ln(P (t, T ))/(T − t).


                                          18
date is an exponential LQ function in the state vector, as in (11), then the
price of a bond “just before” the jump date is of the same form. The second
step (Lemma 2 in Appendix D) is to demonstrate that if the bond price
“just before” the next deterministic jump date is given by the exponential
LQ form (11), then the price during the entire interim period between two
deterministic jump dates is also an exponential LQ function. Together, these
two steps guarantee that for every t, the price P (t, T ) inherits the postulated
form.

Proposition 4:         Suppose that Assumptions 1 and 3 hold under Q. Let
the coeﬃcient vector c (t, T ) be calculated recursively using the algorithm
shown in Appendix E. If Assumption 4 (Appendix D) holds at all deter-
ministic jump times (ti , c(ti , T )), i ∈ {1, . . . , n}, and also at (T, 0), then
P (t, T ) = exp (g (X(t), c(t, T ))) for all t ≤ T .

Proof:            The proof is by induction over the deterministic jump dates
t1 , . . . , tn between t and T . By assumption, P (T, T ) = 1. Applying Lemma
2 with s = T, c = 0 and ψ(X(T ), c) = 1, we see that P (t, T ) satisﬁes (11)
                    ¯                   ¯
for t ∈ [tn , T ). We can then apply Lemma 1 to obtain the desired prop-
erty for P (tn −, T ). Now suppose, for any deterministic jump time ti , that
P (ti+1 −, T ) is given by (11). We can apply Lemma 2 to establish the desired
property for any time t ∈ [ti , ti+1 ) and then Lemma 1 to get it for P (ti −, T ).
By induction, P (t, T ), t ∈ [0, T ], has the desired property. (Note that Lemma
2 can also be applied to the interval [0, t1 ) ).


5     The Target as an Observable Factor
    This section presents the econometric model with Fed targeting.

5.1    Target Dynamics
    Since the beginning of 1994, the target was usually set at FOMC meetings.
Only in emergency cases (‘Peso events’) has the Fed adjusted the target
between FOMC meeting dates. The timing of these two types of policy events
and the discrete distribution of target changes can be modeled by taking the
                                        ˜
i-th FOMC meeting to be an interval [tM (i), tM (i)]. During this interval, the
Fed may move the target in steps of 25 basis points (in light of the histogram


                                        19
in Figure 2) according to the state of the economy. There may be more than
one move during the i-th meeting, but the econometrician will only observe
the target announced at tM (i). Confronted with important macroeconomic
“Peso” events, the Fed may also decide to move the target outside of a
meeting. Peso events are assumed to be triggered by Poisson processes with
small constant arrival rates. More concretely, the target process solves

                         dθ(t) = 0.0025 dN U (t) − dN D (t) ,                             (12)

where N U and N D are counting processes with stochastic intensities given
by
                                              +
           j            λj + λj · X(t−)
                         0    X                   , for t ∈ [tM (i), tM (i)],
                                                             ˜
         λ (t) =                                                                          (13)
                               λj ,
                                 P                  otherwise,

for j = U (“up”) and D (“down”), and where x+ = max{x, 0}.
    While the truncated linear intensities (13) are outside the LQJD class, this
speciﬁcation has several advantages over a pure LQJD speciﬁcation. First,
it allows for negative correlation among intensities (similar to a quadratic
formulation), while the approximating map from factors to yields is invert-
ible. This means that, even in the presence of latent variables, we can use
an estimation method that relies on the likelihood function of the factors.22
Second, the dependence of the intensities on the target θ itself allows for
interest-rate “smoothing”. Moreover, together with the max-operator, this
dependence permits mean reversion in the target.23
    By deﬁning the martingale M = M U − M D , where dM j (t) = dN j (t) −
λj (t) dt for j = U, D, the dynamics of the target in (12) (for times at which
the positivity constraints on λU and λD are not binding) can be rewritten as
                                   ¯
                   dθ(t) = κθ (t) θ(t−) − θ(t) dt + Jθ dM(t),
                    ¯
                    θ(t) = c0 (t) + cX (t) · X(t),
  22
      Ahn, Dittmar, and Gallant (1999) apply EMM to the Saints Model which similarly
generates a non-invertible map from factors to yields. The eﬃcient method of moments by
Gallant and Tauchen (1996), or more generally the simulated method of moments (Duﬃe
and Singleton (1993)), simulates the state variables, computes yields as a function of the
simulated states, and matches moments of the resulting simulated data to actual data.
This makes inversion unnecessary.
   23
      Unfortunately, this is impossible with a quadratic speciﬁcation, as the target rate itself
is not a Gaussian process. Assumption 1 requires squared variables to be Gaussian.


                                              20
for piecewise constant functions c0 (t), κθ (t) ∈ R and cX (t) ∈ RN , which can
be recovered from the intensity parameters. This representation shows that,
                                                         ¯
during times at which κθ (t) > 0, the target reverts to θ(t).
    There are diﬀerent ways to think about the Fed’s policy rule in this set-
ting. As an illustrative example, the original Taylor (1993) rule may be
represented by letting the target revert instantly at FOMC meetings to a
                    ¯
linear combination θ of measures of inﬂation π and output y given by

                 θ(t) = π(t) + rR + 0.5y(t) + 0.5(π(t) − π ∗ ).
                 ¯                                                         (14)

Here, rR is the real rate and π ∗ is the Fed’s inﬂation target.
     To economize on parameters, it is assumed that the slope parameters in
(13) are symmetric, in that λX := λU = −λD . Mean-reversion of the target
                                       X       X
it is also imposed by assuming λD = λU + 2λX x, where x denotes the long-
                                   0     0        ¯        ¯
run mean of X. The arrival rates of Peso events are ﬁxed to their empirical
frequency. There has been one up and one down move outside of FOMC
meetings in the 5 years from 1994 to 1998, so that we set λU = λD = 0.2.
                                                                P  P
For given long-run mean parameters, we therefore have N + 1 free intensity
parameters: λU and λX .
               0


5.2    Additional Latent Factors of Base Models
    A number of alternative approaches to the use of latent state variables
are introduced. In all of the setups, the target is included as observable state
variable and the spread s = r − θ between the short rate and the target
is among the latent factors, with s mean-reverting to zero. In addition, we
consider a traditional stochastic volatility factor as well as a Gaussian iner-
tia factor that aﬀects only the target dynamics. This inertia factor proxies
for variables (in addition to s, θ, and v) to which the Fed reacts when con-
ducting monetary policy. The stochastic intensity of policy events at FOMC
meetings may depend on all of the state variables in X. These alternative
speciﬁcations form a set of base-case models, in the sense that they are not
maximally ﬂexible in the sense of Dai and Singleton (2000): some of the cor-
relation parameters can be freed up without loss of statistical identiﬁcation.
Extensions allowing for additional correlation between state variables will be
examined in Section 8. These base-case models are summarized next.


                                      21
5.2.1   An Intensity-State Model (The λ Model)
    For this base-case model, the state vector X consists of the target rate
θ and the bivariate Gaussian variable (s, z), where s = r − θ, and z is the
inertia factor, with
                     ds(t) = −κs s(t) dt + σs dWs (t),
                     dz(t) = −κz z(t) dt + dWz (t),
where Ws and Wz are independent standard Brownian motions.

5.2.2   A Model with Stochastic Volatility (The SV Model)
    In this second base-case model, an additional factor v, beyond the spread
s, serves both as the stochastic volatility of s and as a factor aﬀecting the
stochastic intensity of policy events, in that
                ds(t) = −κs s(t) dt +     v(t) dWs (t),
                dv(t) = κv (¯ − v(t)) dt + σv
                            v                    v(t) dWv (t),
where Ws and Wv are independent standard Brownian motions.

5.2.3   A Model with Volatility and λ-Factor (The SVλ Model)
   The ﬁnal setup, the SVλ model, combines the previous two by specifying
the dynamics of three latent variables (s, v, z) by

                ds(t) = −κs s(t) dt +     v(t) dWs (t),
               dv(t) = κv (¯ − v(t)) dt + σv v(t) dWv (t),
                           v
               dz(t) = −κz z(t) dt + dWz (t),
where Ws , Wv , and Wz are independent standard Brownian motions.

5.3     Market Prices of Uncertainty
    In the SVλ model, the market prices of uncertainty σξ appearing in (9)
for the Brownian motions Ws , Wv and Wz are of the form
                      s                        
                        σξ (t)         qs v(t)
                      σξ (t)  =  qv σv v(t) 
                          v
                          z
                        σξ (t)             qz

                                     22
leading to risk premia that are aﬃne in the volatility factor v. For the λ
                             v       z
model and the SV model, σξ and σξ are not needed, respectively. In the λ
                                         s
model, the market price of uncertainty σξ for Ws is constant.
    The parametrization of σξ also captures aversion against target moves
that are driven by s, r and v. This means that even without market prices of
uncertainty for N U and N D , the intensities under the risk-neutral measure
Q may diﬀer24 from their values under P because of the state-dependence in
(13). With only 5 years of data, we choose to not parametrize the market
price of jump uncertainty for target-rate moves (for example, λU is hard to
                                                                 0
estimate even without any risk adjustment).


6      Estimation Technique and Data
    This section describes the simulation-based method used to approximate
the joint likelihood function of the target, LIBOR and swap rates, which is
not available in closed form. Moreover, it presents the approximation of the
pricing map used in the estimation.

6.1     Estimation Problem
     Let fX ( · , t|Xt, t; γ) denote the true density of the state vector Xt con-
                     ˜ ˜
                                                  ˜
ditional on the last observation Xt at some t < t. The parameter vector γ
                                        ˜
contains parameters describing the true distribution of X and parameters
governing the market prices of uncertainty. This density involves the nonlin-
ear stochastic intensities in (13). Let p( · , γ) denote the true mapping from
factors to observed yields and the target for a given γ, in that p(Xt , γ) = Yt ,
where Yt is the vector of observables at time t: yields and the target rate
θt . We assume that, p( · , γ) can be inverted to obtain the factors as function
q( · , γ) of the observables Yt , in that Xt = q(Yt , γ).
     Ideally, we would like to estimate by maximizing the likelihood of the
observations over γ, which can be obtained by a change of variables from
the conditional densities of the state variables. For example, the conditional
  24
    For example, the long-run mean of the short rate r under Q compared to that under
P is higher if qs < 0. If λU positively depends r, then the mean intensity of up-moves is
higher under Q than under P.


                                           23
density f ( · , t | Yt, t; γ) of Yt given Yt at t < t is given by
                     ˜ ˜                   ˜    ˜

          f Yt , t | Yt, t; γ = fX q(Yt , γ), t | q(Yt, γ), t; γ |
                      ˜ ˜                            ˜      ˜        Y q(Yt , γ)| .   (15)

Three problems arise. First, the true density fX of the state variables is
not available in closed form. We therefore extend the simulated maximum
likelihood (SML) method of Pedersen (1995) and Brandt and Santa-Clara
(1999) to jump-diﬀusions (Section 6.2). Second, the true maps p and q are
not available in closed form. In this high-dimensional setting, we can only
recover p( · , γ) by Monte-Carlo integration, which is prohibitively expensive.
This roadblock is bypassed by using an approximating LQJD model, for
which the Jacobian term in (15) can be calculated analytically. A time-
consuming hill-climbing procedure, based on analytical derivatives, inverts
the map from states to LIBOR and swap yields numerically for each obser-
vation (Section 6.3). Third, an exact computation of yield coeﬃcients for
the approximating LQJD model is computationally intensive, so we employ
a time-saving algorithm (Section 6.4).

6.2     Density Approximation (SML)
    The conditional density of the likelihood function of the underlying state
vector solves a partial diﬀerential-integral equation that has a closed-form
solution only for a few special cases, such as Gaussian and square-root diﬀu-
sions (Lo (1988)). To overcome this problem, simulated maximum likelihood
(SML) approach is used, which attains approximate eﬃciency similar to the
eﬃcient method of moments technique by Gallant and Tauchen (1996).25
  25
    EMM implements a simulated method of moments estimator with moments generated
by the scores of an auxiliary semi-nonparametric (SNP) density. The SNP density is a
Hermite expansion (with analytical scores) that approaches the true density as the degree
of the polynomial increases. In the case of SML, the simulated moments are scores from
the discretized model. An alternative approximately eﬃcient estimator for is proposed
by Singleton (1999), who computes explicit moments using the conditional characteristic
function ψ( · ) of X, deﬁned by ψ(u) = Et−1 [exp(i u Xt )]. Eﬃciency is achieved by
increasing the number of diﬀerent values taken on by u with one moment associated with
each choice of u. This estimator can also be used in the LQJD setting, as the characteristic
function can also be obtained in closed form. While SML is used here as a potentially
helpful alternative to EMM, the computational costs of explicit moments as in Singleton
(1999) are prohibitive in the present seasonal setting. The same caveat applies to other
GMM approaches based on explicit moments, such as those of Liu (1999) and Pan (1999).


                                            24
The density fX ( · , t | x, t) of the state X(t) conditional on the last observa-
                         ˜ ˜
       ˜
tion X(t) = x can be written, using Bayes’ Rule and the Markov property of
            ˜
X, as

           fX (x, t | x, t) =
                      ˜ ˜           fX (x, t | w, t − h) fX (w, t − h | x, t) dw,
                                                                        ˜ ˜           (16)
                                D

for any time interval h. (This is sometimes called the Chapman-Kolmogorov
equation.) SML computes (16) by Monte-Carlo integration, replacing the
density fX ( · , t | w, t − h) by the density of a discretization of X. The method
is extended in Appendix D to allow for jump-diﬀusions. Particular care needs
to be taken to accommodate time-dependent stochastic intensities.26

6.3     Pricing-Formula Approximation (LQJD Model)
    Modeling the target-rate with jump intensities deﬁned by (13) introduces
a form of nonlinearity that takes the state vector outside of the LQJD class.
The quality of an approximating LQJD-pricing formula Yt = p(Xt , γ) that
                                                                 ˜
ignores the truncation by the max-operator in (13) depends crucially on
how severely the positivity constraint on the intensities is binding (and on
the average impact of hitting the constraint). The inverse q ( · , γ) of the
                                                                ˜
approximating map p is deﬁned in the obvious way. We let
                     ˜

                  D+0 := {x ∈ D : λj = g(x, lj ) ≥ γ0 , j = U, D}
                   γ                                j


denote the set of states at which the intensity formula g( · , li) is bounded
below by a given constant γ0 , for j = U, D. The approximation q ( · , γ) is
                               j
                                                                      ˜
                                  0
likely to be better at states in D+ .
    Two estimation approaches can be taken. The ﬁrst approach is simply to
replace p in (15) by p and obtain an estimator of γ by maximizing the total
                      ˜
  26
    As any simulation-based technique, SML is computationally intensive. The applica-
tion considered in this paper does not exploit the computational advantages that would
be allowed by analytical gradients and Hessians of the likelihood function, as discussed in
Brandt and Santa-Clara (1999), because the map from the state vector to swap yields in-
volves the numerical computation of ODEs that depend on the parameters. The numerical
optimization procedure is therefore based on the Nelder-Mead simplex method, starting a
gradient-based parameter search only after the simplex algorithm has collapsed.


                                              25
approximate likelihood

                       ˜
                       f (Yt , t | Yt, t; γ) =
                                    ˜ ˜
              ˜
             (t,t)∈I

                            fX q (Yt , γ), t | q(Yt , γ), ˜; γ |
                               ˜               ˜ ˜        t            Y q (Yt , γ)| ,
                                                                         ˜
                   ˜
                  (t,t)∈I


where I denotes pairs of successive observation times in the data set. Here,
the true factor dynamics (captured by fX ) are combined with the inverse q   ˜
of the approximating map from factors to yields. Since there is no restriction
here on the parameter space, the approach is labeled unconstrained estima-
tion. The accuracy of this approximation of the likelihood can be assessed
ex post by checking whether the functions p and p are close at the estimated
                                                  ˜
parameter γ .
            ˆ
    One might be concerned that, without a-priori restrictions on the param-
eter space, it is unlikely that a good approximation of p would obtain at the
estimated parameter value. Even though this turned out not to be a problem
in the application below, an alternative approach was also tried. Speciﬁcally,
consider performing the constrained estimation

                            max{γ,γ0 }      ˜
                                                     ˜
                                                     f Yt , t | Yt, t; γ
                                                                 ˜ ˜
                                           (t,t)∈I
                  subject to             q(Yt , γ) ∈ D+0 , for all t ∈ I.
                                         ˜            γ


In words, the parameter space is restricted to contain only those parameters
at which the observations are explained by a factor realization q (Yt , γ) for
                                                                  ˜
which the γ0 -constraint never binds. The special case that was tried in this
paper is γ0 = 0. Naturally these two problems typically deliver distinct
estimators. Any such diﬀerences will be further discussed when the estimates
are presented.

6.4    Coeﬃcient Approximation
   Time dependencies introduced by scheduled announcements, such as FOMC
meetings and macro releases, immensely increase the computational burden
associated with the solution of the approximating LQJD model for yields, and
render almost impossible an estimation using data for long-maturity yields.
For example, in order to evaluate the likelihood function, we need to com-
pute the 5-year swap rate for each observation in the sample. In the setup of

                                                 26
Section 9, this takes 16 minutes on a SUN workstation.27 In setups with only
one type of scheduled announcement (at FOMC meetings), the coeﬃcients
are therefore computed using the following approximation: the time until the
next FOMC meetings is matched exactly only for the next-to-occur meeting.
The subsequent meetings are assumed to be equally spaced over the year.
For the maturities of the yields used in the estimation (6 months and above),
the errors due to this approximation are virtually undetectable. In setups
with more than one type of scheduled announcement (FOMC meetings and
macro announcements), such an approximation is no longer accurate, and is
not pursued in this paper. Instead, only yields with a maturity of up to one
year are used in the estimation, economizing on computation time.

6.5     Data
    With knowledge of an equivalent martingale measure Q, any claim to
future payoﬀs can in principle be priced. This means that the term-structure
model may be estimated with data on a broad range of assets including swaps,
Treasuries, futures (for example, Fed Funds Futures), and options (swaptions,
Eurodollar options, and Treasury options, for example). In the model of this
paper, the Fed is targeting the federal funds rate, an interbank rate that
reﬂects default risk, implying that the target rate itself is on average higher
than a short Treasury rate. Moreover, Treasury rates are further reduced
relative to interbank rates by liquidity, tax eﬀects, and specials in the market
for repurchase agreements (Duﬃe (1996)). For example, the average daily
target rate from 1994 to 1998 is 5.22%, while the 3-month T-bill and the
3-month LIBOR-rate averaged 5.06% and 5.44%, respectively. Target data
cannot, therefore, be combined with treasury rates without modeling the
spread. The empirical results reported here are therefore based on LIBOR
and swap rates. LIBOR-quality swap rates are minimally aﬀected by credit
risk because of their special contractual netting features (Duﬃe and Huang
(1996)), although they do trade at spreads to Treasuries that have, to this
point, resisted a convincing explanation (see, for example, Collin-Dufresne
and Solnik (1997)). For example, the 2, 5 and 10-year average swap rates
were 6.08%, 6.48% and 6.79% during the post-1994 period, respectively, while
  27
    The computation simulates 9 coeﬃcients (for each of the 8 state variables plus a
constant) for 10 diﬀerent bond maturities (0.5, 1, . . . , 4.5, 5 years) for each of the 261
observations, using Runge-Kutta solutions of the ODEs for the coeﬃcients.


                                            27
the same maturities had average percentage yields in the Treasury market of
5.81, 6.13, and 6.34.
    The sample period considered in the estimation is January 1, 1994 to
December 31, 1998. The dates of FOMC meetings were obtained from the
Board of Governors of the Federal Reserve. The FOMC meets eight times
a year. Two of these meetings, the ﬁrst and the fourth, extend over two
days. In the past, if the Fed changed its target during one of these two-
day meetings, the announcement was always made on the second of the two
meeting days.28 The target-rate series used in this paper diﬀers from the
series in Datastream with respect to the timing of the target change during
the two-day meeting of February 1994. Datastream assigns the change to
the ﬁrst meeting day (February 3), while the change was announced on the
second meeting day (The New York Times, February 5, 1994, page 1, “Federal
Reserve, Changing Course, Raises a Key Rate” by Keith Bradsher).
    LIBOR data are from the British Bankers’ Association, while swap rates
are from Intercapital Brokers Limited. Both series are obtained through
Datastream. LIBOR rates are recorded at 11 a.m. London time, while swap
rates are recorded at the end of the UK business day. Target-rate changes
are typically announced from 10 a.m. to 3 p.m. Eastern time.29 This means
that a move in the target on Tuesday, March 1, aﬀects recorded LIBOR
rates on Wednesday, March 2. The eﬀect on recorded swap yields is not
so precisely separable. The data sample is therefore constructed by using
Thursday (London time) observations of LIBOR and swap yields, together
with Wednesday (Eastern time) observations of the target rate. Figure 6
shows a plot of the data. The asynchronous nature of the observations is
ignored in the estimation. Whenever the respective day was a holiday, the
observation of the previous business day was used.
    The bond-pricing formula (2) extends as written to the case of LIBOR
bonds, treating r as a default-adjusted discount rate (Duﬃe and Singleton
(1997)). The 6-month LIBOR rate rL (t) at time t is in that case deﬁned by
                                                    1
                           P (t, t + 1/2) =                   .                    (17)
                                               1 + rL (t)/2
  28
     This is also true for the target rate increase from 4.75 to 5 that was decided upon
during the 2-day meeting on June 29/30 and that was announced only on Wednesday,
June 30.
  29
     For example: Sep 29, 1998 at 2:15 p.m., Oct 15, 1998 at 3:15 p.m., Nov 17, 1998 at
2:15 p.m.


                                          28
    An interest-rate swap is a contract between two parties to exchange ﬁxed
and ﬂoating coupons for a stipulated time, say τ years. One party receives
a semi-annual ﬂoating payment in form of the 6-month LIBOR rate, and
pays in exchange a ﬁxed coupon rate, the swap rate, denoted Y (t, t + τ ). At
the initiation of the swap contract, the swap rate is set so that the value
of the swap contract is zero. For simplicity, we treat swap rates as par-
coupon rates on LIBOR-quality bonds of the same maturity, putting aside
the distinct institutional features and diﬀerences in default risk of LIBOR
and LIBOR-swap markets, so that

                                        2 (1 − P (t, t + τ ))
                      Y (t, t + τ ) =      2τ                 .          (18)
                                           j=1 P (t, t + j/2)


7     Estimation Results for Base Models
    The three base-case models have diﬀerent numbers of state variables. The
λ and SV models have three, while the SVλ model has four. The same set
of yields (6-month LIBOR, 2 and 5-year swap) and the target are used for
all estimations, creating the need to break the stochastic singularity arising
from the exact map of three factors in four observed variables in the lower-
dimensional systems. This is achieved by assuming that the 2-year swap
rate is observed with measurement error. As this section investigates the
properties of the estimated models, the concept of model-implied factors will
be needed. These are obtained by inverting the map from factors to the
target rate, and to those yields that are assumed to be observed without
error at the SML estimates. The map from factors to yields is given by the
pricing formulas ((17) and (18)) from the approximating model.

7.1    Accuracy of the Approximating LQ Model
    As the true state process X is not actually of the LQJD class, because of
(13), it is important to study the accuracy of the LQ-approximating model.
From the swap-yield formula (18), one can see that it is suﬃcient to investi-
gate the approximation accuracy for zero-coupon yields.
    Zero-coupon yields Y0 (t, T ) for each observation t in the sample im-
plied by the true (nonlinear) model can be computed with Monte Carlo
integration. Consider simulating S paths of the short rate for times i =


                                         29
t + h, t + 2h, . . . , T − h starting with the model-implied state x at time t.
The time t yield of a zero-coupon bond maturing at time T is then
                                              ˆ
                                           ln P (t, T )
                           Y0 (t, T )= −
                                     ˆ                  ,
                                             T −t
where
                                    S             T −h
                                1
                    ˆ
                    P (t, T ) =           exp −          ˆx
                                                         ri [s]h .
                                S   s=1           i=t

These calculations are performed using S = 10, 000 and h = 1/365. FOMC
meeting days are additionally subdivided into 30 intervals. Given these
choices, the standard errors of the Monte-Carlo approximation of the true
yields for even the 5-year yield are suﬃciently small, from 0.93 to 1.79 basis
points.
    These zero-coupon yields implied by the true model can be compared to
the yields from the approximating LQ model that are based on the same
model-implied state vector. Table 1 shows that the mean absolute approxi-
mation error made by the LQ model is around 1-3 basis points, with a stan-
dard deviation of 1-2 basis points for the constrained SV model and an even
smaller error for all SVλ models. The approximation error for these speci-
ﬁcations is thus of similar magnitude as the bid-ask spread of swaps. The
unconstrained estimates of the SV model and all λ models produce about 5
times the approximation error, which seems too large to be acceptable.
    In the following, “SV model” and “λ model” will therefore denote the
constrained version of those models, while the unconstrained 4-factor model
will be called “SVλ model”.

7.2     Parameter Estimates
    Table 5 reports the estimated parameters and t-ratios for all base models.
In all models, the short rate quickly reverts to the target, which in turn
                           ¯
reverts to a parameter θ that is ﬁxed to the sample mean, 5.22%, of the
target rate. The speed κs of the mean-reversion is highest in the SVλ model,
implying a weekly autoregressive coeﬃcient of exp(−κs /52) = 0.83 and a
half life of shocks to the spread of less than 1 month. The mean-reversion of
the volatility process v and inertia process z are roughly equally slow in the
λ and SV models, while the weekly autoregressive coeﬃcients of z and v in

                                          30
    Table 1: Approximation Error made by LQ Model (in Basis Points)

        Maturity                     λ Model           SV Model        SVλ Model
                                    Con. Unc.         Con. Unc.        Con. Unc.
        6-mth    mean abs.AE        4.08     8.56     2.85     11.32    2.60    2.11
                 std of abs.AE      2.73     9.26     2.20     10.69    1.91    1.11
                  average SE        0.53     0.38     0.46     0.29     0.59    0.67
         2-yr    mean abs.AE        11.43    20.99    2.62     19.43    2.17    1.73
                 std of abs.AE      8.78     18.59    1.50     18.18    1.36    0.85
                  average SE        0.93     0.66     0.85     0.59     1.07    1.13
         5-yr    mean abs.AE        37.74    28.92    1.76     9.37     1.54    1.86
                 std of abs.AE      20.16    20.74    0.72     9.30     0.81    0.73
                  average SE        1.19     0.93     1.28     1.07     1.64    1.79

NOTE: This table presents summary statistics about the approximation errors in basis
points made by the approximating LQ models over the sample January 1, 1994 to Decem-
ber 31, 1998. Due to the seasonality introduced by FOMC meetings, the approximation
errors in this setup depend on time t even for a given value of the state vector. The
table therefore reports the mean average absolute approximation error |Y (t, T ) − Y0 (t, T )|
and its standard deviation over the sample (ﬁrst and second row). The table also re-
ports the average standard errors of the Monte Carlo approximation of true yields Y0 (t, T )
(third row). These are obtained using the Delta method by viewing the simulated bond
       ˆ
price P (t, T ) at time t as the estimated mean of an i.i.d. population of random variables
exp( T −h ri [s]h). The table reports the average standard errors over the sample.
        i=t ˆ


the SVλ model are quite diﬀerent, 0.82 and 0.99 respectively, implying half
lives of 1 and 17 years.
    The ﬁtted intensity parameters of the SVλ model imply that a 25-basis-
point increase in the spread at the beginning of an FOMC meeting raises
the conditional probability of an upward move in the target for that meeting
by about 5%, indicating that the Fed does not react much to the short
spread s. A 25-basis-point decrease in the target lowers the probability of an
upward move during the FOMC meeting by about 11%, reﬂecting the slow
mean-reversion of θ. A one-standard-deviation shock to v and z increases the
conditional probability of a target increase by about 14 and 30%, respectively.
This shows that the inertia factor z has a larger eﬀect on the stochastic
intensity of target moves than v. While the magnitude of these eﬀects varies


                                             31
across the base-case models, the directions of the eﬀects are the same.
   The “measurement error” of the 2-year swap rate in the three-factor λ
and SV models is persistent, with a weekly autocorrelation coeﬃcient that
varies across models from 0.95 to 0.98, and a standard deviation that varies
from 12 to 21 basis points.

7.3      Interpretation of Model-Implied Factors
    Across all base-case models, estimates of the correlation between the fac-
tors, LIBOR, swap and target rates are reported in Table 2. These corre-

   Table 2: Correlations of Model-Implied in Factors, Yields and Target

                 λ Model        SV Model             SVλ Model          LIBOR & Swaps       Target
 Model           r    z          r    v          r       z     v      6-mth 2-yr 5-yr         θ
  λ       r        1                                                     .56  .26   .14        .66
          z      .01     1                                               .44  .89   .97        .07
   SV     r      .85 −.18        1                                       .57  .13 −.01         .75
          v      .07  .99     −.09       1                               .55  .93   .99        .12
  SVλ     r      .67 −.24      .77    −.16         1                     .54 −.03 −.07       −.12
          z       24  .63      .12     .64      −.18        1            .44  .81   .65        .13
          v     -.08  .78     −.19     .78      −.01      .01     1      .37  .55   .76      −.03
   DS     r      .07 −.57      .21    −.54       .57     -.90   .05    −.05 −.62 −.50        −.26
          θ      .23  .82      .10     .85      −.16      .93   .34      .63  .96   .86        .29
          v    −.15 −.10      −.26     .62       .02    −.19    .97      .19  .33   .59      −.21

NOTE: This table computes the correlation of the ﬁrst diﬀerences of model-implied factors
(r, z and v) from the unconstrained estimations, the model-implied factors (r, θ, v) of
the DS model (at their estimated parameter vector), the 6-month LIBOR rate, the 2 and
5-year swap rates, and the target rate θ over the sample January 1, 1994 to December 31,
1998: All correlations with the target rate are computed using the subsample of FOMC
meetings.

lation estimates are useful for characterizing the factors as ‘level,’ ‘slope,’
and ‘curvature’ (in the language of Litterman and Scheinkman (1993)), and
in comparing their respective roles in explaining yields across base models.
In addition, these correlations can be used to detect misspeciﬁcation in the
model.


                                           32
    The two model-implied latent factors, for both the λ and the SV model,
are almost uncorrelated. For both models, the model-implied short rate r is
correlated most highly with the shortest yield in the estimation, the 6-month
LIBOR rate, while the second latent factor (z for the λ model and v for
the SV model) behaves much like the longest yield. Table 3 shows that, for

       Table 3: Which Factors drive the Probability of Target Moves ?
                 λ Model                  SV Model                 SVλ Model
            Constr. Unconstr.         Constr. Unconstr.         Constr. Unconstr.
       s        .44           .53         .60           .55         .40           .48
       θ       −.33          −.33        −.48          −.48        −.33          −.36
       z        .85           .83          −             −          .86           .71
       v         −             −          .65           .67        −.09           .26

NOTE: To obtain a measure of importance of a factor for the stochastic intensities of
policy events, this table shows the correlation between changes in the model-implied factors
(s, θ, v, z) and changes in the function λ0 + λs s(t)+ λθ θ(t)+ λv v(t)+ λz z(t) for the weekly
sample from January 1, 1994 to December 31, 1998.

both models, the second latent variables are the main driving force behind
the conditional probability of target-rate changes. The short rate is pulled
towards these during FOMC meeting days. (This is also true under the risk-
neutral measures Q). The high correlation of v and z across λ and SV models
seems to indicate that the role of v as conditional second moment of shocks to
r is dominated by its importance in setting the stochastic intensities of policy
events. In the SV model, r is less related to longer yields than in the λ model.
This can also be seen from the estimated mean-reversion parameter κs , higher
for the SV model than for the λ model, and its correlation with target-rate
changes on FOMC meetings, which is also higher in the SV model. This may
be explained by the fact that, for a nonzero market price qs of uncertainty,
the SV model has the additional ﬂexibility of allowing r to revert under Q
to the continuous variable v. In the λ model, the conditional mean between
FOMC meetings is constant.
    The SVλ model is characterized by latent variables z and v that roughly
correspond to the stochastic mean θ and volatility v factors of the “A1 (3)DS
model” by Dai and Singleton (2000, DS), as can be seen from their correlation
of 0.93 and 0.97, respectively. The comovements with yields indicate that z
behaves much like the 2-year swap rate, and that v is related to the 5-year

                                              33
                     Zero-Coupon Yield Coeﬃcients in SVλ Model
                      1
                    0.9
                                  uz (t, T ) × 100
                    0.8
                    0.7
                    0.6
                           uθ (t, T )         uv (t, T )/100
                    0.5
                    0.4
                    0.3
                    0.2
                                      us (t, T )
                    0.1
                      0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
                      0

Figure 4: Zero-Coupon Yield Coeﬃcients as a function of time to maturity (T − t =
1, 2, . . . years) taken from the (unconstrained) SVλ Model: Y0 (t) = u0 (t, T )+us (t, T )s(t)+
uθ (t, T )θ(t)+uv (t, T )v(t)+uz (t, T )z(t) at two typical dates t. The solid line is at an FOMC
meeting. The dotted line is at the day after the meeting.


swap rate. The two models imply, however, very diﬀerent short rates. The
sample mean of the model-implied r in the DS model is -0.46%, while the
average r in the SVλ model is 5.02%. While the SVλ model implies that r
is closely related to the short end of the yield curve, the DS model produces
a short rate that behaves like the slope of the short end of the yield curve.
This can be seen from the correlations between the model-implied short rates
r from the two models and the LIBOR rate (0.54 for the SVλ and −0.05 for
the DS model), and the diﬀerence between the 2-year swap and the LIBOR
rate (0.58 for the SVλ and 0.86 for the DS model). From Table 3, the arrival
intensity of Fed moves in the SVλ model is mostly driven by z.
    More insights can be obtained from Figure 7.3, which shows, for the
SVλ model, the linear dependence of zero-coupon yields on all factors, as a
function of maturity. As shocks to the factors are uncorrelated in the base
models, we can interpret these yield coeﬃcients as instantaneous impulse-
responses of yields to the various shocks. We see that the response of yields
to changes in the target θ is monotonically decreasing with maturity, as are
the responses to shocks to the spread s. Both θ and s are “slope factors”,
but act on diﬀerent parts of the yield curve, as the impact of target-rate
increases dies oﬀ more slowly with maturity than do the impacts of shocks

                                               34
to s. Changes in the inertia factor z cause a hump-shaped reaction in yields,
with a peak at 2 years, making z a “curvature factor”. One can interpret
this as a policy-inertia eﬀect: A positive shock to z increases the conditional
probability of moves up in the target not only at the next FOMC meeting, but
also at subsequent meetings, as shocks to z have a half-life of 1 year. Finally,
shocks to v aﬀect yields at all maturities, reﬂecting the strong persistence of
volatility which has a half life to shocks of roughly 17 years. In this sense, v
is a “level factor”.

7.4       Results about Yield Dynamics
    A measure of goodness of ﬁt of the approximating LQ model is the error
with which it determines yields that are not used in the estimation. Pricing
errors are deﬁned as the diﬀerence between the actual yield and the model-
implied yield, which is computed by inserting the model-implied factors into
the LIBOR and swap formulas ((17) and (18)) based on the approximating
LQ model, respectively. The pricing error is thus composed of both model
misspeciﬁcation and approximation error.
    Table 4 reports the average absolute pricing errors and their sample stan-
dard deviations for the base-case models and for the DS model as a point of
reference. The parameters of the DS are estimated with weekly data on the
same LIBOR and swap rates with the exception of using the 10-year instead
of the 5-year swap, but over a sample period that only partially overlaps
with our sample (April 3, 1987 to August 23, 1996). They have not been
reestimated. The pricing errors of the unconstrained λ and SV λ models are
lowest among these. The λ model matches the short end of the yield curve
extremely well with average absolute errors of around 6 to 13 basis points.
In addition to matching the short end well, with 11-26 basis point errors, the
SV λ model produces low errors, of around 2 basis points, at the long end of
the curve. As can be seen from the table, the latter model outperforms the
DS model, especially at the short end. The incorporation of the target as a
fourth factor appears to provide a manageable way of ﬁxing the short end of
the yield curve. The SV model has overall higher pricing errors (ranging from
8 to 18 basis points) when the model is evaluated at the constrained parame-
ter vector, and about 1-7 basis points higher than that for the unconstrained
parameter.30
 30
      While pricing errors are a measure of ﬁrst moments that depends on the realized path


                                            35
  Table 4: Pricing Errors for Yields not used for Fitting (in Basis Points)
                        λ Model          SV Model         SVλ Model         DS Model
                       Con. Unc.        Con. Unc.         Con. Unc.
    1 mth     mean      13.5    12.53   18.48    25.41    33.57   25.72       237.34
               std     11.07    11.29   11.59    13.96    16.69   16.36       115.48
    3 mth     mean      7.90     7.48   11.85    15.48    16.74   11.00       66.17
               std      6.22     6.33    6.65     7.71     7.41    6.75       30.72
   12 mth     mean      6.67     6.84    9.90    12.44     4.72    3.22        9.93
               std      6.48     6.58    7.17     8.77     3.53    2.92        4.83
    3 year    mean     10.68     9.47   14.96    16.94     8.46    1.76        6.41
               std      5.89     5.66    8.79     9.03     2.35    1.13        1.54
    4 year    mean      6.40     5.76    7.98     8.81    10.14    1.78        5.70
               std      3.17     3.08    4.55     4.51     2.45    1.02        1.51

NOTE: This table presents the mean and the standard deviation of the absolute value
of the pricing error in basis points over the weekly data sample from January 1, 1994 to
December 31, 1998 made by the LQ approximating model and, as a reference, for the
DS model (at their parameter estimates). Using their sample period, Dai and Singleton
(1999) report mean pricing errors for weekly 3, 5 and 7 year swaps rates of -11.3, 16.9 and
-12.7 basis points with standard deviations of 9.6, 16.5 and 10.1 basis points.


    Figure 7 shows the standard deviation of yield changes as a function of
maturity, the so-called term structure of volatility (or ‘vol curve’). The vol
curve is “snake-shaped,” in that volatility is high at the very short end,
declines until maturities about 3-6 months, after which it has a “hump” at a
maturity of 2 years. The hump has already been documented by Litterman,
Scheinkman, and Weiss (1988). The statistical key to generate a hump in
a term structure model is negative correlation between the state variables,
which can be attained, for example, by a stochastic mean model (Dai and
Singleton (2000)).
    In informal accounts, monetary policy has been conjectured to be re-
sponsible for the hump (Fleming and Remolona (1999)). This claim is here
validated in the sense that the source of the hump is precisely the factor
of yields through the use of model-implied factors, we can obtain a history-independent
check by simulating 20, 000 samples of weekly yields. The average 6-month LIBOR, 2 and
5-year swap rates in these simulated samples are 5.59, 5.76 and 5.89%, respectively, which
shows better how the high persistence of yields makes it diﬃcult to match ﬁrst moments.


                                            36
that is responsible for the stochastic intensity of target-rate moves. In other
words, the hump in the coeﬃcient on the inertia factor translates into a
hump in the vol curve. This can be seen from Figure 7, which shows the
volatility curve in simulated data from the SVλ model, which reproduces
the overall “snake-shaped” pattern quite well. That is, bond investors may
view the Fed’s policy to be one of slow adjustment of the target rate, so that
certain shocks to the economy are allowed to fully aﬀect short term interest
rates only with some delay. Rates at medium maturities such as 2 years
would respond immediately to the anticipated cumulative eﬀects of short
rates over a two-year period, and thus have greater volatility then do the
Fed-dampened short rates. At suﬃciently long maturities, beyond 2 years,
mean reversion in short-term market conditions causes short term shocks to
have smaller and smaller impacts on longer and longer rates. The net eﬀect
is the hump-shaped vol pattern.
    Another feature of the simulated curve is the high volatility of the short
rate, the ‘head of the snake’, especially at FOMC meetings, which was also
found in the data. When attention is restricted to the subsample of FOMC
meeting days, the base model somewhat overstates the volatility of maturities
around 6 months. This point will be taken up further when model extensions
are discussed in Section 8.

7.5    Model Implications Regarding Target Dynamics
    From each of the yield-curve setups, it is possible to derive a discrete-
choice model in which, at each FOMC meeting, the Fed is viewed as ran-
domizing over three possible choices: moving the target up, down, or not at
all. The conditional probability of a particular choice at the FOMC meeting
at t depends on the state “right before” t and is obtained from its empirical
frequency in a simulated sample of size S = 10, 000 that is generated by
simulating forward in steps of one day’s length starting at the actual value
of the implied state at the last observation. Outside of FOMC meetings,
the discrete-choice model assigns a small and constant probability to Peso
events. The conditional probability of target moves up and down for each
FOMC meeting since January 1994 is plotted in Figure 9. The conditional
likelihood of moves up is very high at the end of 1994, when in fact the Fed
increased the target in several steps, and again quite large around the target
increase in March 1997. The conditional probability of moves down is high
in 1995/96 and 1998, both years in which the Fed lowered the rate on several

                                      37
occasions.
    Do these conditional probabilities provide a good description of target
dynamics? Table 6 compares forecasts by the model-implied discrete choice
to forecasts based on alternative ways to describe target-rate moves. As
there have been only 7 increases and 5 decreases in the target at FOMC
meetings over the sample period 1994-1998, these results suﬀer from small-
sample noise, and are not intended to oﬀer a serious forecasting comparison.
They do provide, however, a device that might help one characterize the
implications of the model regarding Fed moves.
    Following the discrete-choice literature, a forecast is taken to be the alter-
native with the highest conditional probability. The standard reference setup,
usually labeled ‘constant probability model,’ is a version of the discrete-choice
model in which the conditional probabilities are set equal to their empirical
frequencies. For target moves, these frequencies are small (7/40 for “up” and
5/40 for “down”), so that this version always forecasts that the Fed is not
going to change the target. In other words, this speciﬁcation generates the
same forecasts as a random walk for the target, and are reported under ‘No
Change.’ A second reference model that seems useful is a random walk for
the ﬁrst diﬀerenced target; its forecasts are reported under ‘Same Change.’
For example, the ﬁrst column of Table 6 indicates that of the 7 target-rate
increases that occurred at FOMC meetings, the ‘Same-Change’ model would
have predicted 2 correctly, while it would have forecasted no move in the
remaining 5 cases.
    The forecasts made by the unconstrained base-case models vastly outper-
form those of the constrained models in terms of the overall percentage of
correctly forecasted Fed moves (62.5% and 75%, compared to 20% and 30%,
respectively). The reason is that the constrained models are characterized by
large probabilities of nonzero moves. For example, none of the constrained
models is able to predict a zero target-rate move correctly. It is worth noting,
however, that the constrained SV and SVλ models never miss the direction
of the move, leading to a perfect score in forecasting nonzero moves. In other
words, conditional on a Fed move, these models always predict the right sign
of a move. The constrained λ model is special in this regard, as it implies
high probabilities of target-rate increases, and therefore performs badly with
respect to conditional forecasts as well.
    The unconstrained SV and SVλ models produce forecasts that are also
more accurate than those of the reference models in terms of the overall
correct forecasting percentage: both the SV and the SVλ model predict 75%

                                       38
of the target moves on FOMC meetings correctly, compared to only 60%
using the ‘Same-Change’ model and 70% using the ‘No-Change’ model. It
is also interesting to see that the unconstrained λ model forecasts target-
rate increases better than do all other unconstrained models, including the
reference models.

7.6    Model Implications for Policy Rule
    Policy rules are structural equations that specify the map from a set of
variables to the policy instrument of the central bank. Recursively iden-
tiﬁed VARs, for example, typically contain one equation, describing the
data-generating process of the federal funds rate, that can be interpreted
as a policy rule plus some orthogonal monetary policy shock (Christiano,
Eichenbaum and Evans (1998)). This identifying assumption also underlies
regressions that one ﬁnds in the Taylor-rule literature of the funds rate on
current inﬂation and the output gap, with quarterly data. (See, for example,
John Taylor’s article in Taylor (1999)). The use of contemporaneous right-
hand-side variables has been criticized on practical grounds: policy-rule plots
shown by the Fed staﬀ to the FOMC at the meetings cannot be based on
variables that are yet to be released (Orphanides (1998)).
    In the yield model, we can analogously identify a policy rule by calculating
the conditional expected value of the target as a function of model-implied
factors, using the estimated SML parameters. Under this identifying assump-
tion, the Fed reacts to the value of the state ‘just before’ the meeting. This is
a real-time high-frequency policy rule of the Fed. For a given parameter value,
the rule can be backed out by the staﬀ from daily and even higher-frequency
data. With weekly observations, the model-implied policy rule at t is
         ¯                    ˜           ˜           ˜             ˜
         θ(t) = 0.36 + 0.10 s(t) + 0.87 θ(t) + 7.51 v(t) + 0.0033 z(t),

where t = t − 1/52. This shows that the spread is not an important determi-
        ˜
nant of the rule and there is a strong interest-rate smoothing term. While the
volatility and inertia factors might seem unfamiliar as arguments in a policy
rule, they simply represent yield-based information that the Fed considers
when setting a target, which might act as a suﬃcient statistic for macroe-
conomic variables the Fed cares about. As the inertia factor mostly drives
the conditional probability of target moves, it is also the most important
determinant of the policy rule, in addition to interest-rate smoothing.

                                       39
    Figure 9 compares the model-implied rule to three rules. The ﬁrst rule
is the original Taylor rule recommended by Taylor (1993). The rule is given
by equation (14), with the quarterly averaged fed funds rate on the left-
hand side. On the right-hand side, π is taken to be the four-quarter average
inﬂation rate, computed using the GDP deﬂator, while the output gap y is
the percentage deviation of real GDP from its trend (based on a Hodrick-
Prescott ﬁlter). The two other policy rules are a Taylor rule with estimated
coeﬃcients and an estimated extended Taylor rule that additionally includes
the lagged federal funds rate as explanatory variable in (14).31 To mimic the
decision process of the Fed, we plot for each FOMC meeting the policy rule
that corresponds to the quarter in which the meeting took place, leaving us
with 40 data points.
    By eyeballing, the model-implied rule seems to be a better description of
the actual target. This is conﬁrmed by the mean absolute diﬀerence between
actual target and the value of the target prescribed by the policy rule. For the
original Taylor, the estimated Taylor, the extended Taylor and the model-
implied rule, the mean absolute diﬀerence is 162, 41, 22 and 10 basis points,
respectively. The reason for the better ﬁt of the model-implied rule is that the
model-implied state variables, especially the inertia factor, anticipate many
of the target rate changes, while the Taylor-based rules only catch up slowly
with the target.


8      Extensions
    The base-case models restrict many correlation parameters to zero. Free-
ing up certain parameters of the SVλ model, we have estimated special cases
  31
    The regressions use quarterly data from 1994:1 to 1998:12. In the Taylor rule, the
estimated constant is 6.1031, the coeﬃcients on π and y are -0.4871 and -0.8058 with
standard errors 0.5485, 0.3764 and 0.3068, respectively, calculated with 2 Newey-West
lags. The R2 is 23%. In the extended Taylor rule, the estimated constant is -0.3602,
the coeﬃcients on π, y and the lagged fed funds rate are 0.4968, 0.3411 and 0.9105 with
standard errors 0.7473, 0.1907, 0.2251 and 0.0861, respectively. The R2 is 91%. The data
was obtained from the Federal Reserve Database.


                                          40
of:
       ds(t) = −κs s(t) dt + v(t) dBs (t) + σsv            v(t) dBs (t) + σsz dBz (t)
               +Js (dN U (t) − dN D (t)),
       dv(t) = κv (¯ − v(t)) dt + σv
                   v                        v(t)dBv (t) + Jv (dN U (t) − dN D (t)),
       dz(t) = −κz z(t) dt + σzs v(t) dBs (t) + σzv           v(t) dBv (t) + dBz (t)
               +Jz (dN U (t) − dN D (t)).
Especially interesting extensions are those that allow the spread and the
inertia factor to jump at FOMC meetings, as they would introduce a seasonal
correlation between factors that may help in producing a hump in yield
reactions to target-rate changes and in the vol curve in weeks of FOMC
meetings. As the coeﬃcients determining the dependence of yields on the
inertia factor z are hump-shaped, z has the potential to generate just such
hump-shaped patterns. The estimate in Table 7 of Jz is positive, meaning
that an increase in the target is estimated to increase z as well, triggering yet
more future θ-increases. The plot of the resulting yield reactions in Figure 10
shows that, for Jz = 0.3, there would be hump, but the estimated Jz = 0.1
does not suﬃce to generate it nor does it generate a hump in the vol curve
at FOMC meetings.
    Freeing up Js allows for negative correlation between the target and the
spread, opening another channel for a hump. Table 7 shows an estimate for
Js that is negative, −25.29 basis points, and signiﬁcant. Figure 10 shows
that this negative correlation produces humps, but that the hump in the
yield reaction to θ-changes leads to a low impact of θ-changes on the short
end of the yield curve, which is counterfactual (at least from a comparison
with Figure 7).32


9        FOMC Meetings and Macro Variables
    As the Fed reacts to macroeconomic variables (taken to be nonfarm pay-
roll employment33 and CPI inﬂation) when ﬁxing its target, expectations
  32
     Allowing for jumps in the volatility process v turned out to be unnecessary, as the
estimate for Jv was close to zero and not signiﬁcant. Table 10 also shows an estimate of
σsv that turns out to be positive, an old stylized fact that goes back to Cox, Ingersoll, and
Ross (1985): Conditional volatility and the short rate are positively correlated.
  33
     The relevance of this variable can, for example, be seen from the Minutes of past
FOMC meetings. In six out of eight FOMC meetings in 1996, the Board’s discussion of

                                             41
about future macro variables matter for current yields. In this section, time-
series models are explored as possible candidate descriptions of macro dy-
namics, but a state-space system for the joint data-generating process of
analyst forecasts and actual releases is eventually preferred due to its more
accurate measure of ‘macro news.’ The state-space system is set up after es-
tablishing two facts: One cannot reject the unbiasedness of analyst forecasts
at conventional p-values (at least post-1994), and the correlation between
employment and inﬂation is weak. The system is then build into the model,
respecting the exact timing of analyst forecasts and macro releases.

9.1     Data on Analyst Forecasts and Actuals
    Employment and CPI releases are made by the Bureau of Labor Statistics.
Employment releases are at 8:30 a.m. on the ﬁrst Friday of each month, while
CPI ﬁgures are released about two weeks after the end of the reference month,
also at 8:30 a.m. This means that the LIBOR recorded at 11 a.m. London
time is aﬀected by a macro release on the preceding day, while swap rates
recorded at the end of the London business day react on the same day as the
macro release.34
    The actual and released CPI and nonfarm payroll employment (NPE)
series are from Money Market Services (MMS). The raw series obtained from
MMS are the monthly percentage change in the CPI and changes in nonfarm
payroll employment in thousands. The CPI series is multiplied by 1200 to
obtain the annualized inﬂation rate, and changes in employment are divided
by 100 (to obtain a series that is similar in magnitude to CPI inﬂation). MMS
collects data on analyst forecasts each Friday prior to the actual release from
about 40 money market managers and reports their median forecast. These
analyst-forecast data have been used in most studies of release surprises (for
example, Balduzzi, Elton, and Green (1998), Fleming and Remolona (1998),
and Li and Engle (1998)).
the “economic and ﬁnancial outlook” started with a general overview of the state of the
economy and then immediately turned to the value of nonfarm payroll employment.
  34
     This asynchronicity does not matter for estimation results, as they are obtained with
LIBOR rate only. Swap rates are only used to generate stylized yield facts around macro
releases.


                                           42
9.2     Dynamics of Macro Variables
    The monthly time series of changes in nonfarm payroll employment (NPE)
and CPI inﬂation (CPI) for the sample period considered in Section 5 con-
tains only 60 monthly observations. Evidence about the macro dynamics
will therefore be collected using all available data from MMS, which started
surveying NPE forecasts35 in January 1985.
    Are actual investors’ forecast errors well approximated by the time-series
model? Over the sample period 1985:6 to 1998:12, it is not possible to outper-
form analyst forecasts in the mean-squared-error sense with one-step-ahead
forecasts of univariate or bivariate ARMA speciﬁcations (even conditioning
on past target values). The errors of analyst forecasts are positively corre-
lated with those of time-series models, but this correlation not perfect. For
instance, the sample correlation coeﬃcient is at most 0.65 for the CPI and
0.85 for NPE. A reason for the relatively low correlation between analyst er-
rors and model errors is the oversimpliﬁed informational structure assumed
by the low-dimensional time-series model. When forecasting, actual investors
are able to condition on a wealth of state variables. The approach to be taken
here is therefore to explore a state-space model of macro variables and ana-
lyst forecasts that introduces latent variables summarizing the conditioning
information.
    The ﬁrst step in setting up this state-space system is to check for the
unbiasedness of analyst forecast mF = (mCP I , mN P E ) of the vector m =
                                               F      F
(mCP I , mN P E ) of CPI and NPE. We test for each variable whether ci = 0
                                                                         0
and ci = 1 when ﬁtting
      1

              mi (t) = ci + ci mi (t) + i (t),
                        0    1  F                    i = CP I, NP E,               (19)

where i is white noise. Unbiasedness cannot be rejected at the 1% level for
NPE, but is strongly rejected for the CPI series for the period 1985-1998.
Concentrating on the post-1994 subsample, that used for the yield-curve
model, CPI forecasts also “pass” the unbiasedness test at the 1% conﬁdence
level.36 Finally, three lagged values of CPI, NPE and the target rate were
  35
     The extension of the sample period to pre-1994 is justiﬁed if the precise timing of
policy events (target moves at FOMC meetings versus moves at random business days)
does not matter for how policy impacts monthly macro series, an assumption which seems
to be reasonable.
  36
     Balduzzi, Elton, and Green (1997) and Li and Engle (1998) conduct this test with a
short history as well, and fail to reject the null.


                                          43
included on the right-hand side of (19), but none had a signiﬁcant coeﬃcient,
except perhaps for a weak eﬀect of the ﬁrst lagged CPI on NPE. To conclude,
analyst forecasts of CPI and NPE provide a reasonably good description of
the conditional expected values of these variables, at least in the post-1994
period, so that (19) will be used with ci = 0 and ci = 1.
                                         0          1

                Correlation of CPI at time t and NPE at t + j
                         1
                       0.8
                       0.6
                       0.4
                       0.2
                         0
                      -0.2
                      -0.4
                      -0.6
                      -0.8
                        -1 -20 -15 -10 -5 0 5 10 15 20
                                    Lags and Leads j
Figure 5: The cross correlation between CPI √ time t and NPE at time t + j together
                                            at
with approximate 95% conﬁdence bounds (±2/ T ).


    The second step in setting up the state-space system for NPE and CPI
is an examination of their correlation. The contemporaneous correlation be-
tween the two variables is small (0.017). When lagging one of these variables,
the correlation estimates rarely exit approximate 95% conﬁdence bounds
around zero, as can be seen from Figure 5. In fact, NPE does not help much
in predicting future CPI, as shown by Table 8. The CPI might, however,
help in forecasting NPE. Including lagged values of the CPI in an AR(3)
speciﬁcation, for example, of NPE leads to a small gain in adjusted R2 (9.5%
to 12.1%). For each of CPI and NPE, we therefore specify conditionally
independent subsystems, using the target rate as an exogenous variable.
    The ﬁnal requirements of the state-space system are that the macro vari-
ables are part of the state, and that the state is a four-dimensional autore-
gressive process of order 1 (this last choice will be further discussed below).
    In the continuous-time economy, one deterministic counting process N i
                                                   i
records macroeconomic releases and another, NF , counts the times at which
analyst forecasts are made. Selecting a release time τ i and the succeeding


                                        44
                       i
analyst forecast time τF , we can summarize the speciﬁcation as:
       mi (τ i ) = mi (τ i ) + i (τ i )
                      F                                                                 (20)
      mF (τF ) = a0 + ai mi (τF −) + ai mi (τF −) + ai θ(τF −) +
       i    i       i
                           1 F
                                  i
                                        2
                                             i
                                                     3
                                                          i                  j   i
                                                                             F (τF ),

where i (τ i ) and i (τF ) are jointly Gaussian independent across time with
                    F
                         i

mean zero, respective variances σ i and σF , and covariance σmF .37
                                            i                   i

    Maximum-likelihood parameter estimates for (20) are reported in Table
                                             i
9, except for the covariance parameter σmF , which was estimated to be es-
sentially zero for both CPI and NPE. The forecast mi of mi does not depend
                                                        F
on the past value of mi (both aCP I and aN P E are small and insigniﬁcant),
                                   2           2
and depend only slightly on the past target (aCP I is somewhat larger than
                                                   3
aN P E , but neither is signiﬁcant). Based on this, mCP I and mN P E are both
 3
treated as the sum of an AR(1), the forecast mF , and Gaussian white noise.
This means that with this speciﬁcation, we ﬁnd an ARMA(1,1)-structure.
This is also the speciﬁcation selected by comparing Akaike and Schwarz cri-
teria of ARMA(p,q) models that include p lagged values of the target rate.
The restrictiveness of a four-dimensional state can be checked by including
additional lagged values of mF and m in the equation determining analyst
forecasts. We do not ﬁnd additional signiﬁcant terms for the CPI forecasts,
but an additional moving average for NPE forecasts improves the model-
selection criteria. For the term-structure model, we adopt the speciﬁcation
(20), and re-estimate the parameters after setting a2 to zero, but leaving a3
in the speciﬁcation so as to leave a free channel for both an eﬀect of monetary
policy on macro variables and some correlation between the CPI and NPE.

9.3     Estimation
   The state vector X is now augmented by (m(t), mF (t)), of which only
m(t) enters the stochastic intensities (13). If an FOMC meeting and a macro
   37
      Fore more intuition, consider the problem of modeling mF and m in discrete time.
Let the state be denoted by z = (z CP I , z N P E ) with z i ∈ R2 , i = CP I, N P E. Leaving
out the dependence on the target for the moment, the observation equations in the state
space system are (1) mi (t + 1) = α0 + α1 z1 (t) + α2 z2 (t) and (2) mi (t) = α0 + z1 (t).
                          F
                                                    i         i                           i

This shows that z2 is the latent state that summarizes the information used by investors
to form the forecast mF . The state equation is z i (t) = Ai z i (t − 1) + u(t) with u(t) ∼
                     0    1
N(0, Ωi ), Ai =               . This system is the maximally ﬂexible system that imposes
                    αi αi
                      1    2
that (i) mi (t) is the conditional mean of mi (t), (ii) independence between CPI and NPE,
           F
(iii) mi is part of the state z i , and (iv) z i is an AR(1). This is equivalent to (20).

                                            45
jump event happen on the same day, the Fed’s target decisions are able to
condition on the newly released information, as CPI and NPE releases (at
8:30 a.m.) precede FOMC meetings. The approximation of the likelihood
function is derived in Appendix F. The estimation uses 1, 3, 6, and 12-month
LIBOR rates, the target rate, CPI, NPE, and analyst forecasts. In addition to
the weekly observations from Section 5, we include the day of and preceding
CPI and NPE releases into the sample.

9.4    Results with Macroeconomic Variables
    The SML parameter estimates are not reported here. A major diﬀerence
to the estimation with swap yields is the magnitude of the mean-reversion
parameter of the interia factor z. This already hints that the hump in yield-
coeﬃcients at this parameter vector peaks before maturities around 2 years;
in fact, the peak is at 6 months.
    The impact of the surprise component mi (τ i ) − mi (τ i ) of a macroeco-
                                                           F
nomic release on yields is determined by how it aﬀects the stochastic intensity
of policy events. There are two possible channels for the impulse response of
a release surprise. First, the surprise can directly aﬀect λU and λD at FOMC
meetings that are scheduled before the next macro release. The eﬀect is
propagated to future intensities only through the dependence of the inten-
sities on the past target. Second, the surprise can impact the future path
of macro variables that enter the intensities at much later FOMC meetings,
in addition to its direct eﬀect. Given the dynamics (20), the surprise is a
temporary component of the macro variable: it does not aﬀect the path of
future macro variables. Release suprises therefore have a ‘short life’ by being
propagated only through the ﬁrst channel. The estimated parameters λCP I
and λN P E indicate that this eﬀect is small. In the base-case model, we thus
get the result that release surprises are not inertia factors themselves. By
introducing, for example, a jump in the inertia factor z at release days that
is correlated with the release surprise, we can make the release surprise ‘live
longer,’ as a shock to z aﬀects intensities in the farer future.
    Figure 11 shows the cross-sectional contemporanous impulse response of
yields to a one-standard deviation release surprise to NPE and the CPI.
We can compare these model-implied impulse responses to a least-squares
regression of yield changes on release surprises (also in Figure 11). We can
see that a NPE release surprise has a larger impact than a CPI surprise, which
translates into a stronger seasonal eﬀect on the term structure of volatilities

                                      46
of yields. The response of yields monotonically descreases with maturity
because of the shock’s propagation through only the ﬁrst channel.


10     Conclusion
    The estimated yield-curve model explains the “snake-shaped” term struc-
ture of volatility in yields, based on interest-rate smoothing and policy in-
ertia. Macroeconomic surprises are only temporary components of macro
variables. This means that the impact of these surprises on longer yields
needs to occur over time through a “policy-inertia factor.” The model im-
proves the ﬁt of bond prices over a 3-latent-factor model, especially for short
maturities. A policy rule is identiﬁed from weekly yield data and is found to
provide a good description of the target. In fact, model-based forecasts of
future target rates outperform several benchmarks.


                                      47
Appendices

A     Details for Change of Measure
   Suppose that ξ solves the SDE (9). We ﬁrst state Assumption 2, under
which ξ is a square-integrable P-martingale (Proposition 1).

Assumption 2 (Assumptions on Market Prices of Uncertainty): The
                d        p
processes σξ , Jξ , and Jξ are progressively measurable and satisfy:

 (a) Jξ (t) > −1 and Jξ (t) > −1.
      p               d


      P   d
 (b) Et− Jξ (t) = 0.
                                             T
 (c) Novikov’s Condition: E P exp           0
                                                 σξ (t)σξ (t) dt   < ∞.

                    T
 (d) E P exp 2     0
                                p
                        ln(1 + Jξ (t)) dMp (t)      <∞
                    T
     E P exp 2     0
                                d
                        ln(1 + Jξ (t)) dNd (t)     < ∞.

 (e) Given a Poisson jump time τ , the F (τ −)-conditional distribution of
     Jξ (τ ) is F (0)-measurable.
      p


The integrability conditions A2(c) and A2(d) are easily satisﬁed, for example,
              d    p                                                 d
for constant Jξ , Jξ and σξ , but in this case we would need to set Jξ = 0 from
A2(b), meaning that investors do not demand an uncertainty premium for
jump risk at deterministic jump times. Appendix B provides an example in
        d
which Jξ is stochastic.

Proposition 1 (Martingale Property for ξ):                     Under A2, the solution
ξ to (9) is a positive, square-integrable martingale.

Proof: Applying Ito’s Lemma (for semimartingales, Theorem 33 in Prot-
ter (1990), p. 74) to (9), we get
                                   1
       d ln ξ(t) = −σξ (t) dW (t) − σξ (t)σξ (t) dt
                                   2
                              d                      p
                   + ln 1 + Jξ (t) dNd (t) + ln 1 + Jξ (t) dMp (t).


                                          48
The solution for ξ is well deﬁned, due to A2(a). Moreover, it is exponential
and therefore positive. The proof that ξ is a martingale is by induction over
                                d          d
the deterministic jump times τ1 , . . . , τN . We derive now two intermediate
results.
(R1) At the i-th deterministic jump time τ d , from A2(b),
                  P                  P
                 Eτ d − ξ(τid )   = Eτ d − ξ(τid −) + ξ(τid −)Jξ (τid )
                                                               d
                    i                   i
                                             P
                                  = ξ(τid −)Eτ d − 1 + Jξ (τid )
                                                        d
                                                i

                                  = ξ(τid −).                              (A.21)

(R2) The SDE in (9) has no drift term (W and Mp are martingales), so that,
     for t and s in [τi−1 , τid ) with t ≤ s, we have EtP [ξ(s)] = ξ(t) because of
                      d

     the integrability conditions A2(c) and A2(d).
The process ξ is a martingale for t ∈ [τN , T ] because of (R2). Suppose that
                                           d

for any τid , (R1) holds at τi+1 . We can apply (R2) to get the desired property
                              d

for t ∈ [τi , τi+1 ), and then apply (R1) to obtain this property for τid as well.
           d   d

By induction, ξ is a martingale during [0, T ] ((R2) can also be applied to
     d
[0, τ1 )).
     Due to the exponential form of ξ, square integability is implied from
Novikov’s condition A2(c), the corresponding integrability condition for jump
sizes A2(d), and the fact that W , Mp , and Nd are independent.

    We can now ﬁx T and let dQ/dP = ξ(T ) be the Radon-Nikodym deriva-
tive of Q with respect to P. The next proposition provides a representation
of the dynamics of the state process X under Q. The proposition is stated
for one-dimensional versions of Np and Nd , but is easily extended to the mul-
tidimensional case by attaching subscripts i to all jump sizes and intensities
in the statements (b) and (c), so that they hold for each component of Np
and Nd .

Proposition 2 (“Generalized Girsanov Theorem”):            Suppose that
A2 holds and ﬁx T . Under the probability measure Q with density process
ξ, we have:
 (a) X solves an SDE with drift and diﬀusion coeﬃcients
                    µQ (x, t) = K(t)(¯(t) − x) − σ(x, t)σξ (t) ,
                                       x
                    σ Q (x, t) = σ(x, t).

                                        49
 (b) The counting process Np has a stochastic intensity

                             λQ (t) = λ(t) E P 1 + Jξ (t) .
                                                    p


 (c) For any bounded measurable function h : RN → R,
                                                              p
                  Q      p                P       p
                                                        (1 + Jξ (t))
                E [h(J (t))] = E               h(J (t)) P        p    ,
                                                       E [1 + Jξ (t)]
                                                                  d
                Q        d                P        d
                                                             1 + Jξ (t)
               Et−    h(J (t))       =   Et−   h(J (t))    P       d
                                                                           .
                                                          Et− 1 + Jξ (t)

                                 t
Moreover, W Q (t) = W (t) + 0 σξ (u) du deﬁnes a standard Brownian motion
                                t
under Q, and Mp (t) = Np (t) − 0 λQ (u) du is a compensated Poisson process
                Q

under Q.

Proof: First, we need to construct a Brownian motion and the intensities
of the Poisson processes under Q. For this part of the proof, it is possible to
refer to the relevant results in Duﬃe, Pan, and Singleton (1998, Proposition
4).
    Second, we need to show how to obtain the density of jump sizes at
deterministic jump events under Q. For h(·) bounded and measurable, we
have
                                                                       d
 Q                P                   ξ(t)    P
                                                            ξ(t−) 1 + Jξ (t)
Et− h(J d (t)) = Et− h(J d (t))            = Et− h(J d (t))                    .
                                     ξ(t−)                       ξ(t−)

Because of A2(b), we get the result.

The last proposition shows how to specify market prices of uncertainty, so
that X is a LQJD under both measures, P and Q. For example, from (a)
we see that σ(X(t), t)σξ (t) must be aﬃne in X if the aﬃne drift structure
is to be preserved.

                   d
B     Example for Jξ
   This Appendix provides an example in which the market price of jump
uncertainty at scheduled announcements is non-trivial. For a deterministic

                                          50
jump time τ d , let ∼ N(0, 1) be an F (τ d )-measurable random variable, and
let z = µ+ σ for constants38 µ and σ. Suppose that at time τ d the price of an
asset is F (τ d ) = exp u0 (τ d ) + u1 (τ d )X(τ d ) , for time-dependent coeﬃcients
u0 and u1 , so that JF (τ d ) = exp u1 (τ d )∆X(τ d ) − 1. For simplicity, let
                        d

X be one-dimensional and let its jump size at τ d be J d (τ d ) = z. Now let
Jξ (τ d ) = ξ − 1, where ξ = exp −σξ − 1 (σξ )2 .
  d         ˜             ˜               d
                                               2
                                                     d

                                             ˜
    Assumption A2(a) holds because ξ > 0, while A2(b) is satisﬁed because,
for the deterministic jump time τ d ,
                        P                  P     ˜
                       Eτ d − Jξ (τ d ) = Eτ d − ξ − 1 = 0.
                               d


Condition A2(d) is satisﬁed since Nd does not explode. All other parts of A3
are not needed here. Finally, we need to verify that
                        P                         P
 Q
                       Eτ d − ξ(τ d )JF (τ d )
                                      d
                                                 Eτ d − ξ(τ d −)(1 + Jξ (τ d ))JF (τ d )
                                                                        d       d
Eτ d −   JF (τ d )
          d
                     =                         =
                              ξ(τ d −)                          ξ(τ d −)
                       = E Pd ξJ d (τ d ) = E Pd ξ exp(u1(τ d )z) − 1 = 0.
                              τ −
                                 ˜
                                       F           τ −
                                                       ˜

This is equivalent to
                                       1
                                    µ + σ 2 u1 = σξ σ,
                                                  d
                                       2
                       d
which shows that any σξ solving this last equation can be used to adjust for
uncertainty at deterministic jump times.39


C        Statement of Lemma 1
   Let the times at which deterministic jumps occur between t and T be
         d            d
denoted τ1 , . . . , τn .

Lemma 1:         Suppose that Assumptions 1 and 3 hold under Q. Addition-
ally, for t = τid , for some i, suppose that P (t, T ) = exp (g (X(t), c )) and
                                                                       ¯
  38
     For ease of notation, µ and σ are assumed constant. Everything goes through if they
are bounded functions of time.
  39
     Underlying this example is the following basic result about static changes of measure.
Suppose that ∼ N(0, 1) under the original measure P. Deﬁne ξ = exp(−σξ − 1 (σξ )2 )
                                                                   ˜          d
                                                                                    2
                                                                                       d

                                                      ˜
and the (equivalent) probability measure dQ/dP = ξ. Under Q, ∼ N(−σξ , 1).  d


                                             51
for some c = (¯0 , c1 , c2 ) ∈ C. Then there exist coeﬃcients c ∈ C such that
           ¯    c ¯ ¯
P (t−, T ) = lims↑t P (s, T ) is given by

                        P (t−, T ) = exp (g (X(t−), c )) .                  (C.22)

Proof:      From equation (2),

                         Q
           P (t−, T ) = Et− [P (t, T )]
                         Q
                      = Et− [exp (g (X(t), c ))]
                                           ¯
                                             Q
                      = exp (g (X(t−), c )) Et− [exp (¯1 · ∆X(t))]
                                       ¯               c
                      = exp (g (X(t−), c )) exp (g (X(t−), a(t; c1 )))
                                       ¯                        ¯
                      = exp (g (X(t−), c + a(t; c1 ))) ,
                                       ¯        ¯
         Q
where Et− denotes F (t−)-conditional expectation under Q, and the fourth
equality holds for some a(t; c1 ) ∈ C because of Deﬁnition (A1.c.).
                             ¯


D      Statement of Lemma 2
                                                               d
Lemma 2:             Suppose that, for some i such that s = τi+1 , P (s−, T ) =
limt↑s P (t, T ) can be represented as P (s−, T ) = exp (g (X(s−), c )) for some
                                                                   ¯
c ∈ C. Let Assumptions 1, 3 be satisﬁed under Q. Also suppose that
¯
Assumption 4 below is satisﬁed at (s, c). Then for each t ∈ [τid , s), there
                                           ¯
exist coeﬃcients c(t, s) ∈ C such that

                       P (t, T ) = exp (g (X(t), c(t, s) )) ,              (D.23)

where c( · , s) : [τid , s] → C solves the system of ordinary diﬀerential equations
(ODE’s) in (D.27)-(D.29) stated below, with terminal condition c(s, s) = c.      ¯

Proof:
    Lemma 2 applies the standard Feynman-Kac approach to equation (2)
between deterministic jump times. The approach proceeds in two steps. In a
ﬁrst step, the relevant Cauchy problem is stated and solved. In a second step,
integrability conditions are imposed so that the bond price at time t ∈ [τid , s)
can indeed be viewed as the Feynman-Kac solution to the Cauchy problem
of Step 1.

                                          52
Step 1: Set up and solve the relevant Cauchy Problem.
Consider the following Cauchy problem. For all t ∈ [τid , s) and x ∈ D, let
F (t, s, x) solve the partial diﬀerential-integral equation (PDIE)

   0 = Ft (t, s, x) + Fx (t, s, x) · µ(x, t)                                          (D.24)
       1
     + tr Fx x (t, s, x)σ(x, t)σ(x, t)
       2
          p
     +         g(x, li (t))E Q [F (t, s, x + Jip (t)) − F (t, s, x)] − R(x, t)F (t, s, x),
                     Q

         i=1

with terminal condition F (s, s, x) = exp (g (x, c)).
                                                 ¯

We guess a solution of the form

                              F (t, s, x) = exp (g (x, c(t, s))) ,                   (D.25)

where the coeﬃcients c(t, s) = (c0 (t, s), c1 (t, s), c2(t, s)) satisfy terminal con-
ditions at s given by c = (¯0 , c1 , c2 ). Now we verify that the guess in (D.25)
                      ¯    c ¯ ¯
solves the PDIE (D.24) for all t ∈ [ti , s). By applying Ito’s Lemma to (D.25)
and using the fact that F (t, s, x) is strictly positive, we have

            dc0 (t, s) dc1 (t, s)              dc2 (t, s)
      0=               +            ·x+x                  x                    (D.26)
                dt           dt                   dt
         + (c1 (t, s) + 2 c2(t, s) x) · K Q (t)(¯Q (t) − x)
                                                x
           1
         + tr[ (c1 (t, s) + 2c2 (t, s)x)(c1 (t, s) + 2c2 (t, s)x) + 2c2 (t, s)
           2
          Σ(t) S(x, t)S(x, t) Σ(t) ]
                p
         +           g x, li (t) E Q exp(c1 (t, s) · Jp (t)) − 1 − g (x, δ(t)) ,
                           Q                          i

               i=1

where the coeﬃcients with subscripts are subvectors and submatrices of the
coeﬃcients in equations (4), (5), (6), and (7). This equation must hold for
all x ∈ D, which is assumed to contain an open set, so that we can apply
the usual method of undetermined coeﬃcients which equates the coeﬃcients
of x and the quadratic forms in x to zero. This shows that c(t, s) solves the
ODE’s:


                                               53
       dc0 (t, s)
                  = δ0 (t) − c1 (t, s) K Q (t) xQ (t)
                                               ¯
          dt
                        N
                    1
                  −        [c1 (t, s) Σ(t)]2 si0
                                           i                                 (D.27)
                    2 i=1
                  1
                 − tr[2 c3 (t, s)Σ(t)S(x, t)S(x, t) Σ(t) ]
                  2
                      p
                 −         l0,i (t) E Q exp(c1 (t, s) · Jp (t)) − 1
                            Q                            i

                     i=1


       dc1 (t, s)
                  = δ1 (t) + K Q (t) c1 (t, s) − 2 c2 (t, s)K Q (t) xQ (t)
                                                                    ¯        (D.28)
          dt
                        N
                    1
                  −        [c1 (t, s) Σ(t)]2 si1 (t)
                                           i
                    2 i=1
                 − 2 c2 (t, s) Σ(t)S(x, t)S(x, t) Σ(t) c1 (t, T )
                      p
                 −         l1,i (t) E Q exp(c1 (t, s) · Jp (t)) − 1
                            Q                            i

                     i=1


       dc2 (t, s)
                  = δ2 (t) − c2 (t, s)K Q (t) − K Q (t) c2 (t, s)            (D.29)
          dt
                  − 2 c2 (t, s) Σ(t)S(x, t)S(x, t) Σ(t) c2 (t, s)
                      p
                 −         l2,i (t)E Q exp(c1 (t, s) · Jp (t)) − 1 ,
                            Q                           i

                     i=1

with terminal conditions given by c0 , c1 , and c2 , respectively.
                                  ¯ ¯           ¯


                                             54
Step 2: Here, we impose suﬃcient integrability conditions so that, for t ∈
[τid , s),
                                                  s
          P (t, T ) = EtQ exp −                       R(X(u), u)du     exp (g (X(s−), c))
                                                                                      ¯
                                              t

can be viewed as the Feynmac-Kac solution to the Cauchy problem (D.24).

Assumption 4. (Integrability Conditions):
We say that the integrability conditions hold at (s, c) if
                                                     ¯

   1. c(·, s) : [τid , s] → C uniquely solve equations (D.27)-(D.29) with termi-
      nal conditions c at time s.
                          ¯
                s
   2. E Q      0
                    |γ1 | dt < ∞, for all i = 1, . . . , p, where
                      i

                                                                   Q
         γ1 (t) = Ψ (t−) E Q exp(c1 (t, s) · Jp (t)) − 1 g X(t−), li (t) .
          i                                   i


                 s                          1/2
   3. E Q       0
                     |γ2 (t) · γ2 (t)| dt         < ∞, where
         γ2 (t) = Ψ(t−) c1 (t, s) + 2 X(t−) c2 (t, s) σ(X(t−), t).

   4. E Q (|Ψ(s)|) < ∞,

where Ψ(t) is deﬁned for t ∈ [τid , s] by
                               t
               exp −          0
                                 R(X(u), u)du           exp (g (X(t), c(t, s)))   for t ∈ [τid , s)
Ψ(t) =                         s
               exp −          0
                                 R(X(u), u)du           exp (g (X(s−), c))
                                                                        ¯         for t = s.


Lemma 3: If the integrability conditions hold at (s, c), then Ψ(t) given by
                                                     ¯
(D) is a martingale for t ∈ [τid , s].

Proof:       Applying Ito’s Lemma40 to equation (D) for t ∈ [τid , s] and
using the coeﬃcient calculation (D.27)-(D.29) gives

         dΨ(t) = Ψ(t−) c1 (t, s) + 2 X(t−) c2 (t, s) σ(X(t−), t) dW (t) +
                         p
                    +         Ψ(t−) exp c1 (t, s) · Jp (t) − 1 dMP (t),
                                                     i           i

                        i=0

  40
       See Protter (1990), p. 74.

                                                         55
         i
where MP denotes the i-th compensated Poisson process. Duﬃe, Pan, and
Singleton (2000), p. 26, show that with assumptions 4.1 and 4.2, η2 (t)dW
and Ψ(t−) exp c1 (t, s) · Jp (t) − 1 dMP , for i = 1, . . . , p, are martin-
                                  i        i

gales during the interval [τid , s].


E     Recursive Calculation of c(t, T )
    Here, we provide an algorithm for computing c(t, T ).

Step 0 (Initialization):
The terminal condition for c(t, T ) at T consists of a collection of zeros denoted
by cn+1 in C. Let cn (t, T ) solve the ODE’s in (D.27)-(D.29) during the
    ¯                    ˜
           d                                                   d
interval [τn , T ] with terminal condition cn+1 , and deﬁne τn+1 = T .
                                           ¯
Go to Step 1.

Step i, for i = 1, . . . , n :


    • Calculate the new terminal condition for time τn+1−i as
                                                     d


                       d        d               d                d        d
      cn+1−i = cn+1−i τn+1−i , τn+2−i + cn+1−i τn+1−i , cn+1−i (τn+1−i , τn+2−i ) ,
      ¯        ˜                                        ˜

      where cn+1−i τn+1−i , cn+1−i (τn+1−i , τn+2−i ) ∈ C is taken from equation
                     d
                            ˜        d        d
                             d
      (7) evaluated at t = τn+1−i .

    • For a given terminal condition cn+1−i ∈ C, let cn−i(t, T ) solve the
                                      ¯                    ˜
                                                   d      d
      ODE’s in (D.27)-(D.29) during the interval [τn−i , τn+1−i ], with terminal
      condition cn+1−i .
                ¯

Stop if i = n. Go to Step i + 1.

Coeﬃcient Collection:
The coeﬃcients c(t, T ) are then equal to ci (t, T ) for any t ∈ (τid , τi+1 ) and
                                          ˜                              d

equal to ci at any t = τid .
         ¯


                                       56
F     Proof of Proposition 3
    Under condition A2, the solution ξ of (9) is a square-integrable martingale
by Proposition 1, stated in Appendix A. As ξ is the density process for Q,
the state-price density (or pricing kernel) process π is deﬁned by π(t) =
ξ(t)/F 0(t). For the equivalent measure deﬁned by ξ to be a martingale
measure, it suﬃces that, for each i, πF i is a P-martingale. The SDE solved
by π is given by

      dπ(t)
            = −r(t)dt − σξ (t) dW (t) + Jξ (t)dNd (t) + Jξ (t)dMp (t).
                                         d               p
      π(t−)

Using integration by parts (Corollary 2 in Protter (1990), page 60), we get

 d (F i (t)π(t))                                    d                 p
                 = µF i (t)dt + σF i (t) dW (t) + JF i (t) dNd (t) + JF i (t) dNp (t)
(F i (t−)π(t−))
                   −r(t)dt − σξ (t) dW (t) + Jξ (t) dNd (t) + Jξ (t) (dNp (t) − λ(t)dt)
                                                  d                 p


                     −σF i (t)σξ (t)dt + JF i (t)Jξ (t) dNd (t) + JF i (t)Jξ (t) dNp (t).
                                          d       d                p       p


The process πFi is a P-local martingale if and only if (i) this SDE has a
zero drift and (ii) the F (t−)-conditional expected value of ∆(π(t)ξ(t)) at
a deterministic jump time t is zero. Collecting “dt”-terms for condition (i)
and “dNd ”-terms for condition (ii) results in the two equations stated in
Proposition 3 (because of A2(b)). As F i/F 0 and ξ are both assumed to be
square-integrable, we get that πFi is in fact a P-martingale.


G      Simulated Maximum Likelihood with Jumps
    Suppose X contains the target rate θ, modeled as the (observable) dif-
ference of the “up” and “down” counting processes, with state-and time-
dependent intensities as in (12). We abstract for the moment from the time
dependence of stochastic intensities introduced by FOMC meetings, assum-
                                                                        ˜
ing that these intensities are always “active.” Starting from x at time t, we
                                                              ˜
can simulate X with the scheme
                                        √
            ˆ˜            ˜                     ˜
          ∆Xtx = µ(Xt−h , t − h) h + h σ(Xt−h , t − h) t + JtX zt (G.30)
                         ˆx                    ˆx
            ˆx˜
            X˜ = x,
              t      ˜


                                        57
where t is i.i.d. standard normal and J X is the deterministic jump41 in X at
random times, determined by a 2-dimensional vector of Bernoulli variables zt
that determine jumps, up and down. Using the conditional independence of
the counting processes N U and N D , and assuming that the econometrician
observes only the diﬀerence of the two, the simulation rolls a “three-sided die”
to determine zt . The three sides are “up” (“U”, meaning θt − θt−h = Jθ ),
“down” (“D”, meaning θt − θt−h = −Jθ ), and “no change” (“0”, meaning
θt = θt−h ). Their conditional probabilities at time t are approximately

                              λU h (1 − λD h), for j = U,
                   pj =        t−h       t−h
                    h,t       λD h (1 − λU h), for j = D,
                               t−h       t−h


and p0 = pU pD + (1 − pU )(1 − pD ).
     h,t   h,t h,t        h,t       h,t
   We write X θ for all variables in X other than the target θ. The Monte-
Carlo approximation of the conditional density is
                             S
                        1                                     ˜
                                                             ˆx
  fX (X(t), t | x, t) ≈
                ˜ ˜                         φ(Xtθ , t | θt , Xt−h [s], t − h) pi [s]1i,t [s],
                                                                              ˆh,t
                        S   s=1 i={U,D,0}
                                                                                       (G.31)

                   ˜
where φ( · , t | Xt−h[s], t−h) is the Gaussian density of Xt at time t conditional
                 ˆx
                  ˜                          ˜
on the value Xt−h [s] at time t − h, Xt−h [s] denotes the s-th simulated path
                 ˆx                         ˆx
from the scheme (G.30), 1i,t [s] is the indicator for the i-th side of the die at
                                                                      ˆx˜
time t in the s-th simulation, and pi [s] is constructed using Xt−h [s]. Let
                                          ˆh,t
ˆx˜                                     ˆx˜                            ˜
                                                                      ˆx
θt−h be the target component of Xt−h . If the simulated target θt−h at time
t − h cannot reach the observed time t-value of target in at most one jump,
that simulation is assigned zero likelihood.
    We now turn to a case with time-dependent intensities that is relevant
in Section 5. In that application, policy interventions on meeting days are
modeled by activating state-dependent Poisson intensities only during FOMC
meeting days. More precisely, suppose the i-th meeting day is during the
          ˜
interval [tM (i), tM (i)]. It is straightforward to modify the simulation scheme
(G.30) to allow jumps only during such meeting-day intervals. We refer to
                                                                          ˆ˜
the s-th path drawn from this modiﬁed scheme in what follows as Xtx [s]. We
now construct analogues of (G.31) for this time-dependent case. As long as
  41
    The notation goes through with Gaussian jumps J X , but needs to be adjusted in the
case of other jumps size distributions.


                                             58
the observation time t lies within a meeting-day interval, in that tM (i) ≤
                                                                     ˜
t < tM (i), the approximation (G.31) itself still applies. If the observation
time t is made outside an FOMC meeting, however, then one might want
to replace the Bernoulli-density terms with an indicator function for sample
paths leading up to the actual value of the target at t,
                                     S
                             1                      ˜
                                                   ˆx
        fX (Xt , t | x, t) ≈
                     ˜ ˜                 φ Xtθ , t|Xt−h [s], t − h 1θt =θx
                                                                        ˆ˜                  .   (G.32)
                             S                                                    t−h [s]
                                  s=1

    In (G.32), jumps in the target enter the SML objective function only
                                                            ˆx˜
through the indicator function and the simulated values Xt−h . This creates
a serious problem when maximizing the objective: For a given (ﬁnite) number
S of simulations, a small change in the parameter vector does not necessarily
aﬀect the average number of jumps across simulations and may thus leave
the value of the likelihood function unchanged. Only changes in a parameter
that are large enough to aﬀect the number of simulated jumps change the
objective function, but possibly by a large amount.42 In order to overcome
this discontinuity, an alternative to (G.32) is constructed as follows. The
joint conditional density of factors can be written in the form

            fX Xt , t | x, t = fθ θt , t | x, t fX θ |θ Xtθ , t | θt, x, t .
                        ˜ ˜                ˜ ˜                        ˜ ˜                       (G.33)

The ﬁrst term of equation (G.33) can be approximated by
                                              S
                                   1                           ˆ˜
            fθ   θt , t | x, t
                          ˜ ˜    ≈                 fθ θt , t | Xtx (i)−h [s], tM (i) − h
                                   S         s=1
                                                                 M


                                              S
                                         1
                                 ≈                         pi M (i) [s]1i,t [s]
                                                           ˆh,t
                                         S   s=1 i=U,D,0

                                 ≈ S/S,
                                   ¯
   42
      A similar issue is encountered by Anderson, Benzoni, and Lund (1999) who estimate
a jump-diﬀusion model for equity returns with EMM. In their speciﬁcation, the size of the
jump is Gaussian and the occurrence of jumps is not observed, so that they smooth the
mapping from parameters to the estimation’s objective function by allowing for partial
jumps. In the setup considered in this paper, the target only moves in observable 25 basis
points increments, so that a simulation of partial jumps is not feasible. A conjecture is that
the method proposed in the following can also be applied with EMM, since the eﬃciency
results of Gallant and Tauchen (1996) do not rely on a particular leading density.


                                                     59
        ¯
where S denotes the total number of simulated paths that resulted in the
                                ¯
observed value θ(t). In words, S/S is the frequency of “correctly simulated”
target rates in the simulations (starting with x = Xt ), while the expression
                                               ˜     ˜
in the ﬁrst row weights the simulated paths by their likelihoods. In practice,
with time-dependent intensities h must be chosen carefully, as intensities can
become large during an FOMC meeting. Details about the choice of h can
be found in Appendix H.
    The second term in (G.33) can be approximated by

                                         fX Xt , t | x, t
                                                       ˜ ˜
       fX θ |θ (Xtθ , t | θt , x, t) =
                               ˜ ˜
                                          fθ θt , t | x, t
                                                      ˜ ˜
                                             S
                                     1                         ˜
                                                              ˆx
                                   ≈ ¯            φ Xtθ , t | Xt−h [s], t − h   1 θt = θ x
                                                                                       ˆ˜         .
                                     S                                                  t−h [s]
                                            s=1

   Variance-reduction techniques can improve the eﬃciency of the Monte
Carlo integration (see Geweke (1996)). Here, antithetic variates are used in
simulating the paths of the state vector. That is, with each new pseudo-
random Gaussian [s] and uniform u[s], the antithetic variates − [s] and
1 − u[s] are used as a subsequent scenario.


H        Simulation of the Target
    The highest value that is reached by the intensities λU and λD in a typ-
ical estimated model43 is 1225. At this value, the next Figure shows that
a Bernoulli approximation that allows for only one jump during an FOMC
                                                                                 1
meeting is not accurate. We can see that the Bernoulli density for h = 365
overstates the true probability of one jump. If we increase the number of
                                                              1 1
Bernoulli trials during an FOMC meeting so that h ≥ 30 365 , the Bernoulli
approximation becomes accurate. To economize on the number of simulated
steps (and thereby the computation time for the likelihood evaluation), the
FOMC meeting day is divided into Ms +1 intervals, where Ms is a number di-
visable by 5. During 5 subintervals [ti , ti+1 ] of length h = Ms Ms1+1 365 , jumps
                                                                5
                                                                         1

are drawn from a Poisson distribution with constant parameter λj (t − h)h
by truncating the distribution at Ms jumps. In the last subinterval of length
                                   5
  43
    The value is taken from λU in the unconstrained SVλ speciﬁcation introduced in
Section 5.


                                                   60
                          1
                        0.9
                        0.8 M = 1
                        0.7
                        0.6
                        0.5
                        0.4
                            M =5
                        0.3
                        0.2
                        0.1       M = 30
                          0 2 4 6 8 10 12 14 16 18 20
                          0
     Approximation of a Poisson density (solid line) with λ = 1225 for daily data (t −
     t = 1/365) with Bernoulli trials with success probability p = 1 − exp(−λh), with
     ˜
          1 1
     h = M 365 , for diﬀerent choices of M .


h = Ms1+1 , a Bernoulli discretization is applied. This approximation proce-
dure is equivalent to 31 Bernoulli trials (with appropriately chosen success
probability). In the body of the paper, a choice of Ms = 30 is called ‘subdi-
viding the FOMC meeting into 30 intervals.’


I       SML with Macro Variables
    The state vector X is now augmented with the macro-related information
M(t) = (m(t), mF (t)), and we write X (θ,M ) for the vector consisting of all
coordinates of the state X except θ and M. Analogous to the decomposition
(G.33), we can write the density of X conditional on the last observation
Xt = x in the form
  ˜

                                                                   (θ,M )
    fX (Xt , t | x, ˜) = fθ,M (θt , Mt , t | x, t)fX (θ,M ) |θ,M (Xt
                    t                           ˜                           , t | θt , Mt , x, t). (I.34)
                                                                                               ˜

The ﬁrst term in (I.34) can be written as

               fθ,M (θt , Mt , t | x, ˜) = fM (Mt , t | x, ˜)fθ (θt , t|x, Mt , t).
                                      t                    t                    ˜

If an FOMC meeting and a macro jump event happen on the same day, the
Fed’s target decisions are able to condition on the newly released information,

                                                  61
as CPI and NPE releases (at 8:30 a.m.) precede FOMC meetings. Since each
                           ˜
observation interval [t, t] is chosen so that it does not contain more than one
                                                                ˜
FOMC meeting, the density of M conditional on X(t) depends only on θ(t).                    ˜
Moreover, [t  ˜, t] is short enough so as to not contain the release of the current
month’s macro variable together with the analyst survey of forecasts for the
next month. For the i-th macroeconomic release, we therefore have
                       
                        fmF (mF (t), t | mF (t), θt , t),
                                                ˜ ˜˜                              if τF ∈ [t, t],
                                                                                       i       ˜
fM (Mt , t | x, t
                ˜) =      fm (m(t), t | mF (t
                                            ˜), t),
                                                ˜                                 if τ ∈ [t
                                                                                       i      ˜, t],
                       
                          fmF (mF (t) | mF (t ˜
                                             ˜), θt , t)fm (m(t), t | mF (t), t), if τ i , τF ∈ [t, t].
                                                      ˜                   ˜ ˜               i      ˜


                                              62
References
 Ahn, D.-H., R. F. Dittmar, and A. R. Gallant (1999). The extended saints
   model of the term structure of interest rates: Theory and evidence.
   Kenan-Flagler Business School, University of North Carolina.
 Ait-Sahalia, Y. (1996). Testing continuous-time models of the spot interest
    rate. Review of Financial Studies 9, 385–42.
 Anderson, E., L. P. Hansen, and T. J. Sargent (2000). Robustness, detec-
   tion, and the price of risk. Working Paper, Stanford University.
 Anderson, T. G., L. Benzoni, and J. Lund (1999). Estimating jump-
   diﬀusions for equity returns. J.L. Kellogg Graduate School of Man-
   agement, Northwestern University.
 Anderson, T. G. and J. Lund (1996). Stochastic volatility and mean drift
   in the short rate diﬀusion: Sources of steepness, level and curvature in
   the yield curve. J.L. Kellogg Graduate School of Management, North-
   western University.
 Ang, A. and G. Bekaert (1998). Regime switches in interest rates. Working
   Paper, Stanford University.
 Babbs, S. H. and N. J. Webber (1993). A theory of the term structure
    with an oﬃcial interest rate. Working Paper, Warwick Business School,
    Warwick University.
 Babbs, S. H. and N. J. Webber (1996). Term structure modeling under al-
    ternative oﬃcial regimes. In Dempster and Pliska (Eds.), Mathematics
    of Derivative Securities. Cambridge University Press.
 Balduzzi, P., G. Bertola, and S. Foresi (1996). A model of target changes
    and the term structure of interest rates. Journal of Monetary Econ-
    omy 39, 223–49.
 Balduzzi, P., S. R. Das, S. Foresi, and R. K. Sundaram (1996). A simple
    approach to three factor aﬃne term structure models. Journal of Fixed
    Income 6, 43–53.
 Balduzzi, P., E. J. Elton, and T. C. Green (1998). Economic news and the
    yield curve: Evidence from the u.s. treasury market. Working Paper,
    Boston College.
 Berardi, A. (1998). Term structure, non-neutral inﬂation and economic
    growth: a three factor model. Working Paper, London Business School.

                                    63
Boudoukh, J., M. Richardson, R. Stanton, and R. F. Whitelaw (1998). The
   stochastic behavior of interest rates: Implications from a nonlinear,
   continuous-time, multifactor model. Working Paper, Haas School of
   Business, UC Berkeley.
Brandt, M. and P. Santa-Clara (1999). Simulated likelihood estimation
   of diﬀusions with an application to exchange rate dynamics. Working
   Paper, The Wharton School.
Br´maud, P. (1981). Point processes and queues, martingale dynamics.
  e
   Berlin: Springer Verlag.
Buraschi, A. and A. Jiltsov (1999). How large is the inﬂation risk premium
   in the u.s. nominal term structure? Working Paper, London Business
   School.
Campbell, J. Y. and L. M. Viceira (2000). Who should buy long term
  bonds? American Economic Review, forthcoming.
Chen, Z. and L. Epstein (1999). Ambiguity, risk and asset returns in con-
   tinuous time. Working Paper, University of Rochester.
Christiano, L. J., M. Eichenbaum, and C. L. Evans (1998). Monetary
   policy shocks: What have we learned and to what end? prepared for
   the Handbook of Macroeconomics.
Christiansen, C. (1999). Macroeconomic announcement eﬀects on the co-
   variance structure of bond returns. Working Paper, The Aarhus School
   of Business, Denmark.
Cochrane, J. (1997). Where is the market going? uncertain facts and novel
   theories. NBER Working Paper 6207.
Collin-Dufresne, P. and B. Solnik (1997). On the term structure of default
   premia in the swap and libor markets. Working Paper, Carnegie-Mellon
   University.
Constantinides, G. (1992). A theory of the nominal term structure of in-
  terest rates. Review of Financial Studies 5, 531–52.
Cox, J. C., J. E. Ingersoll, and S. A. Ross (1985). A theory of the term
   structure of interest rates. Econometrica 53, 385–408.
Dai, Q. and K. Singleton (2000). Speciﬁcation analysis of aﬃne term struc-
   ture models. Journal of Finance, forthcoming.


                                  64
Das, S. R. (1998). Poisson-gaussian processes and the bond markets. NBER
   Working Paper 6631.
Delbaen, F. and W. Schachermayer (1994). A general version of the fun-
   damental theorem of asset pricing. Math. Ann. 300, 463–520.
Duﬀee, G. R. (1996). Idiosyncratic variation of treasury bill yields. Journal
  of Finance 51, 527–551.
Duﬀee, G. R. (1999). Forecasting future interest rates: Are aﬃne models
  failures? Working Paper, Haas School of Business, UC Berkeley.
Duﬃe, D. (1996). Special repo rates. Journal of Finance 51, 493–526.
Duﬃe, D. and M. Huang (1996). Swap rates and credit quality. Journal of
  Finance 51, 921–950.
Duﬃe, D. and R. Kan (1996). A yield-factor model of interest rates. Math-
  ematical Finance 6, 379–406.
Duﬃe, D. and J. Liu (2000). Floating-ﬁxed credit spreads. Financial An-
  alysts Journal, forthcoming.
Duﬃe, D., J. Pan, and K. Singleton (2000). Transform analysis and option
  pricing for aﬃne jump-diﬀusions. Econometrica, forthcoming.
Duﬃe, D. and K. Singleton (1993). Simulated moments estimation of
  markov models of asset prices. Econometrica 61, 929–952.
Duﬃe, D. and K. Singleton (1997). An econometric model of the term
  structure of interest rate swap yields. Journal of Finance 52, 1287–
  1323.
Duﬃe, D. and W. Zame (1989). The consumption-based capital asset pric-
  ing model. Econometrica 57, 1279–97.
El Karoui, N., R. Myneni, and R. Viswanathan (1993). Arbitrage pricing
   and hedging of interest rate claims with state variables. Working Paper,
   Universit´ de Paris VI, Laboratoire de Probabilit´’.
            e                                          e
Evans, C. L. and D. Marshall (1998). Monetary policy and the term struc-
   ture of nominal interest rates: Evidence and theory. Carnegie-Rochester
   Conference on Public Policy 49, 53–112.
Farnsworth, H. and R. Bass (1998). The term structure with credible and
   semi-credible targeting. Working Paper, Washington University in St.
   Louis.

                                    65
Fleming, J. M. and E. M. Remolona (1997). What moves the bond market?
   FRBNY Economic Policy Review , 31–50.
Fleming, J. M. and E. M. Remolona (1999). The term structure of an-
   nouncement eﬀects. Working Paper, Federal Reserve Bank of New
   York.
Gallant, A. R. and G. Tauchen (1996). Which moments to match? Econo-
   metric Theory 12, 657–681.
Geweke, J. (1996). Monte carlo simulation and numerical integration. In
  Amman, H. M. Kendrick, D. A., and J. Rust (Eds.), Handbook of Com-
  putational Economics. Amsterdam: Elsevier Science, North-Holland.
Hamilton, J. and O. Jorda (1998). The stance of monetary policy: The
  federal funds rate target? searching in a new direction. Working Paper,
  San Diego.
Hamilton, J. D. (1996). The daily market for federal funds. Journal of
  Political Economy 104, 26–56.
Hansen, L. P. and R. Jagannathan (1991). Implications of security mar-
  ket data for models of dynamic economies. Journal of Political Econ-
  omy 99, 225–62.
Harrison, M. and D. Kreps (1979). Martingales and arbitrage in multi-
   period securities markets. Journal of Economic Theory 20, 381–408.
Harrison, M. and S. Pliska (1981). Martingales and stochastic integrals
   in the theory of continuous trading. Stochastic Processes and Their
   Applications 11, 215–260.
Honor´, P. (1997). Modelling interest rate dynamics in a corridor with
     e
  jump processes. Working Paper, Aarhus School of Business.
Johannes, M. (1999). A nonparametric jump-diﬀusion model of the short
   rate. Working Paper, University of Chicago.
Jones, C. M., O. Lamont, and R. Lamsdaine (1996). Macroeconomic news
   and bond market volatility. Journal of Financial Economics 47, 315–
   337.
Konstantinov, V. (1999). Fed funds rate targeting, monetary regimes and
  the term structure of interbank rates. Working Paper, Harvard Univer-
  sity.


                                  66
Li, L. and R. F. Engle (1998). Macroeconomic announcements and volatil-
    ity of treasury futures. Working Paper No. 98-27, UC San Diego.
Litterman, R., J. Scheinkman, and L. Weiss (1988). Volatility and the yield
    curve. Working Paper, Goldman & Sachs, Financial Strategies Group.
Liu, J. (1999). Generalized method of moments estimation of aﬃne diﬀu-
   sion processes. Working Paper, Graduate School of Business, Stanford
   University.
Lo, A. (1988). Maximum likelihood estimation of generalized ito processes
   with discretely sampled date. Journal of Econometric Theory 4, 231–
   247.
Lucas, R. (1978). Asset prices in an exchange economy. Econometrica 46,
   1429–1445.
Merton, R. (1993a). An intertemporal capital asset pricing model. Econo-
  metrica 41, 867–888.
Merton, R. (1993b). On the pricing of corporate debt: The risk structure
  of interest rates. Journal of Finance 29, 449–470.
Meulendyke, A.-M. (1998). U.S. Monetary Policy & Financial Markets.
  New York: Federal Reserve Bank of New York.
Pan, J. (1999). Integrated time-series analysis of spot and options prices.
   Working Paper, Graduate School of Business, Stanford University.
Pearson, N. D. and T. Sun (1994). Exploiting the conditional density in
   estimating the term structure: An application to the cox, ingersoll, and
   ross model. Journal of Finance XLIX, 1279–1304.
Pedersen, A. R. (1995). A new approach to maximum likelihood estima-
   tion for stochastic diﬀerential equations based on discrete observations.
   Scandinavian Journal of Statistics 22, 55–71.
Pennacchi, G. G. (1991). Identifying the dynamics of real interest rates and
   inﬂation: Evidence using survey data. Review of Financial Studies 4,
   53–86.
Protter, P. (1990). Stochastic Integration and Diﬀerential Equations. New
   York: Springer-Verlag.
Rudebusch, G. D. (1996). Federal reserve interest rate targeting, ratio-
  nal expectations, and the term structure. Journal of Monetary Eco-
  nomics 35, 245–74.

                                   67
 Sack, B. (1998). Does the fed act gradually? a var analysis. Working Paper,
    Board of Governors of the Federal Reserve System.
 Sargent, T. J. (1979). A note on maximum likelihood estimation of the
    rational expectations model of the term structure. Journal of Monetary
    Economics 5, 133–43.
 Sims, C. (1999). Drift and breaks in monetary policy. Working Paper,
    Princeton University.
 Singleton, K. (1999). Estimation of aﬃne asset pricing models using the
    empirical characteristic function. Working Paper, Graduate School of
    Business, Stanford University University.
 Taylor, J. B. (1993). Discretion versus policy rules in practice. Carnegie-
    Rochester Conference Series on Public Policy 39, 195–214.
 Taylor, J. B. (Ed.) (1999). Monetary Policy Rules. Chicago University
    Press.
 Thornton, D. L. (1997). The other change in fed procedure. M onetary
   Trends, Federal Reserve of St. Louis.
 Vasicek, O. (1977). An equilibrium characterization of the term structure.
    Journal of Financial Economics 5, 177–188.
 Woodford, M. (1999). Optimal monetary policy inertia. NBER Working
   Paper 7261.


References
 Ahn, D.-H., R. F. Dittmar, and A. R. Gallant (1999). The extended saints
   model of the term structure of interest rates: Theory and evidence.
   Kenan-Flagler Business School, University of North Carolina.
 Ait-Sahalia, Y. (1996). Testing continuous-time models of the spot interest
    rate. Review of Financial Studies 9, 385–42.
 Anderson, E., L. P. Hansen, and T. J. Sargent (2000). Robustness, detec-
   tion, and the price of risk. Working Paper, Stanford University.
 Anderson, T. G., L. Benzoni, and J. Lund (1999). Estimating jump-
   diﬀusions for equity returns. J.L. Kellogg Graduate School of Man-
   agement, Northwestern University.


                                    68
Anderson, T. G. and J. Lund (1996). Stochastic volatility and mean drift
  in the short rate diﬀusion: Sources of steepness, level and curvature in
  the yield curve. J.L. Kellogg Graduate School of Management, North-
  western University.
Ang, A. and G. Bekaert (1998). Regime switches in interest rates. Working
  Paper, Stanford University.
Babbs, S. H. and N. J. Webber (1993). A theory of the term structure
   with an oﬃcial interest rate. Working Paper, Warwick Business School,
   Warwick University.
Babbs, S. H. and N. J. Webber (1996). Term structure modeling under al-
   ternative oﬃcial regimes. In Dempster and Pliska (Eds.), Mathematics
   of Derivative Securities. Cambridge University Press.
Balduzzi, P., G. Bertola, and S. Foresi (1996). A model of target changes
   and the term structure of interest rates. Journal of Monetary Econ-
   omy 39, 223–49.
Balduzzi, P., S. R. Das, S. Foresi, and R. K. Sundaram (1996). A simple
   approach to three factor aﬃne term structure models. Journal of Fixed
   Income 6, 43–53.
Balduzzi, P., E. J. Elton, and T. C. Green (1998). Economic news and the
   yield curve: Evidence from the u.s. treasury market. Working Paper,
   Boston College.
Berardi, A. (1998). Term structure, non-neutral inﬂation and economic
   growth: a three factor model. Working Paper, London Business School.
Boudoukh, J., M. Richardson, R. Stanton, and R. F. Whitelaw (1998). The
   stochastic behavior of interest rates: Implications from a nonlinear,
   continuous-time, multifactor model. Working Paper, Haas School of
   Business, UC Berkeley.
Brandt, M. and P. Santa-Clara (1999). Simulated likelihood estimation
   of diﬀusions with an application to exchange rate dynamics. Working
   Paper, The Wharton School.
Br´maud, P. (1981). Point processes and queues, martingale dynamics.
  e
   Berlin: Springer Verlag.
Buraschi, A. and A. Jiltsov (1999). How large is the inﬂation risk premium
   in the u.s. nominal term structure? Working Paper, London Business

                                  69
   School.
Campbell, J. Y. and L. M. Viceira (2000). Who should buy long term
  bonds? American Economic Review, forthcoming.
Chen, Z. and L. Epstein (1999). Ambiguity, risk and asset returns in con-
   tinuous time. Working Paper, University of Rochester.
Christiano, L. J., M. Eichenbaum, and C. L. Evans (1998). Monetary
   policy shocks: What have we learned and to what end? prepared for
   the Handbook of Macroeconomics.
Christiansen, C. (1999). Macroeconomic announcement eﬀects on the co-
   variance structure of bond returns. Working Paper, The Aarhus School
   of Business, Denmark.
Cochrane, J. (1997). Where is the market going? uncertain facts and novel
   theories. NBER Working Paper 6207.
Collin-Dufresne, P. and B. Solnik (1997). On the term structure of default
   premia in the swap and libor markets. Working Paper, Carnegie-Mellon
   University.
Constantinides, G. (1992). A theory of the nominal term structure of in-
  terest rates. Review of Financial Studies 5, 531–52.
Cox, J. C., J. E. Ingersoll, and S. A. Ross (1985). A theory of the term
   structure of interest rates. Econometrica 53, 385–408.
Dai, Q. and K. Singleton (2000). Speciﬁcation analysis of aﬃne term struc-
   ture models. Journal of Finance, forthcoming.
Das, S. R. (1998). Poisson-gaussian processes and the bond markets. NBER
   Working Paper 6631.
Delbaen, F. and W. Schachermayer (1994). A general version of the fun-
   damental theorem of asset pricing. Math. Ann. 300, 463–520.
Duﬀee, G. R. (1996). Idiosyncratic variation of treasury bill yields. Journal
  of Finance 51, 527–551.
Duﬀee, G. R. (1999). Forecasting future interest rates: Are aﬃne models
  failures? Working Paper, Haas School of Business, UC Berkeley.
Duﬃe, D. (1996). Special repo rates. Journal of Finance 51, 493–526.
Duﬃe, D. and M. Huang (1996). Swap rates and credit quality. Journal of
  Finance 51, 921–950.

                                    70
Duﬃe, D. and R. Kan (1996). A yield-factor model of interest rates. Math-
  ematical Finance 6, 379–406.
Duﬃe, D. and J. Liu (2000). Floating-ﬁxed credit spreads. Financial An-
  alysts Journal, forthcoming.
Duﬃe, D., J. Pan, and K. Singleton (2000). Transform analysis and option
  pricing for aﬃne jump-diﬀusions. Econometrica, forthcoming.
Duﬃe, D. and K. Singleton (1993). Simulated moments estimation of
  markov models of asset prices. Econometrica 61, 929–952.
Duﬃe, D. and K. Singleton (1997). An econometric model of the term
  structure of interest rate swap yields. Journal of Finance 52, 1287–
  1323.
Duﬃe, D. and W. Zame (1989). The consumption-based capital asset pric-
  ing model. Econometrica 57, 1279–97.
El Karoui, N., R. Myneni, and R. Viswanathan (1993). Arbitrage pricing
   and hedging of interest rate claims with state variables. Working Paper,
   Universit´ de Paris VI, Laboratoire de Probabilit´’.
            e                                          e
Evans, C. L. and D. Marshall (1998). Monetary policy and the term struc-
   ture of nominal interest rates: Evidence and theory. Carnegie-Rochester
   Conference on Public Policy 49, 53–112.
Farnsworth, H. and R. Bass (1998). The term structure with credible and
   semi-credible targeting. Working Paper, Washington University in St.
   Louis.
Fleming, J. M. and E. M. Remolona (1997). What moves the bond market?
   FRBNY Economic Policy Review , 31–50.
Fleming, J. M. and E. M. Remolona (1999). The term structure of an-
   nouncement eﬀects. Working Paper, Federal Reserve Bank of New
   York.
Gallant, A. R. and G. Tauchen (1996). Which moments to match? Econo-
   metric Theory 12, 657–681.
Geweke, J. (1996). Monte carlo simulation and numerical integration. In
  Amman, H. M. Kendrick, D. A., and J. Rust (Eds.), Handbook of Com-
  putational Economics. Amsterdam: Elsevier Science, North-Holland.


                                   71
Hamilton, J. and O. Jorda (1998). The stance of monetary policy: The
  federal funds rate target? searching in a new direction. Working Paper,
  San Diego.
Hamilton, J. D. (1996). The daily market for federal funds. Journal of
  Political Economy 104, 26–56.
Hansen, L. P. and R. Jagannathan (1991). Implications of security mar-
  ket data for models of dynamic economies. Journal of Political Econ-
  omy 99, 225–62.
Harrison, M. and D. Kreps (1979). Martingales and arbitrage in multi-
   period securities markets. Journal of Economic Theory 20, 381–408.
Harrison, M. and S. Pliska (1981). Martingales and stochastic integrals
   in the theory of continuous trading. Stochastic Processes and Their
   Applications 11, 215–260.
Honor´, P. (1997). Modelling interest rate dynamics in a corridor with
     e
  jump processes. Working Paper, Aarhus School of Business.
Johannes, M. (1999). A nonparametric jump-diﬀusion model of the short
   rate. Working Paper, University of Chicago.
Jones, C. M., O. Lamont, and R. Lamsdaine (1996). Macroeconomic news
   and bond market volatility. Journal of Financial Economics 47, 315–
   337.
Konstantinov, V. (1999). Fed funds rate targeting, monetary regimes and
  the term structure of interbank rates. Working Paper, Harvard Univer-
  sity.
Li, L. and R. F. Engle (1998). Macroeconomic announcements and volatil-
    ity of treasury futures. Working Paper No. 98-27, UC San Diego.
Litterman, R., J. Scheinkman, and L. Weiss (1988). Volatility and the yield
    curve. Working Paper, Goldman & Sachs, Financial Strategies Group.
Liu, J. (1999). Generalized method of moments estimation of aﬃne diﬀu-
   sion processes. Working Paper, Graduate School of Business, Stanford
   University.
Lo, A. (1988). Maximum likelihood estimation of generalized ito processes
   with discretely sampled date. Journal of Econometric Theory 4, 231–
   247.


                                   72
Lucas, R. (1978). Asset prices in an exchange economy. Econometrica 46,
   1429–1445.
Merton, R. (1993a). An intertemporal capital asset pricing model. Econo-
  metrica 41, 867–888.
Merton, R. (1993b). On the pricing of corporate debt: The risk structure
  of interest rates. Journal of Finance 29, 449–470.
Meulendyke, A.-M. (1998). U.S. Monetary Policy & Financial Markets.
  New York: Federal Reserve Bank of New York.
Pan, J. (1999). Integrated time-series analysis of spot and options prices.
   Working Paper, Graduate School of Business, Stanford University.
Pearson, N. D. and T. Sun (1994). Exploiting the conditional density in
   estimating the term structure: An application to the cox, ingersoll, and
   ross model. Journal of Finance XLIX, 1279–1304.
Pedersen, A. R. (1995). A new approach to maximum likelihood estima-
   tion for stochastic diﬀerential equations based on discrete observations.
   Scandinavian Journal of Statistics 22, 55–71.
Pennacchi, G. G. (1991). Identifying the dynamics of real interest rates and
   inﬂation: Evidence using survey data. Review of Financial Studies 4,
   53–86.
Protter, P. (1990). Stochastic Integration and Diﬀerential Equations. New
   York: Springer-Verlag.
Rudebusch, G. D. (1996). Federal reserve interest rate targeting, ratio-
  nal expectations, and the term structure. Journal of Monetary Eco-
  nomics 35, 245–74.
Sack, B. (1998). Does the fed act gradually? a var analysis. Working Paper,
   Board of Governors of the Federal Reserve System.
Sargent, T. J. (1979). A note on maximum likelihood estimation of the
   rational expectations model of the term structure. Journal of Monetary
   Economics 5, 133–43.
Sims, C. (1999). Drift and breaks in monetary policy. Working Paper,
   Princeton University.
Singleton, K. (1999). Estimation of aﬃne asset pricing models using the
   empirical characteristic function. Working Paper, Graduate School of
   Business, Stanford University University.

                                   73
Taylor, J. B. (1993). Discretion versus policy rules in practice. Carnegie-
   Rochester Conference Series on Public Policy 39, 195–214.
Taylor, J. B. (Ed.) (1999). Monetary Policy Rules. Chicago University
   Press.
Thornton, D. L. (1997). The other change in fed procedure. M onetary
  Trends, Federal Reserve of St. Louis.
Vasicek, O. (1977). An equilibrium characterization of the term structure.
   Journal of Financial Economics 5, 177–188.
Woodford, M. (1999). Optimal monetary policy inertia. NBER Working
  Paper 7261.


                                   74
                                        Table 5: Estimation Results for Base Models
     Parameter                λ Model                                SV Model                              SVλ Model
                         Constr.         Unconstr.           Constr.         Unconstr.             Constr.           Unconstr.
          κs            1.277306          1.556912          2.861167          3.332016            6.437270           9.748661
                       (2.528269)        (4.102793)        (3.895496)        (4.202992)          (3.832683)         (4.693933)
          κv               −                 −             0.1659727          0.129618            0.124326           0.040925
                           −                 −             (3.738001)        (3.796853)          (0.411320)         (0.415643)
          κz            0.276054          0.288299             −                  −               0.466113           0.724400
                       (2.690688)        (3.028778)            −                  −              (4.730788)         (4.336113)
           θ¯           0.052213          0.052213          0.052213          0.052213            0.052213           0.052213
                           −                 −                 −                  −                   −                 −
           v
           ¯               −                 −           4.480144e-05      5.899065e-05        2.281740e-04       4.150809e-04
                           −                 −             (5.694430)        (4.528638)          (0.506240)         (1.067214)
          σs         7.122488e-05      7.989389e-05            −                  −                   −                 −
                      (51.385560)       (12.499803)            −                  −                   −                 −
          σv               −                 −           5.756028e-03      6.525899e-03           0.054158           0.089933
                           −                 −             (2.040981)        (1.367602)          (0.721646)         (0.775665)
          λu0         758.158092         338.590888        341.664950      −323.321781        1.160316e+03          273.665764
                       (0.106748)        (0.395815)        (0.033362)       (−0.020226)          (0.153202)         (0.548656)
          λs        6.769235e+03       7.582229e+03      2.013169e+04      1.868451e+04       6.117227e+03        7.267293e+03
                       (1.477492)        (2.246950)        (2.304910)        (2.135644)          (2.324139)         (1.856327)
                   −5.269678e + 03 −4.876673e + 03 −1.263833e + 04 −1.479579e + 04 −8.738577e + 03 −9.408883e + 03
75


          λθ
                     (−3.895510)        (−4.302838)       (−7.475722)       (−9.228838)        (−6.527465)         (−4.175201)
          λv               −                 −           1.783100e+07      1.861178e+07      −8.919179e + 04      5.483153e+05
                           −                 −             (2.549629)        (2.256711)        (−0.601905)          (1.629651)
          λz          123.034098          119.4507             −                  −             234.823691          237.629397
                      (18.737795)       (18.630551)            −                  −             (21.788983)        (17.870117)
           qs          70.120506         70.285118       −257.530126       −241.323541          −89.933014         −47.618971
                       (3.274451)        (8.673452)       (−5.289520)       (−4.785031)        (−3.222643)         (−2.904623)
          qv               −                 −             499.896256       1051.160708      −1.018602e + 04 −2.537544e + 03
                           −                 −             (0.047090)        (0.105902)        (−1.192666)         (−0.810859)
          qz           −0.209059         −0.213215             −                  −              0.2553252           0.112632
                    (−132.732571)      (−33.360167)            −                  −             (27.818100)         (0.181483)
          σM         1.364800e-03      2.132152e-03      1.155277e-03      1.145397e-03               −                 −
                      (15.119999)       (14.766891)       (17.539221)       (18.142008)               −                 −
          ρM            0.961300          0.955110          0.982425          0.983664                −                 −
                      (58.641087)       (56.126410)       (78.932304)       (88.109031)               −                 −
                                                                                                                          1 1
     NOTE: This table reports the SML parameter estimates and t-ratios (in brackets) obtained with S = 2500, h = M 365 ,
     M = 1, Ms = 30 and weekly observations of the 6-month LIBOR, 2 and 5-year swap rate from January 1, 1994 to December
     31, 1998. σM is the standard deviation of the measurement error contaminating observations of the 2-year swap rate and ρM
                                                                  ¯     ¯
     is its autocorrelation. For the SV Model, λd = 619.6040 and λu = λd = 480.6345 for the constrained model; λd = 327.3511
                                                 0                                                                0
     and λ       ¯                                                                                  ¯     ¯
           ¯ u = λd = 2.0146 for the unconstrained model. For the SVλ Model, λd = 207.0786 and λu = λd = 683.6972 for the
                                                                                 0
                                             ¯    ¯
     constrained model; λd = 273.6658 and λu = λd = 9.9949 for the unconstrained model.
                            0
            Table 6: Forecasting Evaluation of Target Model
                         Same Change                No Change
Predicted Actual   up      no    down total up no down total
        up          2       5      0     7    0     0      0      0
        no          5      20      3    28    7    28      5     40
      down          0       3      2     5    0     0      0      0
     correct        2      20      2    24    0    28      0     28
      total         7      28      5    40    7    28      5     40
   % correct     28.57 71.42      40    60    0 100        0     70
                      Unconstr. λ Model          Constr. λ Model
Predicted Actual up       no    down total up no down total
         up        7      10      0    17    7     28     4     39
         no        0      18      5    23    0      0     0      0
       down        0       0      0     0    0      0     1      1
      correct      7      18      0    25    7      0     1      8
       total       7      28      5    40    7     28     5     40
    % correct     100 64.29       0   62.50 100 0        20     20
                      Unconstr. SV Model        Constr. SV Model
Predicted Actual   up      no    down total up no down total
        up          5        3      0    8    7     17     0     24
        no          2       25      5    32   0      0     0      0
      down          0        0      0    0    0     11     5     16
     correct        5       25      0    30   7      0     5     12
      total         7       28      5    40   7     28     5     40
    % correct    71.43 89.29        0    75  100 0        100    30
                     Unconstr. SVλ Model        Constr. SVλ Model
Predicted Actual   up      no    down total up no down total
        up          4        2      0    6    7     23     0     30
        no          3       26      5    34   0      0     0      0
      down          0        0      0    0    0      5     5     10
     correct        4       26      0    30   7      0     5     12
      total         7       28      5    40   7     28     5     40
    % correct    57.14 92.86        0    75  100 0        100    30

NOTE: The sample used in this table is January 1, 1994 to December 31, 1998.
During this time, there have been 40 FOMC meetings, 8 moves up in the target (1
outside of an FOMC meeting) and 6 moves down (1 outside of an FOMC meeting).
This means that in a constant probability model, the estimated probability of a
move up and down is 7/40 and 5/40, respectively. Forecasting a particular choice
(up, down, no) is deﬁned as the alternative with the highest probability. As to the
2 changes outside of FOMC meetings, all models would have missed them, so they
are not part of the table.
                                        76
    Table 7: Estimation Results for Some Extensions of the SVλ-Model
         Parameter       Jumps in z        Jumps in s          Corr. σsv
            κs             14.967792         7.820412           9.933076
                          (6.398363)        (5.214696)        (4.473048)
              κv           0.0115368         0.054233           0.042497
                          (0.215357)        (1.584036)        (0.454876)
              κz           0.641243          0.679306           0.748360
                          (3.648210)        (7.190619)        (4.523098)
              ¯
              θ            0.052213          0.052213           0.052213
                               −                −                  −
              v
              ¯         1.056887e-03      2.804654e-04      3.659218e-04
                          (2.137352)        (1.342108)        (1.059371)
              σv           0.017776          0.066227           0.085224
                          (0.455049)        (1.390437)        (0.843764)
              λu
               0           62.358567        49.883635        271.189584
                          (0.111839)        (0.073574)       (0.0520345)
              λs        6.998324e+03      5.582408e+03      7.324409e+03
                          (3.115227)        (1.587551)        (1.860092)
              λθ       -1.560489e+04     -1.526898e+04     -9.240203e+03
                         (-4.552850)       (-3.920868)       (-4.836343)
              λv        7.271786e+05      1.073800e+06      6.046136e+05
                          (2.160996)        (1.299366)        (1.659181)
              λz         187.103829         312.683229       240.092789
                          (9.144435)        (9.095217)       (22.691191)
              qs          -27.058453       -101.979831        -46.167988
                         (-3.770240)       (-3.427597)       (-2.961026)
              qv       -1.349147e+03     -3.588000e+03         -2.322817
                         (-1.526325)       (-1.152997)       (-0.741050)
              qz           0.090891          0.067808           0.165448
                          (0.151579)       (-0.069894)        (0.281479)
              Jz           0.103886             −                  −
                          (1.985465)
              Js               −            -0.002529            −
                                           (-6.145468)
             σsv              −                 −            20.150802
                                                             (1.221437)


NOTE: This table resports SML parameter estimates and t-ratios (in brackets) obtained
                     1 1
with S = 2500, h = M 365 , M = 1, Ms = 30 and weekly observations of the 6-month
LIBOR, 2 and 5-year swap rate from January 1, 1994 to December 31, 1998.
                                        77
                             Table 8: Granger Causality

                                                   n=1       n=3       n=6
                 CPI predictable by NPE           0.4650    0.7415    0.9608
                 NPE predictable by CPI           0.1144    0.0592    0.1260
                CPI predictable by target         0.0001    0.0197    0.0300
                NPE predictable by target         0.0862    0.5028    0.8110


NOTE: This table reports the p-values corresponding to the usual F-test that the coeﬃ-
cients on all lags of the indicated regressor are zero. More precisely, the dependent variable
is regressed on n lags of itself and n lags of the regressor. The sample used for this table
is 1985:6 to 1998:12.


     Table 9: Joint Dynamics of Analyst Forecasts and Actual Releases

                                    j = CP I        j = NPE
                             aj
                              0       0.006095        0.005817
                                    (0.603442)      (1.077137)
                             aj
                              1       0.556357        0.549290
                                    (3.078901)      (2.874358)
                             aj
                              2      -0.027615       -0.011684
                                   (-0.187600)     (-0.106604)
                             aj
                              3       0.164741        0.037299
                                    (1.006975)      (0.507469)
                             σj       0.021181        0.016771
                                    (7.303732)      (8.385483)
                              j
                             σF       0.018219        0.009146
                                   (12.146177)     (10.161710)


NOTE: This table reports maximum likelihood estimates of (20) using the sample 1985:2
                                 j
to 1998:12. The target rate θ(τF ) in (20) is the value of the target on the day before the
                         j
analyst forecast survey τF , as releases occur at 8:30 am and FOMC meetings after that.


                                             78
                  Target and Yield Data used in Estimation

        9
                                                                Target
                                                                6−month LIBOR
                                                                2−year Swap
                                                                5−year Swap
      8


        7


      6
  Percent


      5


      4


        3


               12/15/94      11/30/95        11/14/96   10/30/97      10/15/98

Figure 6: This graph shows weekly data on the 6-month LIBOR, 2 and 5-year swap rate
together with the Fed’s target from January 1, 1994 to December 31, 1998.


                                        79
                                  The Eﬀects of FOMC Meetings
          Response to Target Changes                                    Term Structure of Volatility
 1                                                      30

0.8                                                     25

0.6
                                                        20
0.4
                                                        15
0.2
                                                        10
 0

−0.2
                                                          5
  0 0.5   1 1.5    2 2.5      3 3.5      4 4.5      5     0 0.5     1 1.5          2 2.5 3 3.5           4 4.5   5
                                                                                    Maturity
          Figure 7: These graphs show the response of yields to target changes and the term
          structure of volatility in the data. The response of yields to target changes is measured
          by the slope parameter of weekly yield changes regressed on target rate changes (and an
          intercept) in weeks of FOMC meetings, including the weeks of April 24, 1994 and October
          15, 1998. Dotted lines are standard-error bounds computed using a SUR speciﬁcation.
          The term structure of volatility during weeks of FOMC meetings (dotted line) and the
          remaining weeks is measured as standard deviations of yield changes. The weekly data are
          Wednesday observations from 1994 to 1998 on the target rate, overnight repo rate, and
          Thursday observations on the 1, 3, 6, 12-month LIBOR and 2, 3, 4, 5-year Swap Rates.
          Standard-error bounds around the volatility estimates are computed in the following table
          with GMM using 5 Newey-West lags.
                       Standard Errors around Volatility Estimates (in Basis Points)
                        Repo                 LIBOR Rates                            Swap    Rates
                      Overnight    1 mth    3 mth 6 mth       12 mth       2 yr     3 yr      4 yr     5 yr
             Vol        19.90       8.78     7.26   8.87       11.17      13.18    13.27     13.05    12.92
           ‘Normal’    (2.01)      (1.53)   (0.86) (0.84)      (0.79)     (0.75)   (0.76)    (0.70)   (0.67)
             Vol        31.72      11.13     9.47   9.64       11.35      14.01    14.24     14.00    14.12
            FOMC       (2.21)      (1.69)   (1.37) (1.88)      (2.12)     (1.90)   (1.77)    (1.75)   (1.83)


                                                     80
                             Model-Implied Eﬀects of FOMC Meetings
       Model-Implied Response to Target Changes                   Model-Implied Term Structure of Volatility
 1                                                          30                                             Data−Fomc
                                                                                                           Data−Normal
                                                                                                           Model−Normal
                                                                                                           Model−FOMC
0.8                                                         25

0.6                                                         20

0.4                                                         15

0.2                                                         10

 0
                                                              5
−0.2

  0 0.5      1 1.5     2 2.5 3 3.5           4 4.5      5     0 0.5
                                                               0        1 1.5     2 2.5 3 3.5             4 4.5           5
                        Maturity                                                   Maturity
              Figure 8: The left graph plots the response of yields to target changes estimated in an
              unrestricted estimation (solid line, same as in Figure 7) together with the model-implied
              response measured by calculating the analytical derivative dy/dx and multiplying it by
              400JU (dotted line with +). The right ﬁgure is the term structure of volatility during
              weeks of FOMC meetings (dotted line) and the remaining weeks (black line) in the data
              (without +) and in a simulation of the model (with +). The simulations start with
                                         ˆ
              S = 20, 000 initial states X0 that are obtained by simulating the state dynamics for 10
              years, starting at the unconditional mean. Each day is subdivided into 2 intervals and
                                                                              ˆ
              each FOMC meeting is subdivided into 30 intervals. Given X0 , S diﬀerent samples of
              yields are simulated, each sample uses the actual FOMC calendar.


                                                         81
           0.9                                                         Up−Probability
                                                                       Down−Probability
           0.8

           0.7

           0.6
         Probability


           0.5

           0.4

           0.3

           0.2

           0.1

                       00       5   10   15     20     25     30     35               40
                                 FOMC Meetings counted starting in 1994


                  6

        5.5

                  5
                                               Fed Target
                                               Model−Implied Rule
        Percent


                                               Original Taylor Rule
                                               Estimated Taylor Rule
        4.5                                    Extended Taylor Rule


                  4

        3.5

                  3         5     10   15      20    25      30     35                40
                                FOMC Meetings counted starting in 1994
Figure 9: For each FOMC meeting since January 1994, the upper graph plots the condi-
tional probability of target rate moves up (straight line) and down (dotted line) from the
unconstrained SVλ model. The lower graph shows the model-implied policy rule together
with versions of the Taylor rule (see Section 7.6).
                                                  82
                       Extended Model-Implied Eﬀects of FOMC Meetings
       Model-Implied Response to Target with Js                    Model-Implied Term Structure of Vol with Js
 1.2                                                        30

 1
                                                            25
0.8
                                                            20
0.6

0.4                                                         15

0.2
                                                            10
 0
−0.2
                                                               5
  0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5                             0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
    Model-Implied Response to Target with Jz                   Model-Implied Term Structure of Vol with Jz
 1.2                                                        30

 1
                                                            25
0.8
                                                            20
0.6

0.4                                                         15

0.2
                                                            10
 0
−0.2
                                                               5
  0 0.5      1 1.5    2 2.5 3 3.5 4 4.5 5 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5                                      5
                       Maturity                                                 Maturity
              Figure 10: Analogues of Figure 8 for extended SVλ models using parameters from Table
              7: Js (ﬁrst row) and Jz (second row). The line marked with circles in the lower left corner
              shows sets Jz = 0.3.


                                                          83
                         The Eﬀects of Macroeconomic Releases
                   Reaction to NPE Surprise                   Reaction to CPI Surprise
    12                                          4
    10                                          3
                                                2
        8                                       1
        6                                       0
        4                                      -1
                                               -2
        2                                      -3
    0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
    0
                                                    −4
                                          0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
                 Maturity                              Maturity
   Term Structure of Vol at NPE Release  Term Structure of Vol at CPI Release
   16                                    16
   14                                    14
   12                                    12
   10                                    10
    8                                     8
    6                                     6
    4                                     4
    2                                     2
    0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
    0                                     0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
                                          0
                 Maturity                              Maturity
 Model-Implied Response to NPE Surprise Model-Implied Response to CPI Surprise
           7                                          7


        6                                          6
        5                                          5
                                               Basis Points
    Basis Points


        4                                          4
        3                                          3
        2                                          2
        1                                          1
        0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
        0                                          0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
                                                   0
                   Maturity                                   Maturity

Figure 11: The ﬁrst row of graphs shows the slope parameter of daily yield changes
regressed on NPE and CPI standardized surprises (and an intercept) using the subsample
of the respective release days. Dotted lines are standard-error bounds computed with 5
Newey-West lags. Standardized surprises are deﬁned as the analyst forecast error m(t) −
mF (t), normalized by its standard deviation, so that the regression coeﬃcients can be
interpreted as reactions to a one-standard deviation analyst forecast error. The second
row of graphs is the term structure of volatility at the respective release days. Standard
errors are computed using 5 Newey-West lags. The third row of graphs shows the model-
                                            84
implied cross-sectional (contemporanous) impulse response of yields to NPE and CPI
release surprises. Due to time-zone diﬀerences, the data used for this estimation are same-
day observations from 1994 to 1998 on 2, 3, 4, 5-year swap rates, and next-day observations
of 1, 3, 6 and 12-month LIBOR rates.