Estimating Stochastic Volatility Di usion
    Using Conditional Moments of Integrated
                  Volatility                          y


                                   Tim Bollerslev
                                           and
                                      Hao Zhou
                              Department of Economics
                                  Duke University
                              Durham, NC 27708-0097

                                    February 2000


  yWe would like to thank Ravi Bansal, Neil Shephard, George Tauchen, and the partici-
pants in the Duke nancial econometrics lunch group for many valuable comments and dis-
cussions. Financial support from an NSF grant to the NBER (Bollerslev) is gratefully ac-
knowledged. An updated version of this paper can be downloaded from the following website
http://www.duke.edu/~hz/paper/dnld.html. For questions and comments, please contact Tim
Bollerslev, Department of Economics, Duke University, Box 90097, Durham NC 27708 USA, Email
boller@econ.duke.edu, Phone 919-660-1846, Fax 919-684-8974 or Hao Zhou, Department of Eco-
nomics, Duke University, Box 90097, Durham NC 27708 USA, Email zhou@econ.duke.edu, Phone
919-660-1800, Fax 919-684-8974
                                        Abstract
We exploit the distributional information contained in high-frequency intraday data in con-
structing a simple conditional moment estimator for stochastic volatility di usions. The
estimator is based on the analytical solutions of the rst two conditional moments for the
integrated volatility, which is e ectively approximated by the quadratic variation of the
process. We successfully implement the resulting GMM estimator with high-frequency ve-
minute foreign exchange and equity index returns. Our simulation evidence and actual
empirical results indicate that the method is very reliable and accurate. The computational
speed of the procedure compares very favorably to other existing estimation methods in the
literature.


JEL Classi cation: C13, C22.
Keywords: Stochastic Volatility Di usions Integrated Volatility High-Frequency Data
GMM Estimation.
1 Introduction
Continuous time methods and no-arbitrage arguments gure prominently in the theoretical
asset pricing literature. However, some of the most in uential contributions have been based
upon fairly simple and restrictive assumptions concerning the process for the underlying state
variable(s)|{leading examples include the celebrated Black-Scholes option pricing formula,
which assumes that the true process for the underlying asset follows a geometric Brownian
motion and the CIR model for the term structure of interest rates, which is derived under
the assumption of a square-root process for the short rate. Meanwhile, the burgeon empirical
literature on discrete time ARCH and stochastic volatility models (see Bollerslev et al., 1994
Ghysels et al., 1996), have called into question the empirical validity of a time invariant
di usion, or a single state variable, as a reasonable assumption for most speculative rate of
return series. In response to this, several recent studies have utilized more realistic continuous
time models, explicitly allowing for time varying volatility in the state variables. The Hull
and White (1987) and Heston (1993) stochastic volatility option pricing formula, and the
exponential-a ne class of term structure models in Du e and Kan (1996) and Dai and
Singleton (2000), are all notable examples.
    Aside from a few special cases, estimation of these continuous time stochastic volatility
models are complicated by the lack of a closed form expression for the transition density
function for the corresponding discretely sampled observations, and numerous competing
estimation strategies have been proposed in the literature. An incomplete list of these dif-
ferent techniques includes the Markov Chain Monte Carlo (MCMC) methods advanced by
Jacquier et al. (1994), Eraker (1998), Kim et al. (1998) and Elerian et al. (1998) the sim-
ulated methods of moments approach in Du e and Singleton (1993) the indirect inference
procedure of Gourieroux et al. (1993) utilized by Engle and Lee (1997) the E cient Methods
of Moments (EMM) developed by Gallant and Tauchen (1996) and Gallant and Long (1997)
and implemented by Andersen et al. (1999c) the in nitesimal moment generator underly-
ing the GMM procedure in Hansen and Scheinkman (1995) and Conley et al. (1997) the
non-parametric series expansions of the transition density function advocated by A t-Sahalia
(1996) and Stanton (1997) and the related kernel estimator in Bandi and Phillips (1999)
the approximation method to the likelihood function building on the Kolmogorov forward
equations in Lo (1988) and A t-Sahalia (1998) and the spectral GMM estimator utilizing
the empirical characteristic function in Chacko and Viceira (1999) and Singleton (1999).
While all of these procedures yield consistent, and in most cases also asymptotically e -
cient, parameter estimates for the various model speci cations, they are all computationally
                                                1
demanding and cumbersome to implement in practice.
    In the present paper we propose a new, much easier to compute, estimation procedure
for stochastic volatility di usions. The basic idea is straight forward. Instead of integrating
out the latent volatility, as it is implicitly done in the estimation procedures in the extant
literature, the strategy proposed here utilizes high-frequency data for explicitly measuring
the realized volatility.
    High-frequency, or tick-by-tick, data have recently become available for a host of di erent
  nancial instruments. Following the work of Merton (1980) and Nelson (1992), such data
could in principle be used to construct point-wise consistent ltering measurements for the
instantaneous volatility. Unfortunately, the optimal lter weights depend in complicated
ways on the particular model structure (Nelson and Foster, 1994), and in practice the con-
tinuous record asymptotics underlying the theoretical arguments are corrupted by inherent
discreteness, time-of-day e ects, bid-ask spreads, and other market microstructure frictions.                  1


Meanwhile, it is possible to construct model-free unbiased estimates of the integrated volatil-
ity over a xed time interval, say one day, by simply summing the squared returns over
the relevant time-period. Moreover, by the theory of quadratic variation, the sum of the
squared inter-period returns a ord increasingly more accurate ex-post volatility measure-
ments as the length of the return-horizon decreases. Motivated by this idea, Andersen et al.
                                                               2


(1999b, 2000b) o er a detailed descriptive characterization of the salient distributional fea-
tures of daily realized foreign exchange and individual stock return volatilities constructed
from high-frequency ve-minute returns.
    Here, we go one step further, and show that by matching the sample moments of the real-
ized volatility to the population moments of the integrated volatility implied by a particular
continuous-time model structure, a standard, and easy-to-compute, GMM-type estimator
for the underlying model parameters is immediately applicable. For concreteness we restrict
the analysis in the paper to the square-root, or a ne, class of stochastic volatility models.
This particular class of models arguably constitutes the leading case in the literature, but
the method is general. In particular Barndor -Nielsen and Shephard (1999) present analyt-
ical expressions for the moments of the integrated volatility for a general class of continuous
   1 In a related context, Brandt and Santa-Clara (1999) and Ledoit and Santa-Clara (1999) suggest using
Black-Scholes implied volatilities for short-lived at-the-money options to estimate the instantaneous volatility.
In practice, this implicitly assumes that the volatility is constant over the remaining life of the option, and
that the volatility risk is not priced.
   2 Alternatively, it is possible to extract information about the forward integrated volatility from the high-
low range of the discretely sampled data as in Gallant et al. (1999). Also, Alizadeh et al. (1999) have recently
proposed using the high-low range as a volatility proxy in a Gaussian quasi-maximum likelihood estimation
procedure for a simple stochastic volatility model.

                                                       2
time stochastic volatility models, in which the instantaneous variance is de ned by the sum
of multiple Ornstein-Uhlenbeck processes, each of which is driven by a homogeneous Levy
process.
    The rest of the paper is organized as follows. The next section demonstrates how the
population moments for the integrated volatility may be derived from the moments for the
point-in-time volatility. This section also brie y discusses the basic GMM setup employed
in the estimation. The Monte Carlo simulations in Section 3 highlight, that the method
works very well in empirically realistic nite sample settings, and that the e ciency of the
parameter estimates compares favorably to that of a non-feasible QML procedure treating
the instantaneous volatility as observable. The statistical inference concerning the true
values of the individual parameters and the overall t of the model are generally also very
reliable. The only caveat is a negligible upward bias in the estimates of the variance-of-
variance parameter. This is directly attributable to the measurement error in the quadratic
variation as a proxy for the integrated volatility, and we show how a simple adjustment
term in the moment conditions is e ectively able to eliminate this bias. Section 4 gives
the empirical results from applying the new estimation procedure to a set of high-frequency
  ve-minute foreign exchange rates and Japanese equity index returns. Section 5 concludes.
Mathematical details regarding the derivation of the moment conditions for the integrated
volatility are relegated to a technical appendix.

2 Estimating Stochastic Volatility Di usion
The basic estimation strategy builds on the usual asymptotic theory of GMM assuming an
increasing number of discretely sampled observations (Hansen, 1982). However, the con-
struction of the sample moments explicitly relies on the availability of high-frequency data,
and the almost sure convergence of the quadratic variation to the integrated volatility of the
process. We begin with a general discussion of the main idea, and then proceed to a concrete
illustrative example.

2.1 Integrated Volatility and GMM Estimation
To set out the main idea, let pt denote the time t logarithmic price for some asset. The
generic continuous time stochastic volatility model may then be written as
                          dpt = (pt Vt t )dt + (pt Vt t )dBt                              (1)
                          dVt = (pt Vt t )dt + (pt Vt t )dWt
                                              3
where Vt denotes a vector of latent volatility factors, dBt and dWt denote compatible, possibly
correlated, Brownian motions, and the drift and di usions functions are assumed to be
su ciently regular to guarantee the existence of a unique strong solution (see, e.g., Karatzas
and Shreve, 1997). Moreover, the parameters, , are restricted to lie within some compact
set, , containing the true parameters of the process, say . Of course, the dependence of
                                                                                  0

pt on dWt through both Vt and corr(dBt dWt) may be redundant. Also, for concreteness, in
the subsequent empirical analysis we will normalize the unit time interval to correspond to
one day.
    The exact form of the drift function, (pt Vt t ), is generally irrelevant for the consistent
estimation of the parameters entering the di usion functions. Meanwhile, the estimation of
these parameters based on discretely sample observations for the pt process are complicated
by the Vt process being latent, and the lack of a closed form expression for the corresponding
transition density. As noted in the introduction, this in turn has spurred the development
of several alternative computationally demanding estimation procedures. However, by the
theory of quadratic variation
                     2N                                         2          Z T

                           pt                ; pt                   ;!           (ps Vs s )ds Vt T
                     X
               lim
              N !1
                                   i
                                + N (T ;t)
                                 2
                                                 + i;1 (T ;t)
                                                   2N
                                                                    a:s:
                                                                            t
                                                                                                       (2)
                     i=1

where Vt T denotes for the integrated volatility from time t to T . Thus, while the point-
in-time volatility, (pt Vt t ), is generally unobservable, by summing increasingly ner
sampled squared high-frequency returns, it is possible to obtain increasingly more accurate
estimates of the integrated volatility of the process. Importantly, in the limit the integrated
volatility is e ectively observable.         3


    Explicitly treating the integrated volatility as observable, in turn permits the implementa-
tion of a standard GMM type estimator for the underlying model parameters, by minimizing
the weighted distance between the sample moments and the corresponding population mo-
ments of Vt T implied by the particular model structure. Of course, in practice continuously
sampled observations are not available, so that the integrated volatility is not actually ob-
servable. However, the same GMM estimation strategy may be formally justi ed under the
additional assumption, that the number of observations employed in the construction of the
sample moments converges to in nity at a slower rate than the almost sure convergence rate
of 1=2N for the quadratic variation. The validity of this assumption is obviously an empirical
question.
   3 Andersen and Bollerslev (1998a) provide simulation evidence in support of this idea, and argue for the
practical use of the quadratic variation as a meaningful measure of the ex-post realized volatility.

                                                            4
    The next section details the derivation of the rst two population moments for a particular
class of stochastic volatility models. For concreteness, we shall focus on the square-root
volatility model, or the single factor a ne di usion, analyzed by Heston (1993) among
others. But, the same basic approach employed in the next section could in principle be
extended to any multifactor stochastic volatility processes for which the conditional mean
and conditional variance of the point-in-time volatility have tractable analytical expressions.           4


This latter class is quite general, including the a ne class of stochastic di erential equations
popularized by Du e and Kan (1996), and Dai and Singleton (2000), as well as the quadratic
stochastic volatility class of models (see, e.g., Kloeden and Platen, 1992).

2.2 Conditional Moments of Integrated Volatility
The square-root volatility model, or scalar a ne di usion, is succinctly de ned by,
                                                   p
                                    dpt = t dt + VtdBt p                                                (3)
                                    dVt = ( ; Vt )dt + VtdWt
where Vt is a scalar latent volatility process. While this rst-order parameterization is
obviously somewhat restrictive, it is nonetheless rich enough to illustrate the general idea,
and it has in fact been widely used in the literature. In this parameterization, determines
the long-run (unconditional) mean, is the mean reversion parameter, and denotes the
local variance (volatility-of-volatility) parameter. For the process to be well de ned, the
parameters must satisfy: > 0 (non-negativity), > 0 (stationary in mean), and                 2    2


(stationary in volatility). Note that the drift of the asset returns, t, can be any linear or
nonlinear function of the state variables, pt, Vt, or even other unobservable factor(s), without
impeding the estimation of the stochastic volatility component.
    In deriving the conditional moments for the integrated volatility, it is useful to distin-
guish between two di erent information sets|{the continuous sigma-algebra Ft = fVs s
tg, generated by the point-in-time volatility process, and the discrete sigma-algebra Gt =
  fVt;s; t;s s = 0 1 2 1g, generated by the integrated volatility series. Obviously, the
         1

coarser ltration is nested in the ner ltration (i.e., Gt Ft), and by the Law of Iterated
Expectations, E E ( jFt)jGt] = E ( jGt).
   4 Although the procedure implemented here hinges on the matching of selected population and sample
moments for the integrated volatility, in situations where analytical expressions for the population moments
V are not directly available, these could easily be evaluated by simulations, and the underlying parameters
 t T

estimated by simulated methods of moments (Du e and Singleton, 1993).


                                                     5
2.2.1 Conditional Mean
In deriving the conditional mean for the integrated volatility, it is useful to start with the
conditional mean of the point-in-time volatility. In particular, it follows from the result in
Cox et al. (1985) that,
                                E (VT jFt) = T ;tVt + T ;t                                 (4)
where T ;t and T ;t are functions of the structural parameters and the horizon of the
forecast, T ; t (see Appendix A for details). The second step is to express the conditional
mean of the integrated volatility as a (linear) function of the point-in-time volatility by
interchanging the integration operators
                                                                   !
                                                  Z T

                       E (Vt T jFt) = E            t
                                                        Vsds Ft = aT ;tVt + bT ;t                                 (5)
where aT ;t and bT ;t denote other explicit functions of the drift parameters and the sampling
interval.
    Now, by iteratively substituting the above two results, the conditional mean of the in-
tegrated volatility for the one-day horizon, given the ner information set Ft, is readily
expressed as (see Appendix A),
                                E (Vt   +1 t+2   jFt) = E (Vt t jFt) + +1


where for notational simplicity we omit the subscript for the daily horizon i.e.      and                     1

      1. Using the Law of Iterated Expectation, the above relationship can be conditioned
on the coarser information set Gt, yielding
                  E E (Vt   +1 t+2   jFt) jGt] = E (Vt    +1 t+2   jGt) = E (Vt t jGt) + :
                                                                                +1                                (6)
The rst order moments for the multi-period integrated volatility may be derived by similar
reasoning.
2.2.2 Conditional Second Moment
Analogous to the derivation of the conditional rst moment above, it is convenient to start
from the expression for the conditional variance for the point-in-time volatility. Again,
following Cox et al. (1985), we have
          E (VT jFt) = V ar(VT jFt) + E (VT jFt)] = CT ;tVt + DT ;t +
              2                                          2
                                                                                     T   ;t Vt +   T   ;t]2       (7)

                                                          6
where CT ;t and DT ;t are functionally dependent on the structural parameters and the
sampling interval. Now by expressing the conditional variance of the integrated volatility as
a linear function of the point-in-time volatility and by exploiting It^'s Lemma, it is possible
                                                                      o
to show that
                                V ar(Vt T jFt) = AT ;tVt + BT ;t                            (8)
where AT ;t and BT ;t represent other functionals of the parameters (see Appendix A for
detailed derivations).
    Now combining the conditional variance formula in (8) and the conditional mean formula
in (5), we can derive the second moment of the integrated volatility conditional on the ner
information set Ft. In particular, for the one-day horizon this takes the form
    E (Vt t jFt) = V ar(Vt t jFt) + E (Vt t jFt)] = a Vt + (2ab + A)Vt + (b + B )
        2
            +1                    +1                   +1
                                                                2       2    2
                                                                                            (9)       2


where we have omitted the \daily" subscript \1" on a, b, A and B for notational convenience.
Finally by repeatedly applying the Law of Iterated Expectation on di erent information sets
and substituting expressions between integrated volatility and point-in-time volatility, it
follows that
      E E (Vt    2
                  +1 t+2   jFt)jGt] = E (Vt2
                                            +1 t+2   jGt) = HE (Vt t jGt) + IE (Vt t jGt) + J
                                                                    2
                                                                        +1              (10)     +1


where the functions H , I , and J are again de ned in the Appendix. Corresponding moment
conditions for the squared multi-period integrated volatility follow by analogous arguments.
2.2.3 Conditional Moment Restrictions
The analytical solutions for the conditional rst and second moments in equations (6) and
(10), immediately set the stage for the construction of a standard GMM type estimator.
Of course, the e ciency of the resulting estimator de ned from these equations will depend
upon the particular choice of instruments (see Hansen, 1985 Hansen et al., 1988 Gallant
and Tauchen, 1996, for additional discussion and formal results along these lines). In the
implementation pursued here, we simply augment the two basic moments with their own
lag-one and lag-one squared counterparts, resulting in the following six moments,
                                      E Vt t jGt] ; Vt t
                                       2                                                     3
                                                       +1 +2                +1 +2

                                      E Vt t jGt] ; Vt t
                                       6                                                     7
                                       6              2                     2                7
                                       6               +1 +2                 +1 +2           7

                                E Vt t Vt; tjGt] ; Vt t Vt; t :
                                       6                                                     7
                                       6                                                     7
                      ft( )            6      +1 +2         1               +1 +2
                                                                                       (11)
                                                                                         1   7

                                E Vt t Vt; tjGt] ; Vt t Vt; t
                                       6                                                     7
                                       6      2                             2                7
                                       6       +1 +2        1                +1 +2       1   7

                                E Vt t Vt; tjGt] ; Vt t Vt; t
                                       6
                                       6
                                       4      +1 +2
                                                        2
                                                            1               +1 +2
                                                                                     2
                                                                                         1
                                                                                             7
                                                                                             7
                                                                                             5

                                E Vt t Vt; tjGt] ; Vt t Vt; t
                                              2
                                               +1 +2
                                                        2
                                                            1
                                                                            2
                                                                             +1 +2
                                                                                     2
                                                                                         1


                                                            7
By construction E ft( )jGt] = 0, and the corresponding GMM, or minimum chi-square,
                           0

estimator is de ned by ^T = arg min gT ( )0WgT ( ), where gT ( ) refers to the sample mean
of the moment conditions, gT ( ) 1=T T ; ft( ), and W denotes the asymptotic covari-
                                            t
                                               P
                                                   =1
                                                     2


ance matrix of gT ( ) (Hansen, 1982). Under standard regularity conditions, the minimized
                      0

value of the objective function multiplied by the sample size is distributed asymptotically
as a chi-square distribution with three degrees of freedom, which allows for an omnibus test
of the overidentifying restrictions. Moreover inference concerning the individual parame-
ters is readily available from the standard formula for the asymptotic covariance matrix,
(@ft( )=@ 0 W@ft( )=@ )=T .
    The one-period lag in the moment conditions in equation (11) implies an MA(1) error
structure. However, in order to avoid any nite sample problems with the sample analogue of
W not being positive de nite, in the simulations and the actual empirical estimates reported
below, we used a heteroskedasticity and autocorrelation consistent robust covariance matrix
estimator with a Bartlett-kernel and a lag length of ve (Newey and West, 1987). The next     5


section details the results from a Monte Carlo study designed to investigate the nite sample
performance of this particular GMM estimator.

3 Monte Carlo Study
One important aspect in evaluating econometric methods for estimating continuous time
process concerns their nite sample performance. With strong temporal dependence and/or
conditional heteroskedasticity in the data generating process, asymptotically sound estima-
tors have been shown to exhibit very slow convergence rates (see, e.g., Pritsker, 1998). This
section quali es the small sample e ciency of our GMM estimator, along with the resulting
omnibus speci cation test, and Wald based parameter inference.

3.1 Experimental Design
we presents the results for three benchmark speci cations. Scenario A ( = 0:03, = 0:25,
  = 0:10) features a highly persistent volatility process (nearly unit-root) Scenario B ( =
0:10, = 0:25, = 0:10) has a more stationary variance process Scenario C ( = 0:10,
  = 0:25, = 0:20) has a higher variance-of-variance and is close to the non-stationary
region ( > 2 ).
          2


   5We also experimented with other lag lengths. All of the results were very similar to the ones reported
here, and are available upon request.


                                                     8
    In simulating the data, we utilize a rst order Euler scheme with 82 arti cial \ ve-
minute" intervals per day, further partitioning each ve-minute interval into 10 segments.                 6


The quadratic variation formula (2) is employed to approximate the integrated volatility
series. To check the standard \long-span" asymptotics, the econometric sample sizes are
chosen as T = 1,000 and T = 4,000. Since the true \continuous time" record is known inside
the simulations, we compare the GMM estimator using the \ ve-minute" quadratic variation
with the corresponding non-feasible estimator based on the true integrated volatility. Lastly,
we also compare the results for the GMM estimator with a QML estimator based on the
\daily" point-in-time volatility assuming the process to be Gaussian. Of cause, this latter
                                                                                 7


estimator is not feasible in practice either. The results are summarized in Table 1 and Figures
1 and 2.

3.2 Parameter Estimate and E ciency
First, the nite sample results not only corroborate the moment conditions derived earlier for
the integrated volatility, but also indicate that the GMM estimator fairs well (if not better)
than the other two non-feasible alternatives|{using \unobserved" point-in-time volatility
or the \continuous time" record. The root-mean-squared-errors (RMSEs) of the drift pa-
                                                       p
rameters, and , decrease roughly at the rate of 4 as the sample size increases from
1,000 to 4,000 \days". Meanwhile, the mean-reversion parameter is upward biased, and
the long-run mean parameter exhibits a small downward bias.
    Second, the accuracy of the local variance parameter estimates is a ected by both the
long-span asymptotics and the ll-in asymptotics. Although the RMSE of does decrease
                                                                     p
when the sample size goes from 1,000 to 4,000, the rate is not always 4. Also, while the drift
parameter estimates are almost una ected by the ll-in asymptotics, the RMSE of clearly
diminishes when the sampling frequency increases from \ ve-minute" to the \continuous
time" limit. This con rms the theoretical arguments that the di usion parameter can be
estimated exactly with continuous sampling (see, Merton, 1980 Lo, 1988 Nelson, 1992).
    Third, when the process is close topa unit-root (Scenario A), the variance parameter
seems to converge at a faster rate than T (Table 1 Panel A and Figure 1). Also when the
variance-of-variance parameter is large (Scenario C in Figure 1), the nite sample biases of
   6 Most US nancial markets are open between six-and-a-half to seven hours, corresponding to 78-84 ve-
minute intervals.
   7 This estimator is closely related to the ideas in Fisher and Gilles (1996), who propose a Quasi-Maximum
Likelihood estimator for A ne di usion process, using closed form solutions for the conditional mean and
variance.

                                                     9
the drift parameter estimates are larger than for the more stationary case (Scenario B in
Figure 1). Basically the GMM estimator is not able to distinguish between a very persistent
yet stationary process and a non-stationary, near unit-root process in \small" samples.
    Lastly, the GMM estimates of the local variance parameter, , are systematically up-
ward biased in all three scenarios. Interestingly, this bias completely vanishes when the true
integrated volatility is used in place of the \ ve-minute" quadratic variation. While the mea-
surement error from using the quadratic variation to approximate the integrated volatility
process is averaged out in the rst moment condition, the second moment condition depends
non-linearly on the measurement error. We will investigate this issue further in Section 3.4.
    In terms of relative e ciency, the GMM estimator using the \ ve-minute" realized volatil-
ity actually performs better than the non-feasible QML estimator using the unobservable
point-in-time volatility for the drift parameters in all three scenarios, and better for the
variance parameter in all but the stationary scenario. The middle rows in Table 1 sug-
gest that the RMSEs of the GMM estimator using the true integrated volatility process are
much smaller than those of the QML estimator. However, going to the \continuous time"
limit does not necessarily improve the e ciencies of the GMM drift parameters, but it does
increase the convergence rate of the di usion parameter.

3.3 Statistical Inference
In practice, inference concerning the individual model parameters and the overall speci ca-
tion of the model will have to be based on the standard GMM type test statistics discussed
in Section 2.2.3. In this regard, the t-statistics for the drift parameters in Figure 1 clearly
indicate that the GMM estimator based on the \ ve-minute" quadratic variation is close to
normal for both 1,000 and 4,000 \daily" sample sizes analyzed here. Meanwhile for the di u-
sion parameter, the use of \ ve-minute" realized volatility in the GMM estimation gives rise
to a systematic upward bias in the t-statistics. This is consistent with the earlier explanation
of the non-dissipating measurement error in the second moment condition.                8


    Turning to Figure 2 and the GMM tests of overidentifying restrictions, it follows that
except for the near unit-root case (Scenario A in Figure 2), the test performs very well.
Moreover, the slight over-rejection and under-rejection biases largely vanishes as the sample
size increases from 1,000 to 4,000.    9


   8 The corresponding t-tests for the non-feasible QML estimator based on the point-in-time volatility are
generally much more distorted, while the t-tests for the GMM estimates using the true integrated volatility
are all extremely close to normal. These results are available upon request.
   9 Overrejection biases of GMM omnibus tests are widely reported in the literature (Andersen and S renson,


                                                    10
3.4 Measurement Error Adjustment
By construction the quadratic variation based on the simulated " ve-minute" returns pro-
vides an unbiased estimate for the true integrated volatility. At the same time, the squared
quadratic variation for any xed sampling interval yields a biased estimate of the true squared
integrated volatility. Consequently, while the linear expectations operator washes out the
measurement errors in the rst conditional moment and the corresponding two augmented
moments in equation (11), the three moment conditions involving the squared quadratic
variation will entail a non-zero measurement error. Although the exact form of the mea-
                                                               10


surement error is not known, it follows by the almost sure convergence of the quadratic
variation, that the expectation of the squared error term is bounded by the local maximum
of the continuous local martingale process (see e.g., Karatzas and Shreve, 1997 Protter,
1992). In order to conveniently approximate this term, we simply included an additive nui-
sance parameter, , in each of the three second order moment conditions, replacing the
squared \ ve-minute" quadratic variation, Vt t , by Vt t + .
                                                      2
                                                       +1 +2
                                                                    2
                                                                    +1 +2

    Not surprisingly, from the results reported in Table 2 and Figure 3, the parameter esti-
mates for the two drift parameters and the corresponding t-statistics are largely una ected
by the estimation of this additional nuisance parameter. More important, the pervasive -
nite sample biases for the local variance parameter estimates have completely disappeared.                  11


Moreover, the rejection frequencies for the GMM speci cation test for the overidentifying
restrictions appear marginally closer to their nominal sizes. Thus, all in all, the inclusion of
a simple additive correction term for the squared quadratic variation has e ectively elimi-
nated the only notable statistical bias in the procedure. Of course, it is possible that more
advanced measurement error adjustment procedures could result in further improvements,
especially for more complicated models. However, for the square-root volatility di usion
in equation (3), the GMM estimation procedure proposed here works very well in realistic
  xed-interval nite sample settings.
1996 Hansen et al., 1996), whereas underrejection biases often occur when lag instruments are used to form
the moment conditions (Tauchen, 1986).
   10Andersen and Bollerslev (1998a) provide some limited simulation evidence on the size of this measure-
ment error as a function of the sampling frequency. Andersen et al. (2000c) and Bai et al. (1999) discuss
practical considerations related to the inherent market microstructure frictions and the choice of the sampling
frequency with actual high-frequency data.
   11Meanwhile, the RMSEs for       have increased somewhat.


                                                      11
4 Empirical Illustration
This section provides an empirical illustration of the new estimation procedure using actual
high-frequency date. For ex positional purposes, we will focus on the estimation results for
the simple scalar a ne di usion analyzed in the previous two sections. To illustrate the
applicability of the procedure across di erent markets and institutional arrangements, we
present the results for two separate data sets: spot foreign exchange rates, and Japanese
equity index returns. In line with the simulations in the preceding section, we partition
the trading day for each of the markets into ve-minute intervals, incorporating an additive
nuisance parameter to correct the inherent measurement error in the resulting ve-minute
quadratic variation measures.

4.1 Data Description
The data for the foreign exchange market were obtained from Olsen&Associates in Zurich,
Switzerland, and consists of continuously recorded ve-minute returns for the Deutsche
Mark/U.S. Dollar (DM/$) and Japanese Yen/U.S. Dollar (Yen/$) spot exchange rates. The
sample for the exchange rates spans the period from December 1, 1986 through December
1, 1996. After removing missing data, weekends, xed holidays, and other calendar e ects,
as detailed in Andersen et al. (1999b), we are left with a total of 2,445 trading days, each of
which consists of 288 ve-minute returns over the 24-hour trading cycle.
    The intraday data for the Nikkei 225 composite stock market index were provided by
Nihon Keizei Shimbun Inc. The ve-minute returns for the Nikkei 225 covers the period
from January 2, 1994 through December 31, 1997. Excluding days on which the Japanese
equity market was closed results in a total of 984 trading days. The Tokyo Stock Exchange
opens at 9:00 a.m., closes for lunch from 11:00 to 12:30, and closes for the day at 15:00 p.m..
Omitting the rst ve-minute interval of the day associated with the special Itayose batched
trading process at the opening, leaves us with 53 ve-minute returns per day. In contrast
to the very actively traded foreign exchange rates, the ve-minute returns for the Nikkei
225 cash index is plagued by important non-synchronous trading e ects (see, e.g., Lo and
MacKinlay, 1990 Chan et al., 1991, for a discussion of non-synchronous trading e ects in
equity index returns). While the resulting autocorrelation in the high-frequency returns does
not formally a ect the continuous record asymptotics underlying the GMM estimator, any
mean dependencies in the discretely sampled returns will systematically bias the quadratic
variation as an estimate for the true latent integrated volatility. In order to minimize this

                                              12
bias we pre-whitened the returns by a rst order autoregressive model, treating the residuals
from this model as the actual ve-minute return series. For a more detailed discussion
                                                                    12


of the pertinent institutional arrangements and the pre-whitened ve-minute Nikkei 225
returns, we refer to Andersen et al. (2000a).
    Next we transform the three ve-minute return series into daily time series of integrated
volatilities, as approximated by the quadratic variations in equation (2). Table 3 provides
the standard set of summary statistics for each series. The means of the integrated volatil-
ity for the two exchange rates imply an annualized standard deviation of approximately
11.5 percent, whereas the annualized volatility for the Japanese stock market equals 14.6
percent. The standard deviations of the integrated volatilities are close to the mean for all
         13


three markets. The higher order moments indicate extremely heavy tails and, most notably
in the case of the Yen/$ spot exchange rate, important skewness to the right. These distri-
butional features are con rmed by visual inspection of the time series plots in Figure 5. Each
of the panels also reveals a high degree of serial correlation in the integrated volatility series.
The next subsection presents the estimation results from the stochastic volatility model in
equation (3) explicitly designed to capture this volatility clustering e ect.

4.2 Estimation Results
Before proceeding to the actual estimation results, we caution that the scalar volatility di u-
sion in equation (3) is too simplistic to fully account for the complex dynamic dependencies
in the high-frequency return series. In particular, there are sound theoretical reasons to
expect there to be at least two factors a ecting the exchange rates (Bansal, 1997). Also,
a number of recent studies have argued for the empirical relevance of including multiple
factors and/or jump components when modeling equity index returns (e.g., Chacko and Vi-
ceira, 1999 Andersen et al., 1999a Chernov et al., 1999). Moreover, the model in equation
(3) does not incorporate the strong periodic dependencies in the volatility within the day
documented in several recent studies (see e.g., Andersen and Bollerslev, 1997). In spite         14


of these de ciencies, we feel that the square-root stochastic volatility model is rich enough
  12The estimated AR(1) coe cient for the raw Nikkei 225 ve-minute returns equals 0.1429. Details
concerning the model estimates based on the un-adjusted ve-minute Nikkei 225 returns are available on
request. The resulting daily integrated volatility series is slightly smoother, and the parameter estimates are
marginally lower in level, persistence, and variance.
  13The annualized standard deviation is obtained by multiplying the mean of the daily integrated volatility
by 250 and taking the square-root.
  14Interestingly, the forecasting result in Andersen and Bollerslev (1998b) and Andersen et al. (2000a)
suggest that the in uences of the intraday periodicities are e ectively eliminated in the daily integrated
volatility measures utilized in the GMM estimation.

                                                      13
to o er a rst meaningful empirical illustration of the applicability of the new estimation
procedure.
    The parameter estimates for the three series are reported in Table 4. With the exception
                                                                              15


of the slightly higher values for , the estimates are almost identical to the ones reported
here. As expected the estimates for the long-run means, or , are all fairly close to the sample
means for the three integrated volatility series reported in Table 3. Also, not surprisingly,
the estimates of the variance-of-variance parameter, or , have the largest standard errors
among all of the parameters. Meanwhile, the estimated mean reversion parameters, , are on
the high side relative to the values reported in the extant literature using more complicated
discrete time ARCH and stochastic volatility type formulations. Even though the GMM
omnibus test only rejects the model for the DM/$ exchange rate, the one-factor model
is obviously an oversimpli cation of the true dynamic dependencies for all three markets.
However, from an overall perspective, the estimation results in Table 4 are generally in line
with the simulation evidence reported in the previous section, and clearly suggest that the
new estimation procedure could e ectively be employed in the empirical estimation of more
complicated continuous time di usions.

5 Concluding Remarks
Exploiting closed form analytic expressions for the conditional moments of integrated volatil-
ity coupled with highly accurate empirical quadratic variation measures constructed from
high-frequency data, we proposed a new class of GMM-type estimators for stochastic volatil-
ity di usions. In contrast to other computationally demanding estimation procedures rou-
tinely used in the literature, such as the simulation based EMM and MCMC methods, the
GMM estimator developed here is very easy to implement, requiring only the solution to a
standard non-linear optimization problem. Our Monte Carlo evidence shows that the pro-
cedure results in highly accurate parameter estimates and reliable statistical inferences in
realistic nite samples. In implementing the new estimator with actual ve-minute rates of
return, our results con rm prior evidence in the literature concerning the existence of strong
volatility clustering at the inter-daily level.
    It would be interesting to extend the estimator developed here to more complicated con-
tinuous time jump-di usion and multi-factor di usion processes. More ambitious empirical
applications might also entail the estimation of multivariate di usions, which in turn would
  15Details regarding the parameter estimates without the additive measurement error term are available
upon request.

                                                  14
require vector versions of the integrated volatility and quadratic variation measures exploited
here. Another interesting extension, would be to use the distributional features of the in-
tegrated volatility in pricing nancial options, although this would necessitate additional
assumptions about the price of volatility risk. We leave further work along these lines for
future research.


                                              15
A Conditional Moments of Integrated Volatilities
A.1 Conditional Mean of Integrated Volatility
Because of the linear drift speci cation of the stochastic volatility, the conditional mean of
the integrated volatility can be shown as a linear function of the point-in-time volatility
                                                             !
                                         Z T

             E (Vt T jFt) = E              t
                                                Vs ds Ft
                                   Z T

                            =
                                    t
                                         E (Vs jFt)ds
                                   Z T h                                                  i
                            =       t
                                           Vt e ;    ;
                                                    (s t)
                                                             + 1 ; e;              ;
                                                                                  (s t)
                                                                                              ds
                            = Vt 1 1 ; e; T ;t + (T ; t) ;
                                                         (    )
                                                                                                   1 ; e;   (T   ;t)

                            = aT ;tVt + bT ;t                                                                               (A1)
where aT ;t and bT ;t are functions of the drift parameters and the time di erence (T ; t).
For notational simplicity we denote the parameters for the daily horizon, or T ; t = 1, by
a (1 ; e; ) and b ; (1 ; e; ). The above derivation explicitly uses the conditional
    1


mean of the point-in-time volatility
                E (VT jFt) = Vt e;        (T   ;t) +         1 ; e;    (T   ;t)    =          ;V
                                                                                          T t t      +   T t;               (A2)
where T ;t and T ;t are also functions of the drift parameters and the time di erence (T ; t).
Again for T ; t = 1, we de ne       e; and          (1 ; e; ).
   Focusing on the one-day horizon with E (Vt t jFt ) = aVt + b, and E (Vt jFt) =
                                                                  +1 +2           +1               +1                  +1

 Vt + , it follows that
                 E E (Vt   +1 t+2   jFt ) jFt] = aE (Vt jFt) + b
                                         +1                             +1

                                                         = a( Vt + ) + b
                                                         = E (Vt t jFt) ; b] + a + b
                                                                             +1


which simpli es as
                                E (Vt    +1 t+2   jFt) = E (Vt t jFt) + :+1


Finally, by the Law of Iterated Expectations,
                E E (Vt   +1 t+2   jFt) jGt] = E (Vt          +1 t+2   jGt) = E (Vt t jGt) + : +1                           (A3)


                                                             16
A.2 Conditional Variance of Integrated Volatility
By de nition Vt T = tT Vsds, and from equation (A1) E (Vt T jFt) = aT ;tVt + bT ;t. The
                                 R


stochastic di erential equation (SDE) for E (Vt T jFt) may therefore be generated as a func-
tion of Vt by applying It^'s formula to the a ne di usion in equation (3),
                         o                                                                                                             16


                dE (Vt T jFt) = aT ;t ( ; Vt) + @aT ;t Vt + @b@t;t ]dt + aT ;t VtdWt
                                                                                                                                   q
                                                              T
                                                 @t
which may be further simpli ed as
                                                                                                      q
                                      dE (Vt T jFt) = ;Vtdt + aT ;t VtdWt:                                                                          (A4)
Now x the upper limit T , and let the lower limit t be time varying. The It^ integral implied
                                                                           o
by SDE (A4) then takes the form
                                                                          Z T                         Z T                      q
                   E (VT T jFT ) = E (Vt T jFt) +                          t
                                                                                   (;Vs)ds +
                                                                                                          t
                                                                                                                  aT ;s Vs dWs:
However, E (VT T jFT ) = 0 which implies that
                                                                                   Z T                q
                                      Vt T ; E (Vt T jFt) =                         t
                                                                                            aT ;s Vs dWs :
Using standard arguments in stochastic calculus, it follows from the substitution of equation
(A2) that
                                 V ar(Vt T jFt) = E (Vt T ; E (Vt T j Ft)) jFt]
                                                                      8
                                                                                                                  2
                                                                                                                               9
                                                                        "                                             #2
                                                                      < Z T                       q                            =
                                                         = E          :        t
                                                                                        aT ;s Vs dWs Ft
                                                              Z
                                                                      T
                                                         =
                                                                  t
                                                                          aT ;s E (Vs jFt)ds
                                                                           2            2

                                                              Z T

                                                         =
                                                            t
                                                               aT ;s s;tVt +
                                                                           2            2
                                                                                                              s t ; ]ds
                                                         = AT ;tVt + BT ;t                                                                          (A5)
where
       AT ;t =
                     2
                             1 ; 2e;        (T   ;t)(T   ; t) ; 1 e;               2 (T     ;t)
                     2

  16The simple version of It^'s Lemma for a smooth function f (V t T ) 2 C 2 of a di usion process V states
                            o                                                                     t                                             t

that
                                                    1
    df (V t T ) = f (V t T ) (V t) + f (V t T ) + f (V t T ) 2 (V t)]dt + f (V t T ) (V t)dW
        t            V       t          t
                                                    2
                                                    t     t                        VV        t                t                V   t        t       t


where (V t) and (V t) are the drift and di usion functions de ning the V process.
            t            t                                                                                                 t


                                                                           17
                           "                                                                                                                                                           #

                                                                                        ; 3 1 ; e;
                       2
     BT ;t =                   (T ; t)                   1 + 2e;           ;t)                                                  ;t)      +2                1 ; e;           ;t)    2
                                                                      (T                                                   (T                                          (T
                       2
                           "                                                                                                                                           #
                       2
                  =    2
                               (T ; t)                   1 + 2e;      (T   ;t)          +2            e;       (T   ;t) + 5                  e;       (T   ;t) ; 1         :
In particular, the conditional variance of the integrated volatility is a linear function of the
point-in-time volatility. It follows also from Cox et al. (1985) and equation (A2) above that,
    E (VT jFt) = V ar(VT jFt) + E (VT jFt)]
             2                                                                          2

                                    2                                                                 2
               = Vt e; T ;t ; e; T ;t +
                                               2 1;e
                                                        ; T ;t +
                                                                                                                                                               ;V      +           ;t]
                                                                                                                                                  2
                                                     (        )       2 (               )                                            (       )                                             2
                                                                                                                                                             T t t             T

               = CT ;tVt + DT ;t + T ;tVt + T ;t + 2 T ;t T ;tVt      2             2             2


               = T ;tVt + CT ;t + 2 T ;t T ;t]Vt + DT ;t + T ;t]
                               2            2                                                                                            2
                                                                                                                                                                                           (A6)
where
                                                                           2
                                                CT ;t =                         e;          (T   ;t) ; e;2             (T      ;t)
                                                                          2
                                                DT ;t = 2                           1 ; e;                 ;t)             :
                                                                                                                       2
                                                                                                      (T


   Focusing on the one-day horizon, the conditional variance formula (A5), V ar(Vt t jFt) =                                                                                       +1

AVt + B , and the corresponding one-day conditional mean formula (A1), E (Vt t jFt) =                                                                                          +1

aVt + b, implies that
   E (Vt t jFt) = V ar(Vt t jFt) + E (Vt t jFt)] = a Vt + (2ab + A)Vt + (b + B ): (A7)
         2
             +1                                 +1                             +1
                                                                                                  2         2          2                                           2


Leading the arguments by one period and applying the Law of Iterated Expectation, imme-
diately yields
             E E (Vt  2
                      +1 t+2       jFt )jFt] = a E (Vt jFt) + (2ab + A)E (Vt jFt) + (b + B ):
                                        +1
                                                                  2            2
                                                                               +1                                                            +1
                                                                                                                                                               2


Now substitute E (Vt jFt) by (A2) and E (Vt jFt) by (A6), and reversely substitute out Vt
                               +1
                                                                                        2
                                                                                        +1
                                                                                                                                                                                               2


by (A7) and Vt by (A1), it is clear that
 E (Vt2
       +1 t+2     jFt) = a    Vt + (C + 2 )Vt + (D + )] + (2ab + A)( Vt + ) + (b + B )
                                    2   2        2                                                             2                                                                   2


                       = a Vt + a (C + 2 ) + (2ab + A)]Vt
                                    2 2         2             2


                         + a (D + ) + (2ab + A) + (b + B )]
                                        2                     2                                            2


                       =    E (Vt t jFt) ; (2ab + A)Vt ; (b + B )]
                                    2               2
                                                         +1
                                                                                                                   2


                         + a (C + 2 ) + (2ab + A)]Vt
                                        2


                                                                                        18
                       + a (D + ) + (2ab + A) + (b + B )]
                                   2                2                                  2


                      = E (Vt t jFt)
                              2            2
                                               +1

                                   2                  1
                       + a (C + 2 ) + ( ; )(2ab + A)] a E (Vt t jFt) ; b]     2
                                                                                                            +1

                       + a (D + ) + (2ab + A) + (1 ; )(b + B )]
                                   2                2                                          2        2


                      = E (Vt t jFt)
                              2            2
                                               +1


                       + 1 a (C + 2 ) + ( ; )(2ab + A)]E (Vt t jFt)
                                       2                                           2
                         a                                                                                  +1

                               b
                             ; a a (C + 2 ) + ( ; )(2ab + A)]
                                       2                                           2


                             + a (D + ) + (2ab + A) + (1 ; )(b + B )]
                                   2                2                                          2        2
                                                                                                                              (A8)
Lastly, applying the Law of Iterated Expectations to (A8) and changing the information set,
we have
          E E (Vt  2
                    +1 t+2   jFt)jGt] = E (Vt                 2
                                                                    jGt)
                                                               +1 t+2

                                               =        2
                                                            E (Vt t jGt)
                                                                     2
                                                                         +1
                                                      1
                                                    + a a (C + 2 ) + ( ; )(2ab + A)]E (Vt t jGt)
                                                                 2                                  2
                                                                                                                         +1

                                                   b
                                                 ; a a (C + 2 ) + ( ; )(2ab + A)]
                                                                 2                                  2


                                                 + a (D + ) + (2ab + A) + (1 ; )(b + B )]
                                                             2                2                                  2   2


                                               = HE (Vt t jGt) + IE (Vt t jGt) + J
                                                                     2
                                                                         +1                    +1                             (A9)
where H = , I = 1=a a (C + 2 ) + ( ; )(2ab + A)], and J = ;b=a a (C + 2 ) +
               2                       2                                           2                                     2


( ; )(2ab + A)] + a (D + ) + (2ab + A) + (1 ; )(b + B )].
      2                        2                2                                          2        2


                                                                              19
References
A t-Sahalia, Yacine (1996), \Nonparametric Pricing of Interest Rate Derivatives," Econo-
  metrica , vol. 64, 527{560.
A t-Sahalia, Yacine (1998), \Maximum Likelihood Estimation of Discretely Sampled Di u-
  sions: A Closed-Form Approach," Working Paper , Department of Economics, Princeton
  University.
Alizadeh, Sassan, Michael W. Brandt, and Francis X. Diebold (1999), \Range-Based Esti-
  mation of Stochastic Volatility Models," Working Paper , Department of Finance, NYU.
Andersen, Torben G., Luca Benzoni, and Jesper Lund (1999a), \Estimating Jump-Di usions
 for Equity Returns," Working Paper , Kellogg Graduate School of Management, North-
 western University.
Andersen, Torben G. and Tim Bollerslev (1997), \Intraday Periodicity and Volatility Per-
 sistence in Financial Markets," Journal of Empirical Finance , vol. 4, 115{158.
Andersen, Torben G. and Tim Bollerslev (1998a), \Answering the Skeptics: Yes, Standard
 Volatility Models Do Provide Accurate Forcasts," International Economic Review , vol. 39,
 885{905.
Andersen, Torben G. and Tim Bollerslev (1998b), \DM-Dollar Volatility: Intraday Activity
 Patterns, Macroeconomic Announcements, and Longer-Run Dependencies," Journal of
 Finance , vol. 53, 219{265.
Andersen, Torben G., Tim Bollerslev, and Jun Cai (2000a), \Intraday and Interday Volatil-
 ity in the Nikkei 225 Index," Journal of International Financial Markets, Institutions &
 Money , forthcoming.
Andersen, Torben G., Tim Bollerslev, Francis X. Diebold, and Heiko Ebens (2000b), \The
 Distribution of Stock Return Volatility," Working Paper , Department of Economics, Duke
 University.
Andersen, Torben G., Tim Bollerslev, Francis X. Diebold, and Paul Labys (1999b), \The
 Distribution of Exchange Rate Volatility," NBER Working Paper , No. 6961.


                                           20
Andersen, Torben G., Tim Bollerslev, Francis X. Diebold, and Paul Labys (2000c), \Mi-
 crostructure Bias and Volatility Signatures," Work in Progress , Department of Economics,
 Duke University.
Andersen, Torben G., Hyung-Jin Chung, and Bent E. S renson (1999c), \E cient Method
 of Moments Estimation of a Stochastic Volatility Model: A Monte Carlo Study," Journal
 of Econometrics , vol. 91, 61{87.
Andersen, Torben G. and Bent E. S renson (1996), \GMM estimation of a stochastic volatil-
 ity model: a monte Carlo study," Journal of Business and Economic Statistics , vol. 14,
 328{352.
Bai, Xuezheng, Je rey R. Russell, and George C. Tiao (1999), \Beyond Merton's Utopia
  E ects of Non-Normality and Dependence on the Precision of Variance Estimates Using
  High-Frequency Financial Data," Working Paper , Graduate School of Business, University
  of Chicago.
Bandi, Federico M. and Peter C. B. Phillips (1999), \Econometric Estimation of Di usion
  Models," Working Paper , Department of Economics, Yale University.
Bansal, Ravi (1997), \An Exploration of the Forward Premium Puzzle in Currency Market,"
  The Review of Financial Studies , vol. 10, 369{403.
Barndor -Nielsen, Ole and Neil Shephard (1999), \Non-Gaussian OU Based Models and
  Some of Their Uses in Financial Economics," Working Paper , Nu eld College, Oxford
  University.
Bollerslev, Tim, Robert F. Engle, and Daniel B. Nelson (1994), \ARCH Models," in \Hand-
  book of Econometrics," (edited by Engle, Robert F. and Daniel L. McFadden), vol. IV,
  Amsterdam: North-Holland.
Brandt, Michael W. and Pedro Santa-Clara (1999), \Simulated Likelihood Estimation of
  Multivariate Di usions with an Application to Interest Rates and Exchange Rates with
  Stochastic Volatility," Working Paper , Wharton School, University of Pennsylvania.
Chacko, George and Luis M. Viceira (1999), \Spectral GMM Estimation of Continuous-Time
  Processes," Working Paper , Graduate School of Business, Harvard University.
Chan, Kalok, K. C. Chan, and G. Andrew Karolyi (1991), \Intraday Volatility in the Stock
  Index and Stock Index Futures Markets," Review of Financial Studies , vol. 4, 1161{1187.
                                           21
Chernov, Mikhail, A. Ronald Gallant, Eric Ghysels, and George Tauchen (1999), \A New
  Class of Stochastic Volatility Models with Jumps: Theory and Estimation," Working
  Paper , Department of Economics, Duke University.
Conley, Tim, Lars Peter Hansen, Erzo Luttmer, and Jose Scheinkman (1997), \Short Term
  Interest Rates as Subordinated Di usions," Review of Financial Studies , vol. 10, 525{578.
Cox, John C., Jonathan E. Ingersoll, and Stephen A. Ross (1985), \A Theory of the Term
  Structure of Interest Rates," Econometrica , vol. 53, 385{407.
Dai, Qiang and Kenneth J. Singleton (2000), \Speci cation Analysis of A ne Term Structure
  Models," Journal of Finance , forthcoming.
Du e, Darrell and Rui Kan (1996), \A Yield-Factor Model of Interest Rates," Mathematical
 Finance , vol. 6, 379{406.
Du e, Darrell and Kenneth Singleton (1993), \Simulated moments estimation of Markov
 models of asset prices," Econometrica , vol. 61, 929{52.
Elerian, Ola, Siddhartha Chib, and Neil Shephard (1998), \Likelihood Inference for Dis-
  cretely Observed Non-Linear Di usions," Working Paper , Nu eld College, Oxford Uni-
  versity.
Engle, Robert F. and Gary G. J. Lee (1997), \Estimating Di usion Models of Stochastic
  Volatility," in \Modeling Stock Market Volatility: Bridging the Gap to Continuous Time,"
  (edited by Rossi, Peter E.), Academic Press, New York.
Eraker, Bjorn (1998), \MCMC Analysis of Di usion Models with Application to Finance,"
  Working Paper , Graduate School of Business, University of Chicago.
Fisher, Mark and Christian Gilles (1996), \Estimating Exponential A ne Models of the
  Term Structure," Working Paper .
Gallant, A. Ronald, Chien-Te Hsu, and George Tauchen (1999), \Using Daily Range Data
 to Calibrate Volatility Di usions and Extract the Forward Integrated Variance," Review
 of Economics and Statistics , vol. 81, 617{631.
Gallant, A. Ronald and Jonathan R. Long (1997), \Estimating Stochastic Di erential Equa-
 tions E ciently by Minimum Chi-Square," Biometrika , vol. 84, 125{141.

                                            22
Gallant, A. Ronald and George E. Tauchen (1996), \Which Moment to Match?" Economet-
 ric Theory , vol. 12, 657{681.
Ghysels, Eric, Andrew Harvey, and Eric Renault (1996), \Stochastic Volatility," in \Hand-
 book of Statistics Vol 14., Statistical Method in Finance," (edited by Maddala, G. S.),
 Amsterdam: North-Holland.
Gourieroux, Christian, Alain Monfort, and Eric Renault (1993), \Indirect Inference," Journal
 of Applied Econometrics , vol. 8, s85{s118.
Hansen, Lars Peter (1982), \Large Sample Properties of Generalized Method of Moments
  Estimators," Econometrica , vol. 50, 1029{1054.
Hansen, Lars Peter (1985), \A Method for Calculating Bounds on the Asymptotic Covari-
  ance Matrices of Generalized Method of Moments Estimators," Journal of Econometrics ,
  vol. 30, 203{238.
Hansen, Lars Peter, John Heaton, and Amir Yaron (1996), \Finite-Sample Properties of
  Some Alternative GMM Estimators," Journal of Business and Economic Statistics , vol. 14,
  262{280.
Hansen, Lars Peter, John C. Heaton, and Masao Ogaki (1988), \E ciency Bounds Implied
  by Multiperiod Conditional Moment Restrictions," Journal of the American Statistical
  Association , vol. 83, 863{871.
Hansen, Lars Peter and Jose Alexandre Scheinkman (1995), \Back to the Future: General-
  ized Moment Implications for Continuous Time Markov Process," Econometrica , vol. 63,
  767{804.
Heston, Steven (1993), \A Closed-Form Solution for Options with Stochastic Volatility with
  Applications to Bond and Currency Options," Review of Finanacial Studies , vol. 6, 327{
  343.
Hull, John and Alan White (1987), \The pricing of Options on Assets with Stochastic
 Volatility," Journal of Finance , vol. 42, 381{340.
Jacquier, Eric, Nicholas G. Polson, and Peter E. Rossi (1994), \Bayesian Analysis of Stochas-
  tic Volatility Models," Journal of Business and Economic Statistics , vol. 12, 371{389.


                                             23
Karatzas, Ioannis and Steven E. Shreve (1997), Brownian Motion and Stochastic Calculus ,
 Springer-Verlag.
Kim, S., Neil Shephard, and Siddhartha Chib (1998), \Stochastic Volatility: Likelihood
  Inference and Comparison with ARCH Models," Review of Economic Studies , vol. 65,
  361{394.
Kloeden, Perter E. and Eckhard Platen (1992), Numerical Solution of Stochastic Di erential
  Equations , Applications of Mathematics, Springer-Verlag.
Ledoit, Olivier and Pedro Santa-Clara (1999), \Relative Pricing of Options with Stochastic
  Volatility," Working Paper , Anderson Graduate School of Management, UCLA.
Lo, Andrew W. (1988), \Maximum Likelihood Estimation of Generalized It^ Process with
                                                                      o
  Discretely Sampled Data," Econometric Theory , vol. 4, 231{247.
Lo, Andrew W. and A. Craig MacKinlay (1990), \An Econometric Analysis of Nonsyn-
  chronous Trading," Journal of Econometrics , vol. 45, 181{212.
Merton, Robert C. (1980), \On Estimating the Expected Return on the Market," Journal
 of Financial Economics , vol. 8, 323{361.
Nelson, Daniel B. (1992), \Filtering and Forecasting with Misspeci ed ARCH Models I:
  Getting the Right Variance with the Wrong Model," Journal of Econometrics , vol. 52,
  61{90.
Nelson, Daniel B. and Dean P. Foster (1994), \Asymptotic Filtering Theory for Univariate
  ARCH Models," Econometrica , vol. 62, 1{41.
Newey, Whitney K. and Kenneth D. West (1987), \A Simple Positive Semi-De nite,
  Heteroskedasticity and Autocorrelation Consistent Covariance Matrix," Econometrica ,
  vol. 55, 703{708.
Pritsker, Matt G. (1998), \Nonparametric Density Estimation and Tests of Continuous Time
  Interest Rate Models," Review of Financial Studies , vol. 11, 449{487.
Protter, Philip (1992), Stochastic Integration and Di erential Equations: A New Approach ,
  Springer-Verlag.


                                           24
Singleton, Kenneth (1999), \Estimation of A ne Asset Pricing Models Using the Empirical
  Characteristic Function," Working Paper , Graduate School of Business, Stanford Univer-
  sity.
Stanton, Richard (1997), \A Nonparametric Model of Term Structure Dynamics and the
  Market Price of Interest Rate Risk," Journal of Finance , vol. 52, 1973{2002.
Tauchen, George (1986), \Statistical properties of generalized methods-of-moments estima-
  tors of structural parameters obtained from nancial market data," Journal of Business
  and Economic Statistics , vol. 4, 397{416.


                                           25
B Tables and Figures
                                         Table 1
                                  Monte Carlo Experiment
                                         Panel A
    True Value         Mean                 Median                RMSE
              GMM with Quadratic Variation from High-Frequency Return
                T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
       = 0:03     0.0352    0.0313      0.0340     0.0310     0.0130   0.0054
       = 0:25     0.2430    0.2487      0.2355     0.2460     0.0523   0.0258
       = 0:10     0.1016    0.1030      0.1018     0.1030     0.0080   0.0050
                          GMM with Integrated Volatility
                T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
       = 0:03     0.0382    0.0323      0.0374     0.0319     0.0139   0.0055
       = 0:25     0.2338    0.2456      0.2273     0.2437     0.0521   0.0257
       = 0:10     0.0992    0.0999      0.0992     0.0998     0.0044   0.0020
                         QML with Point-in-Time Volatility
                T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
       = 0:03     0.0446    0.0360      0.0434     0.0361     0.0195   0.0095
       = 0:25     0.2327    0.2441      0.2271     0.2410     0.0537   0.0290
       = 0:10     0.1012    0.1014      0.0999     0.1011     0.0095   0.0052
Note: The table reports the simulation results for the GMM and QML procedures discussed
in the main text applied in estimating the stochastic volatility di usion in equation (3). The
total number of Monte Carlo replications is 1,000.


                                             26
                              Table 1 cont.
                                Panel B
True Value         Mean                 Median                RMSE
          GMM with Quadratic Variation from High-Frequency Return
            T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
   = 0:10     0.1057    0.1023      0.1048     0.1016     0.0214   0.0100
   = 0:25     0.2478    0.2491      0.2474     0.2489     0.0158   0.0078
   = 0:10     0.1059    0.1073      0.1061     0.1072     0.0093   0.0082
                      GMM with Integrated Volatility
            T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
   = 0:10     0.1102    0.1032      0.1090     0.1027     0.0214   0.0091
   = 0:25     0.2460    0.2486      0.2459     0.2483     0.0163   0.0078
   = 0:10     0.0994    0.1000      0.0995     0.0998     0.0042   0.0020
                     QML with Point-in-Time Volatility
            T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
   = 0:10     0.1136    0.1040      0.1134     0.1048     0.0259   0.0138
   = 0:25     0.2497    0.2517      0.2480     0.2510     0.0196   0.0097
   = 0:10     0.0967    0.0956      0.0967     0.0958     0.0059   0.0054


                                   27
                              Table 1 cont.
                                Panel C
True Value         Mean                 Median                RMSE
          GMM with Quadratic Variation from High-Frequency Return
            T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
   = 0:10     0.1113    0.1035      0.1091     0.1035     0.0253   0.0111
   = 0:25     0.2389    0.2468      0.2364     0.2463     0.0326   0.0158
   = 0:20     0.2031    0.2051      0.2030     0.2049     0.0122   0.0078
                      GMM with Integrated Volatility
            T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
   = 0:10     0.1153    0.1048      0.1131     0.1047     0.0270   0.0114
   = 0:25     0.2346    0.2455      0.2319     0.2449     0.0341   0.0160
   = 0:20     0.1984    0.1997      0.1982     0.1995     0.0097   0.0046
                     QML with Point-in-Time Volatility
            T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
   = 0:10     0.1257    0.1093      0.1242     0.1107     0.0390   0.0208
   = 0:25     0.2459    0.2537      0.2432     0.2520     0.0336   0.0199
   = 0:20     0.1977    0.1960      0.1966     0.1958     0.0135   0.0084


                                   28
                                        Table 2
                Monte Carlo Experiment with Measurement Error Correction

    True Value           Mean                Median                 RMSE
                       Scenario A: GMM with Quadratic Variation
                  T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
       = 0:03       0.0364      0.0317   0.0354     0.0315      0.0138   0.0056
       = 0:25       0.2456      0.2491   0.2384     0.2464      0.0520   0.0257
       = 0:10       0.0909      0.0994   0.0905     0.0983      0.0230   0.0127
                    0.0007      0.0004   0.0006     0.0004      0.0008   0.0005
                       Scenario B: GMM with Quadratic Variation
                  T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
       = 0:10       0.1067      0.1027   0.1061     0.1023      0.0219   0.0104
       = 0:25       0.2489      0.2494   0.2484     0.2492      0.0157   0.0078
       = 0:10       0.0990      0.1049   0.0986     0.1046      0.0214   0.0121
                    0.0007      0.0004   0.0006     0.0003      0.0009   0.0005
                       Scenario C: GMM with Quadratic Variation
                  T = 1000 T = 4000 T = 1000 T = 4000 T = 1000 T = 4000
       = 0:10       0.1133      0.1042   0.1109     0.1043      0.0274   0.0119
       = 0:25       0.2435      0.2481   0.2400     0.2473      0.0314   0.0157
       = 0:20       0.1893      0.1999   0.1884     0.1987      0.0303   0.0162
                    0.0017      0.0010   0.0015     0.0009      0.0019   0.0013
Note: The Table reports the GMM estimation results obtained by including an additive
measurement error correction term, , in the moment conditions involving the squared in-
tegrated volatility. The RMSE column for gives the sample standard deviation across the
1,000 Monte Carlo replications.


                                          29
                                        Table 3
                    Summary Statistics for Daily Integrated Volatility

                 Statistics   DM/$ Rate Yen/$ Rate Nikkei 225
                 Mean            0.5290     0.5383     0.8511
                 Std. Dev.       0.4839     0.5217     0.7757
                 Skewness        3.7083     5.5713     3.0203
                 Kurtosis       24.0505    66.6545    18.1780
                 Minimum         0.0517     0.0280     0.0309
                 5% Quant.       0.1384     0.1382     0.1494
                 25% Quant.      0.2542     0.2533     0.3681
                 Medium          0.3990     0.4008     0.6479
                 75% Quant.      0.6252     0.6317     1.0782
                 95% Quant.      1.3450     1.3598     2.2491
                 Maximum         5.2453    10.0971     7.5651
                 Num. of Obs.      2445       2445        984
Note: The daily integrated volatilities are approximated by the quadratic variations con-
structed from ve-minute returns.


                                           30
                                      Table 4
                     GMM Estimation of Stochastic Volatility Model

                Parameter        DM/$ Rate Yen/$ Rate Nikkei 225
                Mean Reversion       0.1464       0.2472   0.1236
                (Standard Error)   (0.0387)     (0.0463) (0.0492)
                Long-run Mean        0.5172       0.5190   0.8312
                (Standard Error)   (0.0342)     (0.0240) (0.0950)
                Local Variance       0.5789       0.4242   0.1909
                (Standard Error)   (0.0580)     (0.1804) (0.3992)
                               GMM Speci cation Test
                Chi-Square (2)      12.1476       3.6182   0.8040
                p-Value              0.0023       0.1638   0.6690
Note: The GMM estimator and the speci cation test are described in Section 2. The daily
integrated volatilities are approximated by the quadratic variations from ve-minute returns.
The variance-covariance matrix is estimated using a Newey-West weighting scheme with a
lag-length of ve.


                                            31
              Scenario A: κ                  Scenario A: θ                  Scenario A: σ

   0.5                            0.5                            0.5

   0.4                            0.4                            0.4

   0.3                            0.3                            0.3

   0.2                            0.2                            0.2

   0.1                            0.1                            0.1

     0                             0                              0
         −5        0          5         −5           0       5         −5        0          5

              Scenario B: κ                  Scenario B: θ                  Scenario B: σ

   0.5                            0.5                            0.5

   0.4                            0.4                            0.4

   0.3                            0.3                            0.3

   0.2                            0.2                            0.2

   0.1                            0.1                            0.1

     0                             0                              0
         −5        0          5         −5           0       5         −5        0          5

              Scenario C: κ                  Scenario C: θ                  Scenario C: σ

   0.5                            0.5                            0.5

   0.4                            0.4                            0.4

   0.3                            0.3                            0.3

   0.2                            0.2                            0.2

   0.1                            0.1                            0.1

     0                             0                              0
         −5        0          5         −5           0       5         −5        0          5

Figure 1: t-test Distributions. \- - -" t-statistics with 1000 observations \|{" t-statistics
with 4000 observations \-.-.-" Normal (0,1) reference density.

                                                32
                                         Scenario A: 1000 Sample                                                       Scenario A: 4000 Sample
                          100                                                                            100
Percentage of Rejection


                                                                               Percentage of Rejection
                           80                                                                             80

                           60                                                                             60
                                                                                                                   Reference Curve
                                    Reference Curve
                           40                                                                             40

                           20                                                                             20                    Rejection Curve
                                             Rejection Curve

                            0                                                                              0
                                0       20     40      60      80   100                                        0       20     40      60      80   100
                                          Nominal Level of Test                                                          Nominal Level of Test
                                         Scenario B: 1000 Sample                                                        Scenario B: 4000 Sample
                          100                                                                            100
Percentage of Rejection


                                                                               Percentage of Rejection
                           80                                                                             80

                           60                                                                             60
                                    Reference Curve                                                                Reference Curve
                           40                                                                             40

                           20                                                                             20                  Rejection Curve
                                             Rejection Curve

                            0                                                                              0
                                0       20     40      60      80   100                                        0       20     40      60      80   100
                                          Nominal Level of Test                                                          Nominal Level of Test
                                         Scenario C: 1000 Sample                                                        Scenario C: 4000 Sample
                          100                                                                            100
Percentage of Rejection


                                                                               Percentage of Rejection


                                         Rejection Curve                                                                 Rejection Curve
                           80                                                                             80

                           60                                                                             60
                                                      Reference Curve
                                                                                                                                     Reference Curve
                           40                                                                             40

                           20                                                                             20

                            0                                                                              0
                                0       20    40      60       80   100                                        0       20    40      60       80   100
                                          Nominal Level of Test                                                          Nominal Level of Test

                                        Figure 2: GMM Speci cation Test of Overidentifying Restrictions.

                                                                          33
              Scenario A: κ                  Scenario A: θ                  Scenario A: σ

   0.5                            0.5                            0.5

   0.4                            0.4                            0.4

   0.3                            0.3                            0.3

   0.2                            0.2                            0.2

   0.1                            0.1                            0.1

    0                              0                              0
         −5        0          5         −5           0       5         −5        0          5

              Scenario B: κ                  Scenario B: θ                  Scenario B: σ

   0.5                            0.5                            0.5

   0.4                            0.4                            0.4

   0.3                            0.3                            0.3

   0.2                            0.2                            0.2

   0.1                            0.1                            0.1

    0                              0                              0
         −5        0          5         −5           0       5         −5        0          5

              Scenario C: κ                  Scenario C: θ                  Scenario C: σ

   0.5                            0.5                            0.5

   0.4                            0.4                            0.4

   0.3                            0.3                            0.3

   0.2                            0.2                            0.2

   0.1                            0.1                            0.1

    0                              0                              0
         −5        0          5         −5           0       5         −5        0          5

Figure 3: t-test Distributions with Measurement Error Correction. \- - -" t-statistics with
1000 observations \|{" t-statistics with 4000 observations \-.-.-" Normal (0,1) reference
density.

                                                34
                                         Scenario A: 1000 Sample                                                           Scenario A: 4000 Sample
                          100                                                                               100
Percentage of Rejection


                                                                                  Percentage of Rejection
                           80                                                                                80

                           60                                                                                60
                                                                                                                      Reference Curve
                           40 Reference Curve                                                                40

                           20                                                                                20                    Rejection Curve
                                           Rejection Curve
                            0                                                                                 0
                                0       20     40      60      80      100                                        0        20     40      60      80     100
                                          Nominal Level of Test                                                              Nominal Level of Test
                                         Scenario B: 1000 Sample                                                            Scenario B: 4000 Sample
                          100                                                                               100
Percentage of Rejection


                                                                                  Percentage of Rejection
                           80                                                                                80

                           60                                                                                60
                                                                                                                      Reference Curve
                                    Reference Curve
                           40                                                                                40

                           20                                                                                20                 Rejection Curve
                                             Rejection Curve

                            0                                                                                 0
                                0       20     40      60      80      100                                        0        20     40      60      80     100
                                          Nominal Level of Test                                                              Nominal Level of Test
                                         Scenario C: 1000 Sample                                                            Scenario C: 4000 Sample
                          100                                                                               100
Percentage of Rejection


                                                                                  Percentage of Rejection


                           80                                                                                80           Rejection Curve
                                        Rejection Curve

                           60                                                                                60
                                                          Reference Curve                                                                   Reference Curve
                           40                                                                                40

                           20                                                                                20

                            0                                                                                 0
                                0       20    40      60       80      100                                        0        20    40      60       80     100
                                          Nominal Level of Test                                                              Nominal Level of Test

Figure 4: GMM Speci cation Test of Overidentifying Restrictions with Measurement Error
Correction.

                                                                             35
                                                              DM/$ Exchange Rate
                              6
Daily Integrated Volatility


                              5

                              4

                              3

                              2

                              1

                          0
                          1987    1988      1989    1990     1991    1992    1993        1994   1995     1996   1997
                                                                     Date
                                                              Yen/$ Exchange Rate
                              6
Daily Integrated Volatility


                              5

                              4

                              3

                              2

                              1

                          0
                          1987    1988      1989    1990     1991      1992     1993     1994   1995     1996   1997
                                                                       Date
                                                              Nikkei 255 Stock Index
                              6
Daily Integrated Volatility


                              5

                              4

                              3

                              2

                              1

                          0
                          1994     1994.5       1995       1995.5     1996      1996.5      1997       1997.5   1998
                                                                      Date

                                         Figure 5: Daily Integrated Volatility on Financial Markets.

                                                                      36