The Simultaneous Equations Model with Generalized Autoregressive Conditional Heteroskedasticity: The SEM-GARCH Model
Abstract
In this paper I generalize the standard simultaneous equations model by allowing the innovations of the structural equations to exhibit Generalized Autoregressive Conditional Heteroskedasticity ( GARCH ). I refer to this new specification as the SEM-GARCH model. I develop two estimation strategies: LIM-GARCH , a limited information estimator, and FIM-GARCH , a full information estimator. I show that these estimators are consistent and asymptotically normal. Following Weiss (1986) I show that when the errors in the SEM-GARCH process are incorrectly assumed to be conditionally normal the likelihood function is still maximized at the true parameters, given certain regularity conditions. This results in the asymptotic variance-covariance matrix being more complex than the usual inverse of the information matrix.
International Finance Discussion Paper Number 322.
May 1988
THE SIMULTANEOUS EQUATIONS MODEL WITH GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSKEDASTICITY: THE SEM-GARCH MODEL
Richard Harmon
NOTE: International Finance Discussion Papers are preliminary materials circulated to stimulate discussion and critical comment. References in publications to International Finance Discussion Papers (other than an acknowledgment by a writer that he has had access to
unpublished material) should be cleared with the author or authors.
ABSTRACT
In this paper I generalize the standard simultaneous equations model by allowing the innovations of the structural equations to exhibit Generalized Autoregressive Conditional Heteroskedasticity (GARCH). I refer to this new specification as the SEM-GARCH model. I develop two estimation strategies: LIM-GARCH, a limited information estimator, and FIM-GARCH, a full informaticn estimator. I show that these estimators are consistent and asymptotically normal. Following Weiss (1986) I show that when the errors in the SEM-GARCH process are incorrectly assumed to be conditionally normal the likelihood function is still maximized at the true parameters, given certain regularity conditions. This results in the asymptotic
variance-covariance matrix being: more complex than the usual inverse of the
information matrix.
THE SIMULTANEOUS EQUATIONS MODEL WITH GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSKEDASTICITY: THE SEM-GARCH MODEL
* Richard Harmon
In this paper I generalize the standard simultaneous equations model (SEM) by allowing the innovations of the structural equations to exhibit Generalized Autoregressive Conditional Heteroskedasticity (GARCH). A GARCH(p,q) process is a process whose conditional variance at time t is a function of the information available at time t-l. Specifically, it is a function of variances of past innovations and of the squared realizations of past innovations. Unconditionally, the current innovation reverts to the standard specification: white noise with fixed variances over time. I refer to this new specification as the SEM-GARCH model. I develop two estimation strategies: LIM-GARCH, a limited information estimator, and FIM-GARCH, a full information estimator. I show that these estimators are consistent and asymptotically normal.
The outline of the paper is as follows. Section I presents the unrestricted and diagonal SEM-GARCH(p,q) models. Sections II and III derive the LIM-GARCH estimator and its asymptotic properties. Sections IV
and V develop the FIM-GARCH estimator and its asymptotic properties.
*I am grateful to James Albrecht, Dale W. Henderson, and especially Dan Westbrook for their patience and guidance throughout the course of research embodied in this paper. Helpful suggestions are also appreciated from Frank Diebold, Neil Ericsson, David Howard, Ralph Tryon and the participants of a seminar at the Federal Reserve Board. This work was completed while I was an intern in the International Finance Division of the Federal Reserve Board. This paper represents the views of the author and should not be interpreted as reflecting the view of the Board of Governors of the Federal Reserve System. Unfortunately, any remaining errors are my responsibility.
Finally in Section VI the Information Matrix Test of White (1982) is used to ascertain the correct form of the variance-covariance matrix and test for misspecification. This is followed by some concluding remarks. I. THE SEM-GARCH(p,q) MODEL
The Autoregressive Conditional Heteroskedastic model (ARCH) developed by Engle (1982) and its generalization, the GARCH model, developed by Bollerslev (1986) specifies the conditional variance of the current innovation as a function of the available information set; specifically,
the conditional variance is a function of the squared realizations of past
innovations and of variances of past innovations. In the GARCH(‘p,q) model, the conditional variance, denoted ne, has the following specification q P (1.1) he = vw + ae - + & he : t ° i-t-i : i t-i i=l i=l
Kraft and Engle (1982) extend the ARCH model into a multivariate time series framework. Extensions with respect to multivariate GARCH models are given by Bollerslev, Engle, and Wooldridge (1985). The SEM-'SARCH model, to be specified below, generalizes the standard SEM by allowing the innovations of the structural equations to exhibit GARCH processes. This type of specification has many potential applications. For example, in models of foreign exchange rate determination it can be used to model the joint determination of the foreign exchange rate with domestic and foreign
interest rates.
The standard SEM consists of M linear equations: (1.2) yo + XB ~ €& t=$1,...,T,
where T is a M x M matrix of coefficients of current endogenous variables,
B is a K x M matrix of coefficients of predetermined variables, ee is
row vector consisting of M innovations at time t, and Y, and x. are row
a
vectors consisting of observations on M endogenous variables and K
predetermined variables, respectively, at time t. It is assumed that the
vector of innovations, ¢€ is given by
t’
(1.3) e.7 N(O,z) ,
where = is nonsingular. The likelihood function for the sample Y Y
porte conditional on X is given by
(1.4) LOYy yes Vp hR) = (2m M2 | |Taf 7/7 ii -1 exp|- 3 = (v0 + X,A)= (YF + x,p)'| i-1
Leaving aside the variance-covariance matrix 2, which has M(M+1)/2 distinct elements, the likelihood function defined by (1.4) has a + MK parameters. Clearly, without a priori restrictions onTI, B, or X, none of
the parameters of the structural model are identified.
For now I restrict my attention to a model whose predetermined variables are exogenous; dynamic linear SEM remain a topic for future research. Following Kraft and Engle's multivariate ARCH specification,
the SEM-GARCH(p,q) model is given by
(1.5) YF + X86 = «€, (1.6) Ce.[T,3) ~- NOH.) , where Hae Maae oe ime (1.7) H = H,. H H
t 21,t §22,t °"" 2M,t
Hae Hye t ee Ha
= W+ Ee @ cea ]ey[ty @ 4 tose + [H ® cealal te ® co
e
+ [1 @ b_4)>, [* ®@ | ticee + [1 @ be PPa| ®@ h._,| ,
where fey and hey are M-element row vectors consisting of ith lagged innovations and ith lagged conditional standard deviations. Y, and x. are row vectors consisting of M endogenous variables and K _ exogenous variables, respectively. Hist (i,j=-1,...,M) is a scalar and represents the conditional variance of the ith equation when i=j and the conditional
covariance between the ith and jth equations when i»j. The primary
restriction imposed on H. is that it must be’ bounded = and positive-definite. C and D are Mw x m2 symmetric matrices with M x M symmetric blocks of Ci, and Diy? respectively. W, a M x M scalar matrix, nests the hypothesis of homoskedastic ey within the SEM-GARCH specification and allows the conditional variance-covariance matrix to exist when all C and D are zero.
While I explicitly assume that the conditional distribution of the innovations is normal, in many circumstances this may not be the case. If the conditional probability model is non-normal, then the estimators proposecl here are quasi-maximum likelihood estimators (QMLE). White (1982) notes that the asymptotic variance-covariance matrix of the QMLE no longer equals the inverse of Fisher's Information matrix, but can be consistently estimated by a more complex form. For expository purposes I will continue to assume that the conditional probability model is correctly specified and explore the consequences of it being incorrectly
specified when the asymptotic properties of the estimates are derived.
The following assumptions will be made throughout:
= Q , where Q is a finite and nonsingular matrix,
(4) . pain
is,
where = is a bounded and positive-definite unconditional variance-covariance matrix.!
A alternative parameterization of the SEM-GARCH(p,q) inodel facilitates estimation and highlights the alternative variance-covariance specifications available. Let h, be the M(M+1)/2 column vector whose components are the unique elements in He taken by vectorizing the lower triangle of H.- For expository purposes I will focus on_ the
SEM-GARCH(1,1) model in a two equation system (M=2). Here the conditional
variance-covariance matrix is defined to be
(1.8) H, =
+ [yen |r, [nen].
where Cc) and Dd, are 4 x 4 parameter matrices given by
tan excellent discussion of Central Limit Theory for the non-i.i.d. case is provided in White (1984) Ch.5:
(1.9)
(1.10)
1 %12 : “13 “14 “o1 S22 : S23 S24 Cay S39 + &33 O34 Cc. C,:6,. ¢
Dir P12 P13 Pe Poa Daa: O23 Dee. D31 P32 : 233° P34 Dar Pa2 : Pas Pua
Note that the symmetry of C and D require that Co - Cho: C53 - Chay: Ca =
Cao)
(o =
43 ~ ©34°
Diary Pos = Paar Par 7 P32 and Pus = Pgy- The
alternative parameterization consists of vectorizing the lower triangle of
He»
* denot:ed h,,
(1.11)
h*
Ait Hort Hoot w A A A e2 11 11 “12 413 1,t-1 vor | + | Aor Aca Aas |} €2,e-161,e-2 | 52 Az, Azo 433 | Oe
11 Bia Bas |} Baa, e-2 91 Boo Boz |} Horta. | °
31 332 333 |} Boo,e-4
which in matrix notation can be written as
1.12) ht + A + Bh. (1.12) t 7 M1 tl
where 1-47 vec(es 1&1) and f.47 (ey ea? > 1° This setup will be referred to as the "“unrestricted" SEM-GARCH(1,1) model since no restrictions are imposed on matrices A and B except those required to ensure the positive definiteness of H,- Expansion of (1.11) highlights
why this specification is referred to as "unrestricted",
2 (1.13) yy ge = yt Agate * Are (€2,e-281,e-2? + A ez + B..H + B..H 13°2,t-1 18a. t-1 12491 t-1 + By 3Hoo t-1 (1.14) 4 ~ vw. + Ane +a 21,t 21 21°1,t-1 222, t-1°1,t-L. + Ae + B,.H + B,.H 23°2,t-1 21841, t-1 22o1 t-1 +
Bo 342 t-1 ?
. 2 (1-15) Hoa tg 7 Man + Agfa eed * Age€o t-181 ev + Anne? + BH + BoH 33°2,t-1 31°11,t-1 32°21,t-1 + B33Hoo ta °
As shown in Table 1, the conditional variance-covariance matrix of a two equation SEM-GARCH(1,1) model has 21 free parameters. While there should be no problem estimating the parameters of this small system, the number of parameters rapidly increases as the system gets larger. Table 1 shows that a five equation SEM-GARCH(1,1) model has 465 parameters. Many time se‘ries data sets do not have enough observations for estimation; clearly a more parsimonious specification is required.
One such specification models the conditional variance of each equation as a function of its own lagged squared innovations and lagged conditional variances. Similarly, the conditional covariances can be modelled’ as functions of the lagged cross-innovations and lagged conditional covariances. Imposing this structure on the SEM-GARCH(1,1)
model yields (1.8) with Cc) and D) given by
11 14 0 0 :C¢ 0
(1.16) Cc) - beceeeeeeet es’? seen 0 C35 0 0 Cc 0 0 oC
GARCH
Model
(1,1)
(2,2)
(3,3)
Table 1
THE SEM-GAR‘ MODE
The Conditional Variance-Covariance Matrix
Equations Unrestricted” Diagonal” 2 21 9 3 ee: 18 4 210 30 5 465 45 2 ; 39 4s 3 150 30 4 410 50 5 915 75 2 | 57 21 3 222 42 4 610 70 5 . 1365 105
* Unrestricted Var-Cov: # = [M(M+1) /2]2(p+q) + [M(M+1)/2]
M
**k Diagonal Var-Cov: # = (l+ptq) Zi where M is the
number of equations in the system.
10
Dd, 0 0 Ply 0 oOo :D 0
(47) Dp os Jee. 238 0 Ds: 0 0
Di 9 0 Dyy
where symmetry implies Co3 - Cha: Cha - C35: Do3 - Diy? and Dal - D35-
* Under the more parsimonious parameterization, h, becomes
H
1l,t 1.18 h* H (1.18) t 21,t Hoot A 0 oO 2 “11 | “11 “1,t-1 - | %1] + 0 Ag, 0 £9 e-191,t+1 2 “99 0 0 433 9 t-1 Bi, 9 820 Aya t + 0 Boy 0 Hor t-1 0 90 B33 tt Boot Expanding (1.18) yields (1.19) H.. . = w + Ace . + BLAH 11,¢ 11 11°1,t-1 117q1 e-1 °
11
(1.20) H = w + A
21,t + B,.H
21 2262, t-1°1,t-1 22°21,t-1 ’
(1.21) H =
22,t Yoo + A
+ By 3Ho 1
2 33°2,t-1 Specification (1.18) is referred to as the "diagonal" SEM-GARCH(1,1) model. Table 1 summarizes the parametric requirements of the full and diagonal SEM-GARCH(1,1) model. For the SEM-GARCH(1,1) model in a two equation system the number of parameters is reduced from 21 to 9. For a five equation system the number of parameters is reduced from 465 to only 45.
As stated earlier, H, is required to be bounded and positive
t definite. For the SEM-GARCH(1, 1) model this requires that
(1.22) Ait
and
Ww Ol.
(1-23) 0 Aya Hoo t 7 Bore
Clearly, a sufficient condition for He to be positive definite is having every term in w, A, and B be positive definite. A less stringent sufficient condition is established by setting A=B=0 which from (1.22) and
(1.23) implies
12
| 2 (1.24) 14 > 0, Woo >0 and 11 ¥o9 — Wo >o. Similarly, setting w=B=0 yields 2 (1.25) All >o, Ayo > 0 and Ay y409 - Aoy >0 Alternatively, setting w=A=0 yields 2 (1.26) Bit >o, Boo >0 and Bi 1359 ad Boy >o.
For higher order SEM-GARCH models the parameter constraints to ensure the positive definiteness of He are extremely complicated. In those cases, one might impose penalty functions to ensure that H. is positive
definite.
Il. ‘THE LIM-GARCH ESTIMATOR
Since the reduced form innovations involve linear combinations of all the structural innovations, my focus is on structural form estimation to facilitate the conditional heteroskedasticity specification. I develop two estimation strategies for the SEM-GARCH(p,q) model. The first is a limited information approach that concentrates upon a single equation of the simultaneous equations system while disregarding the parametric restrictions that bind the system as a whole. The second is a full system
estimation that makes efficient use of all available information. The
13
limited information approach is useful when the full model is too complex to be estimated by a full information technique or when one suspects specification errors in equations other than the equation of primary interest.”
As described in Section I, the structural model consist:s of M
equations with a single equation, say the first, given by
(2.1) YI. + X84 7 eee
where Ty By: and «¢ are the first columns of I, #, and e,
1t respectively. The variables are arranged so that the usual identifying
restrictions may be shown by the following partitions:
and
The number of included and excluded endogenous variables in the first
2The LIM-GARCH estimator can be viewed as a particular case o FIM-GARCH where the other M-1 equations are just identified and have non-ARCH innovations... ,
14
* equatiion are denoted by m) and m, (= M - m,); respectively. The numbers
*
of included and excluded exogenous variables are k, and k, (= K - k))-. Hence, a, and b, are column vectors with m and ky elements, respectively. The matrices of endogeneous and exogenous variables are partitioned in
(2.3) to correspond to the partitioning of the coefficient vectors in
(2.2). The usual order and rank conditions for identification of the
first equation are given by
* (Order Condition): k) = m ~ 1 (Rank Condition) : Rank(, A, ) -M-1
where: >, is a R X (MtK) selection matrix composed of zeros and ones, and Ay = (T'y> B')) = (at : 0’ : be : 0’)’ is a (M+K)—element column vector composed of all the parameters of the system.
The limited information approach to estimation of the SEM-GARCH(p,q) model is referred to as the LIM-GARCH estimator. Following the standard derivation of the LIML estimator, [c.f. Koopmans, Rubin, and Leipnik (1950), Dhrymes (1970):328-357 and Schmidt (1976):184-195], I begin with the conditional log-likelihood function of the LIM-GARCH estimator for the
structural equation defined in (2.1):
(2.4) L(a,,.b,.v 0,641.4) = t
(2.
where
(2.
(2
(2.
(2.
(2.
(2.
(2.
Note that I
X.. The innovations are assumed to be conditionally normal distributed
t
with the conditional variance, hi following the GARCH(p,q) specification
5)
6)
7)
8)
9)
10)
11)
12)
denotes information available at time t-l, including Y. and
of Bollerslev (1986).
1
2 1 ' 1 plog(h; .) + plog(a W
OW 11, to?
1 ~2 3] 1%. + Xb Ea. FR ePPae |
= Pee)
' -l,, (1 — K,(KEX,) XD)
m4 1 - =| 1n(2r) +1 | + Al 1- 1n|w, | | ;
(YePLY) ) ~ NCO, bh?) t-1 y 1t’% ’ a 2 P42 w + La,e + = 6,h , ° j=l i1,t-i i-1 i1,t-i Yuet. + Xi ,>,
P. is a symmetric and idempotent projection matrix,
16
We is the second moment matrix of residuals of the least squares estimate of the reduced form of the entire system and Wait is the sub-matrix of We pertaining to the included endogenous variables, Y,,.
It should be noted that the standard LIML and 2SLS estimators are consistent due to the conditions (i) — (iii) provided in Section I. The primary benefit of the LIM-GARCH estimator, as with the ARCH and GARCH models, is in terms of efficiency.
Since the conditional log-likelihood function L depends on _ the parameters ao» bo» vw» @ and § in a nonlinear fashion, maximization of L(a,,b,,w,,0,5|I, 1) requires an iterative technique. Maximum Likelihood (ML) estimates of the parameters ao: b5: wo @ and § are derived from the first order conditions of (2.4). These derivatives, given in the Appendix, have a complex recursive structure making it extremely difficult to derive compact analytic expressions. While analytic expressions are in general the desired path to pursue, they are very inflexible to changes in specification and computationally extremely burdensome. Therefore, I rely
on numerical derivatives in the actual estimation procedure.
III. ASYMPTOTIC PROPERTIES OF THE LIM-GARCH ESTIMATOR
This section explores the behavior of the LIM-GARCH estimator in large samples .> In the previous sections conditional normality was explicity assumed. But now, following White (1982) and Weiss (1986), this
assumption is relaxed. As with most QMLE, the likelihood function is
;
“This section is based on Weiss (1986) who derives the asymptotic properties of an extended ARCH model in the context of a dynamic linear regression model with moving average errors.
17
derived as though the innovations are, in fact, conditionally normal. In Theorem I it is shown that in the limit the conditional log-likelihood function is maximized at the true parameters even though the assumption of normality may not be valid. Theorems II and III verify the consistency and asymptotic normality properties of the LIM-GARCH’ estimates, respectively.
As described in Section II, the LIM-GARCH specification involves maximizing the conditional log-likelihood function given by equation (2.4). Throughout this section let #@ to be an s-element column vector (s = m, +k, +p+qt1) containing all the parameters of the 1IM-GARCH
specification, that is,
(3.1) gp’ = (m', v') = (ag, bo, wy, a’, 6")
where @ is partitioned such that m is the parameter vector corresponding to the structural equation under investigation, and v is the parameter vector of the conditional variance specification, given by equation (2.11). Furthermore, I assume # € =, where = is a compact sutspace of Euclidean space, and let o, represent the true parameter vector.
First, a set of lemmas is required to provide the foundation for the ensuing theorems. Lemmas I and II require that certain matrices of
partial derivatives be well defined and positive definite.
18
LEMMA T: For all @ € B, there exists a constant M <o , not depending on @ such that de de (3.2) E _it it < M, dm dm’ and
de de (3.3) Det EI—2= —1t] 5 0. aman!
Proof: See Appendix.
An equivalent requirement for the conditional variance hi, is given
by Lemma II.
LEMMA IT:
Assume that the fourth moment of €1t exists and is bounded.
Then for all 6 € 3, there exists a constant My <o@ , not depending
on. @, such that
de de (3.4) e}—* —#) < mw ov ov’
19
de de
(3.5) Det E|—LE —1t] 5 oo. ava’
Proof: See Appendix.
Lemmas I and II imply that the negative expected value of the matrix of second derivatives of the conditional log-likelihood function is
positive definite:
LEMMA III: Under the same conditions as Lemma II, there exists a constant
M, < © not depending on @ such that
T 22 (3.6) A = —g/9L | M, a0ae'
and (3.7) Det A > 0. Proof: See Appendix.
‘These results provide the basis for the following theorems.
20
THEOREM I:
For the LIM-GARCH specification given by equations (2.4) to
(2.9) and under the same conditions as Lemma II
(3.8) L = lim L(6) (exists a.s. for all 6 € ©)
T-<0
and the lim L(6) is uniquely maximized at O.- Proof: See Appendix.
Taus, in the limit the conditional log-likelihood function is maximized at the true parameters even though the assumption of normality
may not be valid.
THEOREM II: (Consistency) For the LIM-GARCH specification given by equations (2.4) to
(2.9), the maximm likelihood estimate 6 is consistent for 6,
provided o. is interior to =. Proof: See Appendix.
Note that the condition that g, is interior to = ensures that, for T large enough, the first derivatives of L(@) are "well-behaved" at 65:
To consider the asymptotic distribution of the LIM-GARCH estimates, first, define A, and By to be the information matrix in Hessian and outer
product form, respectively.
21
9 (3.9) A, - -E guy. agae’
(3.10) Bo = E qo ob ao 36°
As will be shown, both A, and B) appear in the variance-covariance matrix of the asymptotic distribution of the LIM-GARCH estimates and are
therefore required to be invertible.
THEOREM III: (Asymptotic Normality) For the LIM-GARCH specification under the same conditions as
Theorem I, with the requirement that det (B,) > 0, then
(3.11) po/24 1/2 — 6.) ~ N(O,1)
Furthermore, consistent estimates of A, and B, are given by
2 2 ° ° A T dh a T de de (3.122) A = Qs not Pre Mie + tis yz Et t-1 a0.ao tel 30 a8 and A T of. ae 3.13) B= Try ——& t-1 30 40"
with all derivatives evaluated at 0 = a.
22
Proof: See Appendix.
Thus the asymptotic variance-covariance matrix of @ has the following
general form
(3.14) Asymptotic Var-Cov(#) = ALBA.
since the conditional log-likelihood function may not be correctly specified. When the conditional distribution of eit is normal then A, -
B, and the standard form for the asymptotic variance-covariance matrix is
appropriate, that is
(3.15). Asymptotic Var-Cov(@) = a.
To determine which form of the variance-covariance matrix is
appropriate I rely on White's (1982) information matrix test which is
derived in Section VI.
Iv. ‘THE FIM-GARCH ESTIMATOR
An estimator for the complete SEM-GARCH(p,q) model requires a more complex specification than the LIM-GARCH estimator. The full system estimator will be referred to as the Full Information Model with Generalized Autoregressive Conditional Heteroskedastic error processes, or
the FIM-GARCH estimator. The derivation of the FIM-GARCH estimator will
23
focus on the two equation diagonal SEM-GARCH(1,1) model, defined in Section I, since this is the most tractable model and can easily be generalized.
The alternative parameterization of the diagonal SEM-GARCH(1,1) model, given by (1.18), defines a vector hy consisting of all the unique elements of the conditional variance-covariance matrix. In matrix
notation, this can be written as
* (4.1) ho = wv + An, + Bh.) .
where
2 1,t-1
(4.2) Me. 7 €o,t-1°1,t-1 | ’ € 2 ; 2,t-1
Mya ot 4.3 h” He (4.3) t-l1 7 21,t-1
Hoo t-1
FIM-GARCH estimation of the diagonal SEM-GARCH(1,1) model requires
maximizing the conditional log-likelihood function
24
T (4.4) L-=2 tel where (4.5) 2.v,[t,,) = —Mogcae) + log||r]| — 21og]H_| cre te 208 & 2 Ob
- ame [rr + Xp)‘ (¥,0 + x,prHi."]
where H,I denotes the determinate of H., and [|r] | is the absolute value of the determinate of [. For estimation purposes, H. is constructed from its unique elements, defined by he in equation (1.18).
Since the conditional log-likelihood function L depends on the parameters T, 8, w, A, and B.in a highly nonlinear fashion, maximization of L requires iterative techniques. The FIM-GARCH estimator is derived from the first order conditions of (4.4).
As with the LIM-GARCH estimator, the derivatives of He with respect to [ and B are a function of past derivatives of ey and H.. As a result, the analytic derivatives of the FIM-GARCH estimator have a complex recursive structure which are difficult to calculate. Therefore, I will rely on numerical derivatives for actual estimation.
The difficulties involved in deriving the analytical derivatives can be highlighted by examining the partial derivative of the conditional log-likelihood function with respect to the unrestricted structural rm.
parameters, The superscript y denotes a selection operator, as
defined by Hendry (1976), to choose only the unrestricted elements of a
25
matrix. This is necessary since only the derivatives with respect to the
unknown elements are equated to zero.
(4.6) aL _ palogliri] _ [sents [a
ar ary 21 alu| au | jar”
wCe'e) _ ¢, OH
This can be simplified to yield
ar’ 2 ar H H
But the partial derivative of the conditional variance-covariance matrix, H, with respect to [ is also a function of derivatives of lagged residual variances and covariances as well as derivatives of lagged conditional variances and covariances. Similar recursive structures arise with respect to the other parameters of the system.
In estimation, as with the LIM-GARCH estimator, numerical derivatives are used. As explained in Section III, when the model is correctly
specified and the conditional distribution of e¢ is correctly assumed to
1t be normal, then the information matrix can be equivalently expressed in either Hessian or outer product form. In this case, the Berndt, Hall,
Hall, Hausman (BHHH) (1974) algorithm, based on the outer product form of
26
the information matrix, can be used to maximize the conditional log-likelihood functions for the LIM-GARCH and FIM-GARCH estimators. The BHHH method is an iterative method for calculating the optimal parameters,
6. Let §~ denote the parameter estimates after the ith iteration. gitl
is then calculated fron,
T of! @2.J-1T ag! (4.8) gitl _ Reale ae t
te1 aot aot t=1 get
where As is a variable step length chosen to maximize the likelihood function in the given direction.
An alternative algorithm is the Newton-Raphson method which is based on the Hessian form of the information matrix. As mentioned earlier, both methods are equivalent when the conditional log-likelihood function is correctly specified. The asymptotic variance-covariance matrix is given by (3.15) where A, = B.. Alternatively, if the conditional distribution is incorrectly specified to be normal, then under certain regularity
conditions the asymptotic variance-covariance matrix has the form given in
(3.14).
Vv. ASYMPTOTIC PROPERTIES OF THE FIM-GARCH ESTIMATOR
The derivation of the asymptotic properties of the FIM-GARCH estimator closely follows that of the LIM-GARCH estimator. As with the LIM-GARCH estimator, if the conditional distribution is non-normal then the FIM-GARCH estimator is a QMLE. For the FIM-GARCH estimator of the
diagonal SEM-GARCH(p,q) model let @ now be a S-element column vector
27
M (5.1) 6’ = (m’', v’) with S=M+K + (l+ptq)Zi. i=l
where 6, as in Section III, is partitioned such that m is the parameter vector corresponding to the unrestricted parameters of all the structural
equations of the system, that is
(5.2) n= [veccr,a)|* ,
where vec denotes that the matrices are vectorized by column stacking operations and y» denotes a selection operator that chooses only the a priori unrestricted structural parameters. Similarly, v is the parameter
vector of the conditional variance specification, given by equation (4.5), (5.3) v= [reccw.a,B) |" ,
As before, I assume @ € &, where E is a compact subspace of Euclidean space, and let 6, represent the true parameter vector. Theorems IV and V are the FIM-GARCH equivelent to Theorems II and III which verify the
consistency and asymptotic normality properties of the estimator.
THEOREM IV: (Consistency) For the FIM-GARCH specification given by equations (4.1) to Aa
(4.5), the maximum likelihood estimate @ is consistent for 6,
provided §, is interior to E..
Proof: See Appendix.
THEOREM V: (Asymptotic Normality)
28
Assuning that det (B,)>0, the FIM-GARCH specification is distributed
asymptotically normal, (5.4) Bo /2a 1/706 — 6.) ~ N(O,I)
where corsistent estimates of A, and B, are given by
A T dH. dH T 6.5) aA = pts aya yt tert s aya’ t=1- 0606 «6a8’ t=1 and A T @é@2 rip 4 (6.6) B-Ttts —t-—+
t=-1 30 436°
with all derivatives evaluated at 6 = 0.
Proof: See Appendix.
de, de,
a6 308’
Thus the asymptotic variance-covariance matrix of § has the following
general form
. “ -1, .l (5.7) Asymptotic Var-Cov(#) = A, BoA ;
29
since the conditional log-likelihood function may not be correctly specified. When the conditional distribution of ey is truly normal then A, - B, and the standard form for the asymptotic variance-covari.ance
matrix is appropriate, that is
(5.8) Asymptotic Var-Cov(@) = A As with the LIM-GARCH estimator, White’s Information Matrix Test can be
used to determine the correct form for the asymptotic variance-covariance
matrix.
VI. THE INFORMATION MATRIX TEST
A well known test for misspecification associated with maximum likelihood estimation is the information matrix test of White (1982). The test is based on his information matrix equivalence theorem. This theorem essentially says that when the model is correctly specified and the
conditional distribution of 1 is correctly assumed to be normal, then
t
the information matrix can be expressed either in Hessian forn, ACO.) or
in outer product form, B(O,). White shows that under these conditioris (6.1) A(@.) - BC) = 0.
Thus a test for the null hypothesis of conditional normality and
30
correct model specification is based on the difference between consistent a aA
estimates of A) and B,, denoted Ay and By respectively. For the
LIM-GARCH estimator @ is a s-element parameter vector, following White
(1982) and Weiss (1986), I define q [= s(st+l1)/2] vectors d.(8) such that
d (8) has kth element
an, at, 7e, k=1,....4 (6.2) d(6) = |—— + i,j=-1,...,8 a0, 28, 20,80, i<j
The test is based on what White refers to as the "indicators"
(6.3) De) = T B46), tT taf
which are the elements of Ay - B..- Next, define V(@) as (6.4) vV(é) = e[a, (0) aca)" |
V(6) turns out to be the asymptotic variance-covariance matrix of rt/2y, (6). ' yo additional assumptions are required to meet the preconditions for the test to be valid. First, V(#) must be nonsingular. Secondly, for cases when eit is not normally distributed it is necessary
to assume that
31
@o 1t
(6.5) |, (hy)
teal < for all @geé&.
Based on these conditions, White proves that
(6.6) t/?p (9) * N{0,V(9)] and “ a.s. (6.7) Vp(9) > VCO.) where Vip (8) is the estimate of V(O,). Then it follows that the information test statistic, Th is given by (6.8) Ty =- m,.(0) [v.89 D,(9) ~ Xq
To carry out the Information Matrix Test, one computes Ty and compares it to the critical value of the x distribution for a given size of test. If Ty does not exceed this value, then one can not reject the null hypethesis that the model is correctly specified and a may be used as_ the variance-covariance matrix of the LIM-GARCH estimates. This applies
equally well to the FIM-GARCH estimator
VII. CONCLUSION: In this paper I have extended the GARCH(p,q) model of Bollerslev
(1986) into a simultaneous equations framework and derived two estimation
32
strategies: LIM-GARCH and FIM-GARCH. Furthermore, it has been shown that these two estimation strategies have the desireable asymptotic properties, consistency and asymptotic normality, of Maximum Likelihood estimators. Alternative specifications of conditional variances and covariances that are heteroskedastic remain to be explored. My approach has exclusively focused on the ARCH specification originally developed by Engle (1982) and generalized by Bollerslev (1986). Future research will focus on extending the SIIM-GARCH(p,q) model to a dynamic framework by including lagged endogenous variables. In Harmon (1988) I extend the SEM-GARCH model by incorporating the conditional variances and covariances as variables in the structural equations themselves. This is referred to as_ the SEM-GARCH-M model. It is a logical extension of the ARCH(q)—in-Mean model (c.f., Engle, Lilien, and Robins (1987)) and the GARCH(p ,,q)—in-Mean model
(c.f., Bollerslev, Engle, and Wooldridge (1985)).
33
BIBLIOGRAPHY
Anderson, G.J. "The Structure Of Simultaneous Equations Estimators: A Comment." Journal of Econometrics. Vol. 14 (1980): 271-276.
Basawa, I.V., P.D. Feigin, and C.C. Heyde. "Asymptotic Properties Of Maximum Likelihood Estimators For Stochastic Processes." Sankhya: The Indian Journal of Statistics, Vol. 38, Series A, Part 3 (1976): 259-270.
Berndt, E.K., B.H. Hall, R.E. Hall, and J.A. Hausman. "Estimation and Inference in Nonlinear Strutural Models." Annals of Economic and Social Measurement, Vol. 4 (1974): 653-665.
Bollerslev, Tim. "Generalized Autoregressive Conditional Heteroskedasticity." Journal of Econometrics, Vol. 31 (1986): 307-327.
"A Conditionally Heteroskedastic Time Series Model For Security Price and Rates of Return Data." Discussion Paper #85-32, Department of Economics, University of California, San Diego. (1985).
, R. Engle, and J. Wooldridge. "A Capital Asset Pricing Model With Time Varying Covariances." Discussion Paper #82-4, Department: of Economics, University of California, San Diego. (1982).
Crowder; Martin. "Maximum Likelihood Estimation For Dependent Observations." Journal of the Royal Statistical Society, Series B38, (1976): 45-53.
Dhrymes, P. J. Econometrics: Statistical Foundations and Applications. New York: Springer-Verlag. (1974).
Diebold, Francis X. and M. Nerlove. "ARCH Models of Exchange Rate Fluctuations." Unpublished manuscript, Department of Economics, University of Pennsylvania, (1986).
"Modelling The Persistence Of Conditional Variance: A Comment." Econometric Reviews. Vol.5, No.1 (1986): 51-56.
__. "Temporal Aggregation of ARCH Processes and Distribution of Asset
Returns.," Special Studies Paper #200, Board of Governors of the Federal Reserve System, (1986).
"Testing For Serial Correlation In The Presence Of ARCH." Proceedings of the American Statistical Association, Business and Economic Statistics Section. Washington, D.C.: The American Statistical Society, (1986).
and P. Pauly. "Endogenous Risk In A Portfolio-Balance Rational - Expectations Model Of The Deutschemark-Dollar Rate." Forthcoming, European Economic Review (1988).
34
Domowitz, lan and Craig Hakkio. "Conditional Variance And The Risk Premium In The Foreign Exchange Market." Journal of International Economics,
Vol. 19, (1985): 47-66.
Domowitz, [an and Craig Hakkio. "Testing For Serial Correlation In The Preseence Of Heteroscedasticity With Applications To Exchange Rate
Models." Research Paper #83-11, Federal Reserve Bank of Kansas City. (October, 1983).
Engel, Robert F. “Autoregressive Conditional. Heteroscedasticity With
Estimates Of The Variance Of United Kingdom Inflation." Econometrica, Vol. 50, No. 4 (July, 1982): 987-1007.
"Wald, Likelihood Ratio, And Lagrange Multiplier Tests In Econometrics." Handbook of Econometrics, Volume II. Edited by Z. Griliches and M.D. Intriligator. (1984): 776-826.
. “A General Approach To Lagrange Multiplier Model Diagnostics." Journal of Econometrics, Vol. 20 (1982): 83-104.
and T. Bollerslev. "Modelling The Persistence Of Conditional Variances." Econometric Reviews, Vol.5, No.1 (1986): 1-50.
,C.W.J. Granger and D. Kraft. "Combining Competing Forecasts of
Inflation Using A Bivariate ARCH Model." Journal of Economic Dynamics and Control, Vol. 6 (1984): 151-165.
and D. Kraft. "Multiperiod Forecast Error Variances Of Inflation
Estimated From ARCH Models." In Applied Time Series Analysis of Economic Data, edited by A. Zellner, Bureau of the Census, (1983): 293-202.
, D. Lilien, and R. Robins. "Uncertainty And The Term Structure." Discussion Paper #82-4, Department of Economics, University of California, San Diego. (1982).
, D. Lilien, and R. Robins. "Estimating Time Varying Risk Premia In The Term Structure: The ARCH-M Model." Discussion Paper #85-17, Department of Economics, University of California, San Diego. (1984).
, D. Lilien, and R. Robins. “Estimating Time Varying Risk Premia In The Term Structure: The ARCH-M Model." Econometrica. Vol. 55, No. 2 (1987): 391-407.
Harmon, Richard. “The Simultaneous Equations Model With Generalized Autoregressive Conditional Heteroskedasticity In The Mean: The
SEM-GARCH-M Model. Unpublished Manuscript, Georgetown University. (1988).
Hendry, David F. “The Structure Of Simultaneous Equations Estimators." Journal of Econometrics. Vol. 4 (1976): 51-88.
Koopmans, T.C., H. Rubin and R.B. Leipnik. "Measuring the Equation System of
35
Dynamic Economics." In Ch.2 of Statistical Inference in Dynamic Economic Models, Edited by T.C. Koopmans. New York: Wiley (1950).
Kraft, Dennis and R. Engle. "Autoregressive Conditional Heteroskedasticity In Multiple Time Series Models." Discussion Paper #82-23, Department of Economics, University of California, San Diego. (1982).
Pollock, D.S.G. (1979).
The Algebra of Econometrics. New York: John Wiley & Sons.
"Varieties Of The LIML Estimator."
. Australian Economic Papers. (December, 1983): 499-506. .
Schmidt, P. Econometrics. New York: Marcel Dekker. (1976).
Spanos, Aris. Statistical Foundations Of Econometric Modelling.
Cambridge: Cambridge University Press. 1986. Weiss, Andrew A. "On The Stability Of A Heteroskedastic Process." Journal Of Time Series Analysis. Vol. 7, No. 4, (1985): 303-310. _ “ARMA Models With ARCH Errors." Journal Of Time Series Analysis. Vol. 5, (1984): 129-143.
"ARCH And Bilinear Time-Series Models: Comparison And
Combination." Journal of Business and Economic Statistics, Vol. 4 (1986): 59-70. .
"Asymptotic Theory For ARCH Models: Estimation And Testing." Econometric Theory, Vol. 2, No. 1 (1986): 107-128.
White, Halbert. (1984).
Asymptotic Theory for Econometricians. Academic Press, Inc.
"Maximum Likelihood Estimation Of Misspecified Models." Econometrica, Vol. 50, No. 1 (January, 1982): 1-25.
"Corrigendum." Econometrica, Vol. 51, No. 2 (March, 1983): 513.
Al
APPENDIX
Note: All expectations are conditional on Te unless otherwise stated.
First: Order Conditions For The LIM-GARCH Estimator: Differentiating the conditional log-likelihood function, given by
equation (2.4), with respect to a, yields
2 T a'W. 6h (A.1) dL = s]— fete _ ; lt da A t=1 a’W a 2h da ° ; 1t
When simplified this can be written as
e! Y (A.2) OL p | - [Sta o 1l,t 7 _ ie it da 0M 11, to 2h),
where
A2
2 2 2 dh q de, p oh, .. (A.3) lt _ a, 1,t-i + 5 6, 1,t-i da i=1 da i=l da 2 a Ps Ob) ei = Za,e! Y + = {x1 il,t-i1,t-i i-1 i da
Similarly, differentiating the conditional log-likelihood function
with respect to b, yields
9 | T e! X oh é! € (A.4) éL ~ sie Lett + ; || ut lt _ 1| - 0 db, A t=1 2h), 2h), ab, hit J b =b ° ° where 2 2 2 dh q de, p dh, (sy HE Ze bet » by She db i=l ~ db - i=l ~ dab °o °o 5 q P ahi t-i - = ae xX. . + = 6 2 im i 1,t-i°1,t-i i-1 i ab
Now, differentiating with respect to the parameters Ww? as, and 5, that comprise the conditional variance, hee yields T dh €1 € (A.6) an - sl-— ; | = lt 1 - 0 dw, A t=1 2h). dw, hj,
A3
but noizice that
2 2 oh, . p @éh (A.7) —it _ F + 5 hes]
isl Ow
°
It is clear from (A.7) that a complex recursive structure exists, which
requires expansions in order to derive an analytic expression for (A.6).
Differentiating with respect to a, yields éL T ah? €1 € (A.8) — = & H i [ste - 0, da, t=1 2hs, da, hie where 2 2 dh p oh, ,., (A.9) ee le at zB §,— ed da J ie aa,
Then, differentiating with respect to 6 yields the following first order
condit:ion
2 . dL T dh e! i € (A.10) — = & [P| [Ass - 0, a6 eo1]2ni las, JL bh),
where in this case
A4
2 2 dh q de, ,_ (amy —4£ = [nt + Bae as. J Gel a5,
The expressions for the first order conditions derived in equations (A.6) through (A.11) clearly show their complex recursive nature. The remaining first order conditions require differentiating L with respect to
the conditional variance, hi.
aL T él sé (A.12) — = & tists - 1 - 0. ony, te1}2hy el hy,
It is not possible to derive general compact analytic expressions due
to the recursive structure of these first order conditions .
Proof of LEMMA I:
Expressions for de), /ém, where m’ = (a; bo): are given by
de 1t (A.13) da, = Ye : de 1t (A.14) “ab, - Xie .
Then for bounded constant vectors A < eo, one has
AS
(a.15) ar——tE LL gez
én
1 , = where Ze [Yee X,,]- Now, one can write
de Oe
(a.16) Efar—2=—2t y} - glarz za
; 1¢71t m om
where 21 it is the cross product matrix and the right hand side is thus a
scalar. Hence there exists a constant M, < © such that
1 de de (A.17) Ejar—tt —4t yg] < M, for all 9 EE om om’ Next, to show that de de (A.18) det E|/—2= —4t] > 0 om om’ is equivalent to showing that de de (A.19) Ejar—4= 28] > 0 for all \ = 0. om Om’
Following Weiss (1986), proof is by contradiction. First, assume there
exists ’ * 0 such that (A.19) equals zero. Then it must true that
de “A.20) 4'—— = 0 a.s. for all t. om
A6
This implies from (A.15) that ANZ), = 0 a.s. for all t. Now let Yue
represent the forecast of Y given information available at time t-l,
lt that is
A
(A.21) oY, = ely, lt.
Then specify Y), to be
(A.22) Y,. = e +
1t 1t lt ’ where ele denotes the true errors. Furthermore, using a mean value expansion one can specify €;, to be de "° 1t (A.23) fie 7 “ae + om _ m,)
* where the derivative is evaluated at m , which lies between m and m2:
Equations (A.22) and (A.23) imply that ele is a function of Ted and Zee
since de, ,/dm' =Z If that is the case then
1t°
° (A.24) elet, I
-1°71¢|
But the LIM-GARCH specification assumes that
(A.25) elee[t1| = 0
and therefore it must be the case that
A7
, ° (A.26) elet,
ey Zre| - 0.
This implies that eit = 0 a.s. for all t, which contradicts the fact that
E|ced,)"| = o > 0 (i.e., the unconditional variance is greater than
zero). Therefore, no such X exists and
de de (A.27) Elar—tt—lt,] 5 9 a.s. dom Om’
as required. Oo Proof of LEMMA IT:
lixpressions for ahs, /ov, where v’ = (w a’, 6’), are given by
(A.28) zh Um= (U1
(A.29) -
(A. 30) > «i
For constant vectors » < ©, write (A.28), (A.29), and (A.30) as
A8
any. (A.31) py >yT 7 BY We where 2 2 oh dh 2 2 1,1-1 __1,t-p (A.32) We - f ; 1 t-1 pees, €1 t-q , a5, peeey a6
Since by assumption e[ced,*] < « then eles, | <M, < © for all 6 € &.,
2 ’ Thus, the first part of the lemma is straightforward. For the second part of Lemma II, apply the same method of proof as in
Lemma I. As before, to show that
. ‘ane ahi (A.33) det Ej-——— —— 0 ov év'
is equivalent to showing that
2 52 (A.34) nfs —it \ > 0 for all \ = 0. ov év'
Assume there exists a A»#0 such that (A.34) equals zero. Then, as in Lemma I, this implies that A'W - 0 a.s. for all t. Using a similar expression to (A.23), which is derived from the mean value theoren, yields
: 2 an expression for e
1t
de de de,
2 o 2 o -. lt lt lt
(A.35) fe 7 (ea) + ere =(v - vi twv- vf — ov avi
(v - v,)
ov
A9
°
This is a quadratic function in fit which yields two solutions,
° .
(A. 36) fie 7 £,(t) or f(t) These solutions are functions of Ty and Xie: But eit having two values ocnditional on Tey and Xie is not permitted. Therefore, no such X exists
and (A.34) holds. oO
Proof of LEMMA IIT: Differentiating the conditional log-likelihood function with respect
to 6 yields
2 2 € de oh € (a.37) 2 - <5 ed ee + : 1t de 4 -0 a0, tel{ hy 1 28, any Lae, JL bp,
i lt lt
where the derivatives with respect to 6, are evaluated at their true values 4,: Second order derivatives can be shown to have the following
general form
Al10
2 2 2,2 -
e710] ere] | Pre 1 | ‘1 oh,
+ Io tt cael h 06. 00 2h h 06.00.
i i j-
it j ree Mae 2 2 2 2 ere Piel ofa r ffre} [Pre] Me rr ed ee 0 ee ny A OO 2 0 ny LL aa a8, nrelhyelL oath 20, 2 2 ~ [| as 4 ante L.a0,tL 28,
Expressions for de, ,/am, and ah, ,/8v, are given by (A.15) and (A.31),
respectively. For ahi ,/am, one has
2 2 dh q de _ p <3éh, ,. (a.39) EE - 2a, Et + 36S om, i=l , om, i=1 om
where my, is the jth element of m. From Lemma II and (A.39) it is easy to
see that ah, ,/29, have bounded second moments.
Since every term in ahs ,/38, and a”hi , /20 00 also appears in he
j 1t
itself, the expressions
2 2,2
oh ah (4.40) 4 and + (4 hi, a6, hi, 00,98,
are uniformily bounded from above. Hence, evaluating equation (A.29) at the true parameter values 6, implies that the first, third, fourth, and
fifth terms are zero, with the sixth term being
All
2 2 2 2 2
(A.41) p| 2 [fae] |2Pre| | Pre] | gf 2 [te] [Pz |, <n
nt In2_}] a¢.|| ae.}} to]. n+ | ae.{| ae.{| 2 ues i j arb 94; j
since E(e* IT,_4) = he. Similarly, the second term of (A.38), evaluated
at Oo» ‘Ls bounded
where the derivatives are evaluated at 6: To show the Det A > 0, I follow the method utilized in Lemmas I and
II. For any A * 0, one can transform A such that
teal teal
2 2 dh dh
2 + 00 ae’
Ae
de de hie 06 a6’
Al2
Both terms on the right hand side are nonnegative. What remains to be shown is that these are both greater than zero. First, partition 1’
= (Az A5) to conform with #9’ = (m’,v’). Now, since de, ,/dv = 0 the second term in (A.44) is equal to
i de de de de (A.45) £E Noe —1t}|4*), |r, | - = Ma —1t}|—*)a ta. hi. L a0 JL ae" ny. L ae JL a0"
From Lemma I this is clearly positive unless do = 0. If ry - 0, then
X
1 * 0 and the first term of (A.44) becomes
24 ray 2 ah, [an (A.46) £E Nem —1t}|—+*), |r ne. | a9 || ae
1t which is positive because of Lemma II. O
teal
2 apa 2 1 fe] Pre E a —] | »y +L a6 SL a
Proof of THEOREM I:
From the ergodic theorem, for any # € &,
(A.47) L(@) = lim Ae
to
_ _il 2} _ 1 ' = B(C,) 5E| 10g eel $e | toga; .,)|
1 2 ,2 - slit | a.s.
Al3
if the expectations exist. From Jensen's Inequality it follows that (A.48) logE(X) = E[10g(x) | ’
for all positive random variables X, with equality only when X is a
constant: a.s. This implies that
2 2 (A.49) Loge [he | = z [10g hie |
Since, by definition, e[nt, | < © then from (A.47) it must be true that
£|1ogin; .)| < o. From Lemma I it was shown that E(e.) <o, for all ge
=. Then since
then clearly 2 ,2 | (A. 51) ef t/t | <o, Since Wii _ is the second moment matrix of residuals from the regression
of Yue on Xie the vector of all exogenous variables, it is equal to zero
when evaluated at the true parameter values.
Next, following Weiss (1986), make the following transformations, 2 2 -2,- ° o (2 (A. 52) ete - E [niece + fie 7 61.
—2,.0 2 -2,_. °o (2 - ela et | + efns cere ~ €14) |
Al4
. - fe] Since h — €
2 1t1e it? and E(es, 11.) = 0. From (A.52) it is clear that
2 depends only on information available at time t-l
°
2 ,2 |. 2,2 (A.53) ef ete/he| > z| (ed) os |
This holds’ with equality when fie 7 ee for all t ak.s. Taking expectations of an expression for ce similar to (A.55) and based on Lemma I, yields Oe de 2 °o (2 ' 1t 1t °o (2 (A. 54) eet, | = e| (ed) + (m m,) | om “| m,) = el Cet)
with equality only at m = m,: Thus from (A.54) one can see that fie 7
° ie: for all t a.s., only when m = m,:
1 2 1 , (A.55) L(6) = E(C,) - 32 108 ie | _ se [log(ar¥,, .2,)|
1 2 ,-2 - peel h | ,
1 2 1 ' s E(C,) - 3B [log ni e| - e[togcas¥,, .4,)|
~ Boel 3 | ,
eine e| cet)” ni? | - E|cet.)” hye (hy)? ae]
A15
1 2 1, E(C,) - 38| log ni | — 5 z| Logica; Wi, L? a,)|
- Boefeleei” (hy) ‘I|- Hoe? (hy. i] ; eine ele | > x tel oi" |
s E(C,) -F 38[10g Mel - se z| Logica; a1, to |
tere at ‘I|- prof? (hy) ‘||. E(C,) - se z|togcnt,) *| - 38 |1ogca: Wi, 2 a,)|
- hoe[ei? oto]
IA
BC) ~ 5 3E|Logche “| - telroe[ce 2? (hy 2 ;
L(6))
with equality only at 6 = 9,- Note that
(A. 56) “t fn? (no) i] - ofiefi 2 ey? i] |
only when (hy, 2 - hi, But based on Lemma II, this can only occur at
§=6.. Oo
Al6
Proof of THEOREM II:
This proof follows closely the proofs of Weiss (1986) and Basawa, Feigin, and Heyde (1976). I begin by first applying a Taylor series expansion to the first order derivatives for the conditional
log-likelihood function. This expansion can be written as
2,0 4 3, (as7) Se - By Shiog_yy 4 LabG_4) 00 06 06 36! 86 ° ° ° where |6 — @)| > [6 - al. which requires that @ lie between @, and 6.
The third term can be considered as a remainder term on the Taylor series
expansion. Equation (A.57) can be rewritten, based on (2.4), as
aL T Ee A T a°2, A T ae, (A.58) =x = T-—— + (- 6.) = + (6 - a) = —— 06 t=1 36, oO t=1 36 (48° t=1 46
Basawa, Feigin, and Heyde derive a set of sufficient conditions for
the existence of a consistent root of the log-likelihood equation
oL oe 00
(A.59) = 0.
These sufficient conditions consists of
| T a2 (I) A 5 -t Bo. T tl 36,
Al7
(II) There exists a nonrandom matrix M(6.) > 0 such that for all e>o0O, 1 T a7e, Pri- - = —— 2 M(@.) > l-e T t=1 60 46’ fe] ° for all T > T, Ce). (III) There exists a constant M < © such that
3 ) 4.
86,30 30,
E
< M for all@eEee=.
Then it follows that 9 —4 a.
The method of proof is to show that these three conditions are
satisfied. I begin by focusing on the parameters of the current
endogenous variables a.
Condition (I): From equation (A.1), one has
T a'W e! Y (A.50) gL - =: -E 11,t | _ itis da, A t=1 aoW a1 to 2h), awa,
A18
When this derivative is evaluated at the true ao: the first term is zero
since Wi t is the second moment matrix of residuals from the regression ’
of Yue on Xx: Then, following Weiss (1986), it can be shown that
(A.61) 2a I | - 0 t-1 i da ° since E(e,,]I, ,) = 0 and E(eo 1 )- n2 The ergodic theorem, see it! t-1 1t' t-1 1t° ,
White (1984), then implies that
T aa. , s — 2 o. t=1 da,
Ir
(A.62)
4
Condition (II): By the ergodic theorem, for any constant vector \ 0,
17 a7a, a°a, (A.63) = =A!‘ » — EIA'—— A a.s. T tel da _ da! da da!’ ° ° ° ° = -rA\'A X °
where A, is defined to be
2 (A.64) A, = -E io da da’ ° °
This second order derivative, after some simplification, has the following
Al19
form,
2 2 T W 'W a yey (a.65) LH = ¢ 11,t polite le le
, = ,
da da’ t=1 aoWi7 to (30M11 £30) 2h5, U — te f1e%1t sily , Ly} on? in. 1t-i 1t-i
a €. € T-1 1 {1e%1t i-l ., * “| - 1 oft “iea%1e-t
T-1 a €1 ¢€ i-l_ , 1 j,, _ o_lt 1t * an 5) dette ne beta * 21 2 2 ° 1t 1t T-1 i-1 >a) Y3 € i-y 1 itt te Taking negative expectations of (A.65) yields 2 T YIY (A.66) - >| | -~ ptt > o, da da? t=1 2h), since E(e,,|I,__,) - E(e |Z...) = 0 and E(e2 JI. ,) = he Now lt! t-1 1t-i' t-1 lt’ t-1 1t’ ,
following Weiss (1986), let
A20
(A.67) 0 < 6(A) <
Nie
ana.
ATE Xr da da' ° fe)
for a given \.Then, for all « > 0, there exists T, = T, Ce) such that
< 6 | > l-e
for all T>T,. Thus, condition (II) is verified by defining M(a,) to be
1
L2 ana, ah, 1s yt y - gh d
(A. 68) Pr . T tel da da! da da’ ° ° ° °
1 1 ak, (A.69) M(a,) - -3 E @a Ja’ . ° ° Then LT ah, (A.70) Pr} — = A'——A > A'M(a,)A > l-e T t=1 da da’
for all T > Ty:
Condition (III): The third condition to be verified requires that the third order derivatives, which can be interpreted to be the remainder term of the Taylor series expansion, be bounded in absolute value. The third order derivative is derived by differentiating equation (A.65) with respect to a. It is clear that the only terms that cannot be dealt with as above are those terms which contain third order derivatives of he ar.d fe: Following Weiss (1986), one is able to show that an extension of the methods applied to the first and second derivatives implies that terms
containing a°hi./a are bounded and that terms containing a°e,,/aa> have
A21
bounded second moments. O
The proof of consistency for the remaining parameters follows as
above.
Proof of THEOREM III:
Besawa, Feigin, and Heyde (1976) again provide a set of sufficient
conditions for asymptotic normality. These verifiable conditions are
T aA (I) T 1/2 = —t A N(O,B ) for nonrandom B > 0. fe] fe] t=1 06 ° = T a7a. (II) °° +T = - Py “A, for nonrandom A, > 0. t=1 36 46° ; ° °o CLIT) Condition (III) of Theorem II.
The method of proof, as before, is to show that these three conditions are
satisfied.
Condition (I): Following Weiss (1986), assume that B, > 0. Then
from Condition (1) of the proof of Theorem I, it follows that
(A.71) | —t
Weiss shows that if
A22
a2. a (A.72) | —t_t | < @
and
(A.73) os ol ok | - Bo <o@,
then a Martingale central limit theorem is applicable. Furthermore, Weiss shows that when (A.71) is true then (A.72) and (A.73) are equivalent. Thus, all that is required is to show that (A.72) holds.
To show that (A.72) is true I follow the method in Lemma III used to
show A< ©. From (A.37) one has
Z 2 (a7) gf ete] 8 gf. Ste f2fte] , 2 [Pael | fae _ | a6 a6" n2_| 90 an-_| ao || n2
1t 1t 1t
2 2 ord Laas 1 [Pre] ] f1e ~ 72 + 721, 2 + ne a9 ane} a6 || h
2 2 ne 06 jj 30° ne 06 |} 00’
1t 1t
2 2 2 2 42 It rf empe Pel] fre 2 TN te ose h ant | a0 || ae'|| nh
1t 1t
where the derivatives are evaluated at §,- Taking expectations yields
A23
aL, aL de, ] [ae (A..75) | —tt| -& a —it}|—4t) | , a0 a0" ho. | a6 || a0"
which from Lemma III is clearly bounded.
Condition (II): Since by definition
teal
then Lemma III verifies condition II. oO
2 ) A.
(A.76) A, = —E|—— 00 00’ o 0
Proof of THEOREM IV:
This proof is nearly identical to Theorem II. Again, one must verify the three sufficient conditions derived by Basawa, Feigin, and Heyde (1976) and given in the proof of Theorem II. I begin by focusing on the parameters of the endogenous variables, I.
Condition I: From equation (4.8), the first order condition is given
by
T |: OH, | lele e'Y cac77) 8 me se fats + EY] JE - 1] - 4]. ar? tel 2ler*} | H, H, |
t
yg 2 2 Clearly, since E(e,, 11, = 0 and E(e}, 11,1) hoy ;
(a.78) EJS ar’
Then, the ergodic theorem implies that
T dh sr — Bo. t=1 er’
Ie
(A.79)
ey
Condition (II): By the ergodic theorem, for any constant vector A « O,
1 T a°a, ana, (A.80) = A'|—, d —_— E|A'———_ - A" ASA 7)
, ’ t-1 |ar“ar’ rear’
re
where A, is defined to be
| 2 (A.81) ALS = - B|—°4_ {ar’ar®
This second order derivative for the conditional log-likelihood function
is given by
2 T Y'e_|oaH €'e oH oH (A.82) oh = > |-(r'r) 1, etj_t) _ Ljtety] lye —. arvar® —s_ t= H. ar’| 2, a H.. ar*| | ar?
2 t e ¢ 2 H. or’ ar H. H H.. or
Taking negative expectations conditional on information available at time
A25
t-l of (A.82) yields
2 (A.83) —E{|—? 4 ar’ar4
T aH.) [aH Y'y Ial- 2 -creryt + 2] 4] )—t}}—t | 4 1 5 0 t=1 2H, ar’ | | ar’ H
t
As in Theorem II, for a given \, let 6 be bounded by
ana. A! E|————a ar*ar’
Then, for all « > 0, there exists T
(A.84) 0 < 6(A) <
Nik
17 T, (e) such that
17 a°e, ao, => XS A'—— A - E]A’'—— A
(4.85) Pr . : Tt=1 ar*ar’ ar“ar’
< j >l-e
for all. T > T)-
Thus, condition (II) is verified by defining M(T) to be
1 ak, ar“ar* Then 1 7 a°s, (A.87) Pr} = = A'——=— A > AMPA] > 1l-e T t=1 ar“ar4
for all T > T.
Condition (III): See Theorem II, condition III. oO
A26
The proof of consistency for the remaining parameters follows as above.
Proof of THEOREM V:
As with Theorem IV, this proof is nearly identical to its single equation counterpart, Theorem III. The proof requires verifying the three sufficient conditions for asymptotic normality outlines by Basawa, Feigin, and Heyde (1976) and given in the proof of Theorem III. As before, I will focus on the parameters of the endogenous variables, TI.
Condition I: Since it is assumed that B09, then from Condition I
from the proof of Theorem IV, it follows that
(A.90) oe
a = 0
ar
As explained in Theorem III, given the (A.90) is true, all that is
required to verify Condition I is to show that
ak. ae (A.91) os t | < «
re ar’ art
From (A.77) condition (A.91) can be written as
Ok, AR fe) st ee e’'Y.|' (A.92) ns — - —E arly: + |—tyjeet— i] -—- tt ar’ ar# | ar’ H. H
e’e e/y ftft _ 4] _ feet H
wee nee
A27
Yie .e'Y 1) dH e'e _ fosa « sete _ [el [te — |
, He. 4
e’e ‘10H oH e'eé H. a ar’ H. e'Y _9 (rly t i He
where the derivatives are evaluated at the true I. Taking expectations
yields
ar” ar® HH,
Condition II: By definition
aa. (A.94) A, = - E|——; ar’ar®
teal
An argument identical to Lemma III verifies Condition II.
Condition III: Similar to Theorem II, Condition III. O
IFDP
NUMBER
322
321
320
319
318
317
316
315
314
312
311
310
309
International Finance Discussion Papers
TITLES 1988
The Simultaneous Equations Model with Ceneralized Autoregressive Conditional Heteroskedasticity: The SEM-GARCH Model
Adjustment Costs and International Trade Dynamics
The Capital Flight "Problem"
1987
Modeling Investment Income and Other Services in the U.S. International Transactions Accounts
Improving the Forecast Accuracy of Provisional Data: An Application of the Kalman Filter to Retail Sales Estimates
Monte Carlo Methodology and the Finite Sample Properties of Statistics for Testing Nested and Non-Nested Hypotheses
The U.S. External Deficit: Its Causes and Persistence
Debt Conversions: Economic Issues for Heavily Indebted Developing Countries
Exchange Rate Regimes and Macroeconomic Stabilization in a Developing Country
Monetary Policy in Taiwan, China
The Pricing of Forward Exchange Rates Realignment of the Yen-Dollar Exchange Rate: Aspects of the Adjustment Process in Japan
The Effect of Multilateral Trade Clearinghouses on the Demand for
International Reserves
Protection and Retaliation: the Rules of the Game
Changing
I,
AUTHDR(s)
Richard Harmon
Joseph E. Gagnon
David B. Gordon Ross Levine
William Helkie Lois Stekler
B. Dianne Pauls
Neil R. Ericsson
Peter Hooper Catherine L. Mann
Lewis S. Alexander
David H. Howard
Robert F. Emery Ross Levine
Bonnie E. Loopesko Robert E. Johnson
Ellen EF. Meade
Catherine L. Mann
Please address requests for copies to International Finance Discussion Papers, Division of International Finance, Stop 24, Board of Governors of the
Federal Reserve System, Washington, D.C.
20551.
Cite this document
Richard Harmon (1988). The Simultaneous Equations Model with Generalized Autoregressive Conditional Heteroskedasticity: The SEM-GARCH Model (IFDP 1988-322). Board of Governors of the Federal Reserve System, International Finance Discussion Papers. https://whenthefedspeaks.com/doc/ifdp_1988-322
@techreport{wtfs_ifdp_1988_322,
author = {Richard Harmon},
title = {The Simultaneous Equations Model with Generalized Autoregressive Conditional Heteroskedasticity: The SEM-GARCH Model},
type = {International Finance Discussion Papers},
number = {1988-322},
institution = {Board of Governors of the Federal Reserve System},
year = {1988},
url = {https://whenthefedspeaks.com/doc/ifdp_1988-322},
abstract = {In this paper I generalize the standard simultaneous equations model by allowing the innovations of the structural equations to exhibit Generalized Autoregressive Conditional Heteroskedasticity ( GARCH ). I refer to this new specification as the SEM-GARCH model. I develop two estimation strategies: LIM-GARCH , a limited information estimator, and FIM-GARCH , a full information estimator. I show that these estimators are consistent and asymptotically normal. Following Weiss (1986) I show that when the errors in the SEM-GARCH process are incorrectly assumed to be conditionally normal the likelihood function is still maximized at the true parameters, given certain regularity conditions. This results in the asymptotic variance-covariance matrix being more complex than the usual inverse of the information matrix.},
}