ifdp · November 30, 1993

Fluctuating Confidence and Stock-Market Returns

Abstract

The drift of two different diffusion processes (asset returns) is determined by a state variable which can take on two values. It jumps between the two according to Poisson increments (this is called a 'regime-switch'). For any given position of the state variable the drift of one process is high and the other is low. I find that the posterior probability that the 1st asset has higher average returns, conditional on observing the path (returns) of each process, follows a diffusion process and calculate its infinitesimal parameters. I also derive analytical expressions for its stationary density and for some of its path properties. I compare the filtering problem to the Kalman Filtering problem and find that even though the dynamics of the mean of the distribution are very similar, the dynamics of the variance are subject to stochastic fluctuations. The model is parsimonious in that the conditional mean and variance are functions of a single variable.

Board of Governors of the Federal Reserve System

International Finance Discussion Papers Number 461

December 1993

FLUCTUATING CONFIDENCE AND STOCK-MARKET RETURNS

Alexander David

NOTE: International Finance Discussion Papers are preliminary materials circulated to stimulate discussion and critical comment. References in publications to International Finance Discussion Papers (other than an acknowledgment that the writer has had access to unpublished material) should be cleared with the author or authors.

Abstract

The drift of two different diffusion processes (asset returns) is determined by a state variable which can take on two values. It jumps between the two according to Poisson increments (this is called a ‘regime-switch’). For any given position of the state variable the drift of one process is high and the other is low. I find that the posterior probability that the Ist asset has higher average returns, conditional on observing the path (returns) of each process, follows a diffusion process and calculate its infinitesimal parameters. | also derive analytical expresssions for its stationary density and for some of its path properties. I compare the filtering problem to the Kalman Filtering problem and find that even though the dynamics of the mean of the distribution are very similar, the dynamics of the variance are subject to stochastic fluctutations. The model is parsimonious in that the conditional mean and variance are functions of a single variable.

I characterize the interest-rate and total-returns processes in a Cox-Ingersoll-Ross{1985] style model where the productivities of assets are unobserved, but inferred as above. I find that this model is capable of reproducing three stylized facts of stock-market returns and interest-rates. These are the skewness and kurtosis of returns and the ‘Predictive-Asymmetry’ of returns: excess-returns and future changes in volatility are negatively correlated. Further negative returns cause reactions of larger magnitude. The success of the model in generating these features depends on the speed of learning about the regime switches. Parameter values which lead to faster learning, are consistent with large negative skewness of returns and the Predictive Asymmetry property. The slower learning version leads to greater kurtosis of returns. I show that a model based on the same fundamentals but with observed ‘regime-shifts’ is not reconcilable with these features. My analysis suggests that learning about the productivities of assets of the kind introduced here may be an important determinant of portfolio choices and observed asset

returns.

Fluctuating Confidence and Stock Market Returns

Alexander David !

This paper has three purposes. Firstly, the presentation of a Filter in continuous time which characterizes the dynamics of Bayesian learning about recurrent ‘regimeswitches’ to be defined shortly. Secondly, to show how ‘fluctuating confidence’ which arises due to this updating process, is reflected in the statistical properties of interest rate and stock-return processes in a Cox-Ingersoll-Ross[1985] (henceforth CIR) stochastic production economy. I also discuss the nature of the risk associated with these fluctuations and the form of the optimally chosen portfolios to hedge this risk. Finally I draw some relationships between the speed of learning and the ability of the

mode. to replicate three stylized facts about stock-market returns.

A ‘regime-switch’ is said to happen when the average productivities of two different sets of assets in the economy are reversed. The switching occurs due to a Poisson process. We assume that the switching is unobserved and that the total output from each asset which is the sum of the average productivity and ‘noise’ is observed. In contrast :o learning models based on Gaussian distributions of the the underlying state variables and ‘noise’, the updating process here exhibits stochastically fluctuating conditional variance. The regime-switch defined here is to be distuingshed from that of Hamilton[{1991], where a switch is a change in the rate of growth of ouptput in an

1 The author is a staff economist in the Division of International Finance. This paper represents the views of the author and should not be interpreted as reflecting the views of the Board of Goverr ors of the Federal Reserve System or other members of its staff. I thank the members of the Internztional Finance Division and specially those in Trade and Financial Studies for helping me make a smooth transition. A large part of this paper is from my PhD dissertation written at UCLA. The fir.ancial support of the National Science Foundation is gratefully acknowledged. I am grateful to Michael Brennan, Bryan Ellickson, Jon Faust, Robert Jones, David Levine, Thomas Mountford, Simon Potter, Keunkwan Ryu, Kwanho Shin and Bill Zame for helpful conversations. I thank David Senouf and Ronald Fedkiw for clarifying some issues in numerical analysis and Yun Sun for research

assistance. I am specially grateful to Joe Ostroy for his guidance, patience and support.

economy. Both here and in Hamilton’s model the regime switching is described by a 2-state Markov chain. However, the implications of a regime-switch on output growth in this model depend on the speed of learning about these switches, the expected-time to retain certain levels of confidence and the resulting portfolio choices. Further, due to unobservability of these switches there are effectively a continuum of states for the decision maker, a state being identified by the belief of the agent regarding the

current regime.

The inference process is described as follows. The agent receives signals coritinuously from a source which jumps ‘infrequently’ between two values. I emphasise that this is different from the assumption used in the Kalman Filtering problem, Jazwinski[1970], and its variants, that the source moves according to a diffusion process. | find that the updating process follows a diffusion process and calculate its infir.itesimal parameters. This implies that the updating process has continuous sample paths even though the source follows a jump process. Furthermore the updating process satisfies certain regularity conditions which enable us to characterize its stationary density and several of its path properties. I show that the dynamics of the conditional mean of the agent’s estimate are very similar to that of the Kalman filtering problem. However as opposed to the Kalman Filtering problem, the dynamics of the conditional variance are stochastic and subject to cyclical fluctuations. This rnakes

my model particularly suitable for studying the effects of fluctuating confidence.

It is somewhat surprising that the updating process has continuous sample paths even though the drift may jump by a large amount, and the process is observed continuously. The increments of a diffusion process are the sum of a term which is proportional to the drift and the the length of the observation period, and a noise term whose increments are distributed like a Brownian motion. The standard deviation of the increments in the noise term are proportional to the square-root of the observation interval. Over intervals of small length, the drift has negligible effect while the ariount

of noise is of a larger magnitude. So over very small intervals the increments of the

observed process are not informative and do not lead to large changes in estimates.

A model of Non-Gaussian learning has been worked out by Detemple[1991]. In that model it is assumed that the prior distribution is Non-Gaussian. In that economy too the conditional variance process is stochastic. The mean and a set of sufficient of statistics for the conditional variance characterize the updated distribution. In the model presented the mean and variance of the distribution are completely characterized by a single parameter. This is completely analagous to the static binomial distribution. This parsimony allows us to make precise several properties of the updating process including the stationary distribution of the updated priors, the boundary behaviour of the process and the amount of time spent in various regions of the state space characterized by hitting times. Further we are able to solve for optimal decision

rules and equilibrium rates of return in a one-factor CIR model.

I point to three stylized facts about stock-market returns. These are the observed kurtosis and skewness of excess-returns and the asymmetric feed-back effect of excessreturns on stock-market volatility. A number of time-series models have been written to replicate these features and to estimate the strengths of these effects. Without attempting an exhaustive survey of the literature I refer readers for a vivid description of the facts to Black{1976] and to the the models of Nelson{1991] and Campbell and Hentschel{1992]. These models are generalizations of ARCH and ARCH-M models (Bollerslev, Engle and Woolridge{1985] and Engle, Lilien and Robins[1987]) which documented the autoregressive fluctuating volatility of stock-returns. In the model economy presented here fluctuations in returns and their feedback to conditional variance arise due to unobservability of the regime shifts. The inherent inertia in Bayesian learning and changing portfolio choices account for the changing means and

the autocorrelation in the volatility of return processes,

I show that in an economy with the same fundamentals and observable regime shifts the interest rate and volatility of returns is constant and the three features

of stock returns cannot be replicated. The ability of the model with unobservable

regime-shifts to replicate these features depends on the parameters of the model chosen. In models where the difference in drifts is large, the level of noise is low and regime switches are less frequent, the agent spends a large proportion of time in states with high confidence regarding his knowledge of the current regime. I call this the ‘Fast Learning’ model and the one with the opposite set of parameters the ‘Slow Learning’ model. I find that the Slow Learning model generates returns which exhibit excess- kurtosis (fatter tails than the Normal Distribution) and that the Fast Learning generates negative- skewness. A model with an intermediate speed of learning is needed to replicate both features simultaneously. I also find that the Fast Learning model generates a negative relationship between realized excess-returns and future increases in volatility (as found in the data) and that the relationship is not so clear

in the Slow Learning model.

Before carrying out the analysis I refer to some empirical evidence for ‘reallocative’ shocks, i.e. shocks which create a desire to move resources between ‘sectors’. The literature has developed since the Sectoral Shifts Hypothesis of Lilien[1982], who argued that employment fluctuations in the U.S. economy can be explained due to shocks which unevenly affect the productivities of different sectors in the U.S. economy. The exact definition of a ‘sector’ has been a subject of debate. I do not attempt an exhaustive survey of the literature . Loungani, Rush and Tave[1990] created a dispersion measure and found evidence of reallocative shocks between 60 industrial indeces constructed by Standard and Poor. Davis and Haltiwanger{1992] argue that the effects of these shocks is experienced at the plant rather than the industry level. The literature is still evolving yet supportive of the Reallocative Shock Hypotheis at different levels of disaggreation. I find it satisfactory to assume for now that the effects of asymmetric shocks to different industries may be a useful paradigm for

understanding various features of stock-market returns.

The plan for the rest of this paper is as follows. In Section 2 the structure of

the two models is presented. In Section 3 I solve the model under the assumption

that regime-switches are observed. In Section 4 the filtering problem associated with unobserved regime-switches is solved. In Section 5 the model with unobserved regimeswitches is analyzed and the nature of optimally chosen portfolios is discussed. In Section 6 some results from the numerical evaluation of the model and statistics from simulations are presented. The success of the model in its ability to replicate stylized facts about stock-market returns is evaluated in Section 7. The conclusion is

in Section 8.

2 Structure of the Models

In this section we introduce the main features of the model. These are closely related

to those in CIR{1985].

Feature 1. Single Good There is a single physical good which may be allocated to

consumption or investment. All values are expressed in terms of units of this good.

Feature 2. Production Technology Production possibilities consist of two linear activities. The transformation of an investment of 3; amount of the good in the ith production process, is governed by a stochastic differential equation of the form

db _ Bi

for 1 = 1,2. ¢; are independent S.B.M.’s ( Simple Brownian Motions ),! where

ai(Z,) dt +o - dit (1)

ay(21) = 2

a2(z4)=a+b—- xz a>b

(1) specifies the growth of an initial investment when the output of each process is continually reinvested in that same process. The production processes have stochastic constant returns to scale in the sense that the distribution of the rate of return on an investment in any process is independent of the scale of the investment. The drift rates of the processes are determined by a state variable zt, which accounts for random productivity switches between technology | and 2.

‘A real valued process ¢; is a S.B.M. if (7) ¢; has continous sample paths with probability one

(ii) ¢; has independent increments and (itt) Cis — Cit has a normal distribution with mean zero and

variance s — t.

Feature 3. Productivity Switches The dynamics of the state variable z, which affects the drift rate of the production technologies is given by the Poisson stochastic

differential equation

dz, =(b~a) (1-212 = 9). gg, (2)

a ‘> b, q is a Poisson Process i.e., gt4+-at — qt = 0 with probability 1 — AAt + o(At)?.

qt4.at — 4 = 1 with probability AAt + o(At)

Gt4-at — Yt = n,n = 2 with probability o( At)

We call (2) The Transition Equation The Transition Equation implies that the unobserved state variable z; can take only the two values a and 6. It switches between the two with Poisson increments. The probability of no switches in a ‘small’ interval

of time is 1 — Adt, of one switch Adt, and of two or more switches is ‘negligible’.

Feature 4. Representative Consumer There is a representative consumer in

the economy. He seeks to maximize an objective function of the form:

Ex” exp(—p- (s ~ t))U{C. ds] (3)

In (3), &; is an expectations operator conditional on the current state of the economy. C; is the consumption flow at time t. Throughout this paper we shall assume that

Cr U[C;] = > (4)

(4) implies that the consumer’s utility function exhibits constant relative risk-

aversion with a risk-aversion coefficient of 1 — y.

*We shall use the standard notation, z(At) = o( At) if limai—o A) =0

Feature 5. Two Industries Investment is done through competitive valuemaximizing firms in two different industries. Firms in each industry have access to one of the technologies mentioned above. There is free entry of firms within each

industry.

Feature 6. Financial Assets In Zero Net Supply There is a market for instantaneous borrowing and lending at an interest rate r. The market clearing rate is the rate at which lending is at zero net supply. The market clearing rate is determined

as part of the competitive equilibrium of the economy.

Feature 7. Continous Decision Making Physical investment and tracing in real and financial claims take place continously in time. Trading takes place enly at

equilibrium prices.

3. Model 0. Observable Regime Switches

I first solve the stochastic control problem associated with the social-planning problem. I then define equilibrium in the economy, decentralize the decision-making and

provide explicit expressions for the risk-free rate and the excess-returns.

n ‘this model, I shall assume that the state variable z, is observed by the representative agent. With these assumptions the firm faces, in the terminology introduced by Merton[1973], a constant opportunity set . At any moment of time, there exist two assets with average rates of return of a and b respectively. Since the chance of 2, switching from a to b in a small interval of time At is \- At and asset choices can be revisec. after this small span of time, the firms expected payoffs are unaffected up to an order o(At) by the potential switch. In this situation the only risk in the firm’s return is due to the noise in the payoff of each technology. The aggregate wealth

dynamics are given by 7

Proposition 1 When regime-switches are » observed the value function of the socialplanner’s problem is of the form

J[W.,t} = exept) A-— (5)

where A is a constant determined by the parameters of the problem. Optimal portfolio

choices are constant and satisfy

~~ il (e-4)) 1 = 975 o*(1—7) _ il (=a) m2 = 973 R079) (6)

The P-oof is in Appendix 0. The analysis is completely straightforward and can be

found in for example Ingersoll[1988].

Comments on Proposition 1 The value function does not depend on the state

z;. Both states offer the same opportunities to invest and grow. The portfolio choices

reflect the desire to hedge noise, the only source of risk in this model. The portfolio is more diversified when the gap in average productivities a — b is smaller, the level

of noise is higher and the coefficient of risk-aversion is higher.

Decentralization Investment is done through competitve value maximizing firms. We recall there are three types of firms, each with access to one production process. With free-entry within each industry and stochastic constant returns to scale, there is no incentive for firms to enter or leave the industry if and only if the returns on the shares of each firm (the rate at which it can acquire capital) are identical to the technologically determined physical returns on that process. The equilibrium scale of each industry would then be determined by the supply of investment to that industry. Let wo be the share of his wealth allocated to riskless borrowing /lending.

The agent’s wealth dynamics at t are, Wisa; =

; 2 [W, — C,Ad] - [Do wal + ai(a)At + cAG+) + Wor(1 + r:At)] + o( At) (7) i=l where tj > 0 , wor + [22., Wi] = 1 and wo; can take either sign. Definition 1 An equilibrium is defined as a set of stochastic processes (Ci, tit, 72) satisfying the first order conditions (38), (39) of the social-planning problem in Appendiz 0, and the market clearing conditions wi > 0 and [S~?_, wu] = 1 and ty = 0

The excess returns in the economy are the difference in the rates of return on optimally invested wealth and the riskless security. Ie. the excess returns equal Diet Oi(2) Wit — Te.

Corollary 1 The excess returns in the observable economy are constant and equal

(1—)- (X21 w?)- 0, where w; are the choices in Proposition 1.

The proof is in Appendix 0.

4 The Filtering Problem

The filtering problem is described by two equations. The first describes the dynamics of an unobserved state variable and is called the Transition Equation.. The second, describes the composition of a variable which is observed. It consists of the sum of a drift term which is determined by the state variable and a noise term, which has increments distributed like a Simple Brownian Motion. It is called the Observation

Equation. Both equations fall under the class of Ito stochastic differential equations.

THE ‘TRANSITION EQUATION

dz, = (6— a) (1-19). ay, (8)

a > 6b, q is a Poisson Process i.e.,

0 with probability 1 — AAt + o(At)?.

Gt+At — 4

G+At — Gt = n,n > 2 with probability o( At)

The Transition Equation implies that the unobserved state variable z, can take only the two values a and 6. It switches between the two with Poisson increments. The probability of no switches in a ‘small’ interval of time is 1 — Adt, of 1 switch Adt,

and of two or more switches is ‘negligible’.

THE OBSERVATION EQUATION

o > 0, m 1s a Simple Brownian Motion i.e., if t),t2,t3,... is any sequence of times,

then

3We shall use the standard notation, z(At) = o( At) if lima;_.o A) =0

(i) the increments 7,,, — 2; are independent.

(ii) 7,4; — Me, 1s distributed Normal with mean zero and variance t;4, — ¢;.

The Observation Equation falls under the class of Ito Processes which we define below. Definition 2 (Ito Process) * Let (Q,F,P) be a complete probability space, with (Fi,t € T) being a right continuous filtration defined on it. Let (m,Fi,€ T) bea Brownian motion process. The continuous random process (X;,F;,t € T) is called an Ito Process (relative to the Brownian motion process (W:;, F,,t € T) if there

exist two nonanticipative Fy-measurable random processes a;(w) and b,(w) satisfying

for eacht ET [ |a,(w)|ds < 00 (a.s.) (10)

t [ |b,(w)[2ds < oo (a.s.) (11) 0 with b,(w) being left continuous, and if, with probability 1, X;,(w) satisfies the integral equation

X,(w) = Xo(w) + [ a,(w)ds + [ b,(w)dn, teT

or equivalently its stochastic differential equation (S.D.E)representation as

dX, (ui) = a,(w)dt + bi(w)dn,, teT (12)

Since z; switches by Poisson increments, its paths are piece- wise continuous and 10 is satisfied. Also, for the Observation Equation 11 is trivially satisfied (b; is a nondeterministic and constant process) and hence the integral form of the Observation

Equation is an Ito Process.

THE PROBLEM Let F; be the o-field generated by the sample path (8; )o<r<t. We

are interested in characterizing the infinitesimal dynamics of 7, = Prob[z, = a Fi).

‘Definition 6.2.1 in Krishnan[1984]

Theorem 1 7; 1s the solution of the Ito stochastic differential equation “dm =(1—2+%)- Adt | (13)

Ra td OO) aa, — (am + 8° (1 — m)) + de] To prove the Theorem we shall find limas_.o[t4a1— m,|]. The analytical expression for [71--at — 7] is found in lemmas 1 to 3. We state the lemmas below but leave the proofs for the appendix. Since both the Normal and Poisson distributions satisfy a temporal homogeneity property ( Chung and Williams([1990] ) , the analytical expression does not depend on the length At. The limiting expression gives the dynamics of the inference process when no information is received between t and t+ At, and At tends

to zero.

Lemma 1 Let L,, Ly be the densities of observable increments AB, = Bisar — 2B: conditional on the drift of the output being a,b respectively. Then

(a= 8) (AB ~b- AY, | Ay

L,=Ly-(1+ = Lemma 2 tsa: =

m,-(1—AAt)- La + (1 — 7) AAt- L, me (1 — AAL)- La + (1 ~ me) AAC+ Ly + me AAt- La + (1 — m) + (1 — At) - Ly +o At)

Lemma 3

Trepat — TM = (1-—2-m)-AAt + ™,-(1—7)-(a— 6)

(AB, ~ (a+, + b+ (1 — m)) - At] + o( At)

Lerama | calculates the difference of the densities of the Observation Process arising from the two possible regimes. Lemma 2 is mostly an expression of Bayes Law, with

a tiny twist explained in its proof. Lemma 3 simply completes the algebra.

REMARKS ON THEOREM 1 The Ito S.D.E. is the sum of two components. The first (1 — 2-7) - Adt we call a mean-reverting component. It is an adjustment to accommodate for the constant unobserved Poisson switching probability. The second is a product of two terms. mam) or?) which we call the information weighting term and [d@, — (a-™ + 6-(1 — m))- dt] = dy the innovations component, because it contains new information. We notice that (a- m+ 6-(1—7m)) = E7*z,(w). Result 1, below, from filtering theory is used to show that the process vy; is a Brownian Motion with respect to (F;) and that the S.D.E. 13 can be written ‘in the form

dy(w) = pur( (we) )dt + o74(me(w))dve(w) (14)

In this form the infinitesimal coefficients of the S.D.E. are functions of 7; only. Theorem | only states that the updating process satisfies the S.D.E. 13. It will be useful in defining the updating process as a diffusion process which arises as a solution to a §$.D.E.. Result 2 below provides conditions on the coefficients of the $.D.E. (14) to have a unique solution. Result 3 states that under the same conditions of the coefficients the coefficients of the S.D.E. 14 are the infinitesimal mean ana variance of the process. We will then use results from theory for diffusion processes to provide

several interesting properties of the updating process in Subsection 3.

Result 1° Let {X,,B,,t € T} be an Ito process defined on a complete probability space (Q,F, P) represented by the S.D.E.

where / Ela,(w)|dt < 00 teT

with B, being the o-field generated by (a,,W,,s <t,t ET). Let {F,,t € T} be the right continuous o-field generated by {X,,s < t,t € T}withF, C B, C F, und define

°Theorem 8.1.1 Krishnan[1984]

a functional ar(X1(w)) = E7*a,(w)

Ther, the innovations process (; given by

ee.

is an F,-measurable Brownian motion process, and the Ito process (X;) has a S.D.E.

representation,

As discussed earlier, the process (9) is an Ito process of the form (12) and that (a- x, +6b-(1—™)) = E7*z,(w). Hence, (dB, — (a+ + 6- (1 — m,))- dt] = dy, is ar. “innovations” process in the sense of Result 1 and the process (13) has the

representation

-(l—m)-(a—b dts = (12m) ade EE OW), (16)

To ascertain that a solution to the S.D.E (16) exists and that the solution is a

diffusion process (defined below) we shall use a result from Karlin and Taylor({1982].

Definition 3 ° A continuous time parameter stochastic process which possesses the

(strong) Markov property and for which the sample paths X; are (a.s.) continuous

functions of t is called a diffusion process.

Definition 4 (Growth Condition) The coefficients u(x,t) and o(7,t) satisfy the growth

conaition tf there exists a constant K independent of t and x such that

u(r, t) +07(n,t) < K(1 47°)

5f-om Karlin and Taylor(1982], Chapter 15

Definition 5 (Lipschitz Condition) The coefficients y(z,t) and o(z,t) satisfy the

Lipschitz Condition if there exists a constant L independent of t and x such that Jm(z,t) ~ wy, t)| + |o(2,t) — o(y,t)| < Liz — y|

Result 2 * Let y(x,t) and o(x,t) satisfy the growth and Lipshitz conditions defined

above. Then there exists a unique solution to the S.D.E. (14) as a continucus process.

It is easy to check that the coefficients of (16) satisfy the growth and Lipshitz condi-

tions and therefore the conclusions of Result 2 hold.

Result 3 ° Let m, be the diffusion that arises as a solution of the S.D.E. (14), where the coefficients u(x,t) and o(z,t) satisfy the growth and smoothness conditions. Then,

. 1 lim pe ltta — ™|t, = 2] = u(z,t)

. ol

Result 3 suggests the names ‘infinitesimal mean’ and ‘infinitesimal variance’ for the

coefficients (7,t) and o?(7,t) of the S.D.E. (16)

4.1 Properties Of The Updating Process

We shall briefly introduce some notation and terminology standard to the literature

of diffusion processes?

. We then focus our attention on three properties of the updating process. Finally we look at a numerical example which has been of interest in formulating a model of Business Cycles in David{1992b].

?Theorem 16.5, Chapter 15 Karlin and Taylor{1982]

“Theorem 16.6, Chapter 15, Karlin and Taylor{1982]

°This subsection relies heavily on the analysis in Chapter 15, Karlin and Taylor(19&2]

4.1.1 Terminology

Let u(z,t) and o(x,t) be the coefficients of a S.D.E. as in (14). Define,

= erpi-— ° u(g) s(2) = exp{— [2 de)

S(2) = ["stadn = [" expt fo Sae)an — ft se) oe)

m(xr) =

HEURISTICS: From a classical viewpoint 1/s(z) is an integrating factor for the

differential operator L defined by

L f(x) = w(x) f(a) + 50%) P"(2)

‘A modern view of the function s(x) is the following. Let / and r be the left and right boundaries of the diffusion process X;. Let u(x) = Prob(T,; < T,|Xo = 2), where T, is the hitting time to a. u(z) is the probability that the process hits / before hitting r, starting at x. It can be shown that u(r) = SD OESOE So the function S(z) can be used to rescale the state space (/,r) in terms of probabilities of achieving various

levels and is hence named the scale function. We note that the process Y, = S(X;)

has a linear scale function in that b—y Prob{T,(Y) < Ti(Y)|Yo = y} = ba —a

1.e. its hitting probabilities are proportional to actual distances. The modern and classical views are reconciled when we realize that u(z) satisfies the differential equa-

tion u(r) = 0,u(a) = 0,u(d) = 1.

The function m(z) is called the speed-density of the process. The name is moti-

vated by the fact that . 1

where T,, = min{T,,7,}. The function m(zx) can also be thought of as a measure of ‘volatility’ of the process at z. The functions introduced here will be used to classify

the diffusion process.

4.1.2 Boundary Classification

An entrance boundary is one that cannot be reached from the interior of the state space. It is possible to consider processes that start there. Such processes quickly

move to the interior never to return to the entrance boundary.

Let S{a, 6] and M{a, 6] be the Stieltjes measures induced on the state space by the functions s(x) and m(z) respectively, for example S{a,6] = [° s(x)dz. Let N(l) = fr S{n,z|dM(n) = f/ M(l, €JdS(€). N(0) roughly measures the time it takes to reach an interior point x in (/,r) starting at the boundary /. To show that a boundary / is entrance it suffices to establish '° that S(I,z] = 00 while N(l) < oo ( please note that it is sufficient to establish this for any point in the interior of the state space and so

the argument z is suppressed in the definition of N(J).

Property 1 (Entrance Boundary) 0 and 1 are entrance boundaries of the S.D.E. 7 as defined in (14)

We recall for the updating process ™, u(m,t) = (1 — 2m) -A and that o(m,t) = (rej Qn re) (on) Before proving the statement we argue that these parameters suggest that 0 and 1 are entrance boundaries. The infinitesimal mean of the process is of non-negligible size and pulls the process towards the center as the process moves close to its boundaries, while the infinitsimal variance declines to zero as the process

approaches either boundary. So the process never hits either boundary.

Proof We shall prove that S(0,z] = 00 and N(0) < co The proof for the other

boundary i.e. 1 is similar,because all the functions considered are symmetric about

10 Page 234 Karlin and Taylor{1982]

_ 7 Mie) s(x) = exp{— [ 2a)

r 2 _ exp{——— l

ab Ato) (17)

Since, Ina a aa = © S(0, 2] = 00.

coe K l N(Q) = erp a) a A (0) f ) roa ” ore — 0? cole l

<[° (aa expla dn) 1 d fear "Teas < | ——-_ QO < | Tera <= 4.1.3 Stationary Distribution

If it exists, a stationary density v(z) of a diffusion process X; necessarily satisfies

wy) =/

where pit, z,y) is the transition density function i.e., P(t,z,y) = Prob(X; < y|Xo =

w(x) - p(t, x, y)dzx Vt>0

r) and er eu) = p(t,z.y). It can be shown that the stationary density w(z) satisfies,

Oo a2 OVW) — Hew)

Solving the differential equation yields, ¥(z) = m(z)[C,S(z) + C)]

where C’, and C, are constants that guarantee that (x) = 0 on (1,r) and f7 ¥(E)dé = l

When / and r are both entrance boundaries ( 0 and 1 are both entrance boundaries for ™ ), then S(r) > co ag t + | or x + 1, as discussed in the previous sub-

subsection. In this case, C, is chosen equal to zero and the stationary density is

~ Frm(€)de (18)

where m(z) = mane as defined earlier.

Property 2 (Stationary Distribution) For the updating process 7; the stationary distribution w(x) is

v(t) = C1 exp a} + oop (19)

Proof Substitute s(z) from (17) into (18). O

The shape of the stationary distribution depends on the characteristics of the learning process which depends on the parameters A, 0? and (a—6)?. When 4 is large a ‘lot’ of switching occurs and so a relatively large amount of time is spent with x close to a + When o? is large the signal are noiser and again 7 spends a larger time around a 4. When (a — 6)? is large the diferrence in the drifts from the two regimes is large, so learning occurs faster and a relatively large amount of time is spent near

the boundaries.

4.1.4 Path Properties

So far we have characterized the infinitesimal dynamics of the learning p-ocess in

Theorem 1 and. esguety | ong: term: behavionr of the process as measured by the Soe

stationary distributign in We prévious ‘subsubsection. For some problems in decision

theory it is interesting to have estimates of the ‘intermediate-term’dynamics of t}

sample paths of the process. In this ‘sub:subsection, we estimate the expected time

to be spent in different ‘regions’ of the state space conditional on being in the region.

For example it might be important to calculate how long the agent expects to be in a region of ‘low-confidence’ which may be defined as 7, € (.4,.6) or the length of time the agent expects to be ‘confident’ in a regime as measured by the time spent in m, € (.8,1).

Define v(x) = E[T.»|Xo = z] z € (a,b). v(x) is the length of time the agent expects to be in (a,6) conditional on being at x. It can be shown !! that v(x) is the

solution of the differential equation [v(z)=-1 v(a) = v(b) = 0

The solution to this is

v(z) =

24u(.a,b)- [ 1S(b) ~ S(Ehm(€)dg + [1 ~ ule, 0,6)} [ES(E) — S(a)]m(é)de} (20)

where,

u(z,a.b) = Prob{T, < T,|Xo = x}

We give a numerical example to illustrate these properties next.

4.1.5 Example

Time is measured in ‘years’. \ = 2 which implies that there are approximately two switches in a year and a 9 percent chance of having a switch. a = .07 and 6 = —.045 and o =: .)2 The stationary distribution is almost U-shaped and is drawn in Figure 1. With these parameter values the process spends a relatively large proportion of times a the ends rather than in the middle. When 7, = .5, the expected time to be spent in the interval (.3,.7) = .09 years, roughly a month. Once the process hits the level .9, the expected time spent before it goes back to a level of .6 is .47 years.

The process moves out of regions faster when in the middle. This represents fairly

‘lage 192 Karlin and Taylor[1982]

quick learning. when the level of confidence reaches a low level. An agent in such an environment knows that when he reaches a level of middling beliefs, he expects to receive news relatively soon and move fairly quickly to a level of higher conf. dence. Once he reaches a level of high confidence i in one of the regimes, he expects to retain approximately the same beliefs for a relatively long period. In David(1992b], we find that the environment described here, causes cyclical investment patterns, if there i is some inflexibility regarding the agents investment choices. When all parameters are kept at these values, except that the level of noise is raised to .07, learning is slow and

the stationary distribution has most of its mass around .5. This is shown in Figure 2.

5 Comparison With The Kalman Filter

In this section we briefly discuss some similarities and differences in the assumptions and results between the Kalman Filtering problem, Jazwinski[1970] and the filter introduced in this paper. We will not write the Kalman filtering problem in its most general form, but instead concentrate on a few broad features which are apparent

from the simplified version discussed here.

The Kalman Filtering Problem As in the problem considered in Section 1,

the problem is described by the Transition and Observation equations.

THE TRANSITION EQUATION

dz, = fz + qd6, (21)

f > 0,4 > 0 and ¢, is a Standard Brownian Motion. The unobserved state variable z, follows a diffusion process. This is different from the transition described in (1), where the state variable could take on only two values and switched between the two with Poisson increments. The distribution of z at time zero is assumed

Gaussian with mean Zp and variance vp.

THE OBSERVATION EQUATION

a > 0 and 7 is a Simple Brownian Motion. This is identical to (2), the Observation

Equation for the filtering problem considered in this paper.

Let 2, = E”*(z,], where F, is the o-field generated by the sample path (8, )ocret.

Let vy, = Var7*[z].

Result 4 ( Results of the Kalman Filtering Problem)

di = f-aatt + Cap, — sat) (23) d b Sts (af tg?) (ety (24)

Remarks UPDATING THE CONDITIONAL MEAN The dynamics (23), of the conditional mean are analagous to the dynamics of dz described by (13). Notice that (23) may be decomposed into the sum of two parts. The first is a deterministic drift component and it equals E7*{dz,]. This is the counterpart of the ‘mean-reverting’

component of (13) which equalled E7*{dz;] in that context.

The second component is the product of two terms. +(d;— Zdt) is an innovations process as defined in (15), and is a F;-measurable Brownian Motion by Result 1. We recall that the updating rule (13) had a similar ‘innovations’ term.

botue is the weight give to the ‘innovation’ term in updating. This matches the

‘information weighting’ term in (13) which was mea) (anb) It is easy to show in both cases that the information weighting term equals oe The correspondence of the conditional mean dynamics is complete. This identification of the information weighting term also shows that the estimator 7, which we obtained from Bayesian

Updating, coincides with the optimal Least Squares estimator.

UPDATING THE CONDITIONAL VARIANCE The dynamics of the conditional variance are deterministic and this is reflected in the fact that (24) is an Ordinary Differential Equation. In particular, the path taken by the conditional variance is completely determined by the parameters of the problem and can be pre-computed. The solution

of (24) is!?

2 vu = vol +o? uf w= 0 1 vy, = 2wl{l — ————————————— a 0 25 ‘ 1+ Jon Vo exp)! f we (25)

2See for example Gennotte[1986].

where w = fo* — go. If w < 0, limo = 0. If w > 0, limyoo % = 2w. Intuitively, if the ratio of the speed of the drift to the strength of the signal as in the case w > 0, then the agent is always a step behind in his evaluation. Even asymptotically he does not get an exact estimation of the drift. The tracking problem in this paper also has

this property, since the drift can always jump.

The deterministic path of the conditional variance is a property particular to the Kalman Filtering problem. The deterministic dynamics of the conditional variance for the Kalman Filtering problem is based on a well known result on Normal Distributions in statistics. From (22) d3, and z, have a jointly Normal distribution, conditional on all information until until t. More precisely}? if (X,Y) has a nondegenerate NV [11), {l2, 07,03, p] distribution, then the conditional distribution of X given Y = yis N[u. + p2(y — w2),o7(1 — p?)], ie. the conditional variance of X given Y = y is a constant. Our model lacks the property that makes the Kalman Filter tractable. However. since both the conditional mean and conditional variance depend only on 7, we were able to write the updating as a diffusion process in one variable

and analyze its properties.

'3Theorem 1.4.2, Bickel and Doksum{[1976]

6 Model 1 Unobservable Regime Switches

In this model I shall assume that the state variable z, is unobserved. The agent observes the rates of return on both physical assets which are as in (1). The agent has beliefs about the underlving value of the state variable z;. I assume that he uses Bayes Law in updating his beliefs. The agent updates his belief by observing the difference in the rates of returns from the two investments. If the difference in the rates is positive his beliefs shift in the direction of asset 1 having the higher drift and vice versa. This filtering problem has been solved in Section 4. Unobservability of the regime switches lead to various degrees in ‘confidence’ for the representative agent. These can also be viewed as wealth or ‘prospect’ changes. When the agent is confident about the current regime, he can invest in two assets with similar expected returns, approximately the average productivity of the two assets in the economy. On the other hand if he is confident of the current regime, then one asset has high expected productivity and the agent can receive higher returns by investing mostly in that asset. Furthermore the expected time to be spent in different regions of the state space changes with changes in beliefs. This implies that these chanzes in confidence have payoff consequences for periods larger than an ‘instant’. So unlixe the situation in Model 0 where the agent faced a constant opportunity set, the agent here faces an entire spectrum of opportunity sets. The constantly changing ‘confiderce’ /

‘prospects’ introduces a class of risk, which we outline next.

Recall that the belief updating process is a diffusion process. It contains a disturbance term which has a standard deviation of the order o((At)?). In other words uncertainty about the future position of belief is ‘large’ relative to the length of the interval considered. Lets consider the case where the probability of asset 1 Laving drift a is greater than .5. In this case asset 1 promises a higher average return. When the total return of asset 1, which is the sum of the drift and noise , is high it increases

the updated probability of the regime being in favor of asset 1 When the return of

asset 2 is high it lowers this probability. Effectively, asset 1 pays off in states of greater wealth (or lower marginal utility ) and asset 2 in states of lower wealth (higher marginal utility) Therefore holdings in asset 2 help hedge the risk of wealth changes at this infinitesimal horizon. In particular if the the size of the infinitesimal movements is large, the agent can hedge the risk at the cost of lower expected returns.

This risk-hedging behaviour is illustrated in the portfolio choices in Corollary 2.

The Control Problem Let J(W,,71,t] be the value function of the socialplanning problem at time t, given the wealth and beliefs of the agent. The dynamics of the beliefs are given by (13) in Section 4. The updating depends on d3,, which depends on 2 the underlying productivity variable. (13) is not a Markov process in x. Therefore the consumer's decision problem with (13) defining the state variable dyiamics is not a Markov decision problem. However the identification of the ‘Innovations’ process as a Simple Brownian Motion on the filteration of the agent’s information, in Result | rectifies this. A similar result has been proved for models based oa the Kalman Filter such as Dothan and Feldman[1986], Gennotte{1986] and Sundaresan[1984].

Proposition 2 (Separation of Updating and Optimization) The consumer’s consumption and portfolio decision in the economy with unobservable regime-shifts would be the same as in an economy with an erogenous state variable which follows a diffusion

process (in particular Markov).

Proof }y Result 1, the statisical distribution of (16) is identical to that of (13) on

the filteration of the agent’s information. O.

Henveforth (16) shall be used to define the dynamics of beliefs. As for the previous

model the value function is separable in wealth, beliefs and other state variables.

J(Wi, 71, t] = exp(—pt)—4 T(r] (26)

Furthermore the optimal policy is independent of time and wealth and depend only

on beliefs.

Proposition 3 When regime-switches are unobserved I[7] satisfies the differential equation

0= max (4 (Ie) —p-I{r] + an wy =l,w,>0 Y

Im Yoailmjui + 5(y—1(Qwf)o? ) +

Tele] 22 + (wey = 2) -(2) (= 2) (0-8) ) 4

1 50° (Fe () (27)

The proof is in Appendix 2. The transversality condition for this model has not been explicitly discussed. If parameter values are chosen so that the condit:on is satisfied for Model 0, then it will be satisfied for this model. This is because for any strategy, the value for the unobservable case cannot be larger than for the observable

case.

Boundary Conditions. Its evident that the value function is symmetric about .5. Therefore it reaches a local minimum at .5 and we impose I'[.5] = 0. In Section 4 we showed in Property 1 that the belief process did not hit the boundaries ) and 1 although it got arbitraily close to them. As m approaches either boundary it is pulled inward with probability 1. I therefore use the Reflecting Boundary condition I'(1] = 0.

As in Model 0 the portfolio choices are partly determined by the desire to hedge noise. In this model the agent also attempts to hedge the risk due to wealth changes.

This lowers his demand for asset 1 and raises it for assets 2.

Corollary 2 When regime-switches are unobserved portfolio choices are made to hedge the risk associated with ‘fluctuating-confidence’. In the case of interior so-

lutions the portfolio choices are determined by

_1 ay(7) — a(t) | (w)-(1=7)-(a—8) 1, fr] wmi(™) = +50 o*(1 — 7) * 2-(1-7)-o I {x} 11 axle) ele) (x)=) (a8) lel wo(7) = 373 o*(1—y) 2-(1—y)-¢? [x] 28)

The ezcess-returns vary with the ‘level-of-confidence’ of the agent(s). and are

given by

erfr] = (wy wa)(a)1 = n\(a= FT 41 —a)o? Lou? (29)

The proof is in Appendix 2.

Just as in the observable case the agent diversifies the noise in the payoff from each asset. However, in this model the opportunity cost of diversification, the difference in the drifts depends on the beliefs of the agents varies with the agent’s beliefs. The 2nd term in the portfolio choices reflects the noise diversification. Further, the agent hedges the risk of the changing opportunity set as described in the introduction of this section. The last term in the first portfolio choice equation measures the reduction in hcldings of this asset, because its payoff has positive comovements with growth ‘prospects’.

The characterization of the risk-free rate in this model, the expected returns minus the excess-returns from Corollary 2, is formally identical to that in the Consumption- Based-Asset Pricing literature, for example Lucas{1978]. This follows from Theorem 1 CIR[1985], which only depends on the state variable following a diffusion process. | provide a proof which is similar to theirs, but is specialized to this model. It extends the CIR result to the case of unobservable productivity parameters. Let MU[r]

denose the marginal-utility of consumption, which by the envelope condtion equals

the marginal-utility of optimally invested wealth. Let rf [zm] be the risk-free raze when

the agents beliefs are 7. This is as calculated in Corollary 2. Then

Proposition 4

__ 1 AMU{[r] fel= 3 ote

The proof is in Appendix 2. From Corollary 2, the risk-free rate equals the best net-returns in value terms from any asset in the economy. The CCAPM literature prices a risk-free asset in zero net-supply so that its equilibrium rate equals minus the expected rate of change of the marginal rate of substitution. Propostion 4 shows

that at an optimum these two quantities are the same.

A VERY ROUGH UPPER BounD For THE Excess RETURNS IN THIS MODEL. A very rough bound on the size of (r)(1 — r)(a — 6) can be obtained from the solution of Model 0. Let A be the value when 7 is always .5 and A be the value when m 1s always 1. These numbers can be calculated from (44). Then the above quantity is less than _

25-(a—6): = (30)

This puts a bound on the excess returns arising out of unobservability. For example

ifa = .07, b= —.05, y = —1, o = .03, then the bound equals 0.015 or 1.5 %.

7 Fluctuating Confidence and Properties of Stock Market Returns

I state three stylized facts about stock-market data. The facts are well documented in the literature. Here I do not cite all the papers but direct the reader to Black[1976] for a description of the facts and to Campbell and Hentschel[1992] and Nelson{1991] for formal time-series models which measure these effects. To evaluate the capability of Model | these features, | solve the model numerically and then simulate sample paths of real and financial variables. Equation (27), the value function for the consumer’s

problem is solved using an implicit finite-difference method".

I show that with appropriate parameter choices, the stock-market return process of Model 1 is capable of replicating these features, while Model 0 the case with observable regime-switches is not. Illustrative pictures and informal explanations for the success of the model are provided. I also discuss to what extent models based on Kalman Filter learning models might replicate these facts. I do not argue that the model presented here is the only one capable of replicating these features. The idea is to examine the extent to which learning about the relative productivities of different

assets in the economy affects the qualitative properties of equlibrium returns. THREE STYLIZED FACTS

(2) Kurtosis or Fat-Tails of Excess-Returns. Large realizations of returns happen more often than consistent with normality. Using stock-returns monthly data from CRSP (1/26 - 12/88) and T.Bill Data from Ibbotson Associates, Campbell and

Hentschel[1992] report Excess-kurtosis of 6.82 in excess-returns.

(22) Skewness of Excess-Returns. Large negative returns are more common than large positive ones. Campbell and Hentschel report a skewness parameter of -0.443

for the above series.

14The program is written in Gauss. It is available upon request from the author.

(72) The Predictive Asymmetry of Stock-Market Volatility. Black{1976], Nelson{1991] and Campbell and Hentschel[1992] find a negative correlation between current returns and future returns volatility. Further, reactions to unfavorable news tend

to be larger than reactions to favorable events.

I start by showing that Model 0, the case of observable regime switches is incon-

sistent with these stylized facts.

Proposition 5 The conditional distribution of returns is unchanging in Model 0. The unconditional distribution of returns over any horizon (may be non-infinitesimal)

is Gaussian.

Proof. In Proposition 1 in Section 3 I showed that the fractions of resources invested in the high and low productivity is constant over time and independent of z the productivity variable and wealth. Therefore the statistical distribution of rate of return to optimally invested wealth is unchanging. The rate of return over horizon At therefore has a normal distribution, with a mean (w, -a + w2-b)- At and variance (wi + w)-o- At, where w; and we are given by (6). QO.

The success of Model 1 in generating these stylized facts depends on the parameter chosen. I distinguish between parameters which lead to a U-shaped staionary distribution of beliefs from one which lead to an inverse U-shape. The former is called the Fast and the latter Slow learning model. Numerical results from the two models

are presented in Figures 3.1 - 3.8 and 4.1 - 4.8 respectively which I discuss now.

Figures 3.1 and 4.1 show the empirical densities of beliefs for the two models respectively. The shapes are the same as the theoretical shapes discussed in Section 4. Figures 3.2 and 4.2 show the conditional variance of beliefs. The concitional variance is given by wii=n¥ (a) The shapes of the two curves are the same, :hough the absolute values differ. Figures 3.3 and 4.3 show the percentage of the portfolio allocated to the high productivity asset. The portfolio choices are as given in (28)

in Section 6. I discussed the motives to diversify in that section. The figures show

that the agent plunges less readily in the Slow Learning model. This reflects both the greater level of noise and the larger variance of the belief process. Figures 3.4 and 4.4 give the risk-free rate in the two situtions as calculated in Section 6. The interest rate in both models is lowest when confidence is lowest, i.e. = .O, This reflects because in this state beliefs are most volatile. This leads to a large volatility of the consumption process and by Proposition 4 a low risk-free rate. Another way of interpreting this is that the risk-free rate is a measure of the opportunity cost of lending. Notice that the agent completely diversifies under these conditions and effectively faces a market

with poor prospects.

Figures 3. 5 and 4. 5 show the conditional ¢ excess-returns in the two models. Figures 3.6 and 4. 6 show the expected market returns. The market returns reflect the fact ‘that wien the agent 1s confident of the current regime and therefore allocates his asset in the high- productivity asset, he recieves high expected returns. In the Slow learning case the agent waits to get more confident before he plunges and correspondingly the expected rate of returns rises at a | slower rate than 1 in the Fast learning case. The excess returns reflect that the risk j in the market portfolio does essentially i increases as the agent becomes more confident. The relationship i is not monotonic however. As beliefs approach | 0 or 1, the agent has already plunged and taken the maximum risk with respect to noise. However, because the volatility of the belief process declines, the risk due to “prospect” changes as discussed in Section 4 also declines. So the

excess-returns due to this effect taper off as 7 approaches either 0 or 1.

Figures 3.7 and 4.7 show the relationship between the speed of learning and the negative skewness and excess-kurtosis of unconditional realized excess-returns. The pictures reveal that the Fast Learning model leads to. negative skweness while the Slow Learning model leads to eXxcess- kurtosis. Further results on these statistics are

“given it in Table 1. In the Table ] consider various sets of parameters and watch the “effects of changing only the level of noise. A large level of noise gives slow learning.

The same finding as the pictures are also found for other parameter choices. Besides

the Z-values show that the statistics are statistically significant.

The relationship between the speed of learning and the potential negative skewness or excess-kurtosis of the unconditional distribution is as follows. The ‘eturns from the two assets over any given time horizon are given by normal distributions with different means. In the slow learning case, the agent spends a large amount of times in regimes with low confidence. Because his beliefs are rational, this means that essentially he is getting returns from the high and low mean asset with about equal probability. Therefore the distribution of returns is obtained from norraal distributions with different means!> The resulting distribution has fatter tails than a Normal density. In the Fast Learning case, the agent spends a large proportion of time with high confidence regarding his knowledge of the current regime. In this case he allocates his investment to the high productivity asset. Because his beliefs are rational, he gets returns from a Normal density with a high mean most of the time and an occasional small return from the smaller mean density. This makes his distribution negatively skewed. In Figure 5, the approximate modal positions in the

Fast and Slow Learning cases is depicted.

Figures 3.8 and 4.8 show the conditional variance of realized returns for different levels of the belief variable. Again the pattern for the Slow and Fast learning cases are different. For the Fast learning model the conditional variance of returns is highest when the agent is least confident or 7 = .5. For the Slow learning model the opposite is true. The reason this happens is due to the relative impor:ance of noise and different drifts as well as the portfolio choices in the two models. Over any horizon, the variation in returns is caused by two different sources of fluctuations. (1) The difference in the drifts of the assets aud (2) the amount of unhedged noise. In both models when confidence is low, the agent choses diversified portfolios. Therefore the noise in the assets is hedged against. Returns are generated with close to equal

151f the means are very far apart, the unconditional distribution will be bi-modal. For the param- —

eter choices which I made the distribution has a single mode.

probabilites with either drift a or drift 6. The variance due to (2) is relatively large. When confidence is high, the returns are generated by the same drift with a high probability. Therefore the variation due to (2) is low. Most of the variation in returns is caused by (1), because noise is unhedged in high confidence states. In the Slow learning model, the level of noise is large relative to the difference in drifts. Therefore variance of the realized returns is smallest at = .5 and largest when 7 is

near 0 or 1. In the Fast learning model the opposite holds.

The connection to stylized fact (222) is as follows. In the Fast learning model, the agent is mostly in a state of high confidence and low conditional variance of returns. A large negative realized returns leads to a a loss of confidence and his beliefs move closer to .5 where realized returns exhibit higher conditional variance. There ‘s therefore a negative correlation between realzied returns and future changes in volatility. Further a large positive returns, merely confirms the agents’ belief regarding the underlying regime and leads to small revisions. Therefore negative returns have lager effects in absolute value. Things are not so clear for the Slow learning case. Here the conditional variance of realized returns is high when the agent is confident. Therefore the relationship between realized returns and future changes in volatility is in contradiction with the stylized fact for the Slow learning

Case.

In Dassing | compare the success of my model to the explanation put forward by Clark[1973] regarding the kurtosis of returns. Clark showed that a process of returns which was ‘Subordinated’ to the Normal distribution could explain the kurtosis of returns. The subordinated process has a constant conditional mean but a varying condtional variance. The fact that large conditonal variance is followed by large conditional variance is enough to generate fat-tails. He motivated the fluctuating conditional variance by differing amount of trading volume in different periods. Large trades are motivated by differing ‘news’ recieved by traders as well as other

idiosyr.cratic factors. However, the conditional mean of returns was constant. My

explanation, has a clearer expostion of the ‘news arrival’ process, with the updating explicitly described. The results are driven by changing conditional means as well as changing conditional variance of returns both of which leads to differential portfolio

choices. Further my model is also consistent with the other stylized facts discussed.

8 Conclusion

I have constructed a Cox-Ingersoll-Ross model which is capable of replicating three stylzed facts about the U.S. stock-market. The model is built around the dynamics of the beliefs of agents in the economy who are tracking unobserved regime-switches. | find that parameter values which permit faster learning are better able to replicate the observed stylized facts. The paper complements the Conditional Heteroskedasticity literature which argues that several U.S. economic time series are best described by ARCH and GARCH models and their variants. The analysis here has the added advantage of being in a General Equilibrium framework. The required persistence

properties of stock-returns are inherited from the inertia in Bayesian Updating.

Appendix 0 Proof of Proposition 1.

The aggregate wealth dynamics are given by

2 dp; Werac = (Wi ~ CAd [i wall + F*)] + At)

t=1

+(\- At): 3 wit(a;(a + 6 — %)At + ¢AC,)] + o( At)

1=1

Therefore, AW, = —C,At +[W —C,At] - DS wie(a,(z,)At + cAC,,)] + o( At) 1=1 therefore. E,[AW,] = —C,At + W, - oS wa; (21) At} + o( At) i=1 2 Var [AW,] = W? - [D> w3] - o? At + o( At) i=l E,[(AW,)*] = o( At) for k > 2.

J[W, z,t] the derived utility of wealth function satisfies

1=1

= max s.t[>?_ 1 wit]=1

(31)

(32)

(33)

(35)

(36)

w? 2 . | + Jww- [Do wi)o? + o( At)] (37) . 1=1 The first order conditions are Uc = Jw (38)

for 2 = 1,2,3. \° is the Lagrange multiplier associated with the constraint wit = 1. The complementary slackness condition is

Jw WO a,(2)wa] + JwwW?[S- w2 Jo? = » wi)A° = d° (40)

As noted at the beginning of Section 3, the model has a constant opportunity set i.e. even though the productivities of activity 1 and 2 change over time, there is always one with an average productivity of a and one with an average productivity of 6. This implies that the production decision may depend on the wealth at time t, but not on the value of t. Also with a power utility fuction and opportunities to scale up or down wealth proportionately that optimal decision rules are independent

of wealth. We now guess and verify that the value function is of the form

Ww J(W,, t] = exp(—pt) - A» — (41)

Y With this guess J, = —pJ , Jw = a and Jww = ay Subcripts of J denote partial derivatives. Substituting these into the first order conditions (39) gives us the

conditions

9 WIA and equality holding whenever w,;, > 0 for 7 = 1,2,3 which imply that the portfolio

a(24) + (y — l)wyo? <

(42)

choices w,, are independent'of time. From now on we shall avoid the time subscript t.

Let asset 1 have productivity a and asset, 2 have productivity 6 with the understanding

that the choices given below are for the case z = a and that w,; and w2 would be

reversed when z = b. We explicitly write the portfolio choices in the case of interior

solutions. 1 1 (a—6) m= ot Bay 1 1 (b—a) ™ = 949 Ray “)

From (38) we get C = ATW. Substituting the optimal choices of consumption

and portfolio shares and the expressions for the partial derivatives into (37) yields

1—- 1

which implies that

A=[T(C+ Sloe, ax(a)] + (7 1)o? owt) (44)

The transversality condition!® for this problem is that 2 2 p > max{0,7 DI [w;a;(a —1)o? S"[w?}} Qo i=l 1=1

Proof of Corollary 1. The first order conditions for the C, w. and wo are

same as before. The condtion for wg is JwWr = \° (45)

d° is the Lagrange muliplier associated with the constraint 72, wz = 1. Let J*(|W,, t] be the value to the social planning problem, solved in the previous subsection \*° be the Lagrange multiplier associated with the constraint [7?_, wy] =: 1 , w%, the

portfolio shares in the two industries, and C7 the consumption flow rate chosen by

‘©The transversality condition rules out strategies which make the value function unbounded.

the social-planner. Then we claim that C, = Cl, Wie = wi, and r, = we Tey constitute

a competitive equilibrium. Inspection of (45) reveals that Wo; = 0 is optimal, so there

is no borrowing / lending. Confirmation of the other conditions is straightforward.

Substituting the value of r, into the complementary slackness condition and col-

lecting terms implies 2 2 [Do ax( 21) Wi] — he] = Www (D2 2 (46) i=] t=1

For the Power Utility function and the resulting value function the right-hand side

of (46) is (1 — (Xi wi. O

Appendix 1 Lemma 1 Let L,, Li be the densities of observable increments AB, = isar — 2:

conditional on the drift of the output being a,b respectively. Then , (a — b)- (AB, — b- At)

Ly = Ly: [1+ : J+ of At) oO Proof 1 = Ly = = - - expl——— - (AB — b- At)? ‘oat Gane lami O% y) Ly =

+ o(At) = L, els -[(a —b)?- (At)? —2- (a —b)- At: (AB, — b- At)] + o(At)

. 2 since, e=l+r+ Ft...

Ly =

Ly (+ OTA) (AB = PAL) (a8)? | (a 8)?

ao? 2-07 2-a4

“(AB,)"]

+ o(At)

but (AZ,)*? = o7 At a.s., since the Quadratic variation process (Chung and Will-

iams(1990]) of a diffusion process is indistinguishable from {o?-t,t € R,}. Therefore,

(a — b)- (AB, — bAt)

L,=l,-(1+ ]+ o(At)o

Lemma 2

TMtep at =

te (1 = AAL)- La + (1— 7) AAt- Ly + AAt- L, + (1—™)- (1 — AAL)- Ly

+ol At)

Proof By Bayes Law,

T+ at =

m-(1—AAt)- La +(1— 7) AAt- Ly vy-(1—AAt)- La + (1—m)-AAt- Ly +m AAL- Ly + (1 —7,)-(—- AAD) Ly (47)

+ o(At)

We have used the property of the Poisson process that one switch occurs with probability AAt+ — o( At) no switches with probability (1—- AAt)+ — 0(At) and n 2: 2 switches with probability o(At). Please recall from lemma | that L, and L,

are conditional densities.

Now in (47) interchange L, and Ly, whenever they are multiplied by the AA¢ term, but not when they are multiplied by the (1 — AAt) term. This gives an error

of size o(At), since L, — L, = o((At)?) by lemma 1. O

Lemma 3

Tee At 7 Tt =

t+ (1 —7)-(a—b)

-[AB, — (a-m +6-(1 —™))- At]

+ o(At) Proof By lemma 2,

Ttpat =

m+ (1 — AAt) + G2 + (1 — mm) - AAL my (1— AAt)- FE + (1 — m1) AAL+ me AAT. Fe + (1 — m4) > (1 — AA)

+0(At) Tar =

m > (1 — AAt) #2 + (1 — 7m) - AAt

o(At)

substituting for ia from Lemma 1,

me (1 — AAt) + [1 + GPHAB 89) 4] — a) AAt l— Th: [oP Bet)

+ o(At)

since ~~ =1l+a¢+a2’+...

TMtt at = ‘(a—6)- _ 2.(4 —ph\2. 42. (Numerator) - [1 — m(a~ 5) (hr — bAL) 4 milan bye Ab o a + o(At) = (Numerator): m:(a—b)-A 2. (q —b)?—m,-(b—a)- b)At pA AG (HP (a= Pom (ba) BAL gg g oO + o(At) Numerator = NDE = 2-my AME my te [ELE OD, + o(At)

Numerator =

T™,:(b—a)-b

t:(a—b re gg OS oy yom ala (49)

Te + + o(At)

substituting (49) into (48),

Ttpat = m,:-(b-—a)- AR, 1?-(b—a)?—m,-(b—a)-6 ug SOR e) DA Oro oe OW) bag x fr, — ED agp OK) oA Ly Go aag + o(At) a—b -(b—a)-b =n, +o” ie ag, 4 toa) eo) At 20 (h 2. (ha)? $A-(L=2-m)dt¢ ERD ag ea)" A, o o +n} (a —6)?At — 1? -(b—a)- bAt + o(At) (l—m)-(a—b app EO) ng 4 (em) dt 2 2 tm *(l—™):(a—6?) m-(b—a)-b-(1—™) re a eG +o(4\t) collecting terms, Mpa: — Tt = -(l—m)-(a—b (1-2-m)-\At 4 EE ( * 6 ) (AB, —(a+m +b (1 —m)) Ad + o( At)O

Appendix 2

Proof of Propostion 3 The Ist moment of AW, is analagous to (34) with z;

replaced by 7m, its expected value. The 2nd moment and kth moments are the same

as (35) and (36). In addition, from (16) and because the agent takes the diterence

in output from the two industries to update

CovjAW, Ar] = W - (w; — wa): (mt) + (1 — 2) - (a —) Using Bellman’s Principle of Optimality,

2 0= max [U[Ci] + Ji + +Uw -[-Cr + Wi[S> wires]

s.tCr,)~_, wit=1.w;>0 i=l

tn (1 = 20) A+ Jwe Ww, — w2)- 7 (1 — 1) + (a — 6)

Ww? 3 + = Jww (So who? + Inet? (1 ™)?(a — b)? + of At)]

w=1 The first order conditions are

Uc = Jw

Jw W-ay(%)+Jwe- Wm —1)(a—b) + JwwW2wyo? < AO

Jw W on(z) — Jwe: W-x(1 —2)(a — b) + JwwW?wyo? <

(50)

(51)

(52)

(53)

A) is the Lagrange muliplier associated with the constraint yr Wie = 1. Equality

holds for 7 only when w; > 0. Summing the complementary slackness conditions for

the assets implies

2 2 Jw wid; a; (2) wi} + JwwW?[S> wejo*

1=1 1=1

+ Iwas (wie — wa) (1 — 7) + (a — 6) = AM)

(54)

“ow guessing that the value function is separable as in (26) and substituting

completes the proof. O

Proof of Corollary 2 The decentralization for this model is the same as in Model 0 (and that in CIR) and we shall be brief. The risk-free rate r,, the rate at which borrowing and lending are in zero net supply equals Woe: This is the same characterization as in Model 0 and for the general case considered in CIR. Using the

complementary slackness condition (54) yields the result. QO

Proof of Proposition 4 The value function is of the form (26). Suppose the

statement is true.

MUrsailt + An] = exp(—p- (t+ At))-I[m + An] -(W + AW)"! (55) Using Taylor’s theorem,

(W+ AW) = Wh 4 (7-1). Wr

2 2

{ Dewvai(mAt to (Lo wid — (Ula) At + (y— 2) S (Sowa) |

i=1 +o(At) and ising the dynamics of x from (13)

I{x + Ar] = I[r]

+1'[x]-[ (1 -—2r)-AAt+ maken?) -(a(d¢; — d¢,) — (24 — 1)(a— b)At ]

1 (a—6b)?

“(1 — 9) I"[r]At + o(At)

and exp(—p(t + At)) = exp(—pt) - (1 — pAt)

Substituting these into (55) and taking expectations conditonal on 7 implies that

—1*[n] + an — n)(a—b)(wy —w,) +2. Cll, EO = 7)"

From Proposition 3, Tal re rf{x] = Y wasn) + (w; — w2)r(1 — r)(a — 6) —— 4+ = ow)

These two characterizations of the risk-free rate are the same only if the differential

equation (27) is satisfied. QO

Table 1 Model 1 Skewness and Kurtosis

Unobservable Regime Switches

# a b A oa kr Zk sk Zs

l 09 -.04 2 02 2.94 -.6 -.32 -9.23 2. 09 -.04 2 04 299 -008 -05 -1.5 3. 09 -04 2 O07 3.19 211 -.06 -1.9 4. 09 -.04 2 09 3.116 1.79 .04 1.24 3. 09 -.04 1 02 299 -.004 -.36 -10.37 6. .09 -.04 1 04 3.005 05 -.10 -2.9 7. 09 -04 1 O7 3.08 93 -.02 -.58 8. 09 -.04 1 09 3.23 2.488 -.014 = -.39

High Productivity Drift Value

Low Productivity Drift Value

Poisson Regime Switching Parameter

Noise Level

Kurtosis

Z value for Kurtosis. Null Hypothesis of Normality Skewnesss

Z value for Skewness.Null Hypothesis of Normality Impatience Parameter. 0.04 for all simulations

Risk-Aversion Parameter. CRRA 4 for all simulations

Figure | Implementation of Filter

oe o r--s- -— pee eee eee ar | | sg | | Wof | | I | | No | | \ ove NL Lo | -v. |” _ 8 00 100 200 00 400 500 Time in 1/100 years Figure 2.1 Fast Learning >e ra fa ae »o Lo 0 Cy A oo Nogg 0. 02 03 0.4 05 06 07 08 tg 1.0 Belief Figure 2.2 Slow Learning 7 vo a? fe] o3 Oo co 9° ~8 05 rm) 5 0.0 0 0.2 0.3 0.4 05 0.6 07 0.8 09 1.0

Belief

49a

+4] ~~” £O ~~ ty _ ek ee

we oo fe oF w wr eo 0 8 we» t Fore 33 Fure 24 é ; f : * t er w ) u u r) o ") 7) v : ar) v 0 *® SG se vo uw 8 Figure 35 Figure 36 i i . i aL 6 ke o uu & ww of ww oO © i uo sts eu uu 8 ws wo ww » “ oat Bata Figure 27 ' Faure 38 ; ei i “= ery wma “ s 8 a v r) a ) v rl

Fast Learning d=.10 b=.02 A= 2 0 =.02

49b

o_stmPices! Perel, poteg gncoseshetucrs é ¢ f | E 8 CRREES anes nas ete g - k

Slow Learning 0=.19 b=.02 A= 2 0 =

Figure 5

Understanding the Unconditional Distribution of Stock-Market Returns

! '

: \ /

NZ . 0.1 bat YY aAt Portfolio Mean: | Portfolio Mean: Slow Learning | | Fast Learning

0.1

49d

References P.J. Bickel and K.A. Doksum[1976]. “Mathematical Statistics. Basic }deas and Selected Topics.” Holden-Day, Inc. F. Black[1976]. “Studies of Stock Price Volatility Changes.” Proceeding of the Business and Economics Statistics Section, American Statistical Association. T. Bollerslev, R.F. Engle, and J.M. Woolridge[1988]. “A Capital Asset Pricing

Model with Time Varying Covariances.” Journal of Political Economy, 10.

J.Y. Campbell and L. Hentschel[1992]. “No News is Good News: An Asymmettic Model of Changing Volatility in Stock Returns.” Journal of Financial Economics, K. Chung and R. Williams[1990]. “Introduction to Stochastic Integratioa.” Birk-

hauser.

P. Clark[{1973]. “A Subordinated Stochastic Process Model With Finite Variance

for Speculative Prices.” Econometrica. Vol. 41, No. 1.

J. Cox, J. Ingersoll and S. Ross[1985]. “An Intertemporal General Equilibrium

Model of Asset Prices.” Econometrica, 53.

A. David[1992]. “Business Cycle Risk and the Equity Premium.” Working Paper, Department of Economics. UCLA.

S. Davis and J. Haltiwanger. “Gross Job Creation. Gross Job Destruction and

Employment Reallocation.” Quarterly Journal of Economics, 1992.

J.B. Detemple[1991]. “Further Results on Asset Pricing with Incomplete Informa-

tion.” Journal of Economic Dynamics and Control, 15.

U. Dothan and D. Feldman[1986]. “A Theory of Asset Prices and the Term

Structure of Interest Rates in a Partially Observable Economy.” The Journal of

Finance, VOL XLI, No.3.

R.F. Engle, D.M. Lilien and R.P. Robbins[1987]. “Estimating Time Varying

Risk Premia in the Term Structure.” Econometrica, 50.

G. Gennotte[1986]. “ Optimal Portfolio Choice Under Incomplete Information.” The Journal of Finance, VOL XLI, No. 2.

J.D. Hamilton[1989]. “A New Approach to the Economic Analysis of Non-Station-

ary Time Series and the Business Cycle.” Econometrica, Vol 57 # 2. A. Jazwinski[1970]. “Stochastic Processes and Filtering Theory.” Academic Press.

S. Karlin and H. Taylor[{1982]. “A Second Course in Stochastic Processes.” Aca-

demic Press.

V. Krishnan{1982]. “Non-Linear Filtering and Smoothing: An Introduction to

Martingales. Stochastic Integrals and Estimation.” Wiley.

D. Lilien. “Sectoral Shifts and Cyclical Unemployment.” Journal of Political Econom'y,90 1982.

P. Loungani, M. Rush and W. Tave. “Stock Market Dispersion and Unemploy-

ment.” Journal of Monetary Economics, 25, 1990. R.E. Lucas{1978]. “Asset Prices in an Exchange Economy. Econometrica, 46.

R. Merton[{1973]. “An Intertemporal Capital Asset Pricing Model.” Econometrica, 41. D. Nelson[1991}. “Conditional Heteroskedasticity in Asset Returns: A New Ap-

proach.” Econometrica, Vol. 59 # 2.

M. Sundaresan[1984]. “Consumption and Equilibrium Interest Rates in Stochastic Production Economies.” The Journal of Finance, VOL XXXIX.

IFDP Number

461

460

459

458

457

456

455

454

453

452

451

450 449

448

International Finance Discussion Papers

itle

1993

Fluctuatinng Confidence and Stock-Market Returns

Dollarization in Argentina

Union Behavior, Industry Rents, and Optimal Policies

A Comparison of Some Basic Monetary Policy Regimes:

Implications of Different Degrees of Instrument Adjustment and Wage Persistence

Cointegration, Seasonality, Encompassing, and the Demand for Money in the United Kingdom Exchange Rates, Prices, and External Adjustment

in the United States and Japan

Political and Economic Consequences of Alternative Privatization Strategies

Is There a World Real Interest Rate?

Macroeconomic Stabilization Through Monetary and Fiscal Policy Coordination Implications for Monetary Union

Long-term Banking Relationships in General Equilibrium

The Role of Fiscal Policy in an Incomplete Markets Framework

Internal Funds and the Investment Function

Measuring International Economic Linkage with Stock Data

Macroeconomic Risk and Asset Pricing: Estimating the APT with Observable Factors

Author(s)

Alexander David

Steven B. Kamin Neil R. Ericsson

Phillip Swagel Dale W. Henderson Warwick J. McKibbin

Neil R. Ericsson David F. Hendry Hong-Anh Tran

Peter Hooper Jaime Marquez

Catherine L. Mann Stefanie Lenway Derek Utter

Joseph E. Gagnon Mark D. Unferth

Jay H. Bryson

Michael S. Gibson

Charles P. ““homas

Guy V.G. Stevens

John Ammer Jianping Mei

John Ammer

Please address requests for copies to International Finance Discussion Papers, Division of International Finance, Stop 24, Board of Governors of the Federal Reserve System, Washington, D.C. 20551.

446

445

444

443

442

44]

440

439

438

437

436

435

434

International Finance Discussion Papers

Titles

1993

Near observational equivalence and unit root processes: formal concepts and implications

Market Share and Exchange Rate Pass-Through in World Automobile Trade Industry Restructuring and Export Performance:

Evidence on the Transition in Hungary

Exchange Rates and Foreign Direct Investment: A Note

Global versus Country-Specific Productivity Shocks and the Current Account

The GATT’s Contribution to Economic Recovery in Post-War Western Europe

A Utility Based Comparison of Some Models of Exchange Rate Volatility

Cointegration Tests in the Presence of Structural Breaks

1992

Life Expectancy of International Cartels: An Empirical Analysis

Daily Bundesbank and Federal Reserve Intervention and the Conditional Variance Tale in DM/$-Returns

War and Peace: Recovering the Market’s Probability Distribution of Crude Oil Futures Prices During the Gulf Crisis

Growth, Political Instability, and the Defense Burden

Foreign Exchange Policy, Monetary Policy, and Capital Market Liberalization in Korea

The Political Economy of the Won: U.S.-Korean Bilateral Negotiations on Exchange Rates

Author(s)

Jon Faust

Robert C. Feenstra Joseph E. Gagnon Michael M. Knetter

Valerie J. Chang Catherine L. Mann

Guy V.G. Stevens Reuven Glick Kenneth Rogoff Douglas A. Irwin Kenneth D. West Hali J. Edison Dongchul Cho Julia Campos

Neil R. Ericsson David F. Hendry

Jaime Marquez

Geert J. Almekinders Sylvester C.W. Eijffinger William R. Melick Charles P. Thomas Stephen Brock Blomberg

Deborah J. Lindner

Cite this document

APA

Alexander David (1993). Fluctuating Confidence and Stock-Market Returns (IFDP 1993-461). Board of Governors of the Federal Reserve System, International Finance Discussion Papers. https://whenthefedspeaks.com/doc/ifdp_1993-461

BibTeX

@techreport{wtfs_ifdp_1993_461,
  author = {Alexander David},
  title = {Fluctuating Confidence and Stock-Market Returns},
  type = {International Finance Discussion Papers},
  number = {1993-461},
  institution = {Board of Governors of the Federal Reserve System},
  year = {1993},
  url = {https://whenthefedspeaks.com/doc/ifdp_1993-461},
  abstract = {The drift of two different diffusion processes (asset returns) is determined by a state variable which can take on two values. It jumps between the two according to Poisson increments (this is called a 'regime-switch'). For any given position of the state variable the drift of one process is high and the other is low. I find that the posterior probability that the 1st asset has higher average returns, conditional on observing the path (returns) of each process, follows a diffusion process and calculate its infinitesimal parameters. I also derive analytical expressions for its stationary density and for some of its path properties. I compare the filtering problem to the Kalman Filtering problem and find that even though the dynamics of the mean of the distribution are very similar, the dynamics of the variance are subject to stochastic fluctuations. The model is parsimonious in that the conditional mean and variance are functions of a single variable.},
}