feds · July 31, 2014

A Tale of Two Option Markets: Pricing Kernels and Volatility Risk

Abstract

Using prices of both S&P 500 options and recently introduced VIX options, we study asset pricing implications of volatility risk. While pointing out the joint pricing kernel is not identified nonparametrically, we propose model-free estimates of marginal pricing kernels of the market return and volatility conditional on the VIX. We find that the pricing kernel of market return exhibits a decreasing pattern given either a high or low VIX level, whereas the unconditional estimates present a U-shape. Hence, stochastic volatility is the key state variable responsible for the U-shape puzzle documented in the literature. Finally, our estimates of the volatility pricing kernel feature a U-shape, implying that investors have high marginal utility in both high and low volatility states.

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. A Tale of Two Option Markets: Pricing Kernels and Volatility Risk Zhaogang Song and Dacheng Xiu 2014-58 NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

A Tale of Two Option Markets: Pricing Kernels and (cid:3) Volatility Risk y z Zhaogang Song Dacheng Xiu Federal Reserve Board University of Chicago This Version: January, 2014 Abstract Using prices of both S&P 500 options and recently introduced VIX options, we study asset pricing implications of volatility risk. While pointing out the joint pricing kernel is not identified nonparametrically, we propose model-free estimates of marginal pricingkernelsofthemarketreturnandvolatilityconditionalontheVIX.Wefindthat the pricing kernel of market return exhibits a decreasing pattern given either a high or low VIX level, whereas the unconditional estimates present a U-shape. Hence, stochastic volatility is the key state variable responsible for the U-shape puzzle documented in the literature. Finally, our estimates of the volatility pricing kernel feature a U-shape, implyingthatinvestorshavehighmarginalutilityinbothhighandlowvolatilitystates. Key Words: Pricing Kernel, State-Price Density, VIX Option, Volatility Risk JEL classi(cid:12)cation: G12,G13 ∗We benefited from discussions with Yacine A¨ıt-Sahalia, Andrea Buraschi, Bjorn Eraker, Peter Carr, Peter Christoffersen, Fousseni Chabi-Yo, George Constantinides, Jianqing Fan, Ren´e Garcia, Kris Jacobs, Jakub Jurek, Ilze Kalnina, Ralph Koijen, Nicholas Polson, Eric Renault, Jeffrey Russell, Neil Shephard, GeorgeTauchen(discussant),ViktorTodorov,GrigoryVilkov(discussant),HaoZhou,aswellasseminarand conferenceparticipantsattheUniversityofChicago,Northwestern,Princeton,ToulouseSchoolofEconomics, Liverpool School of Management, the 2012 CICF, the 5th Annual SoFiE Conference, the Measuring Risk conference 2012, the 2012 Financial Engineering and Risk Management International Symposium, and the 2012 International Symposium on Risk Management and Derivatives. Xiu acknowledges research support by the Fama-Miller Center for Research in Finance at Chicago Booth. The views expressed herein do not reflect those of the Board of Governors of the Federal Reserve System. †BoardofGovernorsoftheFederalReserveSystem,MailStop165,20thStreetandConstitutionAvenue, Washington, DC, 20551. E-mail: Zhaogang.Song@frb.gov. ‡University of Chicago Booth School of Business, 5807 S. Woodlawn Avenue, Chicago, IL 60637. Email: dacheng.xiu@chicagobooth.edu. 1

1 Introduction In addition to the uncertainty of market returns, volatility risk has been well documented as an essential component of time-varying investment opportunities. Together with the preferences of economic agents, a priced volatility factor leads to a pricing kernel (or stochastic discount factor) which depends on both market returns and volatility. Nevertheless, because volatility is neither tradable nor observable, existing studies on pricing kernels either impose strong parametric restrictions, or ignore the unobservable volatility factor in nonparametric analysis. The pricing kernel estimates produced by these studies exhibit a puzzling U-shape as a function of market return, in conflict with a standard expected utility theory. The lack of tradable and observable volatility has changed substantially since the introduction of the Volatility Index (VIX) in 1993 by the Chicago Board of Options Exchange (CBOE),1 and the introduction of VIX derivatives such as futures and options in 2004 and 2006, respectively. The VIX, derived from S&P 500 options as the square root of the expected average variance over the next 30 calendar days, provides investors with a direct measure of volatility; and VIX derivatives offer investors convenient instruments for trading on the volatility of S&P 500 index.2 As a result, the VIX is constantly exposed in the media spotlight, and VIX options have achieved huge liquidity and become the third most active contracts at CBOE as of October 2011. TakingadvantageoftheS&P500andVIXoptionmarkets, wenonparametricallyidentify and estimate the marginal pricing kernel of market returns and volatility, which equals the ratio of state-price density (or risk-neutral density) to physical density. We show that information in the two option prices is fully captured by the two marginal state-price densities of market returns and volatility separately, whereas the joint state-price density and hence joint 1The VIX, from its inception, was calculated from S&P 500 index options by inverting the Black-Scholes formula. In2003,theCBOEamendedthisapproachandadoptedamodel-freemethodtocalculatetheVIX. 2Previously, investors have to take positions in option portfolios, such as straddles or strangles, in order to trade volatility. 1

pricing kernel cannot be identified nonparametrically as a result of incomplete markets.3 We then provide nonparametric estimates of pricing kernels with respect to return and volatility respectively. Our estimates not only shed light on the puzzling U-shaped pricing kernel, but also provide new empirical stylized facts on the pricing kernel of volatility. In particular, we make several important findings regarding asset pricing implications of volatility. First,ourestimatesofpricingkernelswithrespecttothemarketreturnshowthatstochastic volatility is the key state variable responsible for the “pricing kernel puzzle.” More specifically, we find that a pricing kernel of market return conditional on either a high or low VIX level presents a decreasing pattern, whereas an unconditional pricing kernel (i.e. the one that ignores volatility) may become U-shaped. In fact, marginal utility (the pricing kernel up to a scaling factor) conditional on high volatility is above that conditional on low volatility, as low volatility signals a good investment opportunity and hence is preferred by investors. As a mixture of pricing kernel estimates conditional on different volatility levels, unconditional estimates can exhibit an increasing pattern over the high return region (right tail), where high volatility is prevalent. Our finding echoes the conclusions of parametric models in Chabi-Yo et al. (2008) and Christoffersen et al. (2010), which show that missing state variables in the pricing kernel may result in a U-shape. Without restricting the specification of pricing kernels, however, we show that including volatility as a state variable is the solution to this puzzle. Second, we provide nonparametric estimates of pricing kernels with respect to volatility, for the first time to the best of our knowledge. Our estimates exhibit a pronounced U-shape conditional on either a high or low VIX, indicating that investors attach high marginal utility to payoffs received in both high and low future volatility states, regardless of today’s 3We emphasize that the joint pricing kernel, though not identifiable nonparametrically using the S&P 500 and VIX options, can be estimated with certain parametric correlation restriction on the two marginal pricing kernels. We do not explore this approach because our focus is to recover the pricing kernels without anyparametricrestrictions. Afollow-uppaperofourstudy,JackwerthandVilkov(2013),implementedsuch an exercise using the parametric Frank copula for the two marginal distributions. 2

volatility level. Bakshi et al. (2010) also document a U-shape for the unconditional volatility pricing kernel, but indirectly, by exploring the link between the monotonicity of pricing kernel and returns of VIX option portfolios. In contrast, we provide direct estimates of the conditional volatility pricing kernel by nonparametric methods, which provide further information about the shape and tail behavior of the pricing kernel. In particular, we find that the volatility pricing kernel is asymmetric, and the asymmetry conditional on a current high volatility is much stronger than that conditional on a low volatility. This finding implies that market investors price the volatility risk differently according to different scenarios of the economy, which presents new empirical regularities that need to be incorporated into models of volatility risk. Finally, we evaluate the performance of our nonparametric estimator for in-sample fitting and out-of-sample forecasts against two alternative methods: the nonparametric approach of A¨ıt-Sahalia and Lo (1998) without a volatility factor and a martingale approach commonly used by practitioners that simply predicts tomorrow’s implied volatility by interpolating today’s implied volatility surface. We find that our estimator outperforms both alternative methods for density and implied volatility forecasts, which again highlights the importance of conditioning on volatility. Estimating pricing kernels from option prices is discussed in A¨ıt-Sahalia and Lo (1998), A¨ıt-Sahalia and Duarte (2003), Jackwerth (2000), and Rosenberg and Engle (2002), which ignore the volatility risk and discover a puzzling U-shape.4 Hereafter, many studies have proposed different explanations for the U-shaped pricing kernel, including models with missing state variables in Chabi-Yo et al. (2008), Chabi-Yo (2011), and Christoffersen et al. (2010), and models with heterogeneous agents in Bakshi and Madan (2008) and Ziegler (2007). Our empirical study contributes to this literature by showing, without imposing any parametric restrictions, that volatility is the missing state variable responsible for the puzzle. 4Arelatedstudy, FanandMancini(2009), proposesnonparametricmethodsforpricingderivativesbased on state price distributions. 3

Our paper is also related to the large literature on models with volatility risk, including bothreduced-formoptionpricingmodels, e.g. Bakshietal.(1997), Bates(2000), Pan(2002), Eraker(2004),andBroadieetal.(2007),andequilibriummodels,suchasBansaletal.(2012), Bollerslev et al. (2012), and Campbell et al. (2012). Unlike these studies, our framework does not depend on any parametric restrictions on volatility dynamics that may obscure the empirical characteristics of pricing kernels. Several recent studies have constructed modelfree measures of risk-neutral volatility from S&P 500 options, e.g. Bakshi and Kapadia (2003), Bollerslev et al. (2009), Carr and Wu (2009), and Todorov (2010), and compared them with measures of realized volatility. Their focuses are on the sign, time variation, and return predictability of variance risk premium, which only relates to the conditional mean of variance distributions under different measures. In contrast, we recover the entire volatility pricing kernel. Methodologically, our paper is also related to Boes et al. (2007) and Li and Zhao (2009) who estimate pricing kernels of stock market returns and interest rates, respectively, conditional on an ex-post volatility proxy filtered from historical time series. Our strategy differs from their approach by using the VIX, which possesses a monotonic functional relationship with the unobservable volatility for almost all state-of-the-art volatility models. Therefore, our method avoids estimation errors from the filtering stage, while making it possible to study volatility pricing kernels with the help of VIX options. Furthermore,severalrecentstudiesdocumenttheimportanceofmultiplevolatilityfactors in capturing dynamics of option prices or the term structure of variance swaps, see e.g. Christoffersen et al. (2008), Egloff et al. (2010), Mencia and Sentana (2012), and Bates (2012). Our nonparametric framework can be extended to nest these models by including additional regressors such as a VIX future contract, or CBOE S&P 500 3-Month Volatility Index (VXV). Such an extension, though being interesting and important itself, is beyond the scope of the current focus, to which term structure of volatility is less relevant. 4

Section 2 discusses the nonparametric identification of pricing kernels of both the market returnandvolatility. Section3providesournonparametricestimationframeworkandMonte Carlo simulations. Empirical estimates of pricing kernels are presented in Section 4. Section 5 concludes the paper. 2 Pricing Kernels with a Volatility Factor The pricing kernel equals the ratio of risk-neutral density, also known as state-price density (SPD), to the density under the physical measure. To study pricing kernels, we first discuss the identification of state price densities, by exploring the underlying connection of S&P 500 options, VIX, and VIX options through the latent volatility factor. 2.1 Identification of State-Price Densities To fix ideas, we denote the log price of the S&P 500 index as S , the VIX as Z , and the t t unobserved volatility as V . The information in the derivative markets is driven by the joint t evolution of S and V , which determines Z endogenously. As V is not observable, there t t t t exist no Arrow-Debreu securities traded on V directly. t In fact, the payoffs of S&P 500 and VIX options depend on their own underlying indices at maturity T. Therefore, we focus on the marginal state-price densities with respect to S andZ separately. Weshowthatthemarginaldensitiestogetherspanthetwooptionmarkets, and provide sufficient and necessary information about the dynamics of the market return and its volatility. The joint dynamics, nevertheless, cannot be identified nonparametrically unless certain options whose payoff depends on both S and Z are traded. T T 5

We write the time-t price of a S&P 500 call option with maturity T and strike x as: 5 [ ] C((cid:28);f ;v ;x;r ) =e −rt;(cid:28)(cid:28)E Q (eST (cid:0)x) +jF = f ;V = v t;(cid:28) t t;(cid:28) t;(cid:28) t;(cid:28) t t ∫ =e −rt;(cid:28)(cid:28) (esT (cid:0)x)+p ∗ (s j(cid:28);f ;v )ds T t;(cid:28) t T R where F denotes the log forward price of the S&P 500 index, (cid:28) = T (cid:0) t is the time-tot;(cid:28) maturity, and r is the deterministic risk-free rate between t and T at time t. Similarly, the t;(cid:28) price of a VIX call option with strike y is given by: [ ] H((cid:28);f ;v ;y;r ) =e −rt;(cid:28)(cid:28)E Q (ezT (cid:0)y)+jF = f ;V = v t;(cid:28) t t;(cid:28) t;(cid:28) t;(cid:28) t t ∫ =e −rt;(cid:28)(cid:28) (ezT (cid:0)y)+q ∗ (z j(cid:28);f ;v )dz T t;(cid:28) t T R Observe that the two SPDs p∗(s j(cid:28);f ;v ) and q∗(z j(cid:28);f ;v ) completely determine T t;(cid:28) t T t;(cid:28) t these option prices. Building upon the insight of Breeden and Litzenberger (1978), they can be estimated as the second order derivative of option prices with respect to different strikes. In particular, we can recover (cid:12) @2C((cid:28);f ;v ;x;r )(cid:12) p ∗ (s j(cid:28);f ;v ) = ert;(cid:28)(cid:28)+sT t;(cid:28) t t;(cid:28) (cid:12) ; (1) T t;(cid:28) t @x2 x=esT from S&P 500 options and (cid:12) @2H((cid:28);f ;v ;y;r )(cid:12) q ∗ (z j(cid:28);f ;v ) = ert;(cid:28)(cid:28) t;(cid:28) t t;(cid:28) (cid:12) ; (2) T t;(cid:28) t @y2 y=zT from VIX options. It is apparent that p∗(s j(cid:28);f ;v ) and q∗(z j(cid:28);f ;v ) summarize the T t;(cid:28) t T t;(cid:28) t entire information about these two option markets, hence the joint density of s and z T T cannot be identified from the data without additional parametric assumptions. Nevertheless, these two densities p∗(s j(cid:28);f ;v ) and q∗(z j(cid:28);f ;v ) are not practically T t;(cid:28) t T t;(cid:28) t feasible to estimate as V is unobservable. Alternatively, with the observed VIX from the t 5Inoursetting,thetime-tinformationsetF containsstockprices,instantaneousvolatility,interestrates t and dividends, which can be summarized by the log forward price F and the volatility V . t;(cid:28) t 6

market,6 we may rewrite the option prices with z as a state variable, i.e., C((cid:28);f ;z ;x;r ) t t;(cid:28) t t;(cid:28) and H((cid:28);f ;z ;y;r ),7 and take second order derivatives to obtain t;(cid:28) t t;(cid:28) (cid:12) @2C((cid:28);f ;z ;x;r )(cid:12) p ∗ (s j(cid:28);f ;z ) =ert;(cid:28)(cid:28)+sT t;(cid:28) t t;(cid:28) (cid:12) T t;(cid:28) t @x2 (cid:12) x=esT @2H((cid:28);f ;z ;y;r )(cid:12) q ∗ (z j(cid:28);f ;z ) =ert;(cid:28)(cid:28) t;(cid:28) t t;(cid:28) (cid:12) : (3) T t;(cid:28) t @y2 y=zT Infact, writingoptionsintermsoff andz amountstoassumingthatV canbedetermined t;(cid:28) t t from Z and F , which is rigorous under most models of volatility risk in the literature (see t t;(cid:28) Section 2.3 below for details). With state variables being fully observable, p∗(s j(cid:28);f ;z ) T t;(cid:28) t and q∗(z j(cid:28);f ;z ) can be identified from the data. T t;(cid:28) t In summary, state-price densities p∗(s j(cid:28);f ;z ) and q∗(z j(cid:28);f ;z ) encapsulate all the T t;(cid:28) t T t;(cid:28) t information in the two option markets. They complement each other to reveal an intact picture of the market return, its volatility dynamics and the interactions of the two markets. 2.2 From State-Price Densities to Pricing Kernels We now discuss how to obtain the pricing kernels by combining the risk-neutral and physical densities of S and Z . We denote (cid:25)(s ;z j(cid:28);f ;z ) as the pricing kernel and use (cid:25) for t t T T t;(cid:28) t short. Not surprisingly, for the same reason described in Section 2.1, the joint pricing kernel (cid:25)(s ;z j(cid:28);f ;z ) cannot be identified nonparametrically. We therefore study the T T t;(cid:28) t projections of pricing kernel (cid:25) on S 8 and Z , denoted as (cid:25)(s j(cid:28);f ;z ) and (cid:25)(z j(cid:28);f ;z ), T T T t;(cid:28) t T t;(cid:28) t respectively. They are called the pricing kernel of the market return and the pricing kernel 6The CBOE constructs Z form a portfolio of options weighted by strikes according to the formula: t (Z /100)2 =E Q (QV jF )= 2ert;(cid:28)(cid:28) (∫ eft;(cid:28) P(τ,x) dx+ ∫ ∞ C(τ,x) dx ) +ϵ t t;(cid:28) t τ x2 x2 0 eft;(cid:28) where QV denotes the quadratic variation of the log return process from t to t+τ, P(τ,x) and C(τ,x) t;T are put and call options with time-to-maturity τ and strike x, and f is the log price of forward contracts, t;(cid:28) see e.g. Britten-Jones and Neuberger (2000) and Carr and Wu (2009). 7Strictly speaking, the function C((cid:1)) here is a composite function, which is different from the previous call option pricing function. We recycle it to simplify our notations. 8The projection of π on S is defined as EP(πjS =s ,F =f ,Z =z ). T T T t;(cid:28) t;(cid:28) t t 7

of the VIX in the following. In fact, the price of a S&P 500 call option can be written as [ ] C((cid:28);f ;z ;x;r ) =e −rt;(cid:28)(cid:28)E P (cid:25) (cid:1)(eST (cid:0)x) +jF = f ;Z = z t;(cid:28) t t;(cid:28) t;(cid:28) t;(cid:28) t t ∫ =e −rt;(cid:28)(cid:28) (cid:25)(s j(cid:28);f ;z )(esT (cid:0)x)+p(s j(cid:28);f ;z )ds ; (4) T t;(cid:28) t T t;(cid:28) t T R and the price of a VIX call option is [ ] H((cid:28);f ;z ;y;r ) =e −rt;(cid:28)(cid:28)E P (cid:25) (cid:1)(ezT (cid:0)y)+jF = f ;Z = z t;(cid:28) t t;(cid:28) t;(cid:28) t;(cid:28) t t ∫ =e −rt;(cid:28)(cid:28) (cid:25)(z j(cid:28);f ;z )(ezT (cid:0)y)+q(z j(cid:28);f ;z )dz ; (5) T t;(cid:28) t T t;(cid:28) t T R where p(s j(cid:28);f ;z ) and q(z j(cid:28);f ;z ) are conditional densities of S and Z under the T t;(cid:28) t T t;(cid:28) t T T physical measure, respectively. Note that the law of iterated expectation is used in the second equality of both (4) and (5). Similar to (3), equations (4) and (5) imply that the second order derivatives of the S&P 500andVIXcallpriceswithrespecttotheirstrikesarealsoequalto(cid:25)(s j(cid:28);f ;z )p(s j(cid:28);f ;z ) T t;(cid:28) t T t;(cid:28) t and(cid:25)(z j(cid:28);f ;z )q(z j(cid:28);f ;z ), respectively. Thisfact, combinedwith(3), furtherimplies T t;(cid:28) t T t;(cid:28) t that p∗(s j(cid:28);f ;z ) (cid:25)(s j(cid:28);f ;z ) = T t;(cid:28) t T t;(cid:28) t p(s j(cid:28);f ;z ) T t;(cid:28) t q∗(z j(cid:28);f ;z ) (cid:25)(z j(cid:28);f ;z ) = T t;(cid:28) t T t;(cid:28) t q(z j(cid:28);f ;z ) T t;(cid:28) t That is, by combining the risk-neutral and physical densities of S and Z , we obtain the t t projections of (cid:25) onto S and Z , respectively. These two pricing kernels contain rich infor- T T mation on how risks, especially those associated with volatility shocks, are priced in financial markets. In the equilibrium setup of A¨ıt-Sahalia and Lo (2000) with a representative agent, these pricing kernels represent—up to a scaled factor—the marginal rate of substitution. While A¨ıt-Sahalia and Lo (2000) and Jackwerth (2000) estimate the pricing kernels of S&P 500returns, our(cid:25)(s j(cid:28);f ;z )includes the VIXz intheconditional informationset so that T t;(cid:28) t t 8

volatility becomes relevant to the price of risk regarding the expected returns. In addition, we are able to identify the pricing kernel of the VIX. 2.3 Nested Models As discussed in Section 2.1, we employ the information set generated by F and Z to t;(cid:28) t replace the information generated by F and V , because Z is directly observable. In t;(cid:28) t t fact, the information set of F and Z is coarser than the set generated by F and V , t;(cid:28) t t;(cid:28) t and equating these two effectively assumes that V is an invertible function of F and Z . t t;(cid:28) t We now show that this assumption is satisfied in most parametric models proposed in the literature, including both reduced-form option pricing models and equilibrium models with a priced stochastic volatility factor. Unlike Boes et al. (2007) and Li and Zhao (2009) who use an ex-post volatility proxy filtered from historical time series, we use VIX instead, which bears no approximation errors in most cases. We first consider the class of option pricing models that induce an affine relationship between the unobservable variance and squared VIX. This class of models has the following risk-neutral dynamics:9 √ 1 dS = (r(cid:0)d(cid:0) V )dt+ V dWQ +dLS t 2 t t t t dV = (cid:20)((cid:24) (cid:0)V )dt+(cid:27)(V )dBQ +dLV (6) t t t t t where dLS and dLV may be driven by finite activity compound Poisson processes with t t correlated jump sizes JS and JV. Such models include those discussed in Bakshi et al. t t (1997), Bates (2000), Pan (2002), Chernov and Ghysels (2000), Eraker (2004), Carr et al. (2003), Eraker et al. (2003), and Broadie et al. (2007). Jumps can be driven by L´evy processes such as the CGMY process in Carr et al. (2003) and Bates (2012). Note that this class also includes non-Gaussian OU processes, as introduced in Barndorff-Nielsen and 9The discontinuous part of the quadratic variation of S is assumed to be linear in V. t 9

Shephard (2001); see Shephard (2005) for a collection of similar models. For models of this class, we have Z2 = aV +b; t t where a and b are functions of model parameters (see Carr and Wu (2009) for details). That is, Z2 is a linear function of V , hence Z and V deliver the same information set. t t t t The second class of models introduces a non-affine structure between the squared VIX and variance, such as the exponential-OU-L models in Shephard (2005). In particular, under the risk-neutral measure, such models specify the volatility process as logV = (cid:11)+(cid:12)F ; dF = (cid:20)F dt+dL ; t t t t t The squared VIX, as calculated by Tauchen and Todorov (2011), is ∫ ( ) 1 (cid:28) Z2 = (cid:13) +((cid:17) +1)exp (cid:11)+e(cid:20)u(logV (cid:0)(cid:11))+C(u) du; (7) t (cid:28) t 0 where C(u) is determined by the characteristic exponent of the L´evy process L , and (cid:13) and t (cid:17) are constants determined by the quadratic variation of LS. Observe that the function t V 7! Z is invertible, so that the information sets generated by V and by Z are equivalent. t t t t Finally, we consider a stylized general equilibrium model, which is a simplified version of Bollerslev et al. (2009) and Drechsler and Yaron (2011) that builds on the long-run risk framework of Bansal and Yaron (2004).10 Specifically, the representative agent’s preference over consumption is recursive (Epstein and Zin (1989)). Therefore, the log pricing kernel at time t+1 is m = (cid:18)log(cid:14) (cid:0)(cid:18) −1∆c +((cid:18)(cid:0)1)r ; (8) t+1 t+1 c;t+1 where (cid:18) = (1(cid:0)(cid:13))=(1(cid:0) −1), 0 < (cid:14) < 1 is the subjective discount factor, (cid:13) is the risk 10Other equilibrium models that satisfy the invertibility between Z and V include Bansal et al. (2012), t t Bollerslev et al. (2009), and Campbell et al. (2012). We choose to present the framework of Bollerslev et al. (2009) and Drechsler and Yaron (2011) for simplicity of illustration. 10

aversion coefficient, is the intertemporal elasticity of substitution, ∆c is the growth t+1 rate of log consumption, and r is the time t to t+1 return on the aggregate wealth c;t+1 claim.11 The state vector of the economy follows ∆c =(cid:22) +(cid:27) z +J t+1 c c;t c;t+1 c;t+1 (cid:27)2 =(cid:22) +(cid:26) (cid:27)2 +(cid:27) z +J (9) c;t+1 (cid:27) (cid:27) c;t c;t (cid:27);t+1 (cid:27);t+1 where fz g and fz g are independent i.i.d. N(0,1) processes, J is a compound Poisson c;t (cid:27);t c;t+1 process with intensity (cid:21) and i.i.d. jump size (cid:16)c, J is a compound Poisson process with c;t i (cid:27);t+1 intensity (cid:21) and i.i.d. jump size (cid:16)(cid:27), and both jump processes are independent of each other (cid:27);t i and of the Gaussian shocks. Note that both the Gaussian process z and jump process (cid:27);t+1 J contribute to volatility shocks. (cid:27);t+1 By the standard log-linearization approach following Campbell and Shiller (1988), we have r = (cid:20) +(cid:20) w (cid:0)w +∆c ; (10) c;t+1 0 1 t+1 t t+1 where the price-wealth ratio w is conjectured to be affine in the state vector: t w = A +A (cid:27)2 (11) t 0 (cid:27) c;t with A > 0 and A < 0 as functions of the model parameters we suppress for notational 0 (cid:27) brevity. With (10) and (11), we have r = ∆c +(cid:20) A (cid:27)2 (cid:0)A (cid:27)2 +(cid:20) +(cid:20) A (cid:0)A : c;t+1 t+1 1 (cid:27) c;t+1 (cid:27) c;t 0 1 0 0 Therefore, the volatility factor (cid:27)2 shows up in r , and hence in the pricing kernel m c;t+1 c;t+1 t+1 given in (8). Following the standard practice to proxy the aggregate wealth (consumption) by the aggregate stock market S (see A¨ıt-Sahalia and Lo (2000), Bansal and Yaron (2004), t 11The literature usually assumes that γ > 1 and ψ > 1, which implies θ < 0. This assumption ensures that the representative agent has a preference for early resolution of uncertainty, which is the key for the price of volatility risk. 11

and Campbell et al. (2012)), the return S (cid:0)S corresponds to ∆c , and Z corresponds T t t+1 t to the square root of the risk-neutral expectation of the consumption growth variance (cid:27)2 . c;t Although the state vector dynamics are specified in discrete time, the model (9) is actually a special case of the affine model in Bollerslev et al. (2012). Therefore, Z is an invertible t function of V = (cid:27)2 , which represents the variance of consumption growth rate under this t c;t+1 equilibrium model. In summary, most parametric models with volatility risk proposed in the literature, whether reduced-form or structural, can be nested within our nonparametric framework. As a result, we do not lose any information about the dynamics of S and V when incort t porating Z into the information set; instead, the implementation becomes feasible with the t information set fully observable. 3 Estimation Strategy 3.1 Multivariate Local Linear Estimators for Densities HereweintroduceournonparametricestimationstrategiesforSPDs. Tofixideas, weassume the observed prices, C ˜ and H ˜ , are contaminated with observation errors, such that12 ( (cid:12) ) (cid:12) C((cid:28);f ;z ;x) = E C ˜(cid:12)(cid:28)˜ = (cid:28);F = f ;Z = z ;X = x t;(cid:28) t t;(cid:28) t;(cid:28) t t ( (cid:12) ) (cid:12) H((cid:28);f ;z ;y) = E H ˜(cid:12)(cid:28)˜ = (cid:28);F = f ;Z = z ;Y = y : t;(cid:28) t t;(cid:28) t;(cid:28) t t We then construct nonparametric estimators of C and H, and take derivatives to estimate the SPDs. Different from the multivariate kernel regression approach adopted by A¨ıt-Sahalia and Lo (1998), we prefer the local linear estimator (Fan and Gijbels (1996)) for two main reasons. First of all, the bias and variance of local polynomial estimators are of the same 12Hereafter,wemultiplyalloptionpricesbythecorrespondingert;(cid:28)(cid:28),sothatwecanomitr t;(cid:28) inC andH, and reduce one state variable in the following regressions. Again, we recycle the notations C and H without ambiguity. 12

order of magnitude in the interior or near the boundary, whereas kernel estimators are notorious for the boundary effects. As our empirical studies focus on the tail of pricing kernels, it is advantageous to adopt more efficient estimators. Second, local polynomial regression provides estimates of derivatives, in addition to option prices, which makes it more convenient for our purpose. Theoretically, itisbettertousealocalcubicestimatortoobtainsecond-orderderivatives. Since we have more than one state variable, including all cross-terms of cubic polynomials into the regression is cumbersome. We avoid this by applying the local linear estimator, so that estimators for SPDs can be obtained simply by a first-order differentiation with respect to the strike. We write the option price C as a function of u = ((cid:28);f;z;x)′, and consider the following minimization problem, ∑n min fC (cid:0)(cid:11)(cid:0)(cid:12) ′ (u (cid:0)u)g2 K (u (cid:0)u) i i h i (cid:11);(cid:12) i=1 ′ where u = ((cid:28) ;f ;z ;x ) and C are the characteristics and price respectively of the i i ti;(cid:28)i ti i i i-th option in the sample. K is a kernel function scaled by a bandwidth vector h = h (h ;h ;h ;h )′: (cid:28) f z x ( ) ( ) ( ) ( ) 1 (cid:28) (cid:0)(cid:28) 1 f (cid:0)f 1 z (cid:0)z 1 x (cid:0)x K (u (cid:0)u) = k i k ti;(cid:28)i k ti k i (12) h i h h h h h h h h (cid:28) (cid:28) f f z z x x where k((cid:1)) is, for example, the density a of standard normal distribution. The minimizer has a closed-form representation:    (cid:11)b    = (Ω ′ KΩ) −1Ω ′ KC (13) b (cid:12) (1+4)×1 13

where        1 (u 1 (cid:0)u) ′   C 1   K h (u 1 (cid:0)u)        Ω =   . . . . . .  ; C =   . . .  ; K =   ...  :       1 (u (cid:0)u) ′ C K (u (cid:0)u) n n h n The nonparametric local linear estimator for the option pricing function C((cid:28);f;z;x) is C b ((cid:28);f;z;x) = (cid:11)b = e ′ (Ω ′ KΩ) −1Ω ′ KC; (14) 1 with e = (1;0;0;0)′ and the estimator p b∗(s′j(cid:28);f;z) for the SPD of S is 1 t ( ) @(cid:12) b (cid:12) (cid:12) @ e′ (Ω′KΩ) −1Ω′KC (cid:12) (cid:12) p b∗(s ′j(cid:28);f;z) = es′ 4(cid:12) = es′ 5 (cid:12) : (15) @x x=es′ @x x=es′ where e = (0;0;0;0;1)′. The nonparametric estimator H b ((cid:1)) and q b∗(z′j(cid:28);s;z) can be con- 5 structed similarly. b It may be worth pointing out that (cid:12) in our local linear regression (13) provides estimates of option Greeks. Specifically, option Theta is given by by e′(cid:12) = @C=@(cid:28), Delta by e′(cid:12) (cid:1)es, 1 2 and Vega by e′(cid:12). 3 3.2 Dimension Reduction One of the major issues of nonparametric estimation is the curse of dimensionality. The rate of convergence decreases rapidly as the dimension of state variables increases. In the most general forms, the pricing functions C((cid:1)) and H((cid:1)) depend not only on time-to-maturity, strike, VIX, and the S&P 500 index, but also on interest rates and dividends. Instead of regressing on additional interest rate and dividend variables, we assume that option prices multiplied by ert;(cid:28)(cid:28) depend on these variables only through forward prices. As mentioned in A¨ıt-SahaliaandLo(1998), modelsthatviolatethisassumptionseemveryremoteempirically. Furthermore, following many existing studies such as A¨ıt-Sahalia and Lo (1998) and Li and Zhao (2009), we assume that the S&P 500 option price is homogeneous of degree one in 14

the forward price level: C((cid:28);f;z;x) = efC((cid:28);0;z;x=ef) = efC ¯ ((cid:28);z;m) (16) where m = x=ef represents the moneyness of the option. Consequently, we obtain the estimate of C((cid:28);f;z;x) through multiplying the nonparametric estimate of C ¯ ((cid:1)) by ef, and write the SPD of S as T (cid:12) @2C ¯ ((cid:28);z ;m)(cid:12) p ∗ (s j(cid:28);f ;z ) = esT −ft;(cid:28) t (cid:12) : T t;(cid:28) t @m2 m=esT=eft;(cid:28) As for VIX options, we assume that the information about Z in F is fully incorporated t′ t;(cid:28) into Z . In other words, conditional on Z , Z is independent of F , for any t′ > t. This t t t′ t;(cid:28) assumption further implies that the SPD of Z , obtained from VIX option prices, depends T on F only through Z , i.e., q∗(z j(cid:28);f ;z ) = q∗(z j(cid:28);z ). Thus, the number of state t;(cid:28) t T t;(cid:28) t T t variables for the SPD of VIX is also decreased by one. We conduct robustness checks for these assumptions in Section 4.5, and find supportive evidence. Our dimension reduction strategyismotivatedfromtheeconomicintuition, whichisinsharpcontrasttothestatistical approach proposed by Yao and Hall (2005), who discuss an alternative method in the context of conditional density estimation. 3.3 Estimation of Pricing Kernels Given the homogeneity assumption in (16), the risk neutral density of the return R can be T estimated using the following formula: (cid:12) @2C ¯ ((cid:28);z ;m)(cid:12) p ∗ (R j(cid:28);z ) = eRT −rt;(cid:28)(cid:28) t (cid:12) ; T t @m2 m=eRT −rt;(cid:28)(cid:28) where R = s (cid:0)f . Note that homogeneous of degree one in option prices is equivalent to T T t;(cid:28) that the conditional density of the log returns is independent of s , see, e.g. Joshi (2007) for t more details. This property is satisfied by all parametric models discussed in Section 2.3. While estimating the risk neutral density from option prices, we estimate the physical 15

density p(R j(cid:28);z ) using the time series of the S&P 500 index and VIX based on the local T t linear method. A similar strategy has been adopted by A¨ıt-Sahalia et al. (2009). We collect time series of (R ;z ), i = 1;:::;n, with (cid:28) = T (cid:0) t fixed. We then construct the local Ti ti i i linear estimator of the conditional density of returns p(Rj(cid:28);z) by minimizing: ∑n min fK (R (cid:0)R)(cid:0)(cid:13) (cid:0)(cid:17) ′ (z (cid:0)z)g2W (z (cid:0)z) (cid:13);(cid:17) bR Ti ti bz ti i=1 where b and b are the bandwidths to be selected, and K ((cid:1)) = 1=b (cid:1)k((cid:1)=b ) and W ((cid:1)) = R z bR R R bz 1=b (cid:1)w((cid:1)=b ) are kernels. Therefore, our density estimator is, z z pb(Rj(cid:28);z) = (cid:13)b: (17) Consequently, our pricing kernel estimator can be constructed as pb∗(Rj(cid:28);z) (cid:25)b(Rj(cid:28);z) = : pb(Rj(cid:28);z) Similarly, we can construct the estimator for the pricing kernel of VIX. 3.4 Asymptotic Theory To provide theoretical guidance for our approach, we derive the asymptotic distribution of the option price and density for S&P 500 options as an example. Suppose the sample size of the S&P 500 options is n. Using the equivalent kernels introduced in Fan and Gijbels (1996) 16

and following the derivation in A¨ıt-Sahalia and Lo (1998), we obtain:13 ( ) n1=2(h h h h )1=2 C b ((cid:28);f;z;x)(cid:0)C((cid:28);f;z;x) (18) (cid:28) f z x ( ) [∫ ] 3 (cid:0)!d N 0; k2(c)dc s2((cid:28);f;z;x)=(cid:25)((cid:28);f;z;x) ; as nh h h h (cid:0)! 1; (cid:28) f z x ( ) n1=2h2 (h h h h )1=2 pb(s ′j(cid:28);f;z)(cid:0)p(s ′j(cid:28);f;z) (19) x (cid:28) f z x ( ) [∫ ] [∫ ( ) ] [∫ ] 3 2 2 (cid:0)!d N 0; k2(c)dc ck ˙ (c)+k(c) dc = k(c)c2dc s2((cid:28);f;z;s ′ )=(cid:25)((cid:28);f;z;s ′ ) ; as nh h h h5 (cid:0)! 1; (cid:28) f z x where s2((cid:28);f;z;x) is the conditional variance for the local linear regression of C on the state variables, and (cid:25)((cid:28);f;z;x) is the joint density of these variables. The estimator for s2((cid:1)) can be constructed using similar nonparametric regressions of squared fitting errors on these state variables. The same asymptotic distributions apply to estimators for VIX option prices and their SPDs. Similar technique has been adopted in Ruppert and Wand (1994). In addition to estimating the option price and its first and second order derivatives, we 13We sketch a proof here for the asymptotic theory as part of it is non-standard. Notice from (13) that [ ] αb ′ −1 ′ b =(ΩKΩ) ΩKC. (cid:12) (1+4)×1 Using the properties of Gaussian kernel, we have   ∑n ∑n 1 1 n 1 (Ω ′ KΩ) −1 =     1 ∑n n i=1 K h (u i (cid:0)u) 1 ∑n n i=1 K h (u i (cid:0)u)(u i (cid:0)u) ′     K (u (cid:0)u)(u (cid:0)u) K (u (cid:0)u)(u (cid:0)u)(u (cid:0)u) ′ n h i i n h i i i [ i=1 i=1 ] (cid:0)P! f(u) ∫ 0′ , as h!0,n!0. 0 f(u)(cid:1) c2k2(c)dc(cid:1)diag(h2,h2,h2,h2) (cid:28) f z x Therefore, we can write the estimators in their equivalent kernel forms: ∑n 1 αb (cid:25) K (u (cid:0)u)(cid:1)C nf(u) h i i i=1 ∑n β b (cid:25) ∫ 1 K (u (cid:0)u)(x (cid:0)x)(cid:1)C 4 nh2f(u) c2k2(c)dc h i i i x i=1 Using the standard kernel asymptotic results, we can obtain the above asymptotic theory. 17

applyalocallinearmethodtoestimatetheconditionaldensityin(17). Itsasymptotictheory is given by (see, e.g. Fan et al. (1996)): ( [∫ ][∫ ] ) ( ) n1=2(b b )1=2 b pe(r ′j(cid:28);z)(cid:0)pe(r ′j(cid:28);z) (cid:0)!d N 0; k2(c)dc w2(c)dc pe(r ′j(cid:28);z)=(cid:25)(z) : r z The asymptotic theories provided here are applied to construct confidence bands in our empirical studies. 3.5 Bandwidth Selection Bandwidth selection is important especially for multivariate nonparametric regressions. In theory, the optimal rate of bandwidth for estimating the option price is n−1=(4+d), whereas to estimate densities, we need to adopt a bandwidth with rate n−1=(6+d) due to the curse of differentiation. These bandwidth choices ensure that the nonparametric pricing function achieves the optimal rate of convergence in the mean-squared sense. Empirically, we can choose a bandwidth h (j = (cid:28);z; and m for S&P 500 options, or y for VIX options) as j h = c (cid:27) n−1=(4+d+2(cid:23)), where (cid:27) is the unconditional standard deviation of the regressor j, j j j j d is the number of regressors, and (cid:23) = 0 and 1 for option prices and SPDs, respectively. The constant c is chosen by minimizing the mean-squared error of option prices via crossj validation. Thecross-validationobjectivefunctionforregression(14)isgivenbytheweighted mean squared errors: ∑n ( ) 1 b 2 min C (cid:0)C ((cid:28) ;f ;z ;x ) !((cid:28) ;f ;z ;x ) i h;−i i i i i i i i i h n i=1 where (cid:0)i means leaving the ith observation out, and !((cid:1)) is the weighting function. To further accelerate the cross-validation, we adopt the popular K-fold cross-validation, which is faster compared with this leave-one-out method. The bandwidths of our nonparametric conditional density estimator (17) under the phys- 18

ical measure are chosen by the cross-validation following Fan and Yim (2004): ∫ ∑n ∑n 1 2 min !(s ;z ) (pb(s ′j(cid:28);s ;z ))2ds ′ (cid:0) pb (s j(cid:28);s ;z )!(s ;z ): b n ti ti b ti ti n b;−i Ti ti ti ti ti i=1 i=1 where the first integral can be calculated in closed-form from (17). Alternative choices of bandwidths have been discussed in Yao and Tong (1998) and Ruppert et al. (1995). 3.6 Monte Carlo Simulations Here we provide simulation studies of our local linear estimators. The Monte Carlo experiments are designed to match our empirical studies. First, we select the same option characteristics as those traded on CBOE in our sample. Second, we select a sample path generated from the following stochastic volatility models with both jumps in volatility and prices: √ 1 dS = (r(cid:0)d(cid:0) V )dt+ V dWQ +JQdN (cid:0)(cid:22)(cid:21) dt t 2 t t t S t t √ dV = (cid:20)((cid:24) (cid:0)V )dt+(cid:27) V dBQ +JQdN t t t t V t where WQ and BQ are standard Brownian motions satisfying E(dWQdBQ) = (cid:26)dt, JQ and t t t t S JQ are random jump sizes, dN is a pure-jump process with intensity (cid:21) = (cid:21) + (cid:21) V , and V t t 0 1 t (cid:22) = E(eJQ (cid:0)1). The jump sizes follow: S    exp((cid:12) ) with probability q JQ (cid:24) exp((cid:12) ); JQ (cid:24) + V V S   (cid:0) exp((cid:12) ) with probability 1(cid:0)q − The parameters are taken from Amengual and Xiu (2012), where (cid:20) = 2, (cid:27) = 0:3, (cid:26) = (cid:0)0:8, (cid:24) = 0:04, (cid:12) = 0:01, (cid:12) = 0:03, q = 0:3, (cid:12) = 0:02, (cid:21) = 2, and (cid:21) = 30. We then + − V 0 1 calculate S&P 500 and VIX option prices, according to the closed-form formulae given in Amengual and Xiu (2012). Finally, we pollute the prices with multiplicative measurement errors following log-normal distribution with a 5% standard deviation. Basedonthegeneratedsample,weevaluateournonparametricestimatorsofoptionprices 19

on the grid of time-to-maturity and current index level, with the VIX Z and strike X fixed t attheirsamplemedian. Wealsocalculatetheindexdensitiesonthegridof(cid:28) andS , withS T t fixed at the sample mean, to evaluate our density estimators. The nonparametric estimators of VIX option prices and densities are evaluated similarly. All of these quantities and their percentage errors are reported in Figure 1, averaged over 1000 replications. We observe that the nonparametric estimates are within 5% and 10% of their theoretical Black-Scholes implied volatilities for S&P 500 and VIX options, respectively. The errors for densities are slightlylarger, duetothefactthatderivativesareestimatedwithslowerratesofconvergence, i.e., the so-called curse of differentiation. 4 Empirical Results In this section, we estimate nonparametric SPDs and pricing kernels using both S&P 500 and VIX options, and present our empirical findings. Before delving into the details, we introduce the dataset. 4.1 Data We obtain daily bid and offer prices of S&P 500 and VIX options, quoted between 3:59 p.m. and 4:00 p.m. EST from the OptionMetrics. Our sample period is chosen as June 1, 2009–May 31, 2011, during which the liquidity of VIX options is satisfactory. We plot the daily open interests of VIX options in Figure 2, along with those of S&P 500 options for comparison. It is obvious from the figure that the liquidity of VIX options has improved dramatically since introduced in 2006, and their open interests have achieved roughly 1/4 of those of S&P 500 options. As a result, our choice of sample ensures that our empirical results are not subject to liquidity issues. Figure 3 plots the joint time series of the S&P 500 index and VIX over the sample period, 20

Figure 1: Monte Carlo Simulations SPX Option Pricing Error 1 0 −1 120 −0.2 100 −0.1 80 0 60 0.1 40 0.2 20 Time−to−Maturity Log Moneyness ytilitaloV deilpmI fo rorrE % SPX Density Error 1 0 −1 120 1000 100 1100 80 1200 60 1300 1400 40 1500 20 Time−to−Maturity SPX Level at T XPS fo DPS fo rorrE % VIX Option Pricing Error 1 0 −1 120 −0.2 100 −0.1 80 0 60 0.1 40 0.2 20 Time−to−Maturity Log Moneyness ytilitaloV deilpmI fo rorrE % VIX Density Error 1 0 −1 120 100 18 80 20 60 22 40 24 20 Time−to−Maturity VIX Level at T XIV fo DPS fo rorrE % Note: This figure plots the nonparametric estimation error in the Monte Carlo simulations. The left panel plotsthepricingerrormeasuredintermsofdifferenceinimpliedvolatility, whereastherightpanelplotsthe percentage error in density estimates. The number of Monte Carlo samples is 1000. 21

Figure 2: Open Interests of the S&P 500 and VIX Options x 108 3.5 3 2.5 2 1.5 1 0.5 0 Mar06 Nov07 Aug09 May11 tseretnI nepO S&P 500 Options x 107 12 10 8 6 4 2 0 Mar06 Nov07 Aug09 May11 tseretnI nepO VIX Options Note: This figure plots the monthly time series of the open interests of S&P 500 and VIX options from March 1, 2006 to May 31, 2011. Figure 3: Time Series of the S&P 500 Index and the VIX 1400 1300 1200 1100 1000 900 800 Jun 09 Nov 09 Jun 10 Nov 10 Jun 11 leveL 005 P&S 45 25 5 Jun 09 Nov 09 Jun 10 Nov 10 Jun 11 leveL XIV S&P 500 VIX Note: This figure plots the time series of the S&P 500 index and VIX from Jun 1, 2009 to May 31, 2011. 22

while Table 1 provides their summary statistics. We observe that the VIX ranges between 14.62 and 45.79, which is large enough to have both relatively low and high volatility levels. Moreover, Table 2 also presents summary statistics of option prices. It is worth pointing out that the differences between adjacent strikes of VIX options range from $0.50 to $5 for smaller strikes and from $1 to $10 for large strikes, which are significantly larger percentagewise than their counterparts for S&P 500 options. Therefore, the impact of price discreteness on the nonparametric estimation of VIX densities could be more severe than on the density estimation of the S&P 500 index, as discussed in Section 3.6. Table 1: Summary Statistics of the S&P 500 Index and VIX Mean Std Skew Kurt Min 25% 75% Max Index 1141.670 115.806 0.065 2.414 879.130 1070.453 1221.178 1363.610 Return 0.001 0.011 -0.330 4.822 -0.040 -0.004 0.006 0.043 VIX 22.307 4.900 0.891 4.173 14.620 18.000 25.153 45.790 Note: This table reports the summary statistics of the time series of S&P 500 index, return, and VIX from June 1, 2009 to May 31, 2011. Wefollowthedata-cleaningroutinecommonlyusedintheliterature; see, e.g., A¨ıt-Sahalia and Lo (1998). First, observations with bid or ask prices smaller than $0.025 are eliminated tomitigatetheeffectofpricingerrors. Foreachoption, wetakethemidquoteastheobserved option price. Due to liquidity concerns, we eliminate any options with zero open interests or trading volumes as well as options with time-to-maturity of less than 5 days. In addition, we only consider options with maturity of less than 136 days, because only VIX option contracts with maturities shorter than 6 months are offered by the CBOE after 2009. It is wellknownthatin-the-moneyS&P500optionsarelessliquidthanout-of-the-moneyoptions. Therefore, we delete in-the-money options, and use the put-call parity to construct prices of in-the-money call options from out-of-the-money put options. There is no such pattern 23

Table 2: Summary Statistics of S&P 500 and VIX Options SPO VXO Moneyness ITM ATM OTM ITM ATM OTM # of Records 72161 27144 33245 5573 16823 17726 Volume 102.12 82.73 35.76 2.43 37.55 32.86 Open Interest 1665.98 616.39 513.05 44.57 356.76 471.44 min 41.07 0.13 0.05 2.03 0.03 0.03 25% 120.19 18.15 0.35 6.40 1.83 0.18 Derivative Prices 50% 186.63 33.36 1.73 8.55 3.05 0.50 75% 284.04 49.00 6.40 11.70 4.55 1.03 max 1128.50 110.67 60.00 25.90 11.70 4.75 min 100 845 915 10 14 22.5 25% 825 1075 1160 15 21 35 Strike Prices 50% 940 1135 1225 17 25 42.5 75% 1040 1250 1325 20 30 50 max 1305 1415 3000 40 65 100 min 5 5 5 5 5 5 25% 18 17 21 21 27 28 Time-to-Maturity 50% 32 32 35 44 50 50 75% 53 51 56 77 82 78 max 136 136 136 128 128 126 min 16.05 9.39 9.48 25% 27.56 16.67 15.31 Implied Volatility 50% 33.85 19.84 18.13 75% 43.41 23.37 20.84 max 143.48 45.81 49.44 Note: This table reports the summary statistics (minimum, quantiles, and maximum) for selected S&P 500 andVIXoptionquotesfromJune1,2009toMay31,2011,includingthenumberofrecords,tradingvolume, open interest, option price, strike price, time-to-maturity, and implied volatility. In total, there are 132,550 trading records for S&P 500 options, and 40,122 records for VIX options. All options are call options. The prices of S&P 500 ITM call options are computed from OTM put options using the put-call parity for liquidity concerns. For S&P 500 options, ATM is defined as K/F 2[0.96,1.04], whereas for VIX options, it is defined as K/VIX2[0.9,1.5], with F as the forward price and K as the generic strike. 24

for VIX options and hence we only consider VIX call options. The last step is to eliminate option contracts that violate no-arbitrage conditions. The resulting sample covers a broad cross section of options, including 420,711 S&P 500 call options, and 53,530 VIX call options, which account for 50.84% and 54.18% of their total number of records, respectively. 4.2 Pricing Kernels of the Market Return The upper panels of Figure 4 provide nonparametric SPD estimates of the S&P 500 index for both low and high levels of VIX, fixed at 18.00 and 25.15 that correspond to the 25% and 75% quantiles of the VIX time series in our sample, respectively. We observe that index densities strongly depend on the VIX level Z . Conditional on a low Z , p(s j(cid:28);s ;z ) has t t T t t pronounced spikes, while the density becomes more dispersed when Z rises to a high level, t suggesting that volatility is a key state variable that should be included in the SPDs. To further demonstrate the importance of volatility in studying the SPDs of market return, the bottom panels of Figure 4 compare the nonparametric SPD estimates proposed by A¨ıt-Sahalia and Lo (1998) (AL) who neglect the volatility variable, with p(s j(cid:28);s ;z ) T t t conditional on the two different VIX levels of 18.00 and 25.15. We choose the time-tomaturity as 42 days, and compute the 95% confidence intervals by the asymptotic theory given in (19). Observe that our SPDs differ from the AL densities substantially, with the former more compact and showing higher spikes for a low Z , which confirms the importance t of incorporating volatility into the SPDs. Given the importance of volatility in SPDs of the S&P 500 index documented above, we now study whether and how volatility affects the shape of the pricing kernel p(R j(cid:28);z ). T t According to Section 3.3, we further estimate the physical densities of the S&P 500 return conditional on VIX and obtain the pricing kernel estimates. The top two panels of Figure 5 report the pricing kernel estimates with Z equal to 18.00 (left) and 25.15 (right) and a t maturity of 42 days. We observe that the pricing kernels conditional on either a low or high 25

Figure 4: State-Price Densities of the S&P 500 Index Z = 18 t x 10−3 8 6 4 2 0 120 100 800 80 1000 60 1200 40 1400 20 1600 0 Time−to−Maturity S&P Level S T P&S fo DPS Z = 25.15 t x 10−3 8 6 4 2 0 120 100 800 80 1000 60 1200 40 1400 20 1600 0 Time−to−Maturity S&P Level S T P&S fo DPS Z = 18, τ = 42 Days Z = 25.15, τ = 42 Days x 10−3 t x 10−3 t 6 6 SX Density SX Density AL Density AL Density 5 5 4 4 3 3 2 2 1 1 0 0 800 900 1000 1100 1200 1300 1400 1500 800 900 1000 1100 1200 1300 1400 1500 Note: The top panels provide our nonparametric estimates of SPDs of the S&P 500 index at various timeto-maturities, with volatility levels at 18.00 (left) and 25.15 (right) that correspond to the 25% and 75% quantilesoftheVIXtimeseriesinoursample, respectively. Thebottompanelscompareourestimates(SX) of index SPDs (black, solid) with those using the A¨ıt-Sahalia and Lo (1998) (AL) method (blue, dashed) for the maturity of 42 days, and two current VIX levels at 18.00 and 25.15. Dotted lines around each SPD estimate are the 95% confidence intervals constructed by the asymptotic distribution theory in (19). The interest rate and dividend are fixed at their averages, 2.15% and 2.06%, respectively. 26

VIX level exhibit a decreasing shape, consistent with a standard expected utility theory, which prescribes that the pricing kernel decrease when expected returns are increasing. In contrast, the bottom left panel of Figure 5 shows that the unconditional estimator of the pricing kernel shows a pronounced U-shape, consistent with what have been found in the literature (Jackwerth (2000) and Bakshi et al. (2010)). Therefore, it is the volatility factor, missing in the unconditional estimates, that may lead to the puzzling U-shape. Specifically, high volatility signals bad future investment opportunities, and investors should have high marginal utility in such a state. Hence, the pricing kernel of market return conditional on a high volatility, which equals the marginal utility up to a re-scaling, is higher than that conditional on a low volatility, as shown in the bottom left panel of Figure 5. The unconditional pricing kernel, however, is a mixture of pricing kernels conditional on different volatility levels, and could exhibit a U-shape when volatility switches from low to high levels. Our finding echoes the conclusions of parametric models in Jackwerth and Brown (2001), Chabi-Yo et al. (2008), Chabi-Yo (2011), and Christoffersen et al. (2010), that missing state variables in the pricing kernel may result in the U-shape. Without restricting the specification of pricing kernels, however, our result shows that stochastic volatility is the key but missing state variable of pricing kernels estimated in the literature. The pricing kernels conditional on low and high values of Z have different supporting t regions on the left and right tail. For instance, over the interval (0.08, 0.15), we only have the pricing kernel estimates conditional on a high Z . The reason is that the realized return t R never exceeds 8% given Z = 18:00, as can be seen from the scatter plot of (R ;Z ) on T t T t the bottom right panel of Figure 5. In fact, this observation implies that the unconditional pricing kernel estimates around high levels of market return R are dominated by high level T volatility, which shifts the unconditional estimates upwards, and explains why they present a U-shape. In other words, large market returns R are accompanied by high current volatility T Z ,becauseofwhichinvestorshaveahighmarginalutilitythatleadstotheincreasingportion t 27

Figure 5: Pricing Kernels of the S&P 500 Z = 18, τ = 42 t 0.4 0.3 0.2 0.1 0 −0.1 −0.2 −0.3 −0.4 −0.15 −0.1 −0.05 0 0.05 0.1 Return of S&P 500 Index lenreK gnicirP goL Z = 25.15, τ = 42 t 0.4 0.3 0.2 0.1 0 −0.1 −0.2 −0.3 −0.4 −0.1 −0.05 0 0.05 0.1 0.15 Return of S&P 500 Index lenreK gnicirP goL Comparison with Unconditional Pricing Kernel 0.6 0.5 0.4 0.3 0.2 0.1 0 −0.1 −0.2 −0.3 −0.4 −0.15 −0.1 −0.05 0 0.05 0.1 0.15 Return of S&P 500 Index lenreK gnicirP goL 0.2 Z = 25.15 t Z t = 18.00 0.15 Unconditional 0.1 0.05 0 −0.05 −0.1 −0.15 −0.2 15 20 25 30 35 40 45 50 VIX Level snruteR 005 P&S Scatter Plot 75% Quantile 25% Quantile Note: ThetoppanelsplotthenonparametricestimatesofpricingkernelsoftheS&P500indexreturn(black, solid)forthematurityof42days,withtwocurrentVIXlevelsat18.00and25.15thatcorrespondtothe25% and 75% quantiles of the VIX time series in our sample, respectively. Dotted lines are the 95% confidence intervals. The bottom left figure compares the unconditional pricing kernel (red, solid) with the previous two conditional pricing kernels. The bottom right panel presents the scatter plot of S&P 500 returns R T against the current VIX level Z . t 28

of the unconditional pricing kernel on the right tail. Overall, our nonparametric state-price density estimates differ significantly from those without conditioning on volatility, and confirms that volatility is a key state variable that shouldbeincludedinthepricingkernel. Moreimportantly,withoutimposinganyrestrictions on the dynamics of the market return and volatility, our pricing kernel estimates conditional on VIX show that stochastic volatility is the key variable responsible for the “pricing kernel puzzle.” 4.3 Pricing Kernels of the VIX We now present nonparametric estimates of SPDs and pricing kernels of the VIX and investigate their implications for the pricing of volatility risk. The top panels of Figure 6 present the VIX SPDs at various maturities conditional on two different levels of Z equal to 18.00 t and 25.15. We find first that the VIX SPDs are all positively skewed, with the probability of achieving higher VIX levels decreasing given a low time-t VIX level. Second, the SPD of VIX conditional on a high Z (right panel) has a spike around median volatility levels, t consistent with the conventional wisdom that volatility reverts to its long-run mean. Furthermore, we estimate the pricing kernel (cid:25)(Z j(cid:28);z ) by combining estimates of both T t risk-neutral and physical densities of the VIX. The bottom panels of Figure 6 provide nonparametric estimates of (cid:25)(Z j(cid:28);z ) for a maturity of 42 days and two different levels of Z T t t at 18.00 and 25.15. We observe that the pricing kernel exhibits a pronounced U-shape as a function of future VIX levels. Therefore, volatility risk is priced, and the price of volatility risk increases when volatility deviates from its median level. In other words, investors attach high marginal utility to payoffs received when the future volatility is either extremely high or low. Bakshi et al. (2010) document the U-shape for the volatility pricing kernel indirectly, by exploring the link between the monotonicity of the pricing kernel and returns of VIX option portfolios. They further provide a model with heterogeneity in beliefs to account for 29

Figure 6: State-Price Densities and Pricing Kernels of the VIX Z = 18 t 0.06 0.04 0.02 0 120 10 100 20 80 60 30 40 40 20 50 0 Time−to−Maturity VIX Level Z T XIV fo DPS Z = 25.15 t 0.06 0.04 0.02 0 120 10 100 20 80 60 30 40 40 20 50 0 Time−to−Maturity VIX Level Z T XIV fo DPS Z = 18, τ = 42 Days t 0.8 0.6 0.4 0.2 0 −0.2 −0.4 −0.6 −0.8 15 20 25 30 35 40 Z T lenreK gnicirP goL Z = 25.15, τ = 42 Days t 0.8 0.6 0.4 0.2 0 −0.2 −0.4 −0.6 −0.8 15 20 25 30 35 40 Z T lenreK gnicirP goL Note: ThetoppanelsprovidethenonparametricestimatesofSPDsoftheVIXatvarioustime-to-maturities, with volatility level at 18.00 (left) and 25.15 (right) that correspond to the 25% and 75% quantiles of the VIX time series in our sample, respectively. The bottom panels plot the nonparametric estimates of VIX pricing kernels (black, solid) for the maturity of 42 days, and two current VIX levels at 18.00 and 25.15. Dottedlinesarethethe95%confidenceintervals. Theinterestrateanddividendarefixedattheiraverages, 2.15% and 2.06%, respectively. 30

the U-shape, in which the volatility market is dominated by investors with zero market risk. In contrast, we provide direct estimates of the volatility pricing kernel by nonparametric methods, which provide more robust information about the shape. In particular, we find that the volatility pricing kernel is asymmetric, and the asymmetry conditional on a high time-t volatility is much stronger than that conditional on a low volatility. This finding implies that investors price the volatility risk differently according to different scenarios of the economy, which presents new empirical regularities that need to be incorporated into models of volatility risk. In summary, our SPD estimates of VIX document empirical features of risk-neutral dynamics of volatility such as positive skewness and mean reversion. Although the volatility process under the physical measure is well documented as displaying a mean-reverting pattern using historical time series, its risk-neutral behavior is not crystal clear. Our findings uncover the risk-neutral dynamics of volatility without any parametric restrictions. More importantly, our estimates of the volatility pricing kernel show that investors have high marginal utility even in low volatility states, which supports the model with heterogeneity in beliefs. 4.4 In-Sample Fitting and Out-of-Sample Forecasts We evaluate the performance of our nonparametric estimator (SX) by comparing in-sample fitting and out-of-sample forecasts with two alternative methods discussed in A¨ıt-Sahalia and Lo (1998): the nonparametric approach without volatility factor (AL) in terms of both density and option implied volatility forecasts, and the martingale approach (MKT) for option implied volatility forecasts only. As it is widely used by practitioners, the MKT approach simply forecasts tomorrow’s implied volatility by interpolation using today’s implied volatility surface. Intuitively, a potential advantage of our estimator over the MKT method lies in the 31

inclusion of historical options with similar characteristics. As opposed to the MKT approach that relies exclusively on the cross section of options on the previous day, the SX estimator is ableto capture a more stablepricing function overtime, and hence is expected tooutperform the MKT approach in out-of-sample forecasting, although not surprisingly, the SX estimator may fit the cross section of option prices worse on certain days, but better on other days. With historical option prices incorporated, the AL estimator is also capable of capturing certain stability in the historical data, which helps make predictions. However, it misses an important volatility factor that is incorporated into the SX approach. Panel A of Table 3 reports the forecasting performance of the SX, AL, and MKT methods for option prices (quoted in implied volatility). For each date t, we adopt a preceding 16month window, within which the SX, AL, and MKT estimators for the target options are obtained. The selected target options have a maturity of 42 days with moneyness ranging between (cid:0)0:15 and 0:15. We forecast such options on day t+(cid:13), for (cid:13) = 0 (in-sample), and (cid:13) =7, 14, 21, 28, 35, 42, 63 and 84 days (out-of-sample) progressively. We repeat the procedure for each day t in the last 8 months of our sample period and average across days to obtain the root-mean-squared percentage difference between the predictions and the realized option prices. We observe first that the MKT approach outperforms the AL approach uniformly in forecasting option prices for the sample period we consider, which is in contrast with findings of A¨ıt-Sahalia and Lo (1998). However, this is not surprising as the AL estimator does not includevolatilityasaconditioningvariablewhichchangedsubstantiallyoverthesampleperiod we consider, i.e., June 1, 2009 – May 31, 2011. In contrast, the SX estimator outperforms both the AL and MKT methods especially for longer horizons. The superior performance of the SX estimator highlights the benefit of predicting by capturing certain stable price patterns in the historical data and incorporating the volatility factor. Not surprisingly, the MKT approach has a better in-sample performance given its implementation. 32

stsaceroF elpmaS-fo-tuO dna gnittiF elpmaS-nI :3 elbaT )%( rorrE tsaceroF ytilitaloV deilpmI :A lenaP 48 36 24 53 82 12 41 7 0 (cid:13) 65.71 65.71 88.81 29.61 70.71 28.51 86.61 12.41 23.71 XS 16.63 55.53 51.43 25.23 72.33 41.23 01.23 39.82 52.82 LA 03.22 36.02 08.81 22.81 62.71 77.51 81.61 49.51 47.31 TKM )%( rorrE tsaceroF ytisneD :B lenaP 48 36 24 53 82 12 41 7 0 (cid:13) 02.2 67.1 43.1 51.1 69.0 37.0 25.0 82.0 00.0 XS 53.7 57.6 52.6 70.6 96.5 74.5 22.5 50.5 00.5 LA rotamitse ruo ,)LA( rotamitse )8991( oL dna ailahaS-t¨ıA eht yb decudorp ytilitalov deilpmi fo srorre tsacerof egareva stroper A lenaP :etoN LA eht gnisu seitisned lartuen-ksir fo esoht stroper B lenaP elihw ,dohtem )TKM( noitalopretni elagnitram a dna ,)XS( XIV no lanoitidnoc ,shtnom 61 fo wodniw-gnillor a htiw detamitse era sDPS gnidnopserroc rieht dna ytilitalov deilpmi noitpo cirtemarapnon ehT .sdohtem XS dna ehT .1102 ,13 yaM ot 9002 1 enuJ morf sisab gnillor yliad a no γ snoziroh tsacerof suoirav rof detareneg era stsacerof elpmas-fo-tuo dna .syad 24 sa nesohc si secirp noitpo dna sDPS htob rof ytirutam-ot-emit 33

Panel B of Table 3 reports the forecasting performance of the SX and AL estimators for state-price densities. The empirical design is similar to the forecasting exercise of option implied volatility, with a 16-month window, a target maturity of 42 days, and horizons of (cid:28) =7, 14, 21, 28, 35, 42, 63, and 84 days for the out-of-sample performance. We compute the average forecast error (root-mean-squared percentage difference) as a percentage of the mode value of the realized density over the last 8 months of our sample period. The realized density is computed by the SX approach using a 16-month window including the target day. ResultsinPanelBshowthattheSXestimatoroutperformstheALdensitysubstantiallyover all horizons, due to the missing volatility factor in AL densities. For example, the forecast error for (cid:28) = 84 is 2.2% and 7.4% for the SX and AL density estimators, respectively. 4.5 Robustness Checks As robustness checks, we verify the two dimension-reduction assumptions employed in our nonparametric procedure: the homogeneity of degree one for S&P 500 options, and the conditional independence of state-price densities of the VIX with respect to S . t Figure 7 plots nonparametric estimators of the implied volatility surface of S&P 500 options across both log-moneyness and time-to-maturities: one with the assumption of homogeneity of degree one (left panel) and the other without using it (right panel). We observe that the shape of the two surfaces match each other well in general, although there are slight differences around the boundaries where nonparametric estimators usually incur relatively large biases. Moreover, the estimator without dimension reduction is noiseier as its convergence rate is lower due to the “curse of dimensionality.” Figure 8 plots estimates of VIX SPDs against the S&P 500 index S and VIX Z for t T (cid:28) = 42, and for Z = 18:00 and 25:15 respectively. We observe that conditional densities do t not vary much with S conditional on either the low or high level of Z , especially for the part t t away from the boundary. Overall, the dimension-reduction assumption for VIX options, i.e., 34

Figure 7: Robustness Check I 0.6 0.4 0.2 0 120 −0.2 100 −0.1 80 60 0 40 0.1 20 0.2 0 Time−to−Maturity Log Moneyness ytilitaloV deilpmI 005 P&S 0.6 0.4 0.2 0 120 −0.2 100 −0.1 80 60 0 40 0.1 20 0.2 0 Time−to−Maturity Log Moneyness ytilitaloV deilpmI 005 P&S Note: This figure plots the nonparametric estimates for the implied volatility surface of S&P 500 option prices. The left panel plots the estimates based on dimension reduction techniques, whereas the right panel plots the estimates without such techniques. Figure 8: Robustness Check II Z = 18 t 0.06 0.04 0.02 0 1400 10 1300 20 1200 30 1100 40 1000 50 900 S&P 500 Index VIX Level Z T XIV fo DPS Z = 25.15 t 0.06 0.04 0.02 0 1400 10 1300 20 1200 30 1100 40 1000 50 900 S&P 500 Index VIX Level Z T XIV fo DPS Note: This figure plots the nonparametric estimates of VIX state-price densities with both Z and S as t t conditioning variables. The time-to-maturity is τ =42, and Z is fixed at 18.00 and 25.15 respectively. t 35

the dependence of VIX SPD on S mainly through Z , seems valid for the sample period we t t consider. 5 Conclusion Volatilityhasbeenwelldocumentedasapricedriskfactor, andhenceanessentialcomponent of pricing kernels. Taking advantage of the rapidly developed volatility derivative markets, we provide nonparametric estimates of both SPDs and pricing kernels with volatility. We show that volatility is the key but missing state variable in the unconditional pricing kernel estimates that exhibit the puzzling U-shape. Moreover, we document a U-shaped pricing kernel of volatility, which cannot be captured by standard models with volatility risk, such as Bollerslev et al. (2009) and Drechsler and Yaron (2011). Therefore, it remains important to develop extensions of these models that are in compliance with our empirical findings. In addition, our framework extends the nonparametric option pricing method to allow for stochastic volatility, by exploring additional information from the VIX. Existing parametric stochastic volatility models face an unfortunate compromise between model flexibility and tractability. In contrast, our method enjoys several advantages, such as being model-free, robusttomodelmisspecificationandpricingmeasures, andcomputationallyefficient. Hence, our nonparametric option pricing approach with VIX alleviates the compromise to a great extent. 36

References A¨ıt-Sahalia, Y. and Duarte, J. (2003), “Nonparametric Option Pricing Under Shape Restrictions,” Journal of Econometrics, 116, 9–47. A¨ıt-Sahalia, Y., Fan, J., and Peng, H. (2009), “Nonparametric Transition-Based Tests for Jump-Diffusions,” Journal of the American Statistical Association, 104, 1102–1116. A¨ıt-Sahalia, Y. and Lo, A. (1998), “Nonparametric Estimation of State-Price-Densities Implicit in Financial Asset Prices,” Journal of Finance, 53, 499–547. —(2000), “NonparametricRiskManagementandImpliedRiskAversion,” Journal of Econometrics, 94, 9–51. Amengual, D. and Xiu, D. (2012), “Delving into Risk Premia: Reconciling Evidence from the S&P 500 and VIX Derivatives,” Tech. rep., CEMFI and University of Chicago Booth School of Business. Bakshi, G., Cao, C., and Chen, Z. (1997), “Empirical Performance of Alternative Option Pricing Models,” Journal of Finance, 52, 2003–2049. Bakshi, G.andKapadia, N.(2003), “Delta-HedgedGainsandtheNegativeMarketVolatility Risk Premium,” Review of Financial Studies, 16, 527–566. Bakshi, G. and Madan, D. (2008), “Investor Heterogeneity and the Non-Monotonicity of the Aggregate Marginal Rate of Substitution in the Market Index,” working paper, University of Maryland. Bakshi, G., Madan, D., and Panayotov, G. (2010), “Returns of Claims on the Upside and the Viability of U-Shaped Pricing Kernels,” Journal of Financial Economics, 97, 130–154. Bansal, R., Kiku, D., Shaliastovich, I., and Yaron, A. (2012), “Volatility, the Macroeconomy, and Asset Prices,” Tech. rep., University of Pennsylvania. 37

Bansal, R. and Yaron, A. (2004), “Risks for the Long Run: A Potential Resolution of Asset Pricing Puzzles.” Journal of Finance, 59. Barndorff-Nielsen, O. E. and Shephard, N. (2001), “Non-Gaussian Ornstein-Uhlenbeck- Based Models And Some Of Their Uses In Financial Economics,” Journal of the Royal Statistical Society, B, 63, 167–241. Bates, D. S. (2000), “Post-’87 Crash Fears in the S&P 500 Futures Option Market,” Journal of Econometrics, 94, 181–238. — (2012), “U.S. Stock Market Crash Risk, 1926-2010.” Journal of Financial Economics, 105, 229–259. Boes, M., Drost, F., and Werker, B. J. (2007), “Nonparametric Risk-Neutral Return and Volatility Distributions,” Tech. rep., Tilburg University. Bollerslev, T., Sizova, N., and Tauchen, G. (2012), “Volatility in Equilibrium: Asymmetries and Dynamic Dependencies,” Review of Finance, 16, 31–80. Bollerslev, T., Tauchen, G. E., and Zhou, H. (2009), “Expected Stock Returns and Variance Risk Premia,” Review of Financial Studies, 22, 4463–4492. Breeden, D. and Litzenberger, R. H. (1978), “Prices of State-Contingent Claims Implicit in Option Prices,” Journal of Business, 51, 621–651. Britten-Jones, M. and Neuberger, A. (2000), “Option Prices, Implied Price Processes, and Stochastic Volatility,” Journal of Finance, 55, 839–866. Broadie, M., Chernov, M., and Johannes, M. S. (2007), “Model Specification and Risk Premia: Evidence from Futures Options,” Journal of Finance, 62. Campbell, J., Christopher, P., Turley, B., and Giglio, S. (2012), “An Intertemporal CAPM with Stochastic Volatility,” Tech. rep., Harvard University. 38

Campbell, J.Y.andShiller, R.J.(1988), “StockPrices, Earnings, andExpectedDividends,” Journal of Finance, 43, 661–676. Carr, P., Geman, H., Madan, D. B., and Yor, M. (2003), “Stochastic Volatility for L´evy Processes,” Mathematical Finance, 13, 345–342. Carr, P. and Wu, L. (2009), “Variance Risk Premiums,” Review of Financial Studies, 22, 1311–1341. Chabi-Yo, F. (2011), “Pricing Kernels with Stochastic Skewness and Volatility Risk,” Management Science. Chabi-Yo, F., Garcia, R., and Renault, E. (2008), “State Dependence can Explain the Risk Aversion Puzzle,” Review of Financial Studies, 21, 973–1011. Chernov, M. and Ghysels, E. (2000), “A Study Towards a Unified Approach to the Joint EstimationofObjectiveandRiskNeutralMeasuresforthePurposeofOptionsValuation,” Journal of Financial Economics, 57, 407–458. Christoffersen, P., Heston, S., and Jacobs, K. (2010), “Option Anomalies and the Pricing Kernel,” Tech. rep., McGill University. Christoffersen, P., Jacobs, K., Ornthanalai, C., and Wang, Y. (2008), “Option Valuation with Long-Run and Short-Run Volatility Components,” Journal of Financial Economics, 90, 272–297. Drechsler, I. and Yaron, A. (2011), “What’s Vol Got to Do with It?” Review of Financial Studies, 24, 1–45. Egloff, D., Leippold, M., , and Wu, L. (2010), “The Term Structure of Variance Swap Rates andOptimalVarianceSwapInvestments,”Journal of Financial and Quantitative Analysis, 45, 1279–1310. 39

Epstein, L. and Zin, S. (1989), “Substitution, Risk aversion, and the Temporal Behavior of Consumption and Asset Returns: A Theoretical Framework,” Econometrica, 57, 937–969. Eraker, B. (2004), “Do Stock Prices and Volatility Jump? Reconciling Evidence from Spot and Option Prices,” Journal of Finance, 59. Eraker, B., Johannes, M. S., and Polson, N. (2003), “The Impact of Jumps in Equity Index Volatility and Returns,” Journal of Finance, 58, 1269–1300. Fan, J. and Gijbels, I. (1996), Local Polynomial Modelling and Its Applications, London, U.K.: Chapman & Hall. Fan, J. and Mancini, L. (2009), “Option Pricing with Aggregation of Physical Models and Nonparametric Statistical Learning,” Journal of American Statistical Association, 104, 1351–1372. Fan, J., Yao, Q., and Tong, H. (1996), “Estimation of Conditional Densities and Sensitivity Measures in Nonlinear Dynamical Systems,” Biometrika, 83, 189–206. Fan, J. and Yim, T. H. (2004), “A Crossvalidation Method for Estimating Conditional Densities,” Biometrika, 91, 819–834. Jackwerth, J. (2000), “Recovering Risk Aversion from Option Prices and Realized Returns,” Review of Financial Studies, 13, 433–451. Jackwerth, J. and Brown, P. (2001), “The Pricing Kernel Puzzle: Reconciling Index Option Data and Economic Theory,” Tech. rep., University of Konstanz. Jackwerth, J. and Vilkov, G. (2013), “Asymmetric Volatility Risk: Evidence from Option Markets,” working paper. Joshi, M. (2007), “Log-type models, Homogeneity of Option Prices and Convexity,” Tech. rep., Melbourne University. 40

Li, H. and Zhao, F. (2009), “Nonparametric Estimation of State-Price-Densities Implicit in Interest Rate Cap Prices,” Review of Financial Studies, 22, 4335–4376. Mencia, J. and Sentana, E. (2012), “Valuation of VIX Derivatives,” Journal of Financial Economics, forthcoming. Pan, J. (2002), “The Jump-Risk Premia Implicit in Options: Evidence from an Integrated Time-Series Study,” Journal of Financial Economics, 63, 3–50. Rosenberg, J. V. and Engle, R. F. (2002), “Empirical Pricing Kernels,” Journal of Financial Economics, 64, 341–372. Ruppert, D., Sheather, S., and Wand, M. P. (1995), “An Effective Bandwidth Selector for Local Least Squares Kernel Regression,” Journal of American Statistical Association, 90. Ruppert, D. and Wand, M. (1994), “Multivariate Locally Weighted Least Squares Regression,” Annals of Statistics, 22, 1346–1370. Shephard, N. (2005), Stochastic Volatility, Oxford University Press. Tauchen, G. E. and Todorov, V. (2011), “Volatility Jumps,” Journal of Business and Economic Statistics, 29. Todorov, V. (2010), “Variance Risk Premium Dynamics: The Role of Jumps,” Review of Financial Studies, 23, 345–383. Yao, Q. and Hall, P. (2005), “Estimation for Conditional Distribution Functions via Dimension Reduction,” Annals of Statistics, 33, 1404–1421. Yao, Q. and Tong, H. (1998), “Cross-Validatory Bandwidth Selections for Regression Estimation Based on Dependent Data,” Journal of Statistical Planning and Inference, 68, 387–415. 41

Ziegler, A. (2007), “Why does Implied Risk Aversion Smile?” Review of Financial Studies, 20, 859–904. 42

Cite this document

APA

Zhaogang Song and Dacheng Xiu (2014). A Tale of Two Option Markets: Pricing Kernels and Volatility Risk (FEDS 2014-58). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2014-58

BibTeX

@techreport{wtfs_feds_2014_58,
  author = {Zhaogang Song and Dacheng Xiu},
  title = {A Tale of Two Option Markets: Pricing Kernels and Volatility Risk},
  type = {Finance and Economics Discussion Series},
  number = {2014-58},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2014},
  url = {https://whenthefedspeaks.com/doc/feds_2014-58},
  abstract = {Using prices of both S&P 500 options and recently introduced VIX options, we study asset pricing implications of volatility risk. While pointing out the joint pricing kernel is not identified nonparametrically, we propose model-free estimates of marginal pricing kernels of the market return and volatility conditional on the VIX. We find that the pricing kernel of market return exhibits a decreasing pattern given either a high or low VIX level, whereas the unconditional estimates present a U-shape. Hence, stochastic volatility is the key state variable responsible for the U-shape puzzle documented in the literature. Finally, our estimates of the volatility pricing kernel feature a U-shape, implying that investors have high marginal utility in both high and low volatility states.},
}