feds · August 31, 2015

The Effects of Asymmetric Volatility and Jumps on the Pricing of VIX Derivatives

Abstract

This paper proposes a new collection of affine jump-diffusion models for the valuation of VIX derivatives. The models have two distinctive features. First, we allow for a positive correlation between changes in the VIX and in its stochastic volatility to accommodate asymmetric volatility. Second, upward and downward jumps in the VIX are separately modeled to accommodate the possibility that investors react differently to good and bad surprises. Using the VIX futures and options data from July 2006 through January 2013, we find conclusive evidence for the benefits of including both asymmetric volatility and upward jumps in models of VIX derivatives pricing. We do not, however, find evidence supporting downward jumps.

Finance and Economics Discussion Series Divisions of Research & Statistics and Monetary Affairs Federal Reserve Board, Washington, D.C. The Effects of Asymmetric Volatility and Jumps on the Pricing of VIX Derivatives Yang-Ho Park 2015-071 Please cite this paper as: Park, Yang-Ho (2015). “The Effects of Asymmetric Volatility and Jumps on the Pricing of VIX Derivatives,” Finance and Economics Discussion Series 2015-071. Washington: Board of Governors of the Federal Reserve System, http://dx.doi.org/10.17016/FEDS.2015.071. NOTE: Staff working papers in the Finance and Economics Discussion Series (FEDS) are preliminary materials circulated to stimulate discussion and critical comment. The analysis and conclusions set forth are those of the authors and do not indicate concurrence by other members of the research staff or the Board of Governors. References in publications to the Finance and Economics Discussion Series (other than acknowledgement) should be cleared with the author(s) to protect the tentative character of these papers.

The Effects of Asymmetric Volatility and Jumps on the Pricing of VIX Derivatives Yang-Ho Park Federal Reserve Board September 11, 2015 ABSTRACT This paper proposes a new collection of affine jump-diffusion models for the valuation of VIX derivatives. The models have two distinctive features. First, we allow for a positive correlation between changes in the VIX and in its stochastic volatility to accommodate asymmetric volatility. Second, upward and downward jumps in the VIX are separately modeled to accommodate the possibility that investors react differently to good and bad surprises. UsingtheVIXfuturesandoptionsdatafromJuly2006throughJanuary2013,we find conclusive evidencefor the benefitsof including both asymmetric volatility and upward jumps in models of VIX derivatives pricing. We do not, however, find evidence supporting downward jumps. JEL Classification: G12; G13 Keywords: VIX options; VIX futures; stochastic volatility; volatility smile; jump-diffusion We are very grateful for comments and suggestions from Yacine A¨ıt-Sahalia, Michael Gordy, JaideepOberoi,andseminarparticipantsattheIFSIDThirdConferenceonDerivatives. Thispaper benefits from the excellent research assistance of Juliette Lu. Disclaimer: The analysis and conclusionssetfortharethoseoftheauthorsanddonotindicateconcurrencebytheBoardofGovernorsor other members of its research staff. Send correspondence to Yang-Ho Park, Risk Analysis Section, BoardofGovernorsoftheFederalReserveSystem,20th&CStreets,NW,Washington,D.C.20551, Tel: (202) 452-3177; e-mail: yang-ho.park@frb.gov.

1 Introduction Over the past few years, there has been a great deal of financial innovation in volatility trading markets. A new collection of volatility derivatives, such as VIX futures, options, and exchange-traded products, have been introduced, making volatility trading more accessible to a broad range of investors.1 The increasing volume of trading in those products is largely due to the fact that they can be used as a so-called tail risk hedging strategy against stock market downturns. That is, because changes in market volatility are negatively correlated with stock market returns, investors can limit the loss of an equity portfolio by taking a long position in VIX futures or call options. Given the explosive growth in volatility trading, the objective of this paper is to understand the effects of asymmetric volatility of and jumps in the VIX on the pricing of VIX futures and options. We use the term “asymmetric volatility” to refer to the fact that the volatility of the VIX is not merely stochastic but also varies asymmetrically in response to changes in the VIX. Such asymmetry can be seen, for example, by looking at the dynamic relation between the VIX and the VVIX of the CBOE (Chicago Board Options Exchange). The VIX is a risk-neutral, forward-looking measure of market volatilityimpliedbyacrosssectionofS&P500indexoptions,whiletheVVIXisariskneutral, forward-looking measure of market volatility of volatility implied by a cross section of VIX options.2 The scatter plot in Figure 1 shows that there is a positive relation between changes in the VVIX and in the VIX. That is, the volatility of the VIX, as measured by the VVIX, tends to increase (decrease) as the VIX increases (decreases). Animportantimplicationofthisempiricalregularityisthatthestochasticvolatility factor implicit in the VIX options comprises two components—one that can be spanned by VIX futures and another that cannot. In other words, the VIX options market is nonredundant with the VIX futures market. To gauge the extent to which the options market can be spanned by the futures market, we run a regression of VVIX returns onto VIX returns, and find that the VIX changes can explain only 51 percent of the variation in the VVIX based on the adjusted R2 of the regression. 1 See,forexample,Whaley(2013)andAlexanderandKorovilas(2012)asreferencesonexchangetraded volatility products linked to the VIX. 2 The VVIX is computed by applying the VIX formula to the VIX options market. 2

Contrary to the empirical fact just discussed, the existing VIX option papers generally assume that the volatility of the VIX is either completely spanned or completely unspanned. For example, some authors, such as Gru¨nbichler and Longstaff (1996), Detemple and Osakwe (2000), and Goard and Mazur (2013), apply fully spanned volatility models in which VIX options are completely hedgeable by VIX futures, whereas others, such as Menc´ıa and Sentana (2012), introduce a completely unspanned volatility model in which innovations in the VIX index are uncorrelated with those in the volatility of the VIX index. It is evident that all of those models are unable to accommodate the important feature of the data that we have found— namely, asymmetric volatility. Hence, the chief goal of this paper is to contribute to the literature by studying the effects of asymmetric volatility on the pricing of VIX derivatives. Intuitively, a model that allows for asymmetry in volatility may have great potential to improve the pricing of VIX options because it is capable of explaining the positive skewness implicit in the options.3 The VIX options market shows a persistent deviation from a geometric Brownian motion regardless of time to maturity. In particular, out-of-the-money (OTM) VIX calls tend to have higher Black-Scholes implied volatility than OTM VIX puts, a tendency that is sometimes referred to as a volatility smile, implying that the option-implied VIX return distributions are positively skewed. This positive skewness may be in part attributable to asymmetric volatility. On top of asymmetric volatility, we also look at the impact of jumps in the VIX on the valuation of VIX futures and options because they can be another channel for explaining the positive skewness implied by VIX options. Good and bad surprises may arrive with different rates and sizes and investors may react differently to them.4 Hence, our models assume that upward and downward jumps occur independently with different frequencies and magnitudes. In this respect, our jump treatment is close in spirit to the currency option pricing model of Carr and Wu (2007), which separately models upward and downward jumps using time-changed Levy processes. 3 This echoes a well-known finding that the negative skewness implicit in stock index options is partlyassociatedwiththeleverageorvolatilityfeedbackeffect,whichindicatesanegativecorrelation between stock index returns and volatility changes. 4 The market’s asymmetric response to shocks or news announcements has been documented by Andersen, Bollerslev, Diebold, and Vega (2007) and Bakshi, Carr, and Wu (2008), among others. 3

Building on the affine jump-diffusion framework of Duffie, Pan, and Singleton (2000), we introduce a new family of dynamic models for the VIX with and without the features of asymmetric volatility and jumps and derive quasi-analytic solutions to the prices of futures and options. The models are characterized by three dynamic factors: one observed VIX index and two latent factors to capture time variations in the stochastic volatility and central tendency of the VIX. To reflect asymmetric volatility, we allow for a nonzero correlation between innovations in the VIX and in its stochastic volatility. Upward and downward jumps are both assumed to follow independentcompoundPoissonprocesseswitheachhavingitsownjumpintensityand exponential jump-size distribution. The models are then tested on the VIX futures and options data covering July 2006 through January 2013, via an unscented Kalman filter. Turning to the empirical results, we first evaluate the effects of asymmetric volatility on the pricing of VIX derivatives. By comparing the model with symmetric volatility and no jumps (SVV) with the model with asymmetric volatility and no jumps (AVV), we find that the latter is strongly preferred to the former in both in-sample and out-of-sample tests. The performance difference is statistically significant at the 1 percent level based on the Diebold and Mariano (2002) test. That is, allowing for asymmetry in volatility can make large improvements in fitting the prices of VIX futures and options. We next look at the effects of including upward jumps on top of asymmetric volatility by comparing the AVV model with the AVV-UJ model, the model with asymmetric volatility and upward jumps. The comparison shows decisive evidence in favor of the AVV-UJ model over the AVV model, except for in-sample futures pricing. The results are statistically significant at the 1 percent level according to the Diebold and Mariano (2002) test. That is, including upward jumps in the AVV model can make considerable improvements in VIX derivatives pricing. Lastly, we investigate whether downward jumps can have an incremental effect on the pricing of VIX derivatives by comparing the AVV-UJ model with the model with asymmetric volatility and asymmetric upward and downward jumps (AVV-AJ). The comparison does not result in any significant ranking of one model over the other. Including downward jumps makes little, if any, improvement in the pricing of both futures and options as long as asymmetric volatility and upward jumps are accounted 4

for. However, because we consider finite-activity jumps in this paper, we do not interpret our finding as implying that downward jumps are of no use in explaining VIXderivativesprices; rather, wearguethatupwardjumpsplayafarmoreimportant role in the pricing of VIX derivatives than downward jumps do. Aside from the VIX option pricing literature, this paper also adds to the literature on unspanned stochastic volatility. Unspanned volatility has been found to be important in many other asset classes. For example, unspanned volatility in the interest rate market has been reported by Collin-Dufresne and Goldstein (2002), Li and Zhao (2006), and Bikbov and Chernov (2009), among others. Trolle and Schwartz (2009b) emphasize the importance of including unspanned volatility in pricing commodity options. This paper is also related to the literature on the structure of jumps in the VIX. For example, Todorov and Tauchen (2011) and Wu (2011) both find that jumps in the VIX are more likely to have infinite activity, although they rely on different sources of information and estimation. More recently, Todorov, Tauchen, and Grynkiv (2014) examine asymmetry in upward and downward jumps in VIX dynamics and find that upward and downward jumps are approximately symmetric, although the former are slightlymoreactivethanthelatter. Inmorelooselyrelatedpapers, Eraker(2005)and Broadie, Chernov, and Johannes (2007) study the impact of volatility jumps in joint analyses of stock index returns and options data. Our paper is distinct from all of these papers in that our application takes a direct look at VIX options and futures, which may offer a better environment in which to scrutinize the characteristic of volatility jumps. The rest of the paper is organized as follows. Section 2 develops the models and methods used in this paper. The VIX options and futures data are described in Section 3. Section 4 shows the parameter estimates and some preliminary analysis, while Section 5 provides the main empirical results on pricing performance across the different model specifications. Section 6 concludes. 2 Models and Methods The affine jump-diffusion framework of Duffie, Pan, and Singleton (2000) has been widely used in the term structure modeling of interest rates and the valuation of 5

derivatives because it allows for analytic tractability. Dai and Singleton (2000); Heston (1993); Bakshi, Cao, and Chen (1997); and Bates (2000) are well-known examples. In this section, we introduce a new family of affine jump-diffusion models for the dynamics of the logarithm of the VIX and derive semi-closed-form solutions to the prices of futures and options. In addition, we provide a description of our estimation method and introduce a competing model from Menc´ıa and Sentana (2012), which will be used as a benchmark. 2.1 VIX dynamics We assume that the logarithm of the VIX follows an affine jump-diffusion process. This assumption is motivated by a well-known empirical finding that a logarithmic model does a better job of describing the volatility dynamics of a stock index than the square root model of Heston (1993) (see, for example, A¨ıt-Sahalia and Kimmel (2007), Jones (2003), and Durham (2013)). More importantly, Menc´ıa and Sentana (2012)comparelogarithmicandsquarerootmodelswithrespecttothepricingofVIX futuresandoptionsandfindthattheformerispreferredtothelatter. Furthermore,an affine modeling of the logarithm of the VIX is important because it permits negative jumps as well as positive ones, whereas an affine modeling of the VIX itself leaves no room for modeling negative jumps. As stated earlier, the objective of this paper is to understand the effects of asymmetric volatility of and jumps in the VIX on the pricing of VIX derivatives. Toward this end, it is desirable to start with a fairly realistic baseline model upon which new model characteristics are built. Our baseline model contains two latent factors in addition to the logarithmic VIX index, denoted by v = log(VIX ). Specifically, a latent t t factor, u , is introduced to reflect time variation in the long-run mean of the VIX, t which greatly helps to match VIX future prices along increasing times to maturity.5 The volatility of the VIX is also allowed to vary over time, driven by another latent factor, w . t Given a risk-neutral probability space (Ω,F,Q) and information filtration {F }, t 5 See Gallant, Hsu, and Tauchen (1999), Christoffersen, Heston, and Jacobs (2009), and A¨ıt- Sahalia, Amengual, and Manresa (2015), among others, for the importance of a long-run volatility factor. 6

we assume that the (logarithmic) VIX dynamics take the following form: √ Q Q Q Q Q dv = κ (u −v )dt+ w dB +J dN +J dN −λ δ dt−λ δ dt t v t t t 1t 1 1t 2 2t + + − − Q du = κ (u−u )dt+σ dB (1) t u t u 2t √ Q dw = κ (w−w )dt+σ w dB , t w t w t 3t where κ , κ , and κ capture the persistence of v , u , and w , respectively; σ and v u w t t t u σ control for the volatility of u and w , respectively; u and w capture the longw t t Q Q Q run means of u and w , respectively; and B , B , and B are standard Brownian t t 1t 2t 3t motions under the risk-neutral Q measure. To capture the asymmetric behavior of volatility, we allow for a nonzero correla- Q Q tion, denoted by ρ, between dB and dB . Although we expect ρ to be positive, we 1t 3t do not restrict the sign of ρ, letting the data speak as to its direction. We allow for jumps in v . Specifically, as one of our main research goals is to t understand the differential pricing implications of upward and downward jumps, we assume that they are driven by independent compound Poisson processes, with each Q Q having its own jump intensity and jump-size distribution. N and N denote risk- 1t 2t neutral Poisson processes driving upward and downward jumps with jump intensities Q λ and λ , respectively. Upward jump magnitudes, J , are assumed to follow an + − 1 independent exponential distribution with a positive mean, δ > 0, with the prob- + ability density function taking 1 exp(−x/δ ) if x > 0 and 0 otherwise. Similarly, δ+ + Q downward jump magnitudes, J , are assumed to follow an independent exponential 2 distribution with a negative mean, δ < 0, with the probability density function − taking 1 exp(−x/δ ) if x < 0 and 0 otherwise. |δ−| − An affine jump-diffusion model is a Markov process whose drift vector, variance matrix, andjumpintensitiesallhaveaffinedependenceonthestatevector(seeDuffie, Pan, and Singleton (2000)). To satisfy this condition, we assume zero correlations between dB Q and dB Q and between dB Q and dB Q .6 1t 2t 2t 3t The general form described in Equation (1) nests a range of model specifications with different features. Among those, we select four different specifications depending on whether there are restrictions on ρ, λ , or λ . The model specifications considered + − in this paper are summarized in Table 1. 6 Otherwise the conditional variance matrix of the model in Equation (1) will lose affine dependence on the state vector. 7

2.2 Characteristic function Thevirtueofaffinemodelsisthatvarioustransforms,suchasacharacteristicfunction, canbequasi-analyticallycalculatedbysolvingasystemofRiccatiordinarydifferential equations (ODEs). Specifically, given the model in Equation (1), we consider the characteristic function of v at time t under the Q measure, which is denoted by T g(v ,u ,w ,t,T;φ): t t t Q g(v ,u ,w ,t,T;φ) = E [exp(iφv )], (2) t t t t T Q where E denotes the conditional expectation using the information up to time t t √ under the Q measure and i = −1. Proposition 1. Under some regularity conditions, the characteristic function takes an exponentially affine form of (v ,u ,w ): t t t g(v ,u ,w ,t,T;φ) = exp[α(s)+β (s)v +β (s)u +β (s)w ], (3) t t t v t u t w t where s = T −t and the coefficients, α(s), β (s), β (s), and β (s), satisfy the system v u w of ODEs: 1 α˙(s) = κ uβ (s)+κ wβ (s)+ σ2β (s)2 u u w w 2 u u (cid:20) (cid:21) (cid:20) (cid:21) 1 1 −[λ δ +λ δ ]β (s)+λ −1 +λ −1 + + − − v + − 1−δ β (s) 1−δ β (s) + v − v β˙ (s) = −κ β (s) (4) v v v β˙ (s) = κ β (s)−κ β (s) u v v u u 1 1 β˙ (s) = −κ β (s)+ β (s)2+ σ2β (s)2+ρσ β (s)β (s), w w w 2 v 2 w w w v w where the boundary conditions are given as α(0) = 0, β (0) = iφ, β (0) = 0, and v u β (0) = 0. w Proof. See Appendix A. The system of ODEs in Equation (4) can be solved by numerical solvers such as the Runge-Kutta method. Using this method, however, raises a numerical issue. WhenODEsarestiff, anumericalsolverrequiresasmallerstepsizeinordertoachieve absolute stability, impairing computational efficiency (see, for example, Huang and Yu (2007)). The stiffness of a system of ODEs is associated with how much the 8

eigenvalues of the system differ in orders of magnitude; the larger the difference, the stiffer the system. The eigenvalues of our system of ODEs are equal to the persistence parameters, κ , κ , and κ . Among the parameters, κ and κ have a v u w v u wide discrepancy in orders of magnitude, as can be seen in Table 3. Specifically, κ v is roughly 20 to 25 times as large as κ . Because of this stiffness in our system, it is u desirable to use an implicit method instead of an explicit method, and we find that the Matlab ode15s solver works well in our application.7 2.3 Valuation of VIX futures and options Once the characteristic function has been calculated, it is straightforward to compute the prices of futures and options. Let F(t,T) denote the time-t price of a futures contract with a maturity of T. F(t,T) represents the risk-neutral expectation of the time-T VIX index given the information at time t. Proposition 2. F(t,T) is given by F(t,T) = g(v ,u ,w ,t,T;φ = −i). (5) t t t Proof. See Appendix A. Let C(t,T,K) and P(t,T,K) denote the time-t prices of call and put options, respectively, withamaturityofT andastrikepriceofK. Thesepricesaredetermined by Q C(t,T,K) = exp(−r τ)E [max(F(T,T)−K,0)] t t (6) Q P(t,T,K) = exp(−r τ)E [max(K −F(T,T),0)], t t where r is the risk-free rate at time t and τ = T −t is the time to maturity. Note t that the value of a futures contract should be equal to the VIX on its expiration date: F(T,T) = VIX . Because of this relation, the prices of call and put options can be T determined by the characteristic function of v . t 7 Note that the ODEs are state-independent so we do not need to repeat running a numerical ODE solver on every date for a given parameter set. In other words, given a parameter set, we numerically solve the ODEs just once and save them in a storage space. The stored ODE solution can be repeatedly used to compute the characteristic function on each date throughout the sample period. 9

Proposition 3. C(t,T,K) and P(t,T,K) are given by C(t,T,K) = exp(−r τ)[g(v ,u ,w ,t,T;φ = −i)Π (t,T,K)−KΠ (t,T,K)] t t t t 1 2 P(t,T,K) = exp(−r τ)[K(1−Π (t,T,K))−g(v ,u ,w ,t,T;φ = −i)(1−Π (t,T,K))], t 2 t t t 1 (7) where the complementary distribution functions, Π (t,T,K) and Π (t,T,K), may be 1 2 expressed as 1 1 (cid:90) ∞ (cid:20) exp(−iφlog(K))h(v ,u ,w ,t,T;φ) (cid:21) t t t Π (t,T,K) = + (cid:60) dφ 1 2 π iφ 0 (8) 1 1 (cid:90) ∞ (cid:20) exp(−iφlog(K))g(v ,u ,w ,t,T;φ) (cid:21) t t t Π (t,T,K) = + (cid:60) dφ, 2 2 π iφ 0 where h(v ,u ,w ,t,T;φ) = g(v ,u ,w ,t,T;φ−i)/g(v ,u ,w ,t,T;−i) and (cid:60) indicates t t t t t t t t t the real part of a complex number.8 Proof. See Appendix A. Π (t,T,K) indicatesthe probabilitythat a call optionwith a maturity ofT and a 2 strike price of K ends up in the money on its expiration date. Among different kinds of numerical integration, we choose to use the Gauss-Laguerre quadrature method with an order of 20 in our application. 2.4 Estimation The option pricing literature often estimates the models using a quasi maximum likelihood estimation based on a variant of Kalman filtering (see, for example, Carr and Wu (2003, 2007), Trolle and Schwartz (2009a,b), and Menc´ıa and Sentana (2012)). Christoffersen, Dorion, Jacobs, and Karoui (2014) show that unscented Kalman filtering is superior to extended Kalman filtering in the application of interest rate derivatives and comparable to particle filtering, which is computationally far more 8 One may want to replace the term g(v ,u ,w ,t,T;φ = −i) with the observed futures prices t t t (see, for example, the footnote 10 of Menc´ıa and Sentana (2012)). However, we do not take this approachforseveralreasons. First, wefindthattheobservedVIXfuturepriceshaveoftendeparted from those implied by the put-call parity condition to a large extent, although this departure is not reportedinthispaper. Second,whenthetermg(v ,u ,w ,t,T;φ=−i)isreplacedwiththeobserved t t t prices, we face the numerical issue that the correlation parameter is estimated to be virtually one. Lastly,usingtheobservedfuturepricesinlieuofg(v ,u ,w ,t,T;−i)cannotguaranteethepositivity t t t of the option pricing formula in Equation (7). 10

expensive. Hence, in this paper, we apply a quasi maximum likelihood method via an unscented Kalman filter.9 The Kalman filter requires us to recast the models into a state space form that comprises state equations and observation equations. First, it is necessary to introduce state equations for (u ,w ) for the physical P measure. To t t do so, we assume that u and w take the following physical dynamics: t t P du = κ (u−u )dt+η u dt+σ dB t u t u t u 2t √ (9) P dw = κ (w−w )dt+η w dt+σ w dB , t w t w t w t 3t where BP and BP are independent standard Brownian motions under the physical P 2t 3t measure and η u and η w capture the risk premiums for the u and w processes, u t w t t t respectively. Note that we do not need the physical dynamics for v because our t observation equations involve only the risk-neutral dynamics. For this reason, we are unable to pin down jump risk premiums. By applying an Euler approximation to Equation (9), we are able to define discrete-time state equations: √ u = u +κ (u−u )∆+η u ∆+σ ∆ε t+∆ t u t u t u 2,t+∆ (10) (cid:112) w = w +κ (w−w )∆+η w ∆+σ w ∆ε , t+∆ t w t w t w t 3,t+∆ where ∆ is the time interval and ε and ε are independent standard normal 2,t+∆ 3,t+∆ random variables. With respect to observation equations, we assume that VIX futures and options are observed with measurement errors. Specifically, following Trolle and Schwartz (2009b), we assume that measurement errors in log future prices are constant and that measurement errors in options prices are proportional to their market vegas, denoted by ν(t,T,K). That is, (cid:98) log(F(cid:98)(t,T)) = log(F(t,T))+σ F ξ 1,t C(cid:98)(t,T,K) = C(t,T,K)+ν (cid:98) (t,T,K)σ E ξ 2,t (11) P(cid:98)(t,T,K) = P(t,T,K)+ν (cid:98) (t,T,K)σ E ξ 3,t , where the hat symbol indicates the market price; σ and σ capture the sizes of F E measurement errors for futures and options, respectively; and ξ , ξ , and ξ are in- 1,t 2,t 3,t 9 We also implement an extended Kalman filter and find that the results are robust to the type of a Kalman filter. 11

dependent standard normal random variables. Details on unscented Kalman filtering are described in Appendix C. 2.5 Menc´ıa and Sentana (2012) model Recently, Menc´ıa and Sentana (2012) studied the impact of stochastic volatility on the valuation of VIX derivatives. Although ours is not the first paper to address the importance of this subject, we take a different approach to modeling stochastic volatilitythanMenc´ıaandSentana(2012)do. Thatis,stochasticvolatilityisdiffusion driven in our models, whereas it is jump driven in the Menc´ıa and Sentana (2012) model. To compare the diffusion-driven model to the jump-driven model, we estimate one of the extended models of Menc´ıa and Sentana (2012), which is hereafter referred to as the MS model. The MS model is specified under the risk-neutral measure as √ Q dv = κ (u −v )dt+ z dB t v t t t 1t Q du = κ (u−u )dt+σ dB (12) t u t u 2t Q dz = −λ z dt+dq , t z t t Q where z is a volatility factor and q is assumed to follow a compound Poisson process t t with intensity λ and an exponential jump distribution with a positive mean of δ . z z Note that the dynamics of z in Equation (12) belong to a class of the non-Gaussian t Ornstein-Uhlenbeck process of Barndorff-Nielsen and Shephard (2001). 3 Data Our application looks at the valuation of VIX futures and options, among other volatility derivatives. The markets for these derivatives have seen explosive growth in trading activity in recent years, as can be seen in Figure 2. The top panel of the figure shows that the number of VIX futures contracts traded increased dramatically from about 1 million in 2007 to about 24 million in 2012, and that most of the growth occurred after 2009, likely provoked by the recent financial crisis. The bottom panel of the figure shows that the dollar trading volume for the VIX options also increased substantially from around $3.3 billion in 2007 to around $16.1 billion in 2012. Note that VIX call options are more actively traded than VIX put options, which may be 12

associated with the fact that the former can be used as a means of hedging a stock market crash unlike the latter. Our data set comprises the daily prices of VIX futures and options. The futures data come from Thomson Reuters, Datastream, while the options data are from OptionMetrics, Ivy DB. The VIX futures market started on March 26, 2004, and the VIX options market opened about two years later on February 24, 2006. As we decided to include only the dates for which data on both VIX futures and options are available, our sample starts on July 1, 2006, and ends on January 31, 2013, with a total of 1,657 trading days.10 There are no early exercise premiums because VIX options are European. In general, VIX futures are available for a wider range of maturities than VIX options. We delete VIX futures for which there are no paired VIX options with the same maturity. This leaves us a total of 9,330 futures contracts. Option prices are taken from the bid–ask midpoint at each day’s close of the options market (3:15 p.m). A filtering scheme is applied to eliminate inaccurate or illiquidoptions. Specifically, wedeleteVIXoptionsforwhichthemidpriceislessthan 0.1, the time to maturity is less than eight days, the Black-Scholes implied volatility is null, or the relative bid–ask spread (defined as the bid–ask spread divided by the mid price) is larger than 0.3. We also discard the VIX options that violate the lower bound constraints: C(cid:98)(t,T,K) ≥ max(0,exp(−r t τ)(F(cid:98)(t,T)−K)) (13) P(cid:98)(t,T,K) ≥ max(0,exp(−r t τ)(K −F(cid:98)(t,T))), where the risk-free rates are obtained from OptionMetrics, Ivy DB. Option prices comprise intrinsic and time values, and only the time value portion is subject to model specifications. Moreover, OTM options tend to be more liquid than in-themoney ones. For this reason, we use only OTM call and put options in our empirical tests. Finally, a total of 102,525 options are included in our sample. Table 2 shows summary statistics such as the number of observations and the average price for VIX futures and options. In this table, the data are divided into two categories depending on time to maturity τ: short-term contracts with τ ≤ 10 Because the trading of VIX options was inactive in the very first few months, we exclude those months. 13

three monthsandlong-termcontractswithτ > three months. Foroptions, wefurther break down the data into four categories according to their moneyness: deep OTM puts with d < 0.85, moderate OTM puts with 0.85 < d < 1.0, moderate OTM calls with 1.0 < d < 1.3, and deep OTM calls with d > 1.3, where moneyness is defined as d = K/F(t,T). Note that call options have a larger number of observations and higher levels of Black-Scholes implied volatility than put options. Figure 3 shows the time series of the VIX, the VVIX, and the option-implied skewness. The VIX and the VVIX are provided by the CBOE, while the one-month option-implied skewness is computed using a modified version of the model-free approach of Bakshi, Kapadia, and Madan (2003). The detailed procedure is provided in Appendix D. Two features are worth noting. First, the VIX and the VVIX share some pronounced spikes, such as the bankruptcy of Lehman Brothers, which is associated with asymmetric volatility. Second, the implied skewness takes a positive value most of the time, implying that a nonnormal stochastic mechanism is essential for describing the return distribution implied by VIX options. 4 Parameter estimates and preliminary analysis This section starts by looking at two sets of parameter estimates. The first set is obtained using the entire data set from July 2006 through January 2013, whereas the second set is obtained using a subsample of four years from July 2006 through June 2010. Thesecondsetisusedtotestdifferentmodelspecificationsforanout-of-sample period from July 2010 through January 2013. In addition, we will examine whether asymmetric volatility and jumps help to match the observed term structure of the implied volatility and skewness, on average, which will give us a hint as to the pricing performance results discussed in the section that follows. 4.1 Parameter estimates Table 3 presents in-sample parameter estimates across various model specifications. The models with asymmetry in volatility yield very different volatility dynamics than the model without it. To see this, let us compare the SVV model with the AVV-type models (AVV, AVV-UJ, and AVV-AJ). First, we observe that the volatility process is 14

notverypersistentintheSVVmodel(κ = 5.506),whereasitismorepersistentinthe w AVV-typemodels(0.847 ≤ κ ≤ 1.553). Second, theSVVmodel(σ = 7.271)hasfar w w higher volatility of volatility than the AVV-type models (1.976 ≤ σ ≤ 2.610). Third, w we define the long-run mean of volatility under the physical measure as wP = κww κw−ηw and find that the SVV model (wP = 0.262) has a lower level of the long-run mean than the AVV-type models (0.553 ≤ wP ≤ 0.677). Overall, allowing for asymmetric volatility makes a large difference in the estimation of the volatility dynamics of the VIX, which in turn will have a large effect on the pricing of VIX derivatives. The degree of asymmetry in volatility differs depending on whether upward and downward jumps are included. The asymmetry parameter is highest in the AVV model (ρ = 0.870) and lowest in the AVV-UJ model (ρ = 0.422). The AVV-AJ model (ρ = 0.794) falls in between. Because of this result, a jump process would have an impact on the pricing of VIX derivatives not only per se but also through different degrees of asymmetric volatility. Interestingly, the variance risk premium parameter, η , takes a negative value w for each of the models considered, implying that the market volatility of volatility is negatively priced. A similar discovery is also made by Song (2012) and Park (2015) in a non-parametric fashion. This negative pricing is not surprising because a market volatility of volatility asset can be used as a hedge against stock market downturns as it has a negative exposure (or beta) to the stock market index. Now let us look at the frequencies and sizes of upward and downward jumps in the AVV-UJ and AVV-AJ models. The table shows that upward jumps take place about 2.7 times per annum with an average size of about 0.27, regardless of whether downward jumps are included. As expected, in the AVV-AJ model, upward jumps haveahigheroccurrencerateandalargersizethandownwardjumps; thatis,λ > λ + − and δ > |δ |. + − Table 3 also contains two model comparison metrics: Akaike (AIC) and Schwarz (SIC) information criteria. These are based on some penalty scores, which depend on the number of free parameters, less log likelihood values. The lower the information criteria, the better the model. Both information criteria offer decisive rankings of the models: AVV-AJ > AVV-UJ > AVV > SVV > MS. In short, asymmetric volatility, upward jumps, and downward jumps are all critical modeling features, and the SVV model is preferred to the MS model. 15

Each panel of Figure 4 shows a time series of the filtered volatility states that are obtained using the in-sample parameter estimates. The filtered volatility states in the SVV model tend to be higher and more volatile than those in the AVV model. In addition, the filtered volatility states are substantially higher in the models with no jumps (SVV and AVV) than in the models with jumps (AVV-UJ and AVV-AJ), although all models have similar evolutions over time. Table 4 presents out-of-sample parameter estimates across various model specifications. Thistablealsoindicatesthatthedecisiontoallowforasymmetryinvolatility has a large effect on the volatility dynamics and the decision to include jumps affects the extent of asymmetry in volatility. 4.2 Fitting the term structure of the implied volatility Here, we investigate the ability of the models to fit the average term structure of the option-implied volatility. The option-implied volatility is computed by applying the model-free approach of Bakshi, Kapadia, and Madan (2003) to a cross section of VIX options, with little modification. The detailed procedure is provided in Appendix D. The top panel of Figure 5 shows the scatter plot and kernel regression (solid line) of the implied volatility against months to maturity; the kernel regression is applied to capture the average term structure of the implied volatility. An interesting fact is that the term structure of the implied volatility is downward sloping, with the short end being nearly twice as high as the long end. This decreasing pattern is associated with the mean-reverting behavior of the VIX; the stronger the mean reversion of the VIX, the steeper the slope. To see whether the models under consideration can capture the decreasing pattern in the implied volatility, we compute the average term structure of volatility implied by each model. The model-implied volatility, denoted by IV(t,T), can be computed by differentiating the cumulant generating function for each model: (cid:115) 1 ∂2Ψ(t,T;φ)/∂φ2| IV(t,T) = φ=0 , (14) i2 (T −t) wherethecumulantgeneratingfunctionisdefinedasΨ(t,T;φ) = log(g(v ,u ,w ,t,T;φ)). t t t Note that IV(t,T) is annualized. The model-implied volatility is computed on each 16

day and for each time to maturity throughout the sample period using the in-sample parameter estimates in Table 3. We then run a kernel regression of the model-implied volatility onto months to maturity for each model. The bottom panel of Figure 5 shows that all of the four models—SVV (solid line), AVV (dashed line), AVV-UJ (dotted line), and AVV-AJ (dashed-dotted line)— do a good job of capturing the decreasing pattern in the observed implied volatility, although there are some minor discrepancies across the models. In particular, the SVV model tends to slightly overestimate the term structure of volatility, whereas the AVV-type models tend to slightly underestimate it. Overall, the models considered in this paper are all good at capturing the average term structure of the implied volatility, and the decision whether to allow for asymmetric volatility and jumps does not make a notable difference in this regard. 4.3 Fitting the term structure of the implied skewness This subsection addresses whether asymmetric volatility and jumps can help explain the average term structure of the implied skewness. As for the option-implied volatility, the option-implied skewness is computed by applying the model-free approach of Bakshi, Kapadia, and Madan (2003) to a cross section of VIX options, with little modification. The top panel of Figure 6 shows the scatter plot and kernel regression (solid line) of the implied skewness against months to maturity; the kernel regression is applied to capture the average term structure of the implied skewness. This figure reveals a surprising stylized fact: the average term structure of the implied skewness is nearly flat up to a one-year horizon.11 Put simply, the short-term skewness is approximately as high as the long-term skewness. To examine whether the models under consideration can produce such a flat pattern, wecomputethemodel-impliedskewness. Similartothemodel-impliedvolatility, the model-implied skewness, denoted by SKEW(t,T), can be computed by differen- 11 This finding echoes Carr and Wu (2003), who find that the stock index options market has the almost same slope of the implied volatility smile across different times to maturity. To imitate this pattern, they propose a finite moment log stable process. 17

tiating the cumulant generating function: (cid:12) 1 ∂3Ψ(t,T;φ)/∂φ3 (cid:12) SKEW(t,T) = φ=0 . (15) i3 (T −t)3/2IV(t,T)3 The bottom panel of Figure 6 shows that the AVV (dashed line) and AVV-UJ (dotted line) models both do a fairly good job of capturing the flat term structure of skewness. The AVV-UJ model tends to generate more of a positive skewness for short horizons than the AVV model, implying that upward jumps have a greater effect for short horizons than for long ones. In contrast, the AVV-AJ model (dashed-dotted line) does a poorer job of fitting the term structure of the implied skewness than the AVV and AVV-UJ models. Finally, the SVV model (solid line) has no ability to produce a positive skewness at all, (although it is capable of generating a positive excess kurtosis). 5 Pricing performance Our performance analysis focuses on four main comparisons. First, we compare the diffusion-drivenSVVmodelwiththejump-drivenMSmodel. Second, wecomparethe AVV model with the SVV model to test the importance of allowing for asymmetry in volatility. The third comparison is made between the AVV-UJ model and the AVV model inorder to examinethe effectsof includingupwardjumpson topof asymmetric volatility. Lastly, we compare the AVV-AJ model with the AVV-UJ model to investigate whether the addition of downward jumps can make an incremental improvement once asymmetric volatility and upward jumps are included. Tofacilitateouranalysis, wecomputeaperformancemetric, arootmeansquared error (RMSE), which is separately defined for futures and options as (cid:118) (cid:117) NF (cid:117) 1 (cid:88)(cid:16) (cid:17)2 RMSEF i = (cid:116) NF log(F(cid:98)(t n ,T n ))−log(F(t n ,T n )) n=1 (16) (cid:118) (cid:117) NO RMSEO = (cid:117) (cid:116) 1 (cid:88)(cid:16)O(cid:98)(t n ,T n ,K n )−O(t n ,T n ,K n )(cid:17)2 , i NO ν(t ,T ,K ) (cid:98) n n n n=1 where RMSEF and RMSEO denote the RMSEs implied by a model i for futures and i i 18

options, respectively; NF and NO indicate the total numbers of futures and options included in the sample, respectively; and O(cid:98) and O denote the market-implied and i i model-implied option prices, respectively. Hereafter, RMSE is used to denote either i RMSEF or RMSEO according to the context of analysis. Note that the performance i i metric is consistent with the assumption that we make about measurement errors in the observation equations. To measure the extent to which one model is better or worse than another, we compute pricing differences between two models. Let ∆RMSE denote the pricing i|j difference of a model i over a model j. ∆RMSE is defined as i|j ∆RMSE = 100×log(RMSE /RMSE ), (17) i|j i j whereRMSE andRMSE denotestheRMSEsimpliedbymodelsiandj,respectively. i j A negative (positive) value of ∆RMSE means that model i yields lower (higher) i|j pricing errors than model j, implying that the pricing performance of the former is better (worse) than that of the latter by a percentage of that value. Tables 5 and 6 show in-sample and out-of-sample pricing performance, respectively, across the different models. From left to right, the groups of columns show the pricing errors as measured by RMSE ; the pricing improvements relative to the SVV i model as measured by ∆RMSE ; the pricing improvements relative to the AVV i|SVV model as measured by ∆RMSE ; and the pricing improvements relative to the i|AVV AVV-UJ model as measured by ∆RMSE , respectively. The main results for i|AVV-UJ all futures and options are shown in the top two panels, and the options are broken down into calls and puts and by moneyness levels in the lower panels. To assess the statistical significance of a pricing performance difference, we make pairwise model comparisons based on the Diebold and Mariano (2002) test. To do so, we compute a time series of RMSEs for each model, {RMSE }D , where D is the i,t t=1 total number of evaluation days and RMSE is the date-t RMSE implied by a model i,t i. Note that {RMSE }D is separately obtained for in-sample and out-of-sample i,t t=1 tests. Next, we obtain a series of the differences in RMSEs between every pair of the models, d = RMSE −RMSE , t = 1,...,D, and define a test statistic between i,j,t i,t j,t 19

models i and j as d z = i,j,t , (18) i,j σ(d ) i,j,t where d is a sample mean of the differences and σ(d ) is a heteroskedasticityi,j,t i,j,t and autocorrelation-consistent estimate of the sample standard deviation of the differences.12 Diebold and Mariano (2002) show that the test statistic in Equation (18) follows a standard normal distribution under the null hypothesis that there is no statistically significant difference between models i and j. Tables 7 and 8 display in-sample and out-of-sample test statistics, respectively. A negative (positive) statistic in a cell (i,j) indicates that model i outperforms (underperforms) model j. Panels A and B correspond to VIX futures and options, respectively. The symbols *, **, and *** represent statistical significance at the 10, 5, and 1 percent levels, respectively. 5.1 Diffusion-driven versus jump-driven volatility ThissubsectionfocusesoninvestigatingwhethertheSVVmodelcanpriceVIXfutures and options better than the MS model. With respect to futures, the pricing errors and pairwise model comparisons are shown in Panel A of Tables 5–8. The SVV model has an in-sample RMSE of 0.1842 and an out-of-sample RMSE of 0.2252, while the MS model has an in-sample RMSE of 0.1853 and an out-of-sample RMSE of 0.2332. The ∆RMSE metrics indicate that the futures pricing performance of the SVV i|SVV model is better than that of the MS model by 1.18 percent in the in-sample test and by 7.02 percent in the out-of-sample test. The pricing difference is statistically significant at the 1 percent level only in the out-of-sample test, with a z statistic of ±7.58 . With respect to options, the pricing errors and pairwise model comparisons are shown in Panel B of Tables 5–8. The SVV model has an in-sample RMSE of 0.0939 and an out-of-sample RMSE of 0.1199, while the MS model has an in-sample RMSE of 0.1037 and an out-of-sample RMSE of 0.1475. The ∆RMSE metrics indicate i|SVV that the options pricing performance of the SVV model is better than that of the MS model by 10.00 percent in the in-sample test and by 20.71 percent in the out-of- 12 To control for autocorrelations, we use an optimal number of lags following Newey and West (1994). 20

sample test. The performance difference is statistically significant at the 1 percent level with a z statistic of ±6.44 in the in-sample test and at the 1 percent level with a z statistic of ±12.26 in the out-of-sample test. To summarize, the SVV model outperforms the MS model except for the insample futures pricing. The superiority of the SVV model over the MS model implies that the volatility of the VIX is better described by a diffusive process rather than by a finite-activity jump process.13 For this reason, the SVV model is preferred over the MS model as a baseline model upon which to add asymmetric volatility and jumps. 5.2 Effects of asymmetric volatility Thissubsectionfocusesondiscussingtheeffectsofallowingforasymmetryinvolatility on the pricing of futures and options by comparing the AVV model with the SVV model. Withrespecttofutures, theAVVmodelhasanin-sampleRMSEof0.1804and an out-of-sample RMSE of 0.2201. Based on the ∆RMSE metrics, the futures i|SVV pricing performance of the AVV model is better than that of the SVV model by 4.22 percent in the in-sample test and by 4.58 percent in the out-of-sample test. The pricing difference is statistically significant at the 5 percent level with a z statistic of ±2.52 in the in-sample test and at the 1 percent level with a z statistic of ±8.17 in the out-of-sample test. Furthermore, the futures pricing improvements vary by time to maturity. Specifically, the AVV model generates better futures pricing than the SVV model for short-term horizons both in-sample and out-of-sample but not for long-term horizons. With respect to options, the AVV model has an in-sample RMSE of 0.0885 and an out-of-sample RMSE of 0.1154. The ∆RMSE metrics suggest that the i|SVV options pricing performance of the AVV model is better than that of the SVV model by 5.87 percent in the in-sample test and by 3.87 percent in the out-of-sample test. The performance difference is statistically significant at the 1 percent level with a z statistic of ±7.08 in the in-sample test and at the 1 percent level with a z statistic of ±6.49 in the out-of-sample test. Furthermore, we find that allowing for asymmetry in volatility reduces option pricing errors for all horizons in both in-sample and out- 13 ThispaperconsidersonlytwospecificationsofstochasticvolatilityoftheVIX:theMSandSVV models. A further analysis of other specifications, such as infinite-activity jumps or a combination of diffusive and jump processes, would be an interesting topic for future research. 21

of-sample tests. Taking our analysis one step further, we break down option pricing errors into puts and calls. Panels C and D of Table 5 show the in-sample RMSEs for OTM puts and calls, respectively. Compared to the SVV model, the AVV model substantially improves the pricing of put options by 14.60 percent but improves the pricing of call options only by 1.02 percent. Similarly, Panels C and D of Table 6 show the outof-sample RMSEs for OTM puts and calls, respectively. Again, compared with the SVV model, the AVV model makes a large improvement in the pricing of put options by 11.45 percent, but only a very small improvement in the pricing of call options by 1.86 percent. To sum up, we find decisive evidence that asymmetry in volatility has an enormous and statistically significant effect on the valuation of VIX futures and options. The AVV model is strongly preferred to the SVV model in both in-sample and outof-sample tests. Especially, asymmetry in volatility plays a crucial role in fitting the prices of puts and short-term futures. Given the fact that OTM put prices are usually found to be cheaper than OTM call prices with an equivalent moneyness, our result implies that the underpricing of VIX puts is more likely to be driven by asymmetric volatility rather than upward jumps. 5.3 Effects of upward jumps Wenowinvestigatetheeffectsofincludingupwardjumpsontopofasymmetricvolatility on the pricing of futures and options by comparing the AVV-UJ model with the AVV model. With respect to futures, the AVV-UJ model has an in-sample RMSE of 0.1799 and an out-of-sample RMSE of 0.2163. The ∆RMSE metrics indicate i|AVV that the futures pricing performance of the AVV-UJ model is better than that of the AVV model by 0.53 percent in the in-sample test and by 3.51 percent in the out-ofsample test. Hence, the statistical significance is obtained only for the out-of-sample test at the 1 percent level with a z statistic of ±6.50. Furthermore, the futures pricing improvement depends on time to maturity in both in-sample and out-of-sample tests. Incorporating upward jumps improves the pricing of long-term futures but worsens the pricing of short-term futures. This finding can be contrasted with the previous finding that allowing for asymmetry in 22

volatility helps to improve the pricing of short-term contracts only. Taken together, asymmetric volatility and upward jumps have opposing yet complementary influences on futures pricing performance. With respect to options, the AVV-UJ model has an in-sample RMSE of 0.0817 and an out-of-sample RMSE of 0.1094. The ∆RMSE metrics suggest that the i|AVV options pricing performance of the AVV-UJ model is better than that of the AVV model by 7.96 percent in the in-sample test and by 5.33 percent in the out-of-sample test. The performance difference is statistically significant at the 1 percent level with a z statistic of ±8.16 in the in-sample test and at the 1 percent level with a z statistic of ±11.71 in the out-of-sample test. Furthermore, we find that including upward jumps is effective regardless of option maturities in both in-sample and out-of-sample tests, although the improvement is greater for long-term options than for short-term ones. We now break down option pricing errors into puts and calls. Panels C and D of Tables 5 and 6 show the in-sample and out-of-sample RMSEs for OTM puts and calls, respectively. The AVV-UJ model greatly improves the pricing of cal in both insample and out-of-sample tests, but there is no significant pricing difference between the two models with respect to the pricing of put options. Based on the empirical results above, we conclude that upward jumps have a sizableandstatisticallysignificantimpactonthevaluationofVIXfuturesandoptions. The AVV-UJ model is strongly preferred to the AVV model in both in-sample and out-of-sample tests, except for the in-sample futures pricing. Especially, upward jumps play a pivotal role in pricing calls and long-term futures. Given the fact that OTM call prices are usually more expensive than OTM put prices with an equivalent moneyness, our result implies that high levels of call prices are more likely to be driven by upward jumps rather than by asymmetric volatility. 5.4 Effects of downward jumps Lastly, we evaluate the incrementaleffect of downward jumps on thepricing of futures and option by comparing the AVV-AJ model with the AVV-UJ model. Based on the ∆RMSE metrics, including downward jumps makes little improvement i|AVV-UJ in the pricing of futures and options to the extent that asymmetric volatility and 23

upward jumps are already accounted for. The statistical significance is obtained only for in-sample futures pricing but the size of improvement is quite small with ∆RMSE = −0.86. i|AVV-UJ To see if the inefficacy of downward jumps relies on our particular choice of jump specifications, we also consider an alternative specification of upward and downward jumps. In particular, we estimate the AVV model with double exponential jumps as in Kou (2002) and Kou and Wang (2004), where upward and downward jumps are driven by a single compound Poisson process (see Appendix E for details on the model specification). Once again, we fail to find evidence that the AVV model with double exponential jumps is better at pricing the VIX derivatives than the AVV-UJ model, although the results are not reported in this paper. 6 Conclusion Given the growing interest in volatility trading, this paper suggests a class of affine jump-diffusion models for the pricing of VIX derivatives. The models have two innovative features. First, to assess the effects of asymmetric volatility on the pricing of VIX derivatives, we allow for a positive correlation between changes in the VIX and in its stochastic volatility. Second, we separate out the roles of upward and downward jumps to better consider the reality that investors react differently to good and bad shocks, which may arrive independently with different rates and sizes. Our empirical analysis shows decisive evidence for the benefits of including both asymmetricvolatilityandupwardjumpsinpricingmodelsofVIXfuturesandoptions. In particular, we find that asymmetric volatility is essential for pricing put options and short-term futures, whereas upward jumps are crucial for pricing call options and long-term futures. That is, low levels of put prices are associated with asymmetric volatility, whereas high levels of call prices are likely to be driven by the possibility of upward jumps. In sum, asymmetric volatility and upward jumps both are required to price VIX derivatives, having opposing yet complementary effects on them. However, we fail to find consistent evidence for the benefit of including downward jumps in the pricing of the VIX futures and options. Given that all the jump specifications that we consider are of finite activity, we do not interpret our results as implying that downward jumps are of no use in explaining the VIX derivative 24

prices. Indeed, authors such as Todorov and Tauchen (2011) and Wu (2011) show that volatility jumps may be better described by an infinite-activity jump process than a finite-activity one. Therefore, it would be interesting to extend our research to models with infinite-activity jump processes (on top of asymmetric volatility). Lastly, we report some interesting features of the term structures of the implied volatility and skewness. Specifically, the term structure of the implied volatility has a downward slope, whereas the term structure of the implied skewness is nearly flat. Althoughallofthemodelsconsidereddoanicejobofcapturingthedecreasingpattern intheimpliedvolatility,onlytheAVVandAVV-UJmodelssuccessfullymatchtheflat pattern in the implied skewness, consistent with the result that asymmetric volatility and upward jumps are both crucial for pricing the VIX options. 25

References A¨ıt-Sahalia, Y., D. Amengual, and E. Manresa (2015): “Market-based estimation of stochastic volatility models,” Journal of Econometrics, 187, 418–435. A¨ıt-Sahalia, Y., and R. Kimmel (2007): “Maximum likelihood estimation of stochastic volatility models,” Journal of Financial Economics, 83, 413–452. Alexander, C., and D. Korovilas (2012): “Understanding ETNs on VIX futures,” Working paper, University of Reading. Andersen, T. G., T. Bollerslev, F. X. Diebold, and C. Vega(2007): “Realtime price discovery in global stock, bond and foreign exchange markets,” Journal of International Economics, 73(2), 251–277. Bakshi, G., C. Cao, and Z. Chen (1997): “Empirical performance of alternative option pricing models,” Journal of Finance, 52, 527–566. Bakshi, G., P. Carr, and L. Wu (2008): “Stochastic risk premiums, stochastic skewness in currency options, and stochastic discount factors in international economies,” Journal of Financial Economics, 87(1), 132–156. Bakshi, G., N. Kapadia, and D. Madan (2003): “Stock return characteristics, skewlaws,anddifferentialpricingofindividualequityoptions,”Review of Financial Studies, 16, 101–143. Barndorff-Nielsen, O. E., and N. Shephard(2001): “Non-GaussianOrnstein– Uhlenbeck-based models and some of their uses in financial economics,” Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 167–241. Bates, D. (2000): “Post-’87 crash fears in S&P500 futures options,” Journal of Econometrics, 94, 181–238. Berndt, E., B. Hall, R. Hall, and J. Hausman (1974): “Estimation and inference in nonlinear structural models,” Annals of Economic and Social Measurement, 3, 653–665. Bikbov, R., and M. Chernov (2009): “Unspanned stochastic volatility in affine models: Evidence from Eurodollar futures and options,” Management Science, 55(8), 1292–1305. 26

Broadie, M., M. Chernov, and M. Johannes (2007): “Model specification and risk premia: Evidence from futures options,” Journal of Finance, 62, 1453–1490. Carr, P., and L. Wu (2003): “The finite moment log stable process and option pricing,” Journal of Finance, 58, 753–777. (2007): “Stochastic skew in currency options,” Journal of Financial Economics, 86, 213–247. (2009): “Variance risk premiums,” Review of Financial Studies, 22, 1311– 1341. Christoffersen, P., C. Dorion, K. Jacobs, and L. Karoui (2014): “Nonlinear Kalman filtering in affine term structure models,” Management Science, 60, 2248–2268. Christoffersen, P., S. Heston, and K. Jacobs (2009): “The shape and term structure of the index option smirk: Why multifactor stochastic volatility models work so well,” Management Science, 55, 1914–1932. Collin-Dufresne, P., and R. S. Goldstein (2002): “Do bonds span the fixed incomemarkets? Theoryandevidenceforunspannedstochasticvolatility,” Journal of Finance, 57(4), 1685–1730. Dai, Q., and K. J. Singleton (2000): “Specification analysis of affine term structure models,” Journal of Finance, 55(5), 1943–1978. Detemple, J., and C. Osakwe (2000): “The valuation of volatility options,” European Finance Review, 4(1), 21–50. Diebold, F. X., and R. S. Mariano (2002): “Comparing predictive accuracy,” Journal of Business and Economic Statistics, 20(1), 134–144. Duffie, D., J. Pan, and K. Singleton (2000): “Transform analysis and asset pricing for affine jump-diffusions,” Econometrica, 68, 1343–1376. Durham, G. B. (2013): “Risk-neutral modelling with affine and nonaffine models,” Journal of Financial Econometrics, 11, 650–681. 27

Eraker, B. (2005): “Do stock prices and volatility jump? Reconciling evidence from spot and option prices,” Journal of Finance, 59(3), 1367–1404. Gallant, A. R., C. Hsu, and G. Tauchen (1999): “Using daily range data to calibrate volatility diffusions and extract the forward integrated variance,” Review of Economics and Statistics, 81, 617–631. Goard, J., and M. Mazur (2013): “Stochastic volatility models and the pricing of VIX options,” Mathematical Finance, 23, 439–458. Gru¨nbichler, A., and F. A. Longstaff (1996): “Valuing futures and options on volatility,” Journal of Banking and Finance, 20(6), 985–1001. Heston, S. L. (1993): “A closed-form solution for options with stochastic volatility with applications to bond and currency options,” Review of Financial Studies, 6, 327–343. Huang, S., and J. Yu (2007): “On stiffness in affine asset pricing models,” Journal of Computational Finance, 10(3), 99–123. Jiang, G., and Y. Tian (2007): “Extracting model-free volatility from option prices: An examination of the VIX index,” Journal of Derivatives, 14, 35–60. Jones, C. (2003): “The dynamics of stochastic volatility: Evidence from underlying and options markets,” Journal of Econometrics, 116, 181–224. Julier, S. J. (2002): “The scaled unscented transformation,” in American Control Conference. Proceedings of the 2002, vol. 6, pp. 4555–4559. Kou, S. G. (2002): “A jump-diffusion model for option pricing,” Management Science, 48(8), 1086–1101. Kou, S. G., and H. Wang (2004): “Option pricing under a double exponential jump diffusion model,” Management Science, 50(9), 1178–1192. Li, H., and F. Zhao (2006): “Unspanned stochastic volatility: Evidence from hedging interest rate derivatives,” Journal of Finance, 61(1), 341–378. Menc´ıa, J., and E. Sentana (2012): “Valuation of VIX derivatives,” Journal of Financial Economics, 108, 367–391. 28

Newey, W. K., and K. D. West (1994): “Automatic lag selection in covariance matrix estimation,” Review of Economic Studies, 61(4), 631–653. Park, Y.-H. (2015): “Volatility of volatility and tail risk hedging returns,” Forthcoming in Journal of Financial Markets, Federal Reserve Board. Song, Z. (2012): “Expected VIX option returns,” Working paper, Federal Reserve Board. Todorov, V., and G. Tauchen (2011): “Volatility jumps,” Journal of Business and Economic Statistics, 29(3), 356–371. Todorov, V., G. Tauchen, and I. Grynkiv (2014): “Volatility activity: Specification and estimation,” Journal of Econometrics, 178, 180–193. Trolle, A. B., and E. S. Schwartz (2009a): “A general stochastic volatility model for the pricing of interest rate derivatives,” Review of Financial Studies, 22(5), 2007–2057. (2009b): “Unspanned stochastic volatility and the pricing of commodity derivatives,” Review of Financial Studies, 22(11), 4423–4461. Whaley, R. E. (2013): “Trading volatility: At what cost,” Journal of Portfolio Management, 40(1), 95–108. Wu, L. (2011): “Variance dynamics: Joint evidence from options and high-frequency returns,” Journal of Econometrics, 160, 280–287. 29

A Derivation of the VIX derivative prices Q We start by deriving the characteristic function, g(v ,u ,w ,t,T;φ) = E [exp(iφv )], t t t t T as in Heston (1993). Given the model in Equation (1), Ito’s lemma leads to (cid:104) (cid:105) dg = g +g (κ (u−v)−λ δ −λ δ )+g κ (u−u)+g κ (w−w) dt t v v + + − − u u w w (cid:104) (cid:105) √ √ + 1g w+ 1g σ2 + 1g σ2w+g ρσ w dt+g wdB Q +g σ dB Q +g σ wdB Q 2 vv 2 uu u 2 ww w vw w v 1t u u 2t w w 3t Q Q +[g(v+J ,u,w,t,T;φ)−g(v,u,w,t,T;φ)]dN 1 1t Q Q +[g(v+J ,u,w,t,T;φ)−g(v,u,w,t,T;φ)]dN . 2 2t (A.1) If a Markov process is affine, its cumulant generating function has affine dependence on the state vector: g(v ,u ,w ,t,T;φ) = exp[α(s)+β (s)v +β (s)u +β (s)w ]. (A.2) t t t v t u t w t Substituting Equation (A.2) into Equation (A.1), we obtain E [dg/g] t = −[α˙(s)+β˙ (s)v+β˙ (s)u+β˙ (s)w]+[κ (u−v)−λ δ −λ δ ]β (s) v u w v + + − − v dt +κ (u−u)β (s)+κ (w−w)β (s) u u w w + 1wβ (s)2+ 1σ2β (s)2+ 1σ2wβ (s)2+ρσ wβ (s)β (s) 2 v 2 u u 2 w w w v w (cid:16) (cid:17) (cid:16) (cid:17) Q Q +λ E[exp(β (s)J )]−1 +λ E[exp(β (s)J )]−1 , + v 1 − v 2 (A.3) where E[exp(β (s)J Q )] = 1 v 1 1−δ+βv(s) (A.4) E[exp(β (s)J Q )] = 1 . v 2 1−δ−βv(s) Note that g is a martingale by the law of iterated expectations. Rearranging Equation (A.3) as an affine form of (v,u,w) and using the fact that E [dg] = 0, we t 30

obtain the system of the following ODEs: 1 α˙(s) = κ uβ (s)+κ wβ (s)+ σ2β (s)2 u u w w 2 u u (cid:20) (cid:21) (cid:20) (cid:21) 1 1 −[λ δ +λ δ ]β (s)+λ −1 +λ −1 + + − − v + − 1−δ β (s) 1−δ β (s) + v − v β˙ (s) = −κ β (s) (A.5) v v v β˙ (s) = κ β (s)−κ β (s) u v v u u 1 1 β˙ (s) = −κ β (s)+ β (s)2+ σ2β (s)2+ρσ β (s)β (s), w w w 2 v 2 w w w v w where the boundary conditions are given as α(0) = 0, β (0) = iφ, β (0) = 0, and v u β (0) = 0. w Let F(t,T) denote the time-t price of a futures contract with a maturity of T. F(t,T) is equal to the characteristic function evaluated at φ = −i: Q F(t,T) = E [VIX ] t T Q = E [exp(v )] (A.6) t T = g(v ,u ,w ,t,T;φ = −i). t t t ToderivethepricingformulaforC(t,T,K), letusintroduceachangeofmeasurefrom Q to Q˜ by a Radon-Nikodym derivative dQ˜ = exp(vT) . Let us denote the characdQ E t Q [exp(vT)] teristic function of v under the Q˜ measure by h(v ,u ,w ,t,T;φ). h(v ,u ,w ,t,T;φ) T t t t t t t is given by Q˜ h(v ,u ,w ,t,T;φ) = E [exp(iφv )] t t t t T (cid:34) (cid:35) exp(v ) Q T = E exp(iφv ) t Q T E [exp(v )] t T (cid:34) (cid:35) (A.7) exp(i(φ−i)v ) Q T = E t Q E [exp(v )] t T g(v ,u ,w ,t,T;φ−i) t t t = . g(v ,u ,w ,t,T;−i) t t t 31

C(t,T,K) is calculated using g(v ,u ,w ,t,T;φ) and h(v ,u ,w ,t,T;φ): t t t t t t Q C(t,T,K) = exp(−r τ)E [max(VIX −K,0)] t t T = exp(−r τ)E Q [exp(v )1 −K1 ] t t T (vT≥log(K)) (vT≥log(K)) = exp(−r τ) (cid:104) E Q [exp(v )]E Q˜ [1 ]−KE Q [1 ] (cid:105) (A.8) t t T t (vT≥log(K)) t (vT≥log(K)) = exp(−r τ)[g(v ,u ,w ,t,T;φ = −i)Π (t,T,K)−KΠ (t,T,K)], t t t t 1 2 where Π (t,T,K) and Π (t,T,K) are given by the Levy inversion formula (see Ap- 1 2 pendix B): 1 1 (cid:90) ∞ (cid:20) exp(−iφlog(K))h(v ,u ,w ,t,T;φ) (cid:21) t t t Π (t,T,K) = + (cid:60) dφ 1 2 π iφ 0 (A.9) 1 1 (cid:90) ∞ (cid:20) exp(−iφlog(K))g(v ,u ,w ,t,T;φ) (cid:21) t t t Π (t,T,K) = + (cid:60) dφ, 2 2 π iφ 0 where (cid:60) indicates the real part of a complex number. A similar procedure can be applied to derive the price of a put option. B Levy inversion formula Let F(x) be the distribution function of a random variable. The Levy inversion for- (cid:82)∞ mulainvolvescomputingF(x)giventhecharacteristicfunction,g(φ) = exp(iφx)dF(x). −∞ To begin with, let us bring the following mathematical results: exp(ix)−exp(−ix) sin(x) = 2i g(−φ) = g(φ) (B.1) 2 (cid:90) ∞ sin(ux) sgn(x) = du, π u 0 32

where sgn(x) = −1 if x < 0; 0 if x = 0; and 1 if x > 0. Let us derive the following relation: 1−2F(x) = −F(x)+(1−F(x)) (cid:90) x (cid:90) ∞ = − dF(u)+ dF(u) −∞ x (cid:90) ∞ = sgn(u−x)dF(u) −∞ 2 (cid:90) ∞ (cid:90) ∞ sin((u−x)φ) = dφdF(u) π φ −∞ 0 i (cid:90) ∞(cid:90) ∞ exp(i(u−x)φ)−exp(−i(u−x)φ) = − dF(u)dφ π φ (B.2) 0 −∞ i (cid:90) ∞ exp(−ixφ)g(φ)−exp(ixφ)g(−φ) = − dφ π φ 0 i (cid:90) ∞ exp(−ixφ)g(φ)−exp(−ixφ)g(φ) = − dφ π φ 0 2 (cid:90) ∞ (cid:20) exp(−ixφ)g(φ) (cid:21) = (cid:61) dφ π φ 0 2 (cid:90) ∞ (cid:20) exp(−ixφ)g(φ) (cid:21) = (cid:60) dφ, π iφ 0 where(cid:60)and(cid:61)denotetherealandimaginarypartsofacomplexnumber, respectively. The distribution function is thus given by 1 1 (cid:90) ∞ (cid:20) exp(−ixφ)g(φ) (cid:21) F(x) = − (cid:60) dφ. (B.3) 2 π iφ 0 The option pricing formula calls for the complementary distribution function, Π(x): Π(x) = 1−F(x) 1 1 (cid:90) ∞ (cid:20) exp(−ixφ)g(φ) (cid:21) (B.4) = + (cid:60) dφ. 2 π iφ 0 C Unscented Kalman filtering Let X = (v ,u ) be the latent state vector. Suppose that the latent state vector is of t t t the form: X = Φ +Φ X +(cid:15) , (C.1) t+1 0 x t t+1 33

where var [(cid:15) ] = Q(X ). Kalman filtering comprises two stages: the prediction and t t+1 t theupdatestages. Thepredictionstageinvolvescomputingone-step-aheadpredictive ˆ mean and variance of X given X = E [X ] and P = var [X ]: t t t t t t t Xˆ = Φ +Φ Xˆ t+1|t 0 x t (C.2) (cid:48) P = Φ P Φ +Q(X ) t+1|t x t x t The update stage starts by defining a set of sigma points that match the predictive mean and variance of the latent vector. To be specific, a set of 2L+1 sigma points and weights are selected based on the scaled unscented transformation of Julier (2002), where L is the dimension of the latent vector: X = Xˆ 0 t+1|t (cid:16)(cid:113) (cid:17) X i = Xˆ t+1|t + (L+ξ)P t+1|t , i = 1,...,L (C.3) i (cid:16)(cid:113) (cid:17) X = Xˆ − (L+ξ)P , i = L+1,...,2L, i t+1|t t+1|t i−L whose weights are given by ξ Wm = 0 L+ξ ξ Wc = +(1−α2+β) 0 L+ξ (C.4) 1 Wm = , i = 1,...,2L i 2(L+ξ) 1 Wc = , i = 1,...,2L, i 2(L+ξ) (cid:0)(cid:112) (cid:1) where ξ = α2(L+κ)−L and (L+ξ)P is the ith column of the matrix square t+1|t i root. In our applications, we set α = 0.01, β = 2, and κ = 0. Suppose that the observation equations are of the form: Y = H(X )+Ω, (C.5) t t where H(X ) represents the pricing formula for futures and options and Ω denotes t the measurement errors. The predictive mean and variance of Y are computed using t 34

the sigma points and weights: 2L (cid:88) Yˆ = WmH(X ) t+1|t i i i=0 (C.6) 2L Py = (cid:88) Wc(H(X )−Yˆ )(H(X )−Yˆ )(cid:48)+Ω, t+1|t i i t+1|t i t+1|t i=0 where Y ˆ = E [Y ] and Py = var [Y ]. t+1|t t t+1 t+1|t t t+1 Finally, we are able to update the conditional mean and variance of X : t Xˆ = Xˆ +K (Y −Yˆ ) t+1 t+1|t t+1 t+1 t+1|t (C.7) P = P −K Py K (cid:48) , t+1 t+1|t t+1 t+1|t t+1 where 2L K = (cid:88) Wc(X −Xˆ )(H(X )−Yˆ )(cid:48)(Py )−1. (C.8) t+1 i i t+1|t i t+1|t t+1|t i=0 The log likelihood function is then given by T log(L) = − 1 (cid:88)(cid:104) log(2π)N +log (cid:16) |det(Py )| (cid:17) +(Y −Yˆ )(cid:48)(Py )−1(Y −Yˆ ) (cid:105) , 2 t t+1|t t+1 t+1|t t+1|t t+1 t+1|t t=1 (C.9) where N is the dimension of the observation equations at time t. The log likelihood t function is optimized via the Berndt, Hall, Hall, and Hausman (1974) algorithm. The standarderrors are computed byinvertingthe Hessianmatrix atthe optimalsolution. D Computation of the option-implied volatility and skewness The model-free, option-implied moments are obtained with the approach of Bakshi, Kapadia, and Madan (2003) with light modification. The original formula is based on the no-arbitrage relation that the underlying asset is tradable and so grows at a Q risk-free rate under the risk-neutral measure. That is, E (S ) = S exp(r (T − t)), t T t t where S is the price of an underlying asset at time t. However, this relation no t longer holds in the VIX derivatives market because the underlying VIX index is not tradable. Instead, we use the fact that a VIX futures price reflects the risk-neutral 35

Q expectation of the VIX, F(t,T) = E (VIX ), where F(t,T) is the time-t price of a t T VIX future with a maturity of T. The computation proceeds as follows. Let H (t,T), H (t,T), and H (t,T) denote the prices of three hypothetical se- 1 2 3 curities that pay quadratic, cubic, and quartic payoffs, respectively. These prices are given by (cid:16) (cid:16) (cid:17)(cid:17) (cid:90) ∞ 2 1−ln K H 1 (t,T) = K2 VIXt C(cid:98)(t,T,K)dK VIXt (cid:90) VIXt 2 (cid:0) 1+ln (cid:0)VIXt (cid:1)(cid:1) + K P(cid:98)(t,T,K)dK, (D.1) K2 0 (cid:16) (cid:17) (cid:16) (cid:16) (cid:17)(cid:17)2 (cid:90) ∞ 6ln K −3 ln K H 2 (t,T) = VIXt K2 VIXt C(cid:98)(t,T,K)dK VIXt (cid:90) VIXt 6ln (cid:0)VIXt (cid:1) +3 (cid:0) ln (cid:0)VIXt (cid:1)(cid:1)2 − K K P(cid:98)(t,T,K)dK, (D.2) K2 0 and (cid:16) (cid:16) (cid:17)(cid:17)2 (cid:16) (cid:16) (cid:17)(cid:17)3 (cid:90) ∞ 12 ln K −4 ln K H 3 (t,T) = VIXt K2 VIXt C(cid:98)(t,T,K)dK VIXt (cid:90) VIXt 12 (cid:0) ln (cid:0)VIXt (cid:1)(cid:1)2 +4 (cid:0) ln (cid:0)VIXt (cid:1)(cid:1)3 + K K P(cid:98)(t,T,K)dK, (D.3) K2 0 where C(cid:98)(t,T,K) and P(cid:98)(t,T,K) are the market prices of OTM VIX call and put options with a maturity of T and a strike price of K. The implied volatility and skewness are then computed by (cid:114) exp(r (T −t))H (t,T)−µ(t,T)2 t 1 IV(t,T) = T −t exp(r (T −t))H (t,T)−3µ(t,T)exp(r (T −t))H (t,T)+2µ(t,T)3 t 2 t 1 SKEW(t,T) = , (T −t)3/2IV(t,T)3 (D.4) where IV(t,T) is annualized and F(t,T) exp(r (T −t)) exp(r (T −t)) exp(r (T −t)) t t t µ(t,T) = −1− H (t,T)− H (t,T)− H (t,T). 1 2 3 VIX 2 6 24 t (D.5) In practice, the computation of the implied volatility and skewness is prone to 36

truncation and discretization errors, as reported by Jiang and Tian (2007). To reduce such errors, we interpolate or extrapolate option prices on a fine grid of varying strike prices following the approach of Carr and Wu (2009). First, for a given date and a given maturity, we require at least two observations for both puts and calls. We then generate a grid of strike prices with one-point increments, interpolate Black-Scholes implied volatility at each point, and translate the interpolated implied volatility into the option price. In addition, an extrapolation is used for grid points beyond the last observed strike price. E AVV model with double exponential jumps Kou (2002) and Kou and Wang (2004) develop a double exponential jump diffusion model in which both upward and downward jumps are driven by a single compound Poisson process. To be specific, we introduce the dynamics for v as follows: t √ Q Q Q dv = κ (u −v )dt+ w dB +J dN −λ[pδ +(1−p)δ ]dt t v t t t 1t t + − Q du = κ (u−u )dt+σ dB (E.1) t u t u 2t √ Q dw = κ (w−w )dt+σ w dB , t w t w t 3t Q where N denotes a risk-neutral Poisson process driving jumps with the intensity λ t and JQ has a probability density function that takes p exp(−x/δ ) if x > 0 and δ+ + 1−p exp(−x/δ ) if x < 0. The coefficients of the cumulant generating function of this |δ−| − model satisfy the following system of ODEs: 1 α˙(s) = κ uβ (s)+κ wβ (s)+ σ2β (s)2 u u w w 2 u u (cid:20) (cid:21) p 1−p −λ[pδ +(1−p)δ ]β (s)+λ + −1 + − v 1−δ β (s) 1−δ β (s) + v − v β˙ (s) = −κ β (s) (E.2) v v v β˙ (s) = κ β (s)−κ β (s) u v v u u 1 1 β˙ (s) = −κ β (s)+ β (s)2+ σ2β (s)2+ρσ β (s)β (s), w w w 2 v 2 w w w v w where the boundary conditions are given as α(0) = 0, β (0) = iφ, β (0) = 0, and v u β (0) = 0. w Note that the AVV model with double exponential jumps is different from the 37

AVV-AJ model where upward and downward jumps are driven by independent compoundPoissonprocesseswitheachhavingitsownjumpintensityandjump-sizedistribution. In the AVV-AJ model, upward and downward jumps have distinct jump-size distributions so a combination of upward and downward jumps cannot collapse to a single compound Poisson process as in the double exponential jumps. 38

Table 1: Summary of model specifications Model Description Constraints SVV Symmetric volatility, no jumps ρ = 0, λ = 0, and λ = 0 + − AVV Asymmetric volatility, no jumps λ = 0 and λ = 0 + − AVV-UJ Asymmetric volatility, upward jumps only λ = 0 − AVV-AJ Asymmetric volatility, asymmetric jumps Not applicable 39

Table 2: Summary statistics for VIX futures and options The moneyness is defined as d = K/F(t,T), where K is the strike price and F(t,T) is the futures price with a strike price of K and a maturity of T. Short-term contracts are those with no more than three months to maturity, and long-term contracts are those with more than three months to maturity. BSIV stands for Black-Scholes implied volatility. The futures data come from Thomson Reuters, Datastream, while the options data are from OptionMetrics, Ivy DB. Months to maturity Short Long Total All futures No. of futures 4,578 4,752 9,330 Average price 24.55 25.21 24.89 All OTM options No. of options 49,992 52,533 102,525 Average price 1.25 1.86 1.56 Average BSIV 1.05 0.84 0.94 Average vega 2.58 4.60 3.61 OTM puts No. of options 13,758 13,949 27,707 Average price 1.33 1.90 1.62 Average BSIV 0.68 0.52 0.60 Average vega 2.86 5.01 3.94 OTM calls No. of options 36,403 38,696 75,099 Average price 1.23 1.85 1.55 Average BSIV 1.20 0.95 1.07 Average vega 2.47 4.45 3.49 Deep OTM puts No. of options 6,382 7,639 14,021 (d<0.85) Average price 0.75 1.14 0.96 Average BSIV 0.65 0.48 0.56 Average vega 2.61 4.38 3.58 Moderate OTM puts No. of options 7,384 6,312 13,696 (0.85<d<1.0) Average price 1.83 2.82 2.29 Average BSIV 0.70 0.57 0.64 Average vega 3.08 5.76 4.31 Moderate OTM calls No. of options 17,424 17,433 34,857 (1.0<d<1.3) Average price 1.77 2.68 2.23 Average BSIV 1.16 0.94 1.05 Average vega 2.85 5.14 4.00 Deep OTM calls No. of options 18,987 21,275 40,262 (d>1.3) Average price 0.73 1.16 0.96 Average BSIV 1.23 0.96 1.09 Average vega 2.12 3.89 3.06 40

Table 3: Parameter estimates: In-sample This table shows the parameter estimates that are obtained using the full sample period from July 2006 through January 2013. AIC and SIC stand for Akaike and Schwarz information criteria, respectively. Standard errors are in parentheses. MS SVV AVV AVV-UJ AVV-AJ κ 8.956 6.384 7.393 6.488 6.576 v (0.008) (0.006) (0.006) (0.005) (0.006) ρ 0.870 0.422 0.794 (0.004) (0.005) (0.009) κ 0.386 0.324 0.319 0.256 0.258 u (0.001) (0.002) (0.001) (0.001) (0.001) u 2.979 2.877 2.997 3.073 3.106 (0.001) (0.002) (0.002) (0.002) (0.003) σ 0.088 0.442 0.419 0.282 0.293 u (0.004) (0.001) (0.002) (0.003) (0.003) η -0.039 -0.003 -0.022 -0.019 -0.024 u (0.013) (0.013) (0.027) (0.016) (0.017) κ 5.506 1.553 1.050 0.847 w (0.015) (0.008) (0.008) (0.008) w 1.613 1.629 2.137 1.956 (0.003) (0.012) (0.019) (0.027) σ 7.271 2.610 2.379 1.976 w (0.011) (0.009) (0.013) (0.012) η -28.429 -2.184 -2.926 -2.150 w (1.767) (0.642) (1.036) (0.869) λ 3.358 z (0.013) δ 3.391 z (0.007) λ 2.664 2.682 + (0.039) (0.036) δ 0.267 0.266 + (0.001) (0.001) λ 2.042 − (0.152) δ -0.217 − (0.009) σ 0.080 0.082 0.078 0.072 0.071 e (0.000) (0.000) (0.000) (0.000) (0.000) σ 0.031 0.031 0.030 0.030 0.029 f (0.000) (0.000) (0.000) (0.000) (0.000) log(L) 8,071 8,600 15,803 24,642 24,826 AIC -16,124 -17,177 -31,582 -49,255 -49,620 SIC -16,075 -17,118 -31,517 -49,179 -49,534 41

Table 4: Parameter estimates: Out-of-sample ThistableshowstheparameterestimatesthatareobtainedusingasubsampleoffouryearsfromJuly 2006 through June 2010. This parameter set is used for an out-of-sample test spanning July 2010 throughJanuary2013. AICandSICstandforAkaikeandSchwarzinformationcriteria,respectively. Standard errors are in parentheses. MS SVV AVV AVV-UJ AVV-AJ κ 11.512 9.602 10.074 9.458 9.652 v (0.014) (0.013) (0.011) (0.010) (0.011) ρ 0.606 0.381 0.683 (0.006) (0.007) (0.012) κ 0.495 0.477 0.427 0.399 0.403 u (0.001) (0.002) (0.001) (0.001) (0.001) u 2.751 2.714 2.782 2.785 2.796 (0.001) (0.002) (0.002) (0.002) (0.002) σ 0.236 0.414 0.402 0.303 0.309 u (0.002) (0.002) (0.003) (0.004) (0.004) η 0.025 0.019 0.017 0.016 0.014 u (0.021) (0.025) (0.050) (0.014) (0.017) κ 2.142 1.440 1.087 0.991 w (0.014) (0.011) (0.011) (0.011) w 2.709 2.565 3.464 3.146 (0.021) (0.027) (0.042) (0.047) σ 6.317 3.574 3.067 2.861 w (0.016) (0.022) (0.032) (0.035) η -10.625 -2.589 -3.395 -3.023 w (2.050) (1.485) (1.734) (1.721) λ 2.022 z (0.015) δ 4.482 z (0.019) λ 1.759 2.216 + (0.076) (0.082) δ 0.283 0.265 + (0.005) (0.004) λ 9.080 − (0.879) δ -0.120 − (0.006) σ 0.070 0.073 0.069 0.067 0.067 e (0.000) (0.000) (0.000) (0.000) (0.000) σ 0.026 0.025 0.024 0.024 0.024 f (0.000) (0.000) (0.000) (0.000) (0.000) log(L) 7,086 7,207 10,430 12,129 12,323 AIC -14,155 -14,392 -20,836 -24,231 -24,613 SIC -14,110 -14,338 -20,777 -24,162 -24,535 42

Table 5: Pricing performance across different models: In-sample Thistableshowsthein-sampleperformancemetricsacrossthedifferentmodels. ThemetricRMSE i standsfortherootmeansquarederrorofamodeliasgivenbyEquation(16). Themetric∆RMSE i|j measures the pricing performance difference of a model i relative to a model j, which is defined as ∆RMSE =100×log(RMSE /RMSE ). A negative (positive) value of ∆RMSE means that the i|j i j i|j pricing performance of the model i is better (worse) than that of the model j by a percentage of that value. The moneyness is defined as d = K/F(t,T), where K is the strike price and F(t,T) is the futures price with a strike price of K and a maturity of T. Short-term contracts are those with no more than three months to maturity, and long-term contracts are those with more than three months to maturity. RMSEi ∆RMSEi|SVV ∆RMSEi|AVV ∆RMSEi|AVV-UJ Short long Total Short Long Total Short Long Total Short Long Total PanelA:Allfutures MS 0.1867 0.1839 0.1853 -5.22 8.71 1.18 6.02 4.76 5.39 1.49 10.91 5.93 SVV 0.1916 0.1761 0.1842 0.00 0.00 0.00 11.24 -3.95 4.22 6.71 2.21 4.75 AVV 0.1812 0.1796 0.1804 -11.24 3.95 -4.22 0.00 0.00 0.00 -4.53 6.15 0.53 AVV-UJ 0.1853 0.1741 0.1799 -6.71 -2.21 -4.75 4.53 -6.15 -0.53 0.00 0.00 0.00 AVV-AJ 0.1836 0.1744 0.1791 -8.55 -1.88 -5.60 2.70 -5.83 -1.39 -1.83 0.32 -0.86 PanelB:Alloptions MS 0.1218 0.0829 0.1037 8.25 13.81 10.00 13.43 21.30 15.87 20.83 30.61 23.83 SVV 0.1122 0.0722 0.0939 0.00 0.00 0.00 5.18 7.48 5.87 12.59 16.80 13.83 AVV 0.1065 0.0670 0.0885 -5.18 -7.48 -5.87 0.00 0.00 0.00 7.41 9.32 7.96 AVV-UJ 0.0989 0.0611 0.0817 -12.59 -16.80 -13.83 -7.41 -9.32 -7.96 0.00 0.00 0.00 AVV-AJ 0.0985 0.0609 0.0814 -13.01 -17.10 -14.21 -7.82 -9.62 -8.35 -0.42 -0.30 -0.38 PanelC:OTMputs MS 0.1331 0.0785 0.1091 -7.91 0.35 -5.88 7.87 11.20 8.72 5.21 12.98 7.12 SVV 0.1440 0.0782 0.1157 0.00 0.00 0.00 15.78 10.85 14.60 13.12 12.63 13.01 AVV 0.1230 0.0702 0.1000 -15.78 -10.85 -14.60 0.00 0.00 0.00 -2.66 1.78 -1.60 AVV-UJ 0.1263 0.0689 0.1016 -13.12 -12.63 -13.01 2.66 -1.78 1.60 0.00 0.00 0.00 AVV-AJ 0.1245 0.0685 0.1003 -14.61 -13.23 -14.29 1.17 -2.37 0.32 -1.49 -0.60 -1.28 PanelD:OTMcalls MS 0.1183 0.0846 0.1023 18.00 18.93 18.33 16.56 24.92 19.35 30.59 37.72 32.98 SVV 0.0988 0.0700 0.0852 0.00 0.00 0.00 -1.44 5.99 1.02 12.59 18.79 14.66 AVV 0.1002 0.0659 0.0843 1.44 -5.99 -1.02 0.00 0.00 0.00 14.03 12.80 13.64 AVV-UJ 0.0871 0.0580 0.0736 -12.59 -18.79 -14.66 -14.03 -12.80 -13.64 0.00 0.00 0.00 AVV-AJ 0.0874 0.0579 0.0737 -12.22 -18.94 -14.45 -13.66 -12.94 -13.43 0.37 -0.15 0.21 PanelE:DeepOTMputs(d<0.85) MS 0.1152 0.0695 0.0932 -7.38 -4.78 -6.61 19.07 9.68 16.03 18.07 10.69 15.72 SVV 0.1241 0.0729 0.0995 0.00 0.00 0.00 26.45 14.46 22.64 25.46 15.48 22.33 AVV 0.0952 0.0631 0.0794 -26.45 -14.46 -22.64 0.00 0.00 0.00 -0.99 1.01 -0.31 AVV-UJ 0.0962 0.0625 0.0796 -25.46 -15.48 -22.33 0.99 -1.01 0.31 0.00 0.00 0.00 AVV-AJ 0.0923 0.0617 0.0772 -29.54 -16.69 -25.43 -3.09 -2.23 -2.79 -4.09 -1.21 -3.10 PanelF:ModerateOTMputs(0.85<d<1.0) MS 0.1467 0.0881 0.1232 -8.19 4.59 -5.46 2.79 12.38 4.89 -0.50 14.77 2.70 SVV 0.1592 0.0842 0.1301 0.00 0.00 0.00 10.98 7.78 10.35 7.69 10.18 8.16 AVV 0.1427 0.0779 0.1173 -10.98 -7.78 -10.35 0.00 0.00 0.00 -3.29 2.39 -2.19 AVV-UJ 0.1475 0.0760 0.1199 -7.69 -10.18 -8.16 3.29 -2.39 2.19 0.00 0.00 0.00 AVV-AJ 0.1466 0.0760 0.1194 -8.25 -10.28 -8.64 2.72 -2.49 1.71 -0.57 -0.10 -0.48 PanelG:ModerateOTMcalls(1.0<d<1.3) MS 0.1260 0.1017 0.1145 10.46 24.75 15.62 8.73 30.78 16.32 17.27 45.14 26.50 SVV 0.1135 0.0794 0.0979 0.00 0.00 0.00 -1.73 6.03 0.69 6.81 20.39 10.88 AVV 0.1154 0.0748 0.0972 1.73 -6.03 -0.69 0.00 0.00 0.00 8.54 14.36 10.19 AVV-UJ 0.1060 0.0648 0.0878 -6.81 -20.39 -10.88 -8.54 -14.36 -10.19 0.00 0.00 0.00 AVV-AJ 0.1066 0.0647 0.0882 -6.27 -20.42 -10.50 -8.00 -14.39 -9.81 0.53 -0.03 0.38 PanelH:DeepOTMcalls(d>1.3) MS 0.1107 0.0674 0.0904 28.76 9.60 22.34 27.82 15.54 23.90 53.14 26.26 43.67 SVV 0.0831 0.0612 0.0723 0.00 0.00 0.00 -0.93 5.94 1.56 24.39 16.66 21.32 AVV 0.0838 0.0577 0.0712 0.93 -5.94 -1.56 0.00 0.00 0.00 25.32 10.72 19.77 AVV-UJ 0.0651 0.0518 0.0584 -24.39 -16.66 -21.32 -25.32 -10.72 -19.77 0.00 0.00 0.00 AVV-AJ 0.0651 0.0516 0.0584 -24.42 -16.96 -21.47 -25.36 -11.02 -19.91 -0.03 -0.30 -0.14 43

Table 6: Pricing performance across different models: Out-of-sample This table shows the out-of-sample performance metrics across the different models. The metric RMSE stands for the root mean squared error of a model i as given by Equation (16). The metric i ∆RMSE measures the pricing performance difference of a model i relative to a model j, which i|j is defined as ∆RMSE = 100×log(RMSE /RMSE ). A negative (positive) value of ∆RMSE i|j i j i|j means that the pricing performance of the model i is better (worse) than that of the model j by a percentage of that value. The moneyness is defined as d = K/F(t,T), where K is the strike price and F(t,T) is the futures price with a strike price of K and a maturity of T. Short-term contracts arethosewithnomorethanthreemonthstomaturity,andlong-termcontractsarethosewithmore than three months to maturity. RMSEi ∆RMSEi|SVV ∆RMSEi|AVV ∆RMSEi|AVV-UJ Short long Total Short Long Total Short Long Total Short Long Total PanelA:Allfutures MS 0.2415 0.2242 0.2332 1.12 16.03 7.02 14.62 7.88 11.60 13.26 17.64 15.10 SVV 0.2402 0.2069 0.2252 0.00 0.00 0.00 13.50 -8.15 4.58 12.14 1.61 8.08 AVV 0.2245 0.2155 0.2201 -13.50 8.15 -4.58 0.00 0.00 0.00 -1.36 9.76 3.51 AVV-UJ 0.2261 0.2052 0.2163 -12.14 -1.61 -8.08 1.36 -9.76 -3.51 0.00 0.00 0.00 AVV-AJ 0.2251 0.2073 0.2166 -13.02 0.34 -7.78 0.48 -7.81 -3.21 -0.89 1.95 0.30 PanelB:Alloptions MS 0.1625 0.1320 0.1475 15.87 28.46 20.71 19.95 31.97 24.58 24.71 38.34 29.91 SVV 0.1386 0.0993 0.1199 0.00 0.00 0.00 4.07 3.51 3.87 8.84 9.88 9.20 AVV 0.1331 0.0959 0.1154 -4.07 -3.51 -3.87 0.00 0.00 0.00 4.76 6.37 5.33 AVV-UJ 0.1269 0.0900 0.1094 -8.84 -9.88 -9.20 -4.76 -6.37 -5.33 0.00 0.00 0.00 AVV-AJ 0.1263 0.0911 0.1095 -9.30 -8.56 -9.04 -5.23 -5.05 -5.16 -0.47 1.32 0.16 PanelC:OTMputs MS 0.1728 0.0794 0.1342 8.05 -1.02 6.33 20.56 6.49 17.78 21.87 10.24 19.62 SVV 0.1594 0.0802 0.1260 0.00 0.00 0.00 12.51 7.51 11.45 13.82 11.25 13.29 AVV 0.1407 0.0744 0.1123 -12.51 -7.51 -11.45 0.00 0.00 0.00 1.31 3.74 1.84 AVV-UJ 0.1388 0.0716 0.1103 -13.82 -11.25 -13.29 -1.31 -3.74 -1.84 0.00 0.00 0.00 AVV-AJ 0.1361 0.0719 0.1087 -15.78 -10.93 -14.76 -3.27 -3.42 -3.30 -1.96 0.32 -1.47 PanelD:OTMcalls MS 0.1612 0.1440 0.1525 18.53 32.18 24.41 19.77 35.01 26.27 25.69 41.84 32.55 SVV 0.1339 0.1044 0.1194 0.00 0.00 0.00 1.24 2.83 1.86 7.16 9.66 8.14 AVV 0.1323 0.1015 0.1172 -1.24 -2.83 -1.86 0.00 0.00 0.00 5.93 6.83 6.28 AVV-UJ 0.1246 0.0948 0.1101 -7.16 -9.66 -8.14 -5.93 -6.83 -6.28 0.00 0.00 0.00 AVV-AJ 0.1247 0.0962 0.1108 -7.10 -8.17 -7.52 -5.86 -5.35 -5.66 0.06 1.49 0.62 PanelE:DeepOTMputs(d<0.85) MS 0.0896 0.0628 0.0754 -13.83 -7.55 -11.41 3.23 0.46 2.11 7.15 2.71 5.33 SVV 0.1029 0.0678 0.0845 0.00 0.00 0.00 17.05 8.01 13.52 20.98 10.25 16.74 AVV 0.0867 0.0626 0.0738 -17.05 -8.01 -13.52 0.00 0.00 0.00 3.92 2.24 3.22 AVV-UJ 0.0834 0.0612 0.0715 -20.98 -10.25 -16.74 -3.92 -2.24 -3.22 0.00 0.00 0.00 AVV-AJ 0.0803 0.0613 0.0700 -24.78 -10.12 -18.84 -7.72 -2.10 -5.33 -3.80 0.14 -2.10 PanelF:ModerateOTMputs(0.85<d<1.0) MS 0.2309 0.1079 0.1924 12.72 4.69 11.66 24.05 11.70 22.36 24.74 16.94 23.71 SVV 0.2033 0.1030 0.1712 0.00 0.00 0.00 11.33 7.01 10.69 12.02 12.25 12.05 AVV 0.1815 0.0960 0.1539 -11.33 -7.01 -10.69 0.00 0.00 0.00 0.69 5.24 1.36 AVV-UJ 0.1803 0.0911 0.1518 -12.02 -12.25 -12.05 -0.69 -5.24 -1.36 0.00 0.00 0.00 AVV-AJ 0.1775 0.0916 0.1499 -13.56 -11.74 -13.30 -2.24 -4.73 -2.61 -1.55 0.51 -1.25 PanelG:ModerateOTMcalls(1.0<d<1.3) MS 0.1817 0.1976 0.1897 -0.07 32.47 14.69 0.10 35.70 15.98 4.78 46.37 22.74 SVV 0.1819 0.1428 0.1638 0.00 0.00 0.00 0.16 3.23 1.29 4.85 13.90 8.05 AVV 0.1816 0.1383 0.1617 -0.16 -3.23 -1.29 0.00 0.00 0.00 4.69 10.67 6.76 AVV-UJ 0.1732 0.1243 0.1511 -4.85 -13.90 -8.05 -4.69 -10.67 -6.76 0.00 0.00 0.00 AVV-AJ 0.1733 0.1266 0.1521 -4.85 -12.07 -7.43 -4.68 -8.83 -6.14 0.01 1.84 0.62 PanelH:DeepOTMcalls(d>1.3) MS 0.1448 0.0996 0.1225 53.73 31.53 44.65 58.67 33.44 48.17 69.36 32.68 53.06 SVV 0.0846 0.0727 0.0784 0.00 0.00 0.00 4.94 1.92 3.52 15.63 1.15 8.40 AVV 0.0805 0.0713 0.0757 -4.94 -1.92 -3.52 0.00 0.00 0.00 10.69 -0.77 4.88 AVV-UJ 0.0724 0.0718 0.0721 -15.63 -1.15 -8.40 -10.69 0.77 -4.88 0.00 0.00 0.00 AVV-AJ 0.0726 0.0724 0.0725 -15.33 -0.28 -7.80 -10.39 1.63 -4.28 0.29 0.86 0.60 44

Table 7: Pairwise model comparisons: In-sample This table shows the Diebold and Mariano (2002) test statistics of the in-sample performance differences. A negative (positive) statistic in a cell (i,j) indicates that the model i outperforms (underperforms) the model j. The symbols *, **, and *** represent the statistical significance of the performance difference at the 10, 5, and 1 percent levels, respectively. MS SVV AVV AVV-UJ AVV-AJ Panel A: Futures MS 0.00 0.35 3.20∗∗∗ 1.96∗ 2.56∗∗ SVV -0.35 0.00 2.52∗∗ 2.18∗∗ 2.79∗∗∗ AVV -3.20∗∗∗ -2.52∗∗ 0.00 -0.50 0.60 AVV-UJ -1.96∗ -2.18∗∗ 0.50 0.00 4.93∗∗∗ AVV-AJ -2.56∗∗ -2.79∗∗∗ -0.60 -4.93∗∗∗ 0.00 Panel B: Options MS 0.00 6.44∗∗∗ 11.50∗∗∗ 11.82∗∗∗ 11.76∗∗∗ SVV -6.44∗∗∗ 0.00 7.08∗∗∗ 11.22∗∗∗ 10.72∗∗∗ AVV -11.50∗∗∗ -7.08∗∗∗ 0.00 8.16∗∗∗ 8.44∗∗∗ AVV-UJ -11.82∗∗∗ -11.22∗∗∗ -8.16∗∗∗ 0.00 1.45 AVV-AJ -11.76∗∗∗ -10.72∗∗∗ -8.44∗∗∗ -1.45 0.00 45

Table 8: Pairwise model comparisons: Out-of-sample This table shows the Diebold and Mariano (2002) test statistics of the out-of-sample performance differences. A negative (positive) statistic in a cell (i,j) indicates that the model i outperforms (underperforms) the model j. The symbols *, **, and *** represent the statistical significance of the performance difference at the 10, 5, and 1 percent levels, respectively. MS SVV AVV AVV-UJ AVV-AJ Panel A: Futures MS 0.00 7.58∗∗∗ 14.25∗∗∗ 17.46∗∗∗ 17.15∗∗∗ SVV -7.58∗∗∗ 0.00 8.17∗∗∗ 15.77∗∗∗ 15.51∗∗∗ AVV -14.25∗∗∗ -8.17∗∗∗ 0.00 6.50∗∗∗ 8.27∗∗∗ AVV-UJ -17.46∗∗∗ -15.77∗∗∗ -6.50∗∗∗ 0.00 -1.16 AVV-AJ -17.15∗∗∗ -15.51∗∗∗ -8.27∗∗∗ 1.16 0.00 Panel B: Options MS 0.00 12.26∗∗∗ 14.01∗∗∗ 15.07∗∗∗ 14.96∗∗∗ SVV -12.26∗∗∗ 0.00 6.49∗∗∗ 11.87∗∗∗ 10.79∗∗∗ AVV -14.01∗∗∗ -6.49∗∗∗ 0.00 11.71∗∗∗ 12.10∗∗∗ AVV-UJ -15.07∗∗∗ -11.87∗∗∗ -11.71∗∗∗ 0.00 -1.13 AVV-AJ -14.96∗∗∗ -10.79∗∗∗ -12.10∗∗∗ 1.13 0.00 46

0.3 0.2 0.1 0 −0.1 −0.2 −0.2 −0.1 0 0.1 0.2 0.3 0.4 VIX returns snruter XIVV ∆log(VVIX)=−0.0001+0.46∆log(VIX)+ε, R 2 =0.51 Figure 1: Scatter plot of VVIX log returns against VIX log returns. 47

VIX futures trading volume 25 20 15 10 5 0 2005 2006 2007 2008 2009 2010 2011 2012 )snoillim( stcartnoc fo rebmuN VIX options trading volume 18 16 14 12 10 8 6 4 2 0 2006 2007 2008 2009 2010 2011 2012 )snoillib( emulov ralloD Calls Puts Total Figure 2: Trading volume of VIX futures and options. The top panel shows the numbers of futures contracts traded over time, and the bottom panel presents the dollar trading volumes for options contracts over time. 48

VIX 80 60 40 20 0 Jan07 Jan08 Jan09 Jan10 Jan11 Jan12 Jan13 VVIX 150 100 50 Jan07 Jan08 Jan09 Jan10 Jan11 Jan12 Jan13 Option−implied skewness 4 3 2 1 0 −1 Jan07 Jan08 Jan09 Jan10 Jan11 Jan12 Jan13 Figure 3: Time series of the VIX, the VVIX, and the option-implied skewness. The VIX and the VVIX are provided by the CBOE, and the skewness is computed using the model-free approach of Bakshi, Kapadia, and Madan (2003) with a modification. The options data are from OptionMetrics, Ivy DB. 49

SVV AVV 4.5 4.5 4 4 3.5 3.5 3 3 2.5 2.5 2 2 1.5 1.5 1 1 0.5 0.5 0 0 Jan08 Jan10 Jan12 Jan08 Jan10 Jan12 AVV−UJ AVV−AJ 4.5 4.5 4 4 3.5 3.5 3 3 2.5 2.5 2 2 1.5 1.5 1 1 0.5 0.5 0 0 Jan08 Jan10 Jan12 Jan08 Jan10 Jan12 Figure 4: Filtered volatility states across different models. Eachpaneldisplaysatimeseriesofthefilteredvolatilitystatesthatareobtainedbyapplying an extended Kalman filter to the sample data from July 2006 through January 2013. 50

Market−implied term structure 2 1.5 1 0.5 0 2 4 6 8 10 12 Months to maturity ytilitalov deilpmi−noitpO 1 0.9 0.8 0.7 0.6 0.5 0.4 0 2 4 6 8 10 12 ytilitalov deilpmi−ledoM Model−implied term structure Data SVV AVV AVV−UJ AVV−AJ Figure 5: Term structure of volatility. The top panel shows the scatter plot and the average term structure of the market-implied volatility, and the bottom panel shows the average term structures of the model-implied volatility. 51

Market−implied term structure 3 2 1 0 −1 0 2 4 6 8 10 12 Months to maturity ssenweks deilpmi−noitpO 2 1.5 1 0.5 0 0 2 4 6 8 10 12 ssenweks deilpmi−ledoM Model−implied term structure Data SVV AVV AVV−UJ AVV−AJ Figure 6: Term structure of skewness. The top panel shows the scatter plot and the average term structure of the market-implied skewness, and the bottom panel shows the average term structures of the model-implied skewness. 52

Cite this document
APA
Yang-Ho Park (2015). The Effects of Asymmetric Volatility and Jumps on the Pricing of VIX Derivatives (FEDS 2015-071). Board of Governors of the Federal Reserve System, Finance and Economics Discussion Series. https://whenthefedspeaks.com/doc/feds_2015-071
BibTeX
@techreport{wtfs_feds_2015_071,
  author = {Yang-Ho Park},
  title = {The Effects of Asymmetric Volatility and Jumps on the Pricing of VIX Derivatives},
  type = {Finance and Economics Discussion Series},
  number = {2015-071},
  institution = {Board of Governors of the Federal Reserve System},
  year = {2015},
  url = {https://whenthefedspeaks.com/doc/feds_2015-071},
  abstract = {This paper proposes a new collection of affine jump-diffusion models for the valuation of VIX derivatives. The models have two distinctive features. First, we allow for a positive correlation between changes in the VIX and in its stochastic volatility to accommodate asymmetric volatility. Second, upward and downward jumps in the VIX are separately modeled to accommodate the possibility that investors react differently to good and bad surprises. Using the VIX futures and options data from July 2006 through January 2013, we find conclusive evidence for the benefits of including both asymmetric volatility and upward jumps in models of VIX derivatives pricing. We do not, however, find evidence supporting downward jumps.},
}