JEPI and the…Atlanta Falcons?

This is a follow-up to Derivative “Income” Bumhunting where I shared skeptical takes on derivative income ETFs from:

  • quants (Roni Israelov, David Nze Ndong, the Alpha Architect)
  • a former options market-maker who runs an RIA that uses options (Mark Phillips)
  • a relative value vol manager (QVR)
  • a tax specialist (Brent Sullivan)

I also gave the moontower take which was more of an eye test that rhymed with Mark’s take — an alpha approach to vol is simply not going to be rules-based. Option alpha is built on quicksand because the direction, size, and persistence of opportunity depends on the opponent (the market’s bias). When fear is dominant, you get paid for selling volatility but when the adversary is complacency the job to be done is to pay theta and warehouse options. You get paid for taking what the market gives you.

An ETF with an options mandate is a point guard than can only go right. So the onus is on the investor to know when to put that point guard in the game. It’s not a set-it-and-forget-it choice — you have to be the coach.

No asset gatherer is going to tell you that. This isn’t index investing where you get paid for time and tolerating swings.

Let me revise that.

Part of it is — you are effectively selling in-the-money puts so you are getting some exposure to equity risk premium and then some exposure to a risk premium that is sometimes consciously taking away your right hand dribble. Algebraically you are paying that point guard a lot in fees and tax treatment considering the pure equity risk premia piece can be had for zero.

You can make a valid argument that options can be used to change the shape of your payoff instead of alpha. But this works because options are surgical tools priced for specificity. The logic of passive call-selling is anything but that. Even if I granted such logic, you still need to show why you aren’t just better off buying less shares to match the delta (the ETF’s are less volatile because selling calls means they aren’t 100 delta like SPY) of the ETFs you’re considering hence saving on explicit and hidden costs for a payoff shape that does significantly better in every case except where the market sits relatively still.

[Personal bias warning — the utility preference to outperform when nothing is happening is bizarre. I want to outperform on the downside because I get the double whammy of my wealth holding up better PLUS all the things I want to buy getting cheaper. If you think of your future purchases as a liabilities, ie maybe you want to buy a condo in Miami, your assets are not only holding up but your liabilities are shrinking on the downside. Likewise — there some component of the upside distribution that is an inflationary, blow-off top. This is a disaster to underperform in — or more directly, the stupidest call option you ever sold. I’m being dramatic to mention this in the admittedly narrow context of covered call ETFs but I’m just sharing why I think shifting payoffs from large moves to small moves has never really appealed to me. And just to get ahead of the “the real distribution has more small moves than the Gaussian assumption” — yea I know. So do the vol markets which use surfaces to “correct” the bell-curve. If you want to parse the price beyond that you are now in the club of “one’s life’s work”. That speakeasy’s password changes monthly and won’t be found in the VRP article you read one time in 2011.]

So far, this is all high-level take based on why alpha can exist, how options work, and their adversarial zero-sum nature. Let’s look closer shall we. I grabbed returns for the largest derivative income ETFs to do my own up/down capture analysis. Up/down capture is the metric QVR used — it’s a solid choice. I’m familiar with it from my days at Parallax. You’ll see why it lends itself well to hedged funds.

Learn:

  • how I did the analysis (including normalizations)
  • why I think the up/down capture formula you find all over the internet deserves adjusting
  • how to interpret the results

Draw inferences about:

  • the actual performance
  • what performance depends on

I’ll focus on JEPI because the findings are interesting and as you’ll see — there’s more to the story than QVR’s picture:

Onwards…

What is up/down capture?

The up/down capture ratio measures how an investment performs relative to a benchmark index during periods of market gains (up capture) and market losses (down capture).

An excerpt and example from YCharts:

An upside/downside ratio of 100 means that the investment typically performs the same as the benchmark regardless of if it is rising or falling. If the benchmark increases by 10%, the investment increases by 10%. If the benchmark decreases by 5%, the investment decreases by 5%.

Investments usually don’t have upside/downside ratios of 100. Sometimes, an investment may rise 15% when their benchmark rises by 10% but falls 12% when the market falls 10%.

In this case, we calculate the upside/downside capture ratio by dividing the investment’s upside return and dividing by the downside return:

(.15/.10)/(.12/.10) = 1.25

Multiplying this by 100 gives us an upside/downside capture ratio of 125 for this investment.

Formula

Upside/Downside Capture Ratio = (Investment’s Upside / Benchmark’s Upside) / (Investment’s Downside / Benchmark’s Downside) *100

YCharts uses monthly returns data to determine Upside/Downside Capture Ratio

QVR contends that up/down capture is indicative of manager skill…up/down capture is important not to only produce superior performance but to have any potential of outperforming relevant benchmarks. Or said differently, knowing your manager is not destroying value.

I’d simply put it as — knowing your manager is better than levered beta.

If your manager is up 15% when the market is up 10% and down 15% when the market is down 10% the ratio is:

150% up capture / 150% down capture = 1

Like any single measure, up/down capture could never be a final word. T-bills have low up capture but amazing down capture. If SPY is down 5% and T-bills return 5%, is that negative -100% down capture? The whole up/down capture measure breaks (it’s also a dumb example to benchmark T-bills to SPY).

Investopedia also uses the YCharts method.

When you google up/down capture the first 3 results are Investopedia, Morningstar (there method is ambiguous despite publishing the stat for its funds), and YCharts. In fact most results on Google’s first page use the Investopedia/YCharts method.

Eh.

We already saw one problem with that computation — when the sign of the benchmark and fund return differ the ratio is negative.

The second problem is familiar to anyone who looks at ratios frequently — dividing by small numbers. If the benchmark return .05% and the fund returns .50% you get an up ratio of 10 for the period. It sticks out like a pimple on a chart even though it’s not an especially notable data point.

There are 2 superior methods to computing up/down capture.

  1. Up/down spread

    If you scroll down to the BTS Funds doc you learn that’s what Morningstar actually does. They simply take the difference between the fund and benchmark returns. This is also QVR’s method.

The logreturn method

This is what I use:

[Apparently some place on the google search results called Longs Peak Advisory does a geometric return method which is basically the same]

JEPI

etfdb.com describes JEPI:

The JPMorgan Equity Premium Income ETF (JEPI) is an actively managed fund that generates income by selling options on U.S. large cap stocks. The fund invests in S&P 500 stocks that exhibit low-volatility and value characteristics, and sells options on those stocks to generate additional income. JEPI was launched in May 2020.

At $35b, JEPI is the largest derivative income fund.

When we compare the performance to SPY since July 2020 (I picked July because it lined up with the lookback windows of the volatility calcs), we see that it underperforms, but doesn’t drawdown as hard as SPY in 2022 and doesn’t recover as quickly in the past 18 months.

Which is really just an indication of what we know — it’s less volatile than SPY. This is by constitution. It sells calls against a long position so JEPI is simply long less net delta. The vol ratio bounces around a range of 30-80% but the average is 61%. Said otherwise SPY is 2/3 more volatile than JEPI on average.

But we want a better look at the risk/reward of JEPI relative to SPY despite the vol difference. Sharpe ratio is a conventional choice. This is rolling 1 year Sharpe (I don’t subtract the RFR from the numerator though) computed from daily logreturns. The chart starts in summer 2021 because the lookback is 1-year.

Not that JEPI performs better for the first couple years before giving up ground and finally being overtaken about 1 year ago.

Let’s look at the up/down capture. Instead of using monthly data as its commonly reported, we will use more granular weekly returns with a rolling 1-year lookback.

The individual legs of the ratio (up and down respectively) are far less than 1.0. That makes sense. JEPI is less volatile so it’s not going to go up as much as SPY or down as much. But the up/down ratio remains coherent since the lower volatility fact cancels out. You can see that the ratio starts off amazing, far greater than 1.0 but it starts to degrade and fall below 1.0 in July of 2022, about a year before the rolling Sharpe ratio succumbed to SPY.

Let’s look even closer.

We’ll scale the JEPI returns up to SPY returns by the inverse of the trailing vol ratio (so if JEPI is half the vol of SPY we’ll double its return to mimic a equal-vol weighted allocation).

That healthy up/down capture is being driven by outperformance:

  1. during small moves which we expect since JEPI is short options
  2. on the downside even for large moves despite being short options!

Make no mistake, JEPI was working quite well.

What does it look like as it starts to degrade?

The underperformance on the upside looks similar. If you squint maybe the underperformance seems to be a bit worse for the +4% weeks but it’s not hitting you over the head and the frequency of those weeks is still small.

However, the outperformance on the downside moves is a much thinner green envelope potentially indicating that the call premiums being collecting are not buffering the downside as much as they used to.

A Tale of 2 Halves

JEPI has been around for 4 years. Like the Falcons in Super Bowl LI, it got off to a strong start. I remember leaving the party at halftime and by the time I got home the Patriots were on the verge of finalizing their historic comeback. The JEPI underperformance in its past 2 years isn’t the same as Atlanta’s defensive collapse but an up/down capture of .80 isn’t gonna fill the seats.

Or is it?

This was a lot of AUM growth in one year (red box):

https://ycharts.com/companies/JEPI/total_assets_under_management
https://etfdb.com/etf/JEPI/#fund-flows

That’s a lot of new, easy to anticipate option premium for sale. This is not football. A big fanbase is not an advantage.

2024 was the first time any months saw an outflow and they were gnat-sized.

Wrapping up

This was a high level analysis. It demonstrates the value of up/down capture as another way to slice up performance. We examined what happened when a popular options strategy gained steam.

I don’t think this resolution makes crowding effects a conclusive cause of the performance degradation but my bias is definitely in that direction.

If you want to dig further, I left a lot uncovered:

  • I didn’t even try to relate implied vols or skew data. (If you do this at the index level, you get noise from implied correlation readings.)
  • You can reconstruct the holdings and do p/l attribution on options they sold if that’s available (or inferable).

I will run the same high level analysis for a few more derivative income ETFs: DIVO, JEPQ, NVDY, RYLD and XYLD. If their performance also followed the same gradient descent I’ll see how it lines up with their own respective AUM trajectories. Confirmation or divergence could both be interesting.

For now, I leave you with 2 final bits:

Covered Call ETFs: Shitburger With Extra Tax (9 min read)Brent Sullivan

The Tax Alpha Insider gets into the weeds of tax treatment of all kinds of investments. He spoke at CBOE’s RMC on the topic of taxes and derivative income ETFs.

Just to add a personal point. I have a rough heuristic of individuals being either markets people or structuring people. Traders vs lawyers. Contemporary folks like Matt Levine and Wes Gray are striking examples of people who understand both deeply and are actively sharing knowledge today. Henry Singleton would be on the Mount Rushmore for this breed of ambi-turner.

My brain isn’t on the same plane as these people but even on the rung I’m at I can’t build Ikea furniture or rotate a shape. I’d rather watch your 2-year-old than read a structured-product flowchart with the words “payor” and “receiver”. I hate thinking about taxes, entities, even real estate deals. Some people live for the words “generation-skipping trust”. My eyes glaze over.

I don’t use an FA, but accountants and lawyers are necessary balms. It’s good to see an account like Brent pop up since maybe I can at least get a glimpse of how much I don’t know.

Up/down capture that’s worth a lot

I made up the numbers below, but it’s not that different from what a well-run relative value vol strat will look like using monthly data. Before fees at least.

But they are expensive because who wouldn’t want a positive carry put option. That green outperformance region buys a lot of fallen assets at depressed prices. Your assets are up, while liabilities are down. Double win.

A Cockpit View Of Q3

moontower cockpit

I recently built this cockpit view to see what’s going on in markets. I’ll be iterating on it as well as creating a page to incorporate my portfolio so I can do some high level bucketing by asset class, vol weights, and portfolio correlation. It won’t take much to get it to a suitable template for the personal account.

I’ll probably add proxy benchmarks to mimic private fund holdings that hold public securities. However, there won’t be accounting for angel investments. I hold them at cost on the spreadsheet and at 0 in my brain regardless of their “valuation”. If anything hits, I figure my kids will thank me one day. If not, and I trained them well, they’ll drag me over the foregone beta return. They own a lookback option on our sense of guilt. That goes beyond finances I’m sure.

We’ll talk about the cockpit view briefly before using it to describe Q3.

1) Columns Worth Clarifying

  1. Change: Logreturn (%) from the previous close (current data).
  2. Closing Vol Snapshot: Implied volatility as of T-1, includes:
    • 30d IV: Implied volatility for the next 30 days.
    • 90d IV: Implied volatility for the next 90 days.
    • 30d VRP: (Forward-looking 30d implied volatility) / (1 month realized vol) – 1
  3. Return Z-Score: Normalized asset return based on implied volatility from a week, month, or quarter ago.
  4. 30d IV Change: Change in 1-month constant maturity IV over the past week, month, quarter
  5. Returns (weekly, monthly, quarterly): Asset returns for the specified periods,
  6. RV (Realized Volatility): Annualized historical volatility calculated using daily log returns. Monthly RV is computed using 21 days of data for example.
  7. Lagged VRP: Ratio of realized volatility that transpired compared to implied volatility from a month or quarter ago for example.
  8. Correlation to SPY: return correlation for specified lookback
  9. Beta to SPY: computed as realized vol ratio x correlation for lookback

2) Some insights from the cockpit view

 

  • 30d IV is up across the board on weekly, monthly, and quarterly time frames despite asset return also being up everywhere except oil.

    I bought oil and sold stocks back on 9/11/24 when it seemed like a nice rebalance entry. I was wrong on the oil vol sale, but right on direction…I synthetically sold puts in oil vs selling railroad shares (CP)…see commodity kamikaze

  • Short vol did well last week (see lagged VRP) in US index and sector index. It got WRECKED in China ETFs with weekly return Z-scores greater than 3 standard deviations (Return Z-Score column).
  • Quarterly returns in SPY and IWM were close to 1 standard deviation (return / implied vol from 3 months ago). QQQ flattish as gains in TSLA, AAPL, META were washed by flat or losing results in semis, MSFT, AMZN.
  • Bonds unsurprisingly had a strong quarter with rates falling but the lagged VRP showed vol was overpriced.
  • The euro, yen, and gold all rallied strongly against the dollar this past quarter. BTC was flat.
  • XLU (utilities) were the strongest SPX sector but utilities have bond-like properties so they seemed to inherit both the stock and bond rally. Their trailing correlation is low being pulled in opposite directions by SPY and TLT.
  • The trailing 3 month realized vol performed well compared to how implieds were priced in late June. The August 5th chaos made its mark.

While I got the USO vol trade wrong, the GLD short vol idea carried well.

I threw this on Twitter yesterday:

The discomfort of short vol (this might have something to do with why it often pays):

On 8/16, I wrote about the GLD Nov 232 ATM call looking fat over 17% IV

The call was ~$9.40

Shorts have made nearly $2 on delta-hedged basis

But notice what’s uncomfortable…

Image

GLD is up nearly 5% since then (as of 9/30)

Strike vol did come in hard and stay subdued for about a month after I wrote the post.

It’s recovered recently but the strike vega is much smaller with over a month elapsing & the call being .81d (not ATM anymore)

 

3) Using Excel to pull stock data

A 1-minute video:

Lots of fields:

Click Insert Data to View a List
via Howtogeek.com

There’s even a stockhistory() function.


I hope the cockpit view inspires your own ideas. We will be bringing a very similar version to moontower.ai

If you use options already, definitely check it out. It’s option trader goggles. Vol traders already understand the lens but if you are a directional or fundamental trader who is even curious about options the Primer and Mission Plan docs offer surgical ways to expressing your views. And if the cockpit is any indication, option metrics alone are useful for thinking about risk and opportunity even if you don’t trade options.

That’s the premise of Option Analytics For All.

Some headings from that post:

Option Surfaces As A Unique Source of Market Intel

  • The typical use of option analytics
  • Intelligence from options markets
  • A Brief Detour into the World of Indicators
  • Option analytics as indicators for non-option users

Dragonfly Eyes

Hypocrisy is overrated as a vice.

Don’t get me wrong, a homophobe politician caught in a bathroom stall with his “friend” can go crawl in traffic.

But the cost of being rigidly consistent in your worldview is much higher because it’s dried in a coat of virtue upon which edifices of horror are constructed. It’s the idealogue who twists into a moral pretzel. Better to protect the ego than consider more plausible models of the world I guess.

Hypocrisy comes with built-in birth control. It’s easier to resist because it’s harder to rationalize. Its ability to recruit is smaller because the dissonance is out in the open. It comes off as self-serving which eventually tires the audience out. Ideology, on the other hand, is seductive. It makes sense once you accept the assumptions.

We are all shaped by experience. A career in trading has been a large influence on my belief that the need for coherence drives us mad. I was a far more dogmatic thinker when I was young. It’s not that being cognitively flexible enabled a career in trading so much as the requirement to survive ripped my trust in lazily examined assumptions right out of me.

[A nice example of why trading demands curiosity, suspicion, and honesty about assumptions comes from Risk of Ruin’s critique of Taleb’s so-called “ludic fallacy” which he convincingly argues is a strawman.]

The universal quality of fear — you must respond to it. But you can choose to respond honestly at the cost of ego. The flexibility that trading teaches flows from paranoia. The fear of making a faulty assumption load-bearing.

The need for coherence also comes from fear but animates a different response. Digging your heels into some playhouse fantasy that is continuously reinforced by overfitting every action or utterance as “evidence”.

[As a matter of spotting talent in trading, you can’t rule someone in based on cognitive flexibility (or any other single quality) but clinging to ego is a red flag. Some people are institutionalized by their fear. The signs can be subtle but defensiveness is a dead giveaway.]


You can train yourself to be more flexible with a scout’s mindset. This sits at the heart of trading. Hacking together charts or tables to find abnormalities. Testing the ideas under live fire while having the right sense of proportion around what the experiments can teach you (said otherwise — neither backtesting nor “resulting” is sufficient. One of these non-dogmatisms that some find repellent. I’d urge those people to consider other jobs. On second thought, come right in, the water’s fine.)

I’m currently reading a draft of a friend’s upcoming option book. There’s an outstanding section on what I’d call prospecting. Where’s the fertile ground to look for trade ideas? It’s an underdiscussed meta topic. There’s no shortage of strategies to be pitched but how many of them originate from a game-theoretic perspective of “why am I even able to find this?”

The section echoes in my head because it’s ultimately about how to see things differently. While that isn’t a guarantee to discover, failing to do this is a guarantee you will not discover.

When transitioning from equities to commodities in the 2000s, I had to unlearn lots of principles. Some were simply mechanical — there are different arbitrage bounds in equity vs futures options. Some related to the zero sum nature of commodities vs the perpetual nature of a stock. Futures are derivative underlyings themselves in ways that an equity is not. Equity is an entry on a cap table. A commodity is both an input and an output to be stored, exhausted, mined, or grown. Measuring vols and correlations will help you relate to an asset you are learning about. But thinking by analogy has its limits. The edge cases demand first principles context. The fine print so-to-speak.

In my chat with Corey Hoffstein, I hit one of these points in reference to shifting gears from market-making to position-taking:

You mentioned to me that early in your transition into Parallax, you had a mentor who said to you, quote, do you want to optimize your p&l on a daily basis or something else? And that this question really unlocked something for you. What did this question mean to you? And how did it ultimately impact your behavior in your new seat?

This was a sort of a profound one for me. It was something I needed to hear. So the context here — I was in my first or second year at Parallax and I was trying to dial in my spot-vol correlation parameter in oil.

Without getting too far into the weeds, your spot-vol correlation parameter will have a large impact on your model deltas. So, for example, if you run a spot ball correlation parameter of say 1% or 100 or negative 100, what you’re saying is every time spot goes up 1% I think vol comes in 1%, which is what I would say is that’s a constant straddle regime. Like saying “When the futures move, I believe that the straddle is always around the same price [net of theta ofc]”

So stocks up is 1%, vol is down 1% — constant straddle. Now, obviously, that’s a slope. And that’s a very local slope. If the stock doubles, I don’t think vol halves. So it’s clear that that’s not a constant parameter.

But I was being very locally minded. And the reason for that was I very concerned with my daily p&l, which is a bad habit I picked up from spending those couple of years as an indie market maker.

You don’t want to be too dogmatic about your spot-vol correlation, because it does change. And so what the mentor was doing as he was encouraging me to zoom out and think about the expected value of the spot-vol correlation rather than overweighting it to like the recent observations.

There’s an additional lesson in this that is presumes an assumption that lives in oil that is far less coherent for stocks — the absolute price level matters. Because oil is an input that is refined it is both a raw material and source of revenue depending on its end user. That means its price is represented as a margin or cost on some CFO’s spreadsheet. Its absolute price has many dependency arrows in that spreadsheet. Dependencies that are managed by flipping switches or placing hedges. Sure if GME stock price goes up enough you can expect a shelf — but the degree of freedom here is much wider than what is managed in oil.

Option order flows are therefore more conditional on absolute prices in oil than in a stock. In fact, the price of a stock doesn’t mean much without considering its ratio to earnings.*

In practice, this means I care about underlying price levels in commodities when considering spot-vol correlation or skew in ways that I would simply ignore in equities. This calls for additional tools to look at the market from other angles.

In tomorrow’s paid post, we’re going to expand our thinking about volatility term structure to see why it’s a diamond with several facets — and most interestingly — why multiple ways of looking at it are not all correlated.


The title of this post is a tribute to Phil Tetlock’s phrase in Superforecasting (my notes):

Dragonflies have compound eyes with thousands of lenses “synthesized into vision so superb that the dragonfly can see in almost every direction simultaneously, with the clarity and precision it needs to pick off flying insects at high speed.”

 

Stay Groovy

☮️

 

*This idea harkens back to my post Markets Will Permanently Reset Higher (My Sacrifice to the Delta Gods) about the option contracts people should really want:

While being long the index outright is a blunt hedge, call options, for all their extra hassle, are still not a surgically precise hedge. The right tail we are concerned with is risk premiums shrinking. This can still happen if earnings fall while multiples expand. Imagine earnings falling by 20% and the index only dropping 10%. Multiples will have actually expanded by 12.5%. I admit this sounds unlikely. But we are talking about this as a right tail event. In that context, the forces which are driving the price of capital lower may even accelerate in a recession. The financial option you actually want to buy needs to be struck on the index multiple, not the index level.

So unless a liquid market develops for the SPX 10yr 40 P/E Strike Call, I don’t see a simple financial options hedge.

A few derivative “income” ETF comparisons

In JEPI and the…Atlanta Falcons? I promised to follow up on a few other derivative income ETFs. In that post you get a dose of evergreen education on methodology. Today I’ll just present the charts with quick observations.

XYLD

This is the Global X SP500 covered call ETF. While JEPI manages $35B and has been around close to 5 years, XYLD was born in 2013 and manages $2.8B. It’s one of the largest derivative ETFs which shows just how big JEPI is!

Like JEPI, it’s perfomance degraded by mid July 2022. The second chart shows XYLD weekly returns on the y-axis vs SPY returns.

In this chart I note 2 things:

  • A general observation: the rolling 1 year SPY sharpe has gotten over 2.0 4x in the past 6 years It spent all of 2021 up there and it touched 3.5 in late 2017!
  • XYLD sharpe has been underperforming recently and massively underperformed during COVID. I do wonder to what extent it’s outperformance captured in the lookback thru early 2022 might have been due to call skew being strongly bid (overbid?) in 2021 — the year of mania, SPACs, NFTs, etc.

JEPQ

JEPQ is JP Morgan’s ETF that overwrites Nasdaq calls. It currently manages over $16B. It has a shorter history than JEPI and it’s performed quite well. But I would pause before inferring this to being Nasdaq specific before reading below.

QYLD

This is the Global X version of a Nasdaq call-overwrite ETF. It manages $8B.

Growth of $1 is not great relative to QQQ, but QYLD is also lower vol. So we look at the rolling sharpe. This has a longer history than JEPQ.

The sharp is often in line with QQQ but again has this skewed left tail to it where it can fall apart relatively. Zooming in on that Covid period, you see an extreme dip in the up/down capture driven by the up capture getting shelled plus the down capture spiking on that first Covid sell-off.

The behavior of the second chart is a bit strange. There is steeper underperformance than I’d expect on the downside (after all, the delta should roughly go to 1 and simply mimic the index) and the put performance is a stronger on some of the larger up moves than I’d expect…heck, I’d expect underperformance.

I have a subtle hypothesis for why we observe this. Perhaps the call skew and vols are sticker on the downside — the market drops but the call vols blow out so much that their realized delta is small. So it feels like you are riding stocks down with little offset from the calls depreciating.

On the up moves the calls which might have been too high to begin with massively underperform causing the ETF to do quite well. In other words, Nasdaq call deltas are much lower than you think!

This is just a guess and it’s a muddy one at that since we don’t see the same behavior in JEPQ and overall JEPQ has performed much better than QYLD and quite well on absolutely.

ETF Central as of 9/18/24

Could just come down to a skill issue? Hard to know without getting more into the details of the strategies for a finer attribution.

“How did you solve that math problem?”

The last few issues I’ve talked about mathacademy.com (no less than 7 readers are now doing it for themselves and/or for their kids!).

My mother was visiting this week and was doing the diagnostic over my shoulder while I was working on it. It really bugged her to realize how out of practice she was in elementary math so we went through some refreshers.

We reviewed a bunch of exponents stuff, for example, why 1/2 of 2²⁰ is 2¹⁹.

This is apparent when you think about it. But one of the things I noticed about how she and I do math is how methodical she is with trying to find the formula and how that’s not my first instinct at all. My first reach is always “what’s a simpler analogy and then extrapolate”. If that doesn’t work then get the pencil. I mean a lot of my motivation for retaking math ed is because my only mode is ‘trader math’. Formulaically, I reminded her that multiplying by 1/2 is the same as 2⁻¹ which is how she relates to the problem — she knows the rule for multiplying exponents with the same base is to add the exponents.

[My mom reads moontower believe it or not so it’s nice to share this in print even if a bit corny— we’ve always bonded over math. She went back to school in her 50s to get a college degree. She even took Java and C++. She is a determined learner at heart even if formal education took a backseat to more urgent pragmatism. She cut her college days short to work and get married back in the 70s. I was born the week she turned 24. Meanwhile, my eldest was born hours after my 35th birthday. Just acknowledging the change in norms in a single generation makes me feel like a flea in the sweep of time — no need to invoke cosmic proportion or even geographic birth lottery to think of how lucky I am to feel even remotely resourced while my kids are still kids.]


 

If you want a similar math problem to practice I shared Barclays quant question back in July:

Lily pad

You start with a single lily pad sitting on an otherwise empty pond. You are told that the surface area of the lily pad doubles every day and that it will take 30 days for the single lily pad to cover the surface of the pond.

If instead of one lily pad you start with eight lily pads (each identical in characteristics to the original single lily pad), how many days will it take for the surface of the pond to become covered?


A thought on the Lily Pad question and more:

[My son Zak solved it just like I did — by realizing the answer is the same as if you started after Day 3. My mother preferred the 2³⁰ / 8 = 2³⁰ / 2³= 2²⁷. The different ways we reason through a problem show up yet again.

I suspect my son is railroaded into my method because it wasn’t natural for him to see that representing 8 as 2³ was desirable for the purpose of doing exponent division (which follows a mechanical rule of subtracting exponents).

But getting to the formulaic version is what my mom searches for first.

Even when I was on the trading floor where you had to do mental math quickly to make markets, I enjoyed asking the people standing next to me how they priced the structure. There was a lot of variation. It’s a fun thing to ask others and, as I discovered, people usually like explaining how they mental math so it’s an all-around feel-good exercise.

One of the things I like about common core math is the emphasis on seeing numbers in different ways. My 8-year-old reflexively turns numbers into “friendly numbers” ie ending in 0s before doing operations, then undoing the adjustments before finalizing his answer. They are taught to do this. People my age usually landed on this method organically. But it’s good to teach it.

That said, Nate Bargatze owns the best common core bit:

 


 

Money Angle

Here’s a question I made for my mother to drill the exponent stuff that doubles as an investment problem.

For a fixed tax rate and rate of return is it better to have your return taxed every year or wait to be taxed on the gains all at once at the end?

Knowing the answer to the question is useful in itself but I also want to mention a collateral benefit. The meta-process for approaching the question can help organize your numerical intuition.

Think of what is required to answer:

1) recognition

What kind of problem is this?

Well, it’s a compounding problem.

What does that tell us about the function?

It’s exponential. It takes the form y = abˣ

2) ask yourself where the variable in question (in this case the tax rate) makes the largest impact

Is it as part of the a or the b?

Since the b gets exponentiated (the historical term for this is “involution” or “involuted”) the tax term will have its largest impact there.


I gotta run — I only have hours to secure my spot in mathacademy’s Iron League. I can’t not be gamified.

☮️

derivative “income” bumhunting

I use the following example all the time because it makes it makes the distinction between premium and income plain.

You’re long a $100 stock.

  • It’s fairly priced because it’s 90% to be 0 and 10% to be $1000.
  • You overwrite by selling the 500 strike call at $45.

Did you earn income?

A courageous response to my question on Twitter:

There is no problem here. You take your $45 and move on with your life. If you get called away you make 5x, and if your stock goes to $0 you came out with only a 55% loss.

Umm, incinerating money when you think you are investing is actually what I would call a “problem”.

You make $445 10% of the time and lose $55 90% of the time. You are literally better off betting on roulette.

If you overwrite a call that’s actually worth $1 at a price of $.95 because call markets are faded low for sellers, you are stuck with roulette odds. Factor in your brokerage costs (implicitly or explicitly) and effort.

I’d rather get a free hotel room.

As a market-maker when you sell a call you might book the difference between the trade price and what you think the option is worth as “theo p/l”. And even in that case marking-to-model is only going to show a few cents of edge.

Derivative “income” ETFs treat the entire option premium as yield. Guessing disclosure rules prohibit them from selling ITM calls and labeling the intrinsic as yield. Is the distinction that a hard arbitrage bound can’t be marketed as income but a soft one can be? After all, if you buy a .25d call in a random stock for 0, you’re like 99% to make an arbitrage profit with a series of delta hedges. If I sold the call for 1 cent, I can call that yield? Ok. Well, I call that “semantic arbitrage”.

Looking at the AUM growth of these funds, their cheerleading has worked. They don’t need any boosting. Turns out I’ve been collecting the critical takes on these ETFs for a few months. Are they biased? Of course. But I’ll excerpt and weave the arguments so you can form an impression to weigh against what “income” marketers claim.

You will hear the arguments from:

  • quants
  • a former options market-maker who runs an RIA that uses options
  • a very familiar vol manager (who allowed me to re-publish their firm’s take which is nothing short of violence)
  • a tax specialist
  • a bit from me (and more next week, as I plan to get my hands dirty with some data)

Onwards…

The quants

Roni Israelov and David Nze Ndong’s paper A Devil’s Bargain: When Generating Income Undermines Investment Returns was published in the Spring 2024 issue of The Journal of Alternative Investments.

🔗free version of the copy that preceded it on SSRN

These are the key points remixed between me and ChatGPT (emphasis mine):

Passive Income Strategies and Covered Calls

The paper notes that many investors, particularly in the retail sector, are attracted to strategies that generate income, such as covered calls. These strategies have gained popularity due to their ability to provide income through derivative overlays, often being presented as ‘income-generating’ strategies.

Negative Relationship Between Derivative Income and Total Returns:

The authors demonstrate a strong negative mechanical relationship between the expected total return and derivative income for covered call strategies. Empirical evidence from a nearly 25-year analysis of S&P 500 Index covered call strategies supports this finding. Essentially, the higher the derivative income generated by these strategies, the greater the losses. This outcome contradicts the common assumption that high derivative income leads to higher total returns.

[Kris: This is framed as surprising, but moontower readers know better. Remember Distributional Edge vs Carry?]

Impact of High-Yielding Strategies:

High-yielding call selling strategies, by design, have larger short equity exposure, leading to worse returns. This is a mechanical relationship and is highly predictable. Additionally, selling call options introduces short volatility exposure, which can be profitable but typically does not offset the losses from short equity exposure.

[Kris: Hard to benchmark because the beta is less than the broader index. I’m exploring an adjustment as I want to run my own comparisons. More on that next week inshallah.]

Misconceptions About Covered Calls and “Income”

The authors challenge the idea of treating the initial inflow from selling a call option as income, as this overlooks the associated liability and potential for loss upon settlement of the option. They argue that viewing the initial cash inflow from selling an index call option as income, while ignoring the expected outflow at settlement, is misleading.

[Kris: Been on that train so long I’m slumped over a rocks glass in the bar car right now. And I don’t even drink. See what you made me do. By “you” I mean the committee who defined SEC yield and the asset managers who gave ’em the reach around.]

The quants over at Alpha Architect boosted the paper as well, putting a bow on the argument by comparing the sleight of hand to how dividend chasers are hunted:

Investor Takeaways

Israelov and Ndong’s findings demonstrated that at least some investors might be attracted to covered calls for the wrong reason, seeking income rather than equity and volatility risk premia. That attraction can lead to a misallocation to these risk premia versus a best-fit allocation when analyzed appropriately. They also demonstrated that these strategies might lead investors to have overly optimistic return assumptions guided by their derivative yield. Considering the yield as income could also lead investors to make misinformed choices in terms of spending.

The live returns of the two most popular covered call writing ETFs should cause investors to question the prudence of these strategies, which are also less tax-efficient than traditional long-only strategies.

The most important takeaway is that the call premium is not income. This is the same type of mistake investors make about dividends, leading many to overvalue them. By definition, income increases wealth. Dividends do not do that; when a dividend is paid, their investment is now worth less (by the amount of the dividend). In other words, a dividend is just a forced divestment of some of your investment—you are receiving cash but now have a lower equity allocation. It’s also not income, except for tax purposes, which makes dividend payments an inefficient way to return capital to shareholders. In both cases, the failure to understand this can lead to overallocation to “income” strategies.

Thoughts from former option market maker Mark Phillips

His post is a 5 min read and worth every second but here’s what I want to highlight from Show Me Your Edge (emphasis mine):

The edge [in passive stock index investing] is time. Patience is an expensive virtue – but it pays handsomely in the long run. $1000 a month in an 80/20 stock/bond portfolio for a 40-year career returns about 9.7% a year and turns into $5.3 million. And that’s only half of the monthly 401(k) max limit.

Equity risk premium pays you for your patience. That 9.7% a year is several percentage points higher than risk-free treasury bills because of volatility. Don’t stop buying the few times a decade when stocks drop by more than 20%.

The most naive investor can capture the equity risk premium. It’s available in any brokerage account and with roughly 15 clicks the entire process can be automated.

With equity options, there is no set it and forget it.

There are rapidly increasing number of ETFs that offer options structures in a simple wrapper. But this is grocery store sushi. I’m not so much concerned about them suppressing volatility or creating some sort of derivatives tinder box, as I am with what an undifferentiated approach does for returns.

It takes 15 clicks to set up a lifetime of ERP.

It takes 15 trades to collect a month’s worth of VRP. 

The net of the implied vol and historical vol is a very abstract number. Other than over-the-counter agreements in the tens of millions of dollars, that’s not a tradeable concept.

Covered calls are a very simple way to capture part of this. You’re long the equity realized path, and short some implied volatility. But positive VRP could still be a losing trade. Stock going down, even at a lower than expected rate is still negative PnL.

If you can’t trade variance swaps, you can still try and manage a short premium capture strategy with condors, straddles or strangles. Moving these around to stay short the right amount of volatility while stocks move and strikes go in and out of the money is introducing a lot of friction into a trading strategy.

The rebalancing for an index strategy only happens quarterly, and stock execution is cheaper and finer-grained than options execution. This weighting adjustment works very much in the favor of the investor. Outperforming companies are added to, while laggards slowly melt away. Options greeks not only evolve on a daily basis, but the friction and frequency of adjustments are a major performance drag.

Consistently applying the same trade, like a QQQY that sells ATM NDX puts every day, is also going to deliver sub-optimal outcomes. The directional aspect might be a tailwind, but there’s nothing particularly systematic about why that option should be a consistent sale.

Adjusting trades based on delta (i.e. implied volatility levels) is a useful adaptation, but options overlay strategies are more about shifts along the utility curve than adding alpha. Selling calls to buy put spreads transforms the risk, it doesn’t deliver performance.

There are plenty of edges in options, VRP is real…But passive can’t be an edge. Dumping your money in equity markets and closing your eyes will have you beating most active managers. Blind selling or buying options will almost certainly be a performance drag.

Vol manager QVR pulls no punches

As a vol manager themselves, they are biased. And to be clear, so am I. Most of my opinions here overlap (not shocking — with some LinkedIn sleuthing you can spot some personnel merry-go-round between my past firm and QVR although I had no involvement).

Excerpts from their latest letter, DERIVATIVE INCOME…? DEFENSIVE EQUITY…? HEDGED EQUITY…? ARE YOU BEING MISLED?:

The marketing departments at banks and asset managers have conjured up some savvy descriptors for options strategies over the years. Portfolio Insurance, Option Overlay, Hedged Equity, Overwriting, Derivative Income, Defensive Equity, Structured Outcome, Enhanced Equity, and Buffer. As sales departments had success selling these types of products, the assets under management (“AUM”) grew.

Strategies using options are now commonplace in portfolios. Investors will always look for ways to cheapen hedges, but unless very closely understood, persistently capping the upside on your equity allocation is not the answer. Historical techniques such as collars and put-spread collars employed by some of the largest equity providers have become so overcrowded, they will likely lead to nothing but extreme relative underperformance, simply value destruction. This statement is arguably also true for strategies now referred to as “Derivative Income” (aka call overwriting, defensive equity, covered call, etc.), which are all simply selling short calls against equity.

Historically, the go-to benchmark here has been the Cboe BXM Index. An accurate description of benchmarks such as the Cboe BXM or similarly PUT indexes historically would be “Equity-like returns with lower volatility,” but today should be replaced with the statement, “Equity-like risk with lower returns.” These volatility selling strategies cap upside. An option selling strategy is not inherently risk-reducing: a covered call or a cash-secured put selling strategy has effectively the same exposure as outright equities in a sharp market selloff.

Various competitors in Hedged Equity such as JP Morgan, SWAN, Gateway, Calamous, Parametric, and Innovator have had difficulty in recent years delivering their stated objectives. We believe their underperformance is the lack of price sensitivity on the options they use to deliver their stated investment mandate. These strategies have caused a combination of a simple imbalance of near-dated volatility supply and long-dated put demand. Over time, this has led to their products’ underperformance in both market rises and declines. Having simply overpaid for put exposure on the one side and having sold calls too cheap on average on the other side.

So how do investors improve risk-adjusted performance in hedged equity? The answer is: Combine an alpha strategy, that has negative correlation to broader equity. Moving alpha and porting that atop beta has been called portable alpha. Capturing positive returns in upside equity markets (positive up capture) and capturing positive performance or at least flattening downside beta returns in equity downside (flat to negative downside capture) is ideal.

The vast majority of managers are upside down on up-down capture. Hedge funds and various alternative strategies with a negative up-down capture spread, rely on persistent positive equity upside for returns. In other words, this is typically achieved by underperforming funds through a combination of participating in less of the upside (bad) and more of the downside (bad) equating to a negative up-down capture.

The term “outcome-oriented investing” started to be used many years ago, to allow managers to explain away poor performance relative to simple, suitable benchmarks and strategy substitutes. In Figure 1 we show how a simple strategy substitute of 50% S&P 500 TR + 50% Generic 1M T-Bill Return has recently outperformed notable JP Morgan strategies, tickers JHEQX, JP Morgan Hedged Equity and JEPI, JP Morgan Equity Premium Income ETF. Note the dashed blue and green lines underperforming the 50/50 strategy substitute

The market through option pricing is punishing investors of these products and giving opportunity for more skilled traders to achieve true alpha for investment programs. As a reminder, in decades past these types of strategies had the potential to outperform generic equity benchmarks. Today, however, not even a reduced benchmark hurdle of a 50/50 benchmark seems to be easily outperformed. The massive trade flow, supporting hundreds of billions in AUM growth without a doubt, is contributing to a large and still growing structural dislocation in options markets.

As with all risk premiums, the attractiveness ebbs and flows.

Pre-GFC, short-term options tended to be expensive. As a result, harvesting a volatility risk premium and hedged equity strategies were attractive and backtested well pre-Global Financial Crisis (“GFC”). Post-GFC, both retail and institutional demand for option-selling strategies and hedged equity started to grow rapidly. Large groups of investors are now viewing both as an “evergreen” asset class for income generation (“Derivative Income”) or a liquid alternative (“Hedged Equity”). Or at least that’s what we see marketed everywhere. Table 1 shows the degradation of the up-down capture for the HFRX Equity Hedge Index, +31.6% (excellent) ITD through GFC versus -8.7% Post-GFC (bad)

Call and put write strategies have roughly the same downside risk (down capture) as underlying equity market risk, with much less positive return in up markets. In a portfolio context, adding negative convexity strategies with low returns to the upside and high correlation to risk assets to the downside is not additive to portfolio performance. An investor should always think about the portfolio context and the opportunity cost of capital, the value potential.

Up-down capture is important, not only to produce superior performance but to have any potential of outperforming relevant benchmarks. Or said differently, knowing your manager is not destroying value potential.

So are you being misled, with all the marketing spin? Maybe. This is a market after all and we do encourage different points of view. So here is ours:

  • COVERED CALLS ARE NOT A FIXED INCOME ALTERNATIVE
  • COVRED CALL AND PUT WRITE STRATEGIES ARE NOT DEFENSIVE EQUITY OR HEDGED EQUITY
  • THE VOLATILITY RISK PREMIUM IS NOT PERSISTANTLY POSITIVE OR ATTRACTIVE
  • BEWARE OF THE TERM “DERIVATIVE INCOME”, NEW MARKETING SPIN

Opportunity for skilled traders? Unequivocally, yes. And that is the point of this analysis, creating better Hedged Equity and S&P 500 outperformance.

ALPHA MATTERS.

We are undeterred that, finding repeatable sources of alpha, attractive risk adjusted return is best found by providing liquidity to price insensitive end users of derivatives. This is a core belief of any skilled trader and a good starting point for any long-term investment strategy, so should come as no surprise.

A tax specialist’s perspective

Brent Sullivan raises concerns about the return of capital tax hack derivative income ETFs may employ to distribute tax-free income (LinkedIn)

A smattering of moontower thoughts

Options are always about vol. I co-sign both Mark and QVR’s message. No opinion on vol? Then you are just shifting around utility curves. Maybe that’s worth the fees, short-term tax treatment, and execution slippage*.

But “income” is a dead giveaway that someone’s bumhunting.

If the managers want to say they’re in the alpha game and not some poorly-defined risk premia sport then I agree with QVR we should see it in the up/down capture (and that’s what I want to explore more myself).

The broader asset-management and retail world lags the alpha world by a generation. Perhaps in 20 years, any fund focused on options would have to disclose how their vol p/l looks benchmarked to some naive beta-esque expressions of vol trading. Right now you can imagine an option relative value trader in a pod being forced to benchmark their vol buys and sells compared to if they had just bought or sold SPX vol or a basket of liquid singles’ vols in an attempt to measure their vol “idio” or skill. Then you can really study whether they are good at timing, sizing, or selection.

Further reading that drives home the “options are always about vol” message:


*Some option traders have mentioned that often the large predictable option fund rebalances will trade mid-market and not even disturb the surface. Well, let’s just say that 20 years ago the market would get leaned the day the makers expected the execution. Today, the table gets set earlier and earlier. The periodicity of maturity cycles and flows makes option traders feel like they’re riding a longboard during the dull times. Slide down a bit, flatten, slope back up the wave, re-catch the energy, make your move, snap back down.

That mid-market execution probably has a few days of slippage baked into it. Harder to detect. But nobody’s job is to stand there and slurp risk for free. Trading firms’ results are coming from someone (actually everyone, but some more than others).

a riddle related to American-style options

Friends,

I saw a fun riddle this week. To get you in the right mindset before sharing it I’ll introduce the so-called secretary problem. I first came across this concept when I was a trainee at SIG from John Allen Paulos’ Innumeracy in the context of choosing a mate.

From Wikipedia:

The basic form of the problem is the following: imagine an administrator who wants to hire the best secretary out of n rankable applicants for a position. The applicants are interviewed one by one in random order. A decision about each particular applicant is to be made immediately after the interview. Once rejected, an applicant cannot be recalled. During the interview, the administrator gains information sufficient to rank the applicant among all applicants interviewed so far, but is unaware of the quality of yet unseen applicants.

The question is about the optimal strategy (stopping rule) to maximize the probability of selecting the best applicant. If the decision can be deferred to the end, this can be solved by the simple maximum selection algorithm of tracking the running maximum (and who achieved it), and selecting the overall maximum at the end. The difficulty is that the decision must be made immediately.

The shortest rigorous proof known so far is provided by the odds algorithmIt implies that the optimal win probability is always at least 1/e or about 37%

The reason the secretary problem has received so much attention is that it’s the optimal policy for the problem, the stopping rule is simple and selects the single best candidate about 37% of the time, irrespective of whether there are 100 or 100 million applicants.

Key Insights

[These are a mix of my thoughts and Llama 3.1, the LLM you can chat with from Whatsapp]

  • 37% provides sufficient information about the distribution of quality.
  • Maximizes probability of selecting the best option.
  • Balances exploration and exploitation

    (this should remind you of the multi-armed bandit problem — a problem so diabolical that the Allies considered “dropping” it on German scientists as the ultimate nerdsnipe — to distract them from the more urgent matter of developing weapons. See my notes from Algorithms To Live By author Brian Christian)

Real-World Applications

  • Job Searching: Interview 37% of candidates before making an offer.
  • Dating: Meet 37% of potential partners before committing.
  • Shopping: Research 37% of options before purchasing.
  • Recruitment: Screen 37% of applicants before inviting for interviews.

Assumptions

  • Random arrival: Options arrive randomly and independently.
  • No recall: Previously rejected options cannot be revisited.
  • No additional information: No new information becomes available after observing an option.

Limitations

  • Small sample size: With few options, the 37% Rule may not provide accurate results.
  • Non-uniform distribution: If options are not uniformly distributed (e.g., clustered), the rule may fail.
  • Correlated options: If options are correlated (e.g., similar), the rule may not account for this.

Practical Considerations

  • Difficulty in estimating 37%: Real-world applications may make it challenging to determine the exact 37% mark.
  • Time constraints: The rule assumes unlimited time for observation and decision-making.
  • Multiple criteria: The rule focuses on a single criterion; real-world decisions often involve multiple factors.

Contextual Limitations

  • Irreversible decisions: The rule may not apply to irreversible decisions (e.g., marriage).
  • High-stakes decisions: The rule may not suffice for critical decisions (e.g., life-or-death).
  • Dynamic environments: The rule assumes a static environment; changing circumstances may require adjustments.

 

Application to financial options

With that background, you can see how American-style options are a specific instance of “optimal stopping time problems”. That’s because they can be exercised any time before expiration, unlike European options, which can only be exercised at expiration. The holder of the option must decide the best time to exercise, if at all, to maximize their payoff.

This is why American-style options are priced by simulations such as tree methods while European-style options have closed-form equations.

In the simulations, the value of the option is computed by looking at the value at the next time step (i.e., whether to exercise now or wait). A backward induction process unravels from the expiration date back to the present. The model calculates the optimal decision at each point in time based on the payoff of immediate exercise versus the expected value of holding the option.

 

With ALL that said, you are ready for the riddle!

Flip 100 coins, labeled 1 through 100.

Alice checks the coins in order (1, 2, 3, …) while Bob checks the odd-labeled coins, then the even-labeled ones (so 1, 3, 5, …, 99, 2, 4, 6, …)

Who is more likely to see two heads first?

  • Alice
  • Bob
  • Equally likely

The riddle is neat because it works the same muscles as pricing an option. In fact, the riddle doesn’t even require math!

🔓See my reasoning and the original thread


Welcome to 2024…

The cost to learn is COLLAPSING if your eyes are open. Culturally I can sense (and anticipate much more) hand-wringing over what this means for society but right now I cannot emphasize enough how you should at least be taking advantage of all the consumer surplus LLMs are dumping in your lap. Earlier this week I mentioned that I screenshotted a spreadsheet of futures data and asked ChatGPT to write the formulas I’d need to arrange it the way I wanted. It spelled out exactly what helper column I needed and where to place all the formulas. It even stepped through the method so I can understand why its solution works.

Now consider that riddle.

  • I prompted ChatGPT to give me the Python code to simulate the question many times to so I could validate my answer.
  • I ran the code in Google Colab (cloud-based Jupyter notebook)

This entire process takes seconds not minutes.

Here are the steps you follow upon seeing the riddle on twitter:

  1. open 2 tabs — ChatGPT & Google Colab
  2. ctrl-c from twitter
  3. ctrl-v into ChatGPT
  4. type “what code that simulates this”
  5. ctrl-c the response
  6. ctrl-v into Google Colab
  7. ctrl-enter to run the script

[And yes I use a PC…loving my new Surface laptop btw]

In 5 years, an AI agent implanted in your reading glasses will know you wanted to do that when you scrolled over the tweet and a tooltip will simply be projected over the tweet with the simulation results.

Just kidding.

Twitter will be gone by then.

If interested here’s my Google Colab link:

🔗coin-checking script.ipynb

[Not to get into the weeds but I had to have a couple back-and-forths with the LLM because it made the mistake of thinking that the position that the head is found in determines if Alice or Bob won but it’s actually which ordinal observation that determines the winner. The process took more like 5 minutes as I had to prompt a specific debug and explanation. It doesn’t take away from the point — we are talking about orders of magnitude decreases in the time to code up this simulation for someone whose coding skills are as soft as mine.]

crossing the commodity chasm

I was planning to publish a follow-up to last week’s derivative “income” bumhunting where I look closer at various ETF performances. I started data-wrangling and then got distracted with yesterday’s trade idea. I will circle back to the ETFs in an upcoming issue.

As far as the trade idea this is what I ended up doing:

  • Liquidated most of my railroad shares (economic activity play that has held up well in contrast to commods)
  • Shorted .54 delta calls in November expiry options in WTI. I used Z24 NYMEX crude options which are “Dec” options but expire in November. The nomenclature comes from the commodity delivery window as opposed to the last trading day of the options. Confusing if you are coming from equity land.
  • Bought Dec2025 WTI futures

On balance, I added overrall risk-on length via the total quantity of Dec25 futures.

The oil legs assumed a .50 beta between Z25 and Z24 so I bought 2x as much Z25 as I sold in Z24 oil delta, which brings me to…commodity analytics. Like where did I get .50 beta from?

For some background, I spent 2005-2021 managing commodity options businesses. Despite moontower.ai being equity focused and the start of my career being in equities, ETFs, and equity options, commodity futures and options are my binkie.

One of my favorite periods of my professional career was building out the analytics for commodities. It was also a clue that I enjoyed building as much as trading. The career is most rewarding when you can not only drive the racecar but drive the one you built.

The analytics were similar to moontower.ai in the sense that they are volatility-centric and ignore fundamental data. But there is a tremendous amount of translation required to think about vol and term structure in the commodity markets.

A non-exhaustive list of features (or bugs) in commodity futures:

  • Physical delivery and cash delivery nuances
  • A single order book instead of multiple exchanges
  • A large, opaque OTC market whose contracts are cleared by exchanges but not traded on them
  • No insider trading rules
  • Different rebate mechanisms for volume traders
  • Less restrictions on market makers taking a “show” in voice markets
  • Even the underlying is highly levered — and exchanges can change margin requirements on a whim
  • Cross margining of look-alike products (ICE vs NYMEX WTI)
  • Options on spreads, Asian options, “new crop”, short-dated ag options, and 0DTE has existed in these markets for almost 20 years already (but were not traded electronically)
  • Close to 24 hour trading
  • Early exercise of calls and puts in a world without dividends or corporate actions
  • CFTC not SEC
  • Different arbitrage bounds — a time spread can trade for a credit because each option expiry is tied to its own underlying. This gets very hairy in commodities that are hard to store (ie nat gas or electricity) or highly seasonal ones such as ags which have “new crop”/”old” crop” dynamics that can actually have negative correlations (the performance of an old crop will affect planting decisions for the next season — high prices one year can lead to oversupply the next. Forward vols in commodity markets are a fun topic.

Infrastructure-wise you need a totally different “security master” or database mapping between financial instruments.

  • Underlyings can be referenced by many types of options in various combinations. Crack spreads, crush spreads, spark spreads. Options on all of them.
  • Every commodity has different expiration schedules, trading hours, last trading time, settlement procedures, naming conventions.
  • The markets pool liquidity in bespoke ways — October options in cotton are listed but never traded. If you trade nat gas you care a lot about 2 spreads in particular — H/J (March/April) and V/F (Oct/Jan). In RBOB, the gasoline spec for delivery changes in the North American summer.

Your security master needs to be flexible enough to accommodate all this variation. Your front-end analytics are going in the other direction — tuned for the idiosyncrasies of the markets you’re focused on. And then all the risk needs to be reported to central command in a way that fits with the overall portfolio risk while not losing the details when they matter. It is essential for the risk management layer to understand the drivers in these markets to devise appropriate shock scenarios and aggregations.

My first home in commodity trading was the oil complex. WTI, Brent, heating oil and gasoline (RBOB but I started trading it when HU was listed in parallel as the market was migrating to the RBOB spec). In that world you have American-style options expiries spanning from 1 day to many years into the future. You have calendar spread options (CSOs), crack options (I stood next to an independent trader that was long so many gas crack calls when Katrina hit that it was a noble percentage of a day’s worth of refining capacity in the US), Asian options, and European look-alike options that settled to swaps.

There were active markets in American vs European “switches”. You could exercise an American option for an hour after the market officially closed while the Europeans were cash-settled. If you were long Europeans on a strike, and short the Americans going into a pin…well, the market was gonna help you discover what that switch is worth.

If you trade WTI-Brent “arb” options, you are trading options on the spread between the 2 oil benchmarks. WTI and Brent have their own supply/demand dynamics because they are in different locations and vary by spec. This means different buyers and suppliers. Refineries optimize their throughput based on what end products are demanded and where (diesel, gasoline, jet fuel, bunker oil, etc). Throw in transportation costs, technicalities, and legality/tariff/sanctions on exports and you have a pair of highly liquid individual markets only loosely tethered by the wire of “arbitrage”. The arb options are a way to directly trade the spread. And then nerds can then relative value trade the vanilla options on each commodity vs the arb options. There’s no boxed arbitrage embedded in the math but the concept of implied pairwise correlation is a tradeable, albeit, messy parameter. And with the different expiration dates and times for the “same” month you will find yourself long or short a pile of unmatched option greeks for a string of days in between expirations (which also suggests that getting your volatility calendar correct is paramount).

(This implied correlation idea also exists in the context of calendar spread options. In 2007/2008 there were giant discounts in the WTI option forwards. The time spreads were incredibly attractive. The risk was you were inherently short spread vol so if you believed that the spread vol was highest conditional on the front month futures collapsing, then CSO puts were a clever hedge. The trade set-up was caused by the SEM Group blowout. I feel like I owe their desk a thank you for effectively buying my first apartment for me.)


An aside

The natural gas futures and options world is (was?) the king of cruft. It felt like a racket for churning exchange fees and broker commissions.

Expirations were so cumbersome because the liquid underlying was a physical future but the options that referenced them barely traded — instead the cash-settled options were traded but the cash-settled underlying swaps were not. They weren’t even listed, they were only cleared by the exchange! I can still remember being worried that I’d have an error, outtrade, or unreconciled position contaminate the grid I used to project how many futures I needed to buy/sell above/below each strike to replace the cash-settled deltas that were “going away”.

Oh yeah, and the swaps themselves had a different expiry than the futures so there were highly active markets in TAS (“trade at settlement”) futures, pen/LD swaps (“penultimate mini vs last day” swap spread), “futures/pen” (physical futures vs full penultimate swap), and the EFS (“exchange for swap” — last day futures vs last day swap). All of these things are tied together by algebraic relationships so you can triangulate the implied market in one from the legs of the others. Except none of this trades on a screen so you were still doing mock trading math based on quotes you are seeing on AOL IM or YM…in the 2010s!

I don’t remember the full history of how the markets eased into these conventions but I believe it was the marriage of disjointed OTC, exchange-traded, and physical markets. I’m not even getting into how the multipliers on these contracts work — if you trade X mmBTUs in the physical market that’s a size per # of days in the month which is how pipeline operators think and to which the financial players need to adapt.

And just like the oil market, there’s a NYMEX version and ICE version and sometimes you didn’t find out what you got until after the deal was consummated.

Hence venting on my Lost Xtranormal Video (Fonz, remember when we scripted this?)

It remains true that the number of commodities you can trade is much smaller than the number of equities, but there are loads of futures expirations, futures option expirations, strikes, and instruments.

A basic infrastructure starts with the futures chain and term structure. While I don’t currently have a proper infra I did grab several years of simple end-of-day WTI futures prices prompted by my oil trade idea.

I got distracted from writing the ETF post into creating a bunch of charts that demonstrate things I like to look at in futures. I’ll share them below and the reason a vol trader should care.

A quick comment on infrastructures.

You should be able to examine futures data in at least 2 ways:

  1. Fixed expiration (ie a history of the Dec24 contract)This is especially useful for seasonal charts (ie X-axis is Jan thru Dec and the lines are contracts for various years)
  2. Relative expiration (ie M1 or CL1)The ordinal number 1 refers to the first contract listed. This is the basis for “continuous” contracts. It’s also important because of the Samuelson effect which I mention in the interview with Dean — a contract with 12 months until expiry is less volatile than the same contract with 1 month until expiry. Which is another way of saying the M1-M12 spread has volatility and that volatility has its own properties.

The importance of having both views compounds when we layer in option analysis.

We’ll keep to a tight format. A chart and its relevance. All data is WTI futures settlement on the NYMEX.

[LLM note: To organize the data there was one instance where I wasn’t sure the easiest way to do it in Excel. I figured I’d need a helper column but instead of trial-and-error I just gave ChatGPT a screenshot of the spreadsheet and a description of my desired output. Abracadabra. Even the explanation for how the formula worked was perfect. I don’t know how long until it’s here but I feel like sometime shortly, organizing the data, laying out the charts, and the blog post will all be done by autonomous parallel AI agents. Come to think of it, why would I even prompt it…the LLM will just prompt better questions than I’m bothering to answer just by knowing what data is in front of me and the history of my blog posts.

The movie Fight Club makes more sense to me now than it ever did before.

Nerds being the victim of their own success is the own goal the Colosseum has been waiting for]

Alright, let’s get to it.

We start with a couple high level charts. They are useful when inspecting a commodity for the first time to get a sense of its nature.

Term structure time series chart

  • To keep it legible, I just chose 3 ordinal months — M1, M8, M15. You can see how steep the contango was in April 2020 when the front month went negative!
  • You can also see how for the past 5 years, the oil price has been backwardated (descending term structure) whenever the prompt price has been at least $45. I didn’t load the data from the 2010s but the last few years have been regime. Perhaps reflecting the idea that oil-demand in the future is uncertain with the focus on alternative fuels. (Personally I keep my core oil length in the deferred futures incentivized by an implied positive roll return and because the mandate also comes with a lowered incentive to invest in supply. In other words, vibes. I don’t know anything about the future but energy is part of my asset allocation. Hedgers sell the back so that’s where there should be a compensation for providing liquidity which manifests in a roll return. If I have to pick a spot on the curve to fill that bucket, that’s where I’m going.)

Term structure cloud chart

  • I group term structures by month and average them. Looks like a vol cone doesn’t it.
  • The back end has a tighter range than the front. It’s less volatile. Supply and demand are more elastic with a year to go than with a month to go.

Volatility term structure is one of the most important tradable concepts for option traders. Time spreads, straddle swaps, implied forward vols. So here’s the kind of riddle I might give in an interview:

There’s 3 contracts listed — M1, M2, M3

They each have their own option chains and the ATM implied vols are 30% across the board.

1) Is the option term structure flat, ascending, or descending?

2) Give me a framework for computing the forward vol.

Instead of an answer, I’ll offer a clue:

The vol ratio cloud

  • This is a chart of 1 month realized volatility for each contract divided by M1’s realized volatility (grouped by month)
  • Notice how some periods of time the back months are far less volatile than the near months. I think of this as volatility in the futures spread absorbing the volatility from the back months as the near month is being driven by some current concern that is not propagating to the backs. May 2020 being the archetypical example as we were running out of near term storage during COVID.
  • How does this idea inform your thinking about the riddle above?

Zooming in on spread volatility

To reduce the noise from M1 a tad we focus on the M2/M12 futures behavior from Jan2021 until last week.

In the top panel we see:

  • Time series of both futures prices. This is not a continuous roll return display so at each expiry as M2 inherits M3’s price there’s a small bump in futures prices that you would adjust for if you were working in a returns context.
  • The green time series of the futures spread defined as M2 – M12 (front – back is a commodity convention but the equity market defines it the opposite way. Another source of confusion and “Texas” hedging for traders who cross the chasm from commods to equities or vice versa). Notice that the spread has been positive (a backwardated market) for almost the entire period.
  • The silver time series just normalizes the spread value by the M2 price into percent terms.

In the bottom panel:

  • The red line is the rolling 21d standard deviation of the spread price changes. The spike is the Ukraine invasion where the near-dated futures skyrocketed relative to the backs. Inelastic demand in the front meets supply concerns. We don’t compute percent volatility (what if a spread price is zero or negative) but instead measure the price volatility directly.
  • The white line is the MAD or “mean absolute deviation”. This measure of volatility tends to be lower since we don’t amplify large moves by squaring them as we do with standard deviation.
  • The army green line (right axis) is the MAD/St Dev ratio. An MAD less than .80 typically signifies a skewed or fat-tailed distribution of moves. See the [👿MAD Straddle for more color on this idea.]

Spread “delta”

This is a scatterplot of spread price vs M2 price.

The slope of the regression tells us the “spread delta”. For example in 2022 (yellow), a $1 change in M2, meant a $.44 change in the spread.

If you are long the futures spread (long M2 and short M12), you are inherently long the market. If you want to isolate your trade to just betting on the slope between the 2 prices you must weight the position by the spread delta. So for each M12 you sell, you buy only .44 M2.

This comes in handy if you are trading a large options book with deltas in each month. You normalize them all to M1 deltas as a quick, liquid hedge. The r2 of the regressions give you a sense of how volatile that delta estimate is. A lower r2, a worse fit. If the fit is relatively poor, you will likely delta hedge your spreads directly and more often to reduce the noise.

A few observations:

  • 2021 & 2022: the underlying had a wide range of M2 prices
  • 2021 and 2023 had the lowest r2 indicating more variation in the spread “delta”
  • Spread delta is highest in 2024 ($.57 change per $1 move in M2)

This observation brings us to the last chart. A high spread delta typically means a wider divergence between contracts as M2 moves around. The spread is volatile. If the front month goes up a dollar, the back month “lags” more than if the spread delta were lower.

Say it however you like:

  • More volatile spreads mean that the volatility ratio between the back and front is lower.
  • The beta is lower.
  • The back is not “keeping” up with the front.

We can observe this directly by looking at each of those years’ M12/M2 realized vol ratio.

Notice that the high spread delta of 2024 corresponds to a low M12/M2 vol ratio.

In 2021, when the spread delta is a mere .18, the vol ratio between the 2 months spends plenty of time above 80% and even goes above 100% as the futures curve moved in parallel rather than flattening/steepening.

Can you see how spread volatility and vol ratio would have profound influence on how to interpret the options term structure on these contracts?

If the spread volatility is high, meaning the realized vol ratio of M12 to M2 is low, then if you are long time spreads you are going to be short options on the thing that moving a lot, and long options on the thing that’s lagging. You want to make sure you are getting the appropriate vol discount to hold that! Measuring what that is will get you to a proper understanding of the vol term structure and implied forward vol.

If you are coming from equity vol land, where the options are struck on the same underlying this is a new frontier for you.

We close with the bottom panel which simply shows the rolling correlation and beta (beta is vol ratio * correlation). This is the correct number to use for weighting your future positions not just the aforementioned vol ratio. The beta can change due to the vol ratio or the correlation and it’s worth decomposing it to see what’s driving the inevitable mismatch between your hedge ratios and realized p/l.


Trading commodity vol after trading equity vol can feel like a foreign world at first. But trading equity vol from trading delta one is an even bigger leap. Once you get used to commodities they actually feel cleaner. I’m super rusty on thinking about early exercise, dividends, rev/cons, and merger-arbish math because all those muscles atrophied from a life in commods.

Even the strange commodity trades I talk about with Dean in the interview revolve around USO and UNG — commodity trades that got tangled with SEC wrappers.

Commodities have their own language and their own grammar. But they are globally pertinent and tell their own gripping stories of history. I’ve recommended it before, but Javier Blas and Jack Farchy’s book World For Sale is an absolute banger. The best book I’ve read in my last 20.

Let’s leave it there.

a winner that’s really a loser

The Earth rotates around the Sun at a speed of 67,000mph. When I go out for my occasional run, my own speed is in the tens of thousands of miles per hour. Should I take credit for this amazing performance? I wouldn’t be completely lying if I bragged about this with friends (which I do); but it would be more transparent if I mentioned that my speed record is in the frame reference of the Sun. In this frame of reference, my speed is indistinguishable from Usain Bolt’s. This factoid obscures the vast difference in skill between the two of us. To really understand the difference, we need to change the frame of reference. Another way to interpret the decomposition of returns is a method to change the frame of reference in investing. Total returns – and a portfolio’s total PnL – live in the Sun’s frame of reference. It is easy to fool ourselves with the belief that we beat birds, airplanes and supermen at their own game. Idiosyncratic returns and PnL live in the Earth’s frame of reference. If we want to compare our performance to that of our peers, or to our very own past performance, we need to move to this frame. Factor-based performance attribution makes it possible.

– Giuseppe Paleologo (via Advanced Portfolio Management)

When I think of the “heyday” of hedge funds in the early 2000s I picture a bunch of cowboys who became generationally rich riding a steed named Greenspan. “Hedge” funds? How about beta boys?

I’m writing in between sips of hatorade because I remember the contrast of working in trading vs seeing peers in asset management. They had “points” in the bonus pool. Meanwhile traders had “I know you made all this extra money but we think your pit was more lucrative then expected. We don’t pay extra for good luck”. And to be fair, I also saw the flipside where traders got paid well despite having a tough year because they made good decisions that had a lot of noise (an amount of noise the firm was willing to underwrite and not penalize you for — the whole “not resulting” thing isn’t just marketing — it’s load-bearing).

[Side note: cultures like that are hard to build and rest on constant communication and buy-in for situations where the opportunities might outsize your specific trading assignment and you need to “recruit” the mothership’s approval either implicitly or literally by storing some of the trade in an account whose p/l you don’t need to stare at every day. The old-school “back book” except you know about it because you were the one who alerted management to the situation.

The other thing such cultures rest on is “permanent capital”.]

The triumph of pod shops has been to operationalize trading firm epistemology, then maximize the fee they charge for blue meth. The job of a PM is now as hard as Walter White’s life for the same $10mm in comp you could have gotten 20 years ago.

When it comes to the retail investor, the conventional wisdom gets it pretty correct — almost everyone should be using low cost indices to construct diversified portfolios and get back to the competitive advantages in their daycrafts.

The retail world can be further divided into retail traders (the more sophisticated ones are sometimes called “prosumers”) and retail investors who “sin” by trading in haphazard ways as if NVDA earnings was the point-spread of the big game on Sunday. In both categories, there are no external demands for retail to be honest with themselves about their returns on effort in trading.

Which is basically, well, fine.

I don’t want my tone to be misconstrued — I’m not here to wag a finger at anyone. Serious traders, the ones who depend on edge to eat, get this. If you trade actively but less seriously then presumably you have some other means to knock out the rent so it’s hard to muster solemn concern for you.

But you can still upgrade your thinking massively by stepping through attribution.

On Sunday, I published a slutty post-mortem on a GLD trade that worked (and continues to work this week). I also promised to step through a trade that was not flattering because I think it’s loaded with lessons.

Let’s go through an option trade I did in IWM back on 7/17/24, a few days after the “small cap rotation” pushed the Russell up 10%.

a winner that’s really a loser

The setup

On July 17th, with IWM trading about $222.50 after a 10% rally over the prior days, I got a hankering to rebalance out of my shares. I consulted the mowing lines the gardener left in the grass and their pattern said “no need to buy something just trim net portfolio length.”

I pulled up moontower.ai and several views told me the vol seemed high.

Cross-sectionally:

The 30-day IV was elevated and with 1m-6m term slope “in line” other names with high implied vols I simply zero’d in on the 3m or October expiry.

Now I wanted to look cross-sectionally at how the options were “carrying”. I was looking for high VRP.

Cross-sectionally or relative to other names, the VRP didn’t stand out as excessive. It’s smack dab in the center of the chart. But there are 2 pieces of context swirling in my head:

  1. The entire VRP map is elevated. The mean VRP shows that on average IVs are trading at a 25% premium to realized vols. If you look at a chart like this every day, you know it’s more typical for that premium to be closer to 10%. If IVs were clustered towards the low end then a fatter VRP for the market could be expected but this isn’t that kind of clustering (toggle summer 2023 vols to see what low vol world looks like).
  2. This IWM VRP already incorporates the large move from the prior week in the denominator

On a relative basis I’m satisfied that IWM vol looks rich.

Let’s now examine IWM compared to its own history:

Getting confirmation.

That VRP is on the high end not just absolutely but also relative to the RV percentile.

Let’s look at the time series:

The 90d vol was a tad high relative (the October options were about 21% vol even a bit higher than what you see in this snapshot) to the realized vol sustained over rolling 3-month periods over the past year. This is not strongly confirming but it definitely doesn’t detract from the case we’ve built thus far.

One last check…what about if we zoom out to a 3-year lookback:

Selling vol at 21% is not attractive from this view with the IV being below the median rolling 3-month realized vol since 2021.

The trade looks good on 1 year lookbacks but not on a 3 year look. Insert the Larry David indecision meme.

Well, I never buried the lede…I told you I sold October IWM options. This tells you my bias is to discount old info heavily. 2021 might as well have been a different decade. I’m more inclined to look further back if I’m considering a high duration trade but we’re talking a 3-month option here.

Pulling the trigger

Instead of selling my IWM shares, I fully covered them by shorting the Oct24 220 strike calls at $11.82 on 7/17/24 (94 DTE). The stock was trading $222.22

On 8/23/24, 37 days later, I revisited the trade and that’s what inspired this post.

The calls were marked at $7.94 and the stock was at $220.38.

A walk in the park, right? $3.88 of profit and the stock is only down $1.84

Let’s really understand how to think about this.

Reviewing the trade

First, looking at what I call the unhedged version of the trade — sell the calls, go on vacation.

$1.07 of the p/l is due to delta. It’s not pertinent to evaluating an option trade because you could have generated directional or delta p/l by selling shares.

$2.81 of the p/l comes from the fact that 37 days have elapsed and the option is worth less. Implied vol was basically unchanged point-to-point from 7/17 to 8/23.

So this $2.81 gain is due to theta purely. But this is naive way to look at the trade because are inspecting the result when the stock happens to close to the price it was back at the start. If we chose a different time, and we will, you can see the folly in resulting from point-to-point outcome. And even the “different” snapshot we choose will be arbitrary, it will just show a totally different result. The point is this entire approach of opening our account and looking at the p/l teaches us nothing about whether an option trade was “good” or not.

The chassis for properly understanding vol p/l is laid out by platonic example in:

Dynamic Hedging & Option P/L Decomposition

and by simulation in:

Simulating Dynamically Hedged Option Positions.

But here we will simulate daily delta-hedged p/l for the trade IWM calls I actually sold and the daily prices that ensued.

This table is straightforward.

  • stock and option marks come from the market
  • option greeks and vols come from moontower.ai
  • p/l data assumes you hedge daily based on the marks and greeks

General observations before we discuss p/l

  • The IV on the 220 strike started and ended very close to 21%
  • The sample realized vol is 1.82% per day or 28.9% annualized (nearly 50% higher than the implied vol that was sold). This is based on 27 daily return observations.
  • Based on the realized vols, a 2 standard deviation move larger than 3.64% (ie 2 * 1.82%) would be expected at least once. There were zero. It’s therefore not surprising that the MAD or mean absolute deviation was greater than 80% of the standard deviation. (When the ratio is less than 80% it indicates a fat-tailed or skewed distribution). This observation isn’t central to this post but just thought I’d point it out. The topic is covered further in 👿The MAD Straddle

     

P/L

On a daily delta-hedged basis, this trade lost $1.60 per contract. Considering that the extrinsic portion of the call option was $9.60 when I sold it, this was a terrible trade.

Charted:

From the simplest perspective — I sold vol at 21% and it realized close to 29%. There’s no redeeming aspect to this. Was I unlucky? Sure, being short almost any option going into the August 5th volquake hurts.

But I’m less interested in the reason for the p/l versus the ridiculous framing where one looks at the result of making nearly $4 on shorting this call and thinking they were on the right side of vol history.

For people still early in their options learning, I can sympathize with what they are thinking — “Kris, this kinda makes sense, but you wanted to trim IWM length and selling calls worked…aren’t you just being pedantic? Does this even matter?”

My view is it not only matters, it’s all that matters. You chose an option as your weapon and the market wore exactly the right armor against your assault. That you happened strike a blow at all, based on when you looked at the outcome, is to not understand how badly you lost the fight. And without understanding that you can’t learn about the only thing that options depend on — they are always vol trades.

I’ll rattle off a more observations in hopes that one of them is the right key to turn:

1) The short call made $3.88 but $1.07 of that was delta p/l. You can think of the $1.07 as the counterfactual p/l if I just sold 60% of my shares.

2) Imagine how I felt on the 8/7/24 snapshot… my short option p/l is +$8.36 but I’ve lost $20.34 on my 100% of my IWM shares that I didn’t sell instead for a total loss of $11.98 (the RV would have been 29.6% from 7/17 to 8/7).

To measure more fairly in this 8/7/24 scenario, let’s assume instead of selling the calls I would have sold only 60% of my shares. I would have rode 40% of my shares back down to $201.88 for a net loss of $8.14

3) The cumulative delta-neutral p/l chart bottoms out in sync with the largest move (3 consecutive down days with a magnitude greater than 3% each). If you would have sold the 220 call on 8/5 when it was a 30d and hedged daily you would have made money for the next 2 weeks even though the stock rallied back nearly 10% back to the strike!

This is because the down leg saw the 220 call IV go from 21% to 29.6%, while the upleg saw the IV roundtrip back from 29.6% to 21% again.

The downleg option position suffered both due to IV expansion (vega p/l) and high realized vol, the upleg won to vega and realized vol (rv from 8/5 to 8/23 was 23.8%, but remember if you sold vol on 8/5 you sold IV at 29.6!)


Wrapping up

I’m a broken record —> options are always about vol.

If the implied is the main driver, it’s vega p/l.

If the realized is the main driver it’s the tug-of-war between gamma and its cost, theta. The cost, in turn, is driven by the implied. IV sets the hurdle on whether you win or lose to an option trade.

In today’s example, I sold an IWM call. I happened to win to it even though it was a bad trade (not in the sense that it was bad when I put it on, just that it realized a bad outcome that was masked by when I happened to look at it).

To be less fooled by randomness you can compute a delta-hedged p/l stream to get a clearer picture of a trade’s results.

If you don’t have that, you can compare the IV you traded vs the RV that was realized. This will be lower resolution that a delta-hedged p/l stream because vanilla option p/ls are sensitive to path but it’s better than just resulting.

In moontower.ai we now have a toggle that lets you see the time series of IV vs RV but the IV is lagged (so for example I can compare the 1 month trailing RV today vs the 1 month IV that prevailed a month ago).

This uses 50d IV. You can still see how the RV ramped up above the IV (this is using 90d RV to compare to 90d IV so its not as dramatic as the analysis above which showed the impact of the shorter window 29% RV on a 3-month 21% vol option. Plus the option I sold was not “constant maturity” but a regular option that becomes a shorter maturity with time).

We have the ability in our backend to simulate a delta-hedged option through time using actual market prices. It’s on the roadmap to create a front-end GUI for everyone.

how arbitrage pricing creates opportunities for directional investors

This post starts with a response to a reader question but leads to a deeper question — do arbitrage-free prices present opportunities to fundamental or directional investors?


 

One thing I’ll be doing more of is sharing my answers to reader questions. Here’s one that channels a topic that eternally confuses option investors when they learn about skew (emphasis mine):

In your excellent article Lessons from the .50 Delta option, you wrote that by bidding up the put skew, the market makes call spreads more expensive. Combining this with your insights on the deeper understanding of vertical spreads, implied distributions, and thinking of spreads as odds, I was left wondering how the steepness of the skew relates to outright buying or selling options.

Here’s my question: Let’s say the SPX Dec24 call skew (100-105 moneyness) is descending. If I were to buy a call spread with 100/105 moneyness, I’d probably measure it as slightly more expensive than a 100/95 put spread. This aligns with what you’ve been writing. However, what I don’t understand is: if call spreads are more expensive (because the 105 vol is lower than ATMF), wouldn’t it be better to buy 105 calls outright due to the low vol? In my mind, this contradicts the perception we get from the spreads. The market seems to favor upside scenarios and bid up call spreads, yet at the same time, outright calls get cheaper?

Could this have something to do with IVs, deltas, or VRP?

It’s such a great and common question.

My response:

I know it feels like a contradiction. I’ve even framed it that way because it grabs your attention…it really feels like one!

But ultimately it’s not.

Think of option skew creating vertical spread prices that imply a distribution which “corrects” the Brownian motion assumption — namely, that the market is positively skewed but more likely to go down than up.

This is exactly the opposite of empirical results…the market is negatively skewed and a favorite to go up!

The upside call is cheaper because there’s less positive long-tail skew than Black Scholes assumes but the call spread is more expensive because the stock is more likely to go up.

We shift probability to the right but truncate the magnitude of the upside. It’s like betting on a favorite in sports…you’ll probably be right but the upside isn’t as large.

The main point is that the distribution is like a sculpture — the volatility skew moves some clay from the right tail to the left tail, and shifts the whole sculpture rightwards a smidge.


The paywalled Thursday Moontower posts generally dig deeper into investing concepts practically. They’ve been very option-centric lately. However, the most popular one I’ve written recently is not directly about options. It drove 20 paid subs which is an unusually high amount.

[Incidentally, the topic of paywalls, monetization, and so forth is something I’m happy to discuss. In fact, I have a backlog item to come up with a pricing model for “Should I paywall content or not” mostly to itemize most of the major inputs into such a model and their sensitivities or greeks. In any case, the topic of monetization feels under-discussed at least from my aperture. I wonder if that’s because people are hesitant to share. In any case, I see no advantage in being opaque on how to think about monetization. At the end of the day, some amount of what I write is paywalled. A lot of effort goes into but I don’t subscribe to Marx’s Labor Theory of Value — if it’s worth paying for, it’ll get paid for, and if it’s not it won’t.]

Here’s the post, with the paywall removed 🙌

🔎Volatility Depends On The Resolution

Despite the word “volatility” in the title, it’s more germane to the topic of risk broadly than options. It’s not a long post, but it’s a lot to ponder for any investor. The heart of it can be summarized in a line:

Realized volatility depends on sampling frequency

This concept has such a broad appeal and because of my bit above in Money Angle about realized distributions vs theoretical lognormal distributions, it seemed like a good one to unlock.

The substack post is a condensed version of the whole post which includes lots of charts.

For example, charts that compare realized daily returns vs what is expected theoretically:

The real-world distribution includes:

  1. Fatter tails (note that greater than 3 st devs is one giant bucket so that’s why even the Gaussian expectation is slopes back up a touch)
  2. A higher peak (more small moves than theory predicts)
  3. Positive drift (in theory the drift is approximately [RFR-div yield]/365…a much smaller number than the actual drift has been. Incidentally, people really struggle with the Black-Scholes assumption that a stock has no expected return premium — but that’s actually not the assumption UNLESS you are talking about a specific context — pricing options as a replicating portfolio. Financial Hacking has a great clarification on this.)

 

The options market adjusts to this empirical reality by:

  1. Increasing downside put IVs or put skew. This extends the distribution of the left tail but also increases the implied probability of a positive stock return by pushing up the value of ITM calls and therefore call spreads. This shifts the center of the distribution to the right.
  2. The wing options on both the calls and puts can smirk up increasing the mass in the tails. In price space, this pushes the value of the OTM options relative to the ATM straddle which increases the value of the butterfly (the wings up, relative to the “meat” of the fly). If you think a stock is going to have more small moves than the Gaussian distribution of logreturns predicts then you want to buy a butterfly that’s short the ATM straddle — that is exactly the structure whose value increases. 

The volatility skew creates option spread prices that conform to empirical actuarial odds. They “correct” the assumptions baked into the underlying Black-Scholes distribution.

One of the most provocative bunny trails from this conversation is how a long-term call option price that is priced using the RFR for drift will look wildly cheap (and the put expensive) to an investor who assumes higher rates of return because the stock is risky. But if they bid the calls and offered the puts with their return assumptions in the model they would offer risk-less arbitrage profits to option traders would simply do “conversion” strategies.*

If this intrigues you, check out the Warren Buffet section in Real World vs Risk-Neutral WorldsAn option that looks cheap to a fundamental investor focused on the near-term can simultaneously look expensive to an arbitrageur who has no opinion about the underlying but is in the business of squeezing profits from replicating that option for less. Both the directional investor and arbitrage trader can win.

(The loser in a zero-sum paradigm is all the noise traders flipping the stock back and forth in the interim to the delta-rebalancing option trader).

 

*conversion example

Stock = $100

Interest rate =3%

Theo value of synthetic future (ie long 100 strike call/short 100 strike put) = $103

The call will be worth $3 more than the put. C – P = $3.

Buying the call and selling the put is the same as “buying the 100-strike combo for $3 which is equivalent to buying the synthetic future for $103 — why? Whether you exercise the call in a year or get assigned on the put, you will have paid $100 for the stock. Plus you paid $3 today for that structure, which is why you can say you paid $103 for the stock to be settled in one year.

Imagine a fundamental investor who thinks the stock will be worth $108 in a year (8% RoR instead of RFR) and is willing to pay $5 for the combo ie $105 for a 1-year future on the stock.

They would be buried by arbitrage traders who would sell the combo at $5 and buy the stock for $100 to hedge. At expiry, the arbs will either be assigned on their short call or exercise their long put, guaranteeing they sell the stock at $100. They already bought the stock today for $100 and will have to pay $3 interest to hold it for a year. So at expiry they will deliver the stock to the fundamental investor (selling the stock they bought for $100 at $100…a wash).

The arbitrageur will have paid $3 in interest…but remember they collected $5 in option premium via the combo. Net result: $2 riskless profit

The market hoovers up riskless profits so the arbs will compete to sell the combo to the fundamental investor all the way down to a price of $3 where the combo is fairly priced compared to the RFR.

It doesn’t matter what the fundamental investor or Warren Buffet thinks. If they pay more for the calls relative to the puts than the cost of carry warrants, they get arbed. This has nothing to do with stock distributions, volatility, or any theoretical nerd stuff. A 10-year-old can draw the cash flows on a “conversion” arbitrage.

A “reversal” arbitrage is the opposite…if the combo is too cheap, say $1, you can buy the synthetic future (buy call/sell put), short the stock and at expiration your short stock will be covered by the stock you buy via the future (ie exercise the long call, or get assigned on the put) and the $3 interest you earn on the short stock proceeds covers the $1 premium you paid for the combo leaving $2 of arbitrage profit.

Practically speaking, reversals and conversions, the first arbitrage trades you learn about in options, still have risk.

  • There’s pin risk if the stock expires on the strike and you have to guess if you will be assigned on an option or not.
  • Interest rates can change from your initial assumption — the overnight interest rates that you pay for long stock or collect as a short rebate change. They can also change dramatically if a stock becomes hard-to-borrow.
  • Dividends can be announced, increased, or cut. Higher divs increase put prices relative to calls so if you have a reversal position and a stock announces a dividend your long combo will drop by the dividend
  • In other words, a reversal position is long rho and a conversion is short rho (it’s rooting for either interest rates to drop, the stock to become hard-to-borrow, or the stock to pay a larger div)