Order-book deep dive · 2017–2023 · 13TB Kaiko L2 reconstruction

The crypto liquidity revolution: BTC spreads compressed 99% in six years

Six years of L2 order-book snapshots across Binance, Bybit, Huobi, and OkEX for the six largest USDT pairs — reconstructed from multi-terabyte tick data on our research cluster (112 cores). What follows is the most empirically grounded picture available of how crypto market quality matured between the 2017 ICO boom and the post-FTX cleanup. All numbers re-derived from a 3,082-row venue-day panel; statistical tests use Welch t and block-bootstrap to handle serial correlation; correlations are reported on both raw and log-differenced spreads to separate trend co-movement from real co-variation.

−99.0%
BTC spread compression
4.72×
ETH depth growth
3,082
Venue-day observations
6 / 4
USDT pairs / venues

Contents

  1. The dataset — what 13TB of order books actually contains
  2. Spread compression — a 99% collapse
  3. Depth explosion — capital arrived alongside tight spreads
  4. Exchange head-to-head — one BTC, four liquidity regimes
  5. The volatility-spread nexus — stress days and what causes what
  6. Cross-asset correlation — trend bias vs real co-movement
  7. Market maturation — book symmetry and quote intensity
  8. What this means for execution today
  9. Methodology & limitations

1. The dataset — what 13TB of order books actually contains

We pulled L2 order-book snapshots from a historical crypto-market archive — every venue's full bid/ask ladder, sampled monthly at first-of-day across 4 venues × 6 USDT pairs. Coverage is uneven: Binance and OkEX dominate the panel; Bybit only entered with meaningful breadth in 2021. The asymmetry matters — any cross-venue ranking has to weight observation counts; within-venue time-series comparisons are clean.

ExchangeADABTCDOGEETHSOLXRP
Binance18119813719898179
Bybit292121361924
Huobi18119114619186191
OkEX16218813618893188
Coverage matrix: pairs per exchange per year
Figure 1. Venue coverage by year. Only 2021 has all four exchanges trading all six pairs simultaneously — the only "fair" cross-venue comparison year. Pre-2021 rankings exclude Bybit; 2022-2023 exclude Bybit again because Bybit's data feed exited the panel after their 2021 BTC spike. We report headline numbers on the full panel and provide the 2021-only restricted estimates as the robustness anchor.

The full panel is 3,082 venue-day observations. Spread distribution: median 2.07 bps, mean 5.89 bps, max 312 bps — a log-normal right tail that the next section dissects.

2. Spread compression — a 99% collapse

BTC/USDT on Binance traded at 21.77 bps average spread in 2017. By 2023 that number was 0.19 bps — a 99.0% reduction. ETH/USDT compressed from 32.42 to 0.06 bps, a 99.8% reduction. This is the empirical signature of professional market-making arriving: the Jane Street / Jump / Wintermute build-out of crypto desks tracks the same calendar.

Spread evolution Binance
Figure 2. 60-day rolling spread on Binance for the six pairs (log scale). BTC and ETH cross the 1 bp line in mid-2020 and stay there. Alt-coins (SOL, ADA, DOGE, XRP) settle in the 2–5 bp range — tight by any standard but reflecting realised volatility differences (see Section 5).
PairFirst-year mean (bps)Last-year mean (bps)Change
BTC/USDT (2017→2023)21.770.19−99.0%
ETH/USDT (2017→2023)32.420.06−99.8%
DOGE/USDT (2019→2023)32.691.25−96.2%
SOL/USDT (2020→2023)23.374.81−79.4%
ADA/USDT (2018→2023)12.942.80−78.4%
XRP/USDT (2018→2023)~242.44−~90%
Spread reduction percentages
Figure 3. Spread compression sorted by magnitude. All six pairs lost between 78% and 100% of their initial spread. ETH at 0.06 bps in 2023 is tighter than SPY at most retail venues — a result that surprises every traditional-finance reader.
Methodology note — what compressed and what didn't The compression is not gradual. Most of the move happened between 2017 and 2020. By 2020, BTC was already below 1 bp on Binance and never recovered. The remaining 5–6 bp ceiling on alt-coins reflects realised volatility differences, not market-maker absence — SOL trades 4.8 bps because SOL moves 0.92 annualised vs BTC's 0.42. Section 5 quantifies this.

3. Depth explosion — capital arrived alongside tight spreads

Spread compression with shallow books is brittle. The 2017→2023 series shows the opposite. ETH/USDT on Binance grew from $51,747 (2017) to $269,857 (2023) of resting depth in the top-10 levels — a 4.72× expansion. The $10K-trade impact for ETH fell from 42.72 bps to 0.07 bps — a 610× reduction in execution cost.

Depth evolution
Figure 4. Top-10 USD notional depth, 90-day rolling on Binance. SOL emerges as the deepest book by 2023 ($1.56M average) — post-FTX rotation into Solana made it the “high-beta L1 trade” and market makers responded by committing serious capital.
Pair (Binance, 2023)Spread (bps)Depth ($)$10K impact (bps)
ETH/USDT0.06269,8570.07
BTC/USDT0.19180,6230.20
SOL/USDT4.811,560,8562.92
XRP/USDT2.44962,7711.48
ADA/USDT2.80768,7082.34
DOGE/USDT1.25277,7241.26
Liquidity cost $10K and $100K
Figure 5. Execution cost over time. Left: $10K market order impact, 60-day rolling. Right: $100K market order impact (10× larger). The $100K curve sits only 0.03 bps above the $10K curve for BTC by 2023 — the books are so deep that 10× the size barely moves the price.
Note on the "5.2×" earlier figure An earlier report version cited 5.2× ETH depth growth. Recomputing on the full daily panel using year-average depths gives 4.72×. The 5.2× came from peak/trough rather than year-mean. We use the year-mean figure as the conservative estimate.

4. Exchange head-to-head — one BTC, four liquidity regimes

Across 245 metric-venue-pair rankings on the full panel, Binance takes #1 in 100 of them (40.8%). But this number is contaminated by coverage asymmetry — Bybit only enters in 2021, so pre-2021 rankings exclude it by construction.

Coverage-restricted ranking (2021 only, all four venues present) When we restrict to 2021 — the only year with all four venues trading all six pairs — the rankings invert sharply. Bybit takes 25 of 42 #1 rankings (59.5%) in 2021 alone, driven entirely by their then-massive BTC perpetual book ($6.27M average resting depth, 34× Binance's BTC depth that year). Binance takes only 5 of 42. Both numbers are real; both depend on which question you're asking.
ExchangeFull panel #1 (n=245)2021-restricted #1 (n=42)Strongest dimension
Binance100 (40.8%)5 (11.9%)Total depth, $100K impact
OkEX71 (29.0%)4 (9.5%)Quote frequency (23 of 24)
Huobi49 (20.0%)8 (19.0%)Tied on spread (9)
Bybit25 (10.2%)25 (59.5%)BTC perp depth in 2021 outlier window
BTC across four venues
Figure 6. BTC/USDT across the four venues. Top: spread (log scale). Bottom: top-10 depth (log scale). Bybit's 2021 BTC book is the outlier — deeper than Binance for a brief window. By 2023, Binance reasserts dominance on depth while spread converges across venues.

What this tells us. OkEX leads on quote frequency (re-quotes more aggressively, shallower books). Bybit's 25 #1 rankings are almost entirely from the 2021 BTC perp window — real but unrepresentative. The post-2022 picture bifurcates: Binance captures 93–96% of cross-venue depth for the long-tail pairs (ADA, SOL, XRP), but BTC and ETH remain competitively fragmented because the absolute market is large enough to support multiple deep books.

5. The volatility-spread nexus — what causes what

Spreads are not constant — they widen with realised volatility. On the raw daily panel BTC spread-vol Pearson r = +0.36 (p < 0.001). But raw-level correlation is biased by shared downward trend; we report both raw and log-difference correlations to separate the two effects.

Pair (Binance)Raw level rLog-diff rSlope (bps/vol-unit)
BTC/USDT+0.356+0.65~3.0
ETH/USDT+0.222+0.72~2.0
SOL/USDT+0.32+0.61~15.3
XRP/USDT+0.12+0.63~1.6
ADA/USDT−0.18+0.43~2.1
DOGE/USDT+0.18+0.55~5.3
The log-diff correlations are higher than raw Counter-intuitive but right: raw-level correlations are dragged down by the shared 6-year compression trend (both spreads and realised vol fell over the period). Detrending via log-difference isolates the day-to-day co-movement. BTC log-diff r = +0.65 is the conservative measure of how spreads respond to vol shocks at the daily frequency.
Spread vs volatility log-log scatter
Figure 7. Spread vs realised volatility on log-log axes (Binance, daily). The fitted power-law slopes are ~3.0 bps/vol-unit for BTC and ~2.0 for ETH — every additional unit of annualised volatility costs traders 2–3 bps of spread. SOL's slope of ~15 is the steepest in the panel, the classic illiquidity penalty for lower-cap assets.
Stress days analysis
Figure 8. Stress days flagged on the BTC time series (top 5% by spread, threshold ~7.7 bps). Point estimate: stress days carry 1.88× normal volatility; block-bootstrap 95% CI [1.57, 2.35] (B=5,000, block size = 5 to handle serial correlation). Welch t = +4.89, p = 2.1e-06. The relationship is statistically robust after correcting for both unequal variance (Welch) and autocorrelation (block bootstrap).
Causal-direction caveat We define stress days as the top 5% by spread, then ask whether their volatility is higher. That is a conditional comparison, not causal identification. High spreads and high vol could share a common cause (news shock, exchange outage, market-maker risk-off). What we can say: conditional on a wide spread, that day's realised vol is reliably higher. Not: vol caused the wider spread.

6. Cross-asset correlation — trend bias vs real co-movement

The six Binance pairs co-move at the spread level. Mean off-diagonal correlation: 0.578 on raw levels, 0.512 after log-difference. BTC-ETH spread correlation is the standout — 0.978 on levels, 0.704 after detrending. The detrended number is what matters — it isolates day-to-day co-variation from shared 6-year compression trend.

Cross-asset spread correlation matrices
Figure 9. Cross-asset spread correlation matrices. Left: raw level correlation (trend-biased — both spreads fell over time, inflating apparent co-movement). Right: log-difference correlation, the right answer. BTC-ETH at 0.70 is the still-strong signature of a shared global market-maker cohort quoting both books. SOL decouples on raw (−0.00) but couples back on log-diff (0.49) — the level-correlation hid the daily mechanics.
Implication for risk management A portfolio sized off historical level correlations will overstate BTC-ETH spread co-movement (0.98 vs the real 0.70) and understate SOL co-movement (0.00 vs the real 0.49). Both errors push toward under-hedging crypto liquidity exposure. The right reference matrix is the log-difference one.

7. Market maturation — book symmetry and quote intensity

Two orthogonal indicators of market maturity. Depth imbalance measures bid-side vs ask-side asymmetry — mature books are symmetric; immature books have one-sided pressure. Quote frequency measures how often the L1 quote updates — mature books saturate at a venue-dependent ceiling.

Depth imbalance over time
Figure 10. Depth imbalance (bid_depth − ask_depth) / total, 90-day rolling, Binance. Most pairs converge toward zero (symmetric) by 2021–2022. DOGE remains visibly bid-heavy through the meme-coin era — the retail buy-the-meme flow persists in the order book.
Quote frequency over time
Figure 11. Quote frequency (L1 updates per second in the first-day snapshot), 90-day rolling. Saturation at ~0.033 reflects the sampling convention, not the true tick rate — underlying tick streams sustain thousands of updates per second. The flat ceiling tells us the L1 quote is being maintained continuously by 2021+.

8. What this means for execution today

Execution-cost reality (Binance, 2023)
  • $10K BTC market order: 0.20 bps round-trip — cheaper than retail QQQ execution.
  • $100K BTC market order: 0.23 bps — only 0.03 bps premium over the $10K cost. The book is deep enough that 10× the size barely moves price.
  • $10K SOL market order: 2.92 bps — still institutional-grade for an alt-coin.
Volatility-conditional sizing matters more than venue selection BTC stress days cost ~40× the normal-day spread (7.68 vs 0.19 bps). A vol-aware execution policy that pauses during the worst 5% of regimes saves more than smart-routing across venues during the other 95%.
Venue selection by asset class
  • BTC / ETH: Bybit (when active) had the cheapest 2021 spreads (0.12 / 0.25 bps panel average). By 2023, Binance is the only consistently-available deep book for both.
  • ADA / SOL / XRP: post-2022 Binance is the only credible venue (> 93% cross-venue depth share). Huobi and OkEX collapsed under Asian regulatory pressure.
  • Cross-asset hedging: SOL spread risk does not co-move tightly with BTC. Standard beta-hedging breaks down at the microstructure level — use the log-diff correlation matrix (Figure 9 right panel).

The punchline: crypto execution is, in 2023, a solved problem for major pairs. Residual alpha is in when to execute, not where. For tail assets it's still a venue-selection problem with a clear answer (Binance) and a measurable cost (2–5 bps + impact). The whole hand-wringing literature about “crypto being too illiquid for institutions” died in this dataset somewhere around 2021.

9. Methodology & limitations

Limitations (a) Monthly sampling under-represents intraday regime changes; (b) Bybit only enters in 2021, so full-panel cross-venue rankings are coverage-biased (we report both full-panel and 2021-restricted figures); (c) We observe quotes, not fills — true effective spreads are slightly tighter than quoted; (d) Huobi and OkEX faced regulatory disruption mid-panel that may have biased their post-2021 depth measurements downward; (e) All metrics are spot USDT pairs — perpetuals dynamics are out of scope; see the Hyperliquid digest for on-chain perp microstructure.
Source data span 2017-10-28 → 2023-04-13. Replication: panel reconstructed from 13TB Kaiko historical L2 tick archive at Cornell research3 by upstream pipeline; redo analyses (this page) run from daily_metrics.csv + spread_full_history.csv with Welch t-test, moving-block bootstrap, and log-difference correlation diagnostics. Companion digest: Hyperliquid on-chain for the perp side. Snapshot date . Not investment advice.