Statistical Error Bounds for Weighted Mean and Median With Application to Robust Aggregation of Cryptocurrency Data

  • Michaël Allouche
  • , Mnacho Echenim
  • , Emmanuel Gobet
  • , Anne Claire Maurice

Research output: Contribution to journalArticlepeer-review

Abstract

We study price aggregation methodologies applied to crypto-currency prices with quotations fragmented on different platforms. An intrinsic difficulty is that the price returns and volumes are heavy-tailed, with many outliers, making averaging and aggregation challenging. While conventional methods rely on volume-weighted average prices (called VWAPs), or volume-weighted median prices (called VWMs), we develop a new robust weighted median (RWM) estimator that is robust to price and volume outliers. Our study is based on new probabilistic concentration inequalities for weighted means and weighted quantiles under different tail assumptions (heavy tails, sub-gamma tails, sub-Gaussian tails). This justifies that fluctuations of VWAP and VWM are statistically important given the heavy-tailed properties of volumes and/or prices. We show that our RWM estimator overcomes this problem and also satisfies all the desirable properties of a price aggregator. We illustrate the behavior of RWM on synthetic data (within a parametric model close to real data): Our estimator achieves a statistical accuracy twice as good as its competitors, and also allows to recover realized volatilities in a very accurate way. Tests on real data are also performed and confirm the good behavior of the estimator on various use cases.

Original languageEnglish
Pages (from-to)760-778
Number of pages19
JournalMathematical Finance
Volume35
Issue number4
DOIs
Publication statusPublished - 1 Oct 2025

Keywords

  • concentration inequalities
  • heavy tails
  • outliers
  • robust aggregation
  • weighted mean and quantile estimation

Fingerprint

Dive into the research topics of 'Statistical Error Bounds for Weighted Mean and Median With Application to Robust Aggregation of Cryptocurrency Data'. Together they form a unique fingerprint.

Cite this