Estimator Status¶

This table describes the implementation status of estimators available through the default registry. Status is not a ranking. It tells users how cautiously to interpret results.

Status meanings:

baseline: suitable as a simple comparison method in public examples and smoke benchmarks.
approximate: useful for research benchmarking, but implemented as a pragmatic approximation rather than a reference implementation.
experimental: available for exploratory comparison; needs stronger validation before it should anchor public claims.
reference-grade: validated enough to serve as an implementation reference. No estimator has this status yet.

Estimator	Family	Target estimand	Status	Assumptions	Expected regime	Known failure risks
`RS`	temporal	`hurst_scaling_proxy`	baseline	Self-similar scaling over selected block sizes.	Clean fGn/fBm-style synthetic checks and simple public smoke suites.	Finite-sample bias: the classical estimator systematically over-estimates `H` near `0.5` and for small `n`. An opt-in Anis-Lloyd correction is available via `params.use_anis_lloyd_correction`. Short signals, nonstationary trends, and contamination can also bias the slope; bootstrap intervals are approximate.
`DFA`	temporal	`hurst_scaling_proxy`	baseline	Power-law fluctuation scaling after polynomial detrending.	fGn/fBm-style signals with a defensible scale window.	Scale-window and detrending-order choices can dominate finite-sample results.
`DMA`	temporal	`hurst_scaling_proxy`	baseline	Moving-average fluctuation scaling over selected windows.	fGn/fBm-style signals where window range avoids edge effects.	Window choices, short records, and deterministic trends can distort estimates.
`AbsoluteMoment`	temporal	`hurst_scaling_proxy`	approximate	Absolute first moment of block-aggregated series follows power-law scaling.	Aggregated-variance/moment comparisons on stationary fGn-style records.	Scale-window choices, short records, and centring choices can materially affect the slope.
`Variance`	temporal	`hurst_scaling_proxy`	approximate	Variance of block-aggregated series follows power-law scaling.	Classical aggregated variance checks for stationary long-memory-like records.	Bias under trends, level shifts, and poor block-size support.
`VarianceResidual`	temporal	`hurst_scaling_proxy`	approximate	Average variance of within-block residuals follows power-law scaling after local detrending.	Residual-variance scaling comparisons on records with enough block support.	Detrending order and scale window can dominate finite-sample estimates.
`GPH`	spectral	`long_memory_parameter`	baseline	Low-frequency log-periodogram regression for ARFIMA-style memory.	ARFIMA(0,d,0)-like synthetic regimes with moderate sample sizes.	Bandwidth sensitivity, short-memory leakage, and low-frequency contamination. Optional cosine taper (`params.taper: cosine`) can reduce periodogram bias from spectral leakage.
`Periodogram`	spectral	`long_memory_parameter`	approximate	Log-periodogram slope approximates long-memory parameter.	ARFIMA-style long-memory comparisons.	Bandwidth choices materially affect results; periodogram noise is high. Optional cosine taper (`params.taper: cosine`) can reduce spectral leakage.
`WhittleMLE`	spectral	`long_memory_parameter`	approximate	Gaussian ARFIMA(0,d,0) spectral likelihood approximation.	Clean ARFIMA-style records near the fitted model family.	Model misspecification, numerical bounds, and short samples.
`ModifiedLocalWhittle`	spectral	`long_memory_parameter`	approximate	Local low-frequency Whittle objective captures fractional memory.	ARFIMA-style long-memory regimes with adequate low-frequency support.	Bandwidth sensitivity and finite-sample instability.
`Higuchi`	geometric	`hurst_scaling_proxy`	approximate	Graph dimension maps to a Hurst-style proxy.	Geometry-based comparator on sufficiently long records.	It is indirect LRD evidence and can fail under short or highly noisy signals.
`GHE`	geometric	`hurst_scaling_proxy`	approximate	Increment moments scale across selected lags.	Multiscale increment-variance comparisons.	Arbitrary heuristic: when the absolute log-log slope is below `flat_slope_tol` (default `0.08`), the estimate is clamped to `0.5`. This guard is empiric, not theoretically derived; set `flat_slope_tol: 0.0` to disable it. Also sensitive to lag-grid choices and short samples.
`WaveletOLS`	wavelet	`hurst_scaling_proxy`	approximate	Wavelet detail variances scale linearly across selected levels.	fGn/fBm-style records with adequate wavelet levels.	Scale-band selection, boundary effects, and short signals.
`WaveletAbryVeitch`	wavelet	`hurst_scaling_proxy`	experimental	Abry-Veitch-style wavelet log-scale regression approximation.	Exploratory wavelet comparison on long enough records.	Needs stronger validation; sensitive to wavelet family and usable levels.
`WaveletBardet`	wavelet	`hurst_scaling_proxy`	experimental	Weighted wavelet log-scale regression approximation.	Exploratory wavelet comparison on long enough records.	Weighting and level selection can drive results; short signals are invalid.
`WaveletJensen`	wavelet	`hurst_scaling_proxy`	experimental	Two-band wavelet slope extrapolation.	Exploratory comparison where fine and coarse bands are both populated.	Band definitions can be fragile; short or narrow-band records fail.
`WaveletWhittle`	wavelet	`hurst_scaling_proxy`	experimental	Wavelet-domain Whittle-type fit to detail variances.	Exploratory wavelet comparison on long enough records.	Numerical fit and level support remain experimental.
`MLRandomForest`	data-driven	`hurst_scaling_proxy`	experimental	Supervised synthetic training distribution declared in `ml_training`.	Contaminated synthetic Hurst-proxy comparisons where train/eval provenance is explicit.	Distribution shift, training-grid leakage, and optional dependency availability.
`MLSVR`	data-driven	`hurst_scaling_proxy`	experimental	Supervised synthetic training distribution declared in `ml_training`.	Feature-based nonlinear baseline against classical estimators.	Feature design and hyperparameters can dominate results; no estimator-level CI.
`MLCNN`	data-driven	`hurst_scaling_proxy`	experimental	Supervised synthetic training distribution declared in `ml_training`; requires `lrdbench[nn]`.	Sequence baseline for exploratory neural comparisons.	Small training grids can overfit; distribution shift and stochastic training affect results.
`MLLSTM`	data-driven	`hurst_scaling_proxy`	experimental	Supervised synthetic training distribution declared in `ml_training`; requires `lrdbench[nn]`.	Sequential neural baseline for exploratory comparisons.	Slow training, overfitting, and weak extrapolation outside the manifest training grid.

Interpretation Rules¶

Do not mix estimator targets casually. hurst_scaling_proxy and long_memory_parameter are related in some model families but are not identical public-contract quantities.

Leaderboard results are summaries of declared component metrics. They are not universal estimator rankings and should always be reported with the underlying metrics.

For publication-facing analysis, prefer reporting estimator families, target estimands, parameter settings, and failure/missing-uncertainty rates alongside any accuracy or robustness summaries.

Data-driven estimators are run-local supervised baselines. Interpret them relative to the manifest-declared ml_training distribution, not as distribution-free LRD estimators.