0% found this document useful (0 votes)

44 views36 pages

Trimmed Sample Means For Uniform Mean Estimation and Regression

The document discusses trimmed sample means for robust uniform mean estimation and regression. It begins by introducing the problem of estimating expected values and discusses how sample means can be used to estimate means but are not robust to outliers. It then presents trimmed sample means as an alternative estimator that is more robust. The document discusses theoretical bounds on the performance of trimmed sample means and how they can achieve minimax optimal error rates. It also discusses how trimmed means can be applied to problems beyond mean estimation, such as regression and covariance estimation.

Uploaded by

rimfo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views36 pages

Trimmed Sample Means For Uniform Mean Estimation and Regression

Uploaded by

rimfo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Trimmed sample

means for robust

uniform mean
estimation and
regression

Roberto Imbuzeiro Oliveira

Columbia University
Oct 6th 2023
Estimating expected values
Quantites of interest in Statistics often are expected values or
can be defined in terms of them.

Example of the latter: M-estimators.

𝜃!"#$ = argmin 𝔼(∼* ℓ 𝜃, 𝑍 𝜃∈Θ .

%∈'

How well can one estimate expected values?

1d data and sample means
Scalar-valued data
𝑋+ , 𝑋, , … , 𝑋- : i.i.d. with mean 𝜇 ∈ ℝ and variance 𝜎 , ∈ ℝ. .
-
1
𝑋- : = 9 𝑋/
𝑛
/0+
Mean squared error
,
𝜎,
𝔼 𝑋- − 𝜇 =
𝑛
under no further assumptions.
High-confidence finite sample bounds

ℙ 𝑋- − 𝜇 ≤ 𝑟(𝛼, 𝑛) ≥ 1 − 𝛼

Asymptotic (CLT) Worst case - Chebyshev

2
2 log( ) 𝜎
𝑟(𝛼, 𝑛) ∼ 𝜎 𝛼 𝑟 𝛼, 𝑛 ≈
𝑛 𝛼𝑛
Catoni (AnIHP’12) + Lee/Valiant (STOC’21)

ℙ 𝜇G- − 𝜇 ≤ 𝑟(𝛼, 𝑛) ≥ 1 − 𝛼 with best possible estimator.

Asymptotic (CLT) Worst distr.- best estimator

2 2
2 log( ) 2 log( )
𝑟 𝛼, 𝑛 ∼ 𝜎 𝛼 𝑟 𝛼, 𝑛 ∼ 𝜎 𝛼
𝑛 𝑛
The main pont

When it comes to ,inite-sample bounds for deviations,

the sample mean is exponentially worse in 𝛼
than the best possible estimator!

Sample mean cannot tolerate contamination for any 𝜀 > 0.

General problem

Given a statistical task,

find the estimator with smallest possible error
in terms of the sample size 𝑛, the desired confidence 1 − 𝛼
and the contamination level 𝜀.

“Robustness” against heavy tails and contamination.

General problem

Given a statistical task,

find the estimator with smallest possible error
in terms of the desired confidence 1 − 𝛼
and the contamination level 𝜀.

“Robustness” against heavy tails and contamination.

Lots of other work with this perspective
Study of 1d mean est.: Devroye, Lerasle, Lugosi, O. (AnnStat’16)

Higher dimensions/covariances: Minsker (Bernoulli’15, AnnStat’18); Lugosi &

Mendelson (Ann Stat’19&21/FoCM’19/PTRF’19); Hopkins (AnnStat’20);
Depersin & Lecué (AnnStat’21,PTRF’21); Diakonikolas, Kamath, Kane, Li,
Moitra, Stewart (FOCS’16); Cherapanamjeri, Flammarion, Bartlett (COLT’19);
Lei, Lu, Venkat, Zhang (COLT’20); Hopkins, Li, Zhang (NeurIPS’20); Abdalla &
Zhivotovskiy (arXiv’22); Rico & O. (arXiv’22)...

Regression/M-estimation: Lerasle & O. (arXiv’11); Brownlees, Lugosi & Joly

(Ann Stat’15); Diakonikolas, Kamath, Kane, Li, Steinhardt, Stewart (ICML’19);
Jambulapati, Li, Schramm, Tiam (NeurIPS’21); book by Diakonikolas and Kane
(Cambridge; forthcoming)...
Trimmed means
Trimmed mean
Trimmed mean for scalar data
𝑋+ , 𝑋, , … , 𝑋- : i.i.d. random variables
𝑋(+) ≤ 𝑋 , ≤ ⋯ ≤ 𝑋(-) : order statistics

-76
1
𝑋-,6 : = 9 𝑋(/)
𝑛 − 2𝑘
/06.+
How to compute a trimmed mean

𝑛 = 8 and 𝑘 = 2

𝑋! 𝑋" 𝑋# 𝑋$ 𝑋% 𝑋( 𝑋' 𝑋&

𝑋(") 𝑋(%) 𝑋(#) 𝑋(&) 𝑋(() 𝑋(') 𝑋(!) 𝑋($)

Trimmed mean averages these points

Minimax optimal bounds
Theorem (Z. F. Rico’s thesis, 2022 – assuming 𝜀 < 1/4)
!
Set 𝜈8 ≔ (𝔼9∼* 𝑋 − 𝜇 8 ) , 𝑘 = 𝜀𝑛 + ⌈ 𝜀𝑛 ∨ 8 ln(2/𝛼 7+ )⌉ and
"

#
! "1
% ,-. $
𝑟 𝛼, 𝑛, 𝜀 ≔ 𝐶 inf 𝜈+ "
+𝐶 inf 𝜈2 𝜀 "1"/2 .
+:;:, 0 23"

Then ℙ 𝑋-,6 − 𝜇 ≤ 𝑟 𝛼, 𝑛, 𝜀 ≥ 1 − 𝛼.
Better than your MoM
Median of Means for 1d data
Break data into 𝑘 blocks, take averages of blocks
& then take the median of these averages.

2- +
Requires 𝑘 ≥ ∨ log( ) for robustness + high prob.
, <
#
! "1
,-. $
"
𝑟=>= 𝛼, 𝑛, 𝜀 ≔ 𝐶 inf 𝜈+ 𝜀 + .
+:;:, 0
Can trimmed means lead to
improvements more generally?
Joint with Lucas Resende – IMPA
arXiv:2302.06710
What we did
Trimmed means give nearly optimal results for 2 problems:
uniform mean estimation + regression with mean sq. error.

Experiments and heuristics for linear regression

that improve previous results in a number of settings.

Zoraida’s talk: covariance estimation via trimmed means.

Uniform mean estimation (Minsker’18)
Given
I.i.d. (possibly corrupted) sample from 𝑃 over some set 𝕏
Family ℱ of functions from 𝕏 to ℝ.

Goal
Estimate 𝑃𝑓 = 𝔼9∼ * 𝑓(𝑋) for each 𝑓 ∈ ℱ with small worst-case
error:
}
Loss( 𝑃𝑓 ?∈ℱ , 𝐸𝑓 ): = sup 𝑃𝑓 − } .
𝐸𝑓
?∈ℱ ?∈ℱ
Applications
M-estimation/regression
We’ll see an example soon.

Vector mean estimation under general norms

Lugosi & Mendelson PTRF2019, Depersin & Lecué PTRF2021
If 𝑋 ∼ 𝑃 takes values in ℝA , estimating the mean
𝜇: = 𝔼9∼* 𝑋 with error measured by a norm || ⋅ ||
is equivalent to estimating 𝑃𝑓 over ℱ ≔ dual unit ball of || ⋅ || .
Towards uniform mean estimation
Setup
𝕏, 𝒳, 𝑃 a probability space
2
𝑋+:- ≔ 𝑋+ , … , 𝑋- ∼ 𝑃; 𝑋+:- with #{𝑖 ∈ 𝑛 ∶ 𝑋/2 ≠ 𝑋/ } ≤ 𝜀𝑛.
ℱ ≔ a family of 𝑃-integrable functions 𝑓: 𝕏 → ℝ.

-76
1
𝑇„-,6
2
𝑓 ≔ 9 𝑓(𝑋 2/,? )
𝑛 − 2𝑘
/06.+

Here 𝑓 𝑋 $!,# ≤ 𝑓 𝑋 $%,# ≤ ⋯ ≤ 𝑓 𝑋 $&,# .

Relevant parameters of function class
Global “complexity” parameter
"
Emp5 ℱ ≔ 𝔼6#:& ∼ 8 99: sup ∑09>"(𝑓 𝑋9 − 𝑃𝑓) .
;∈ℱ 0

Minimax error for worst-case function #

! "1
% ,-. $
𝑟ℱ 𝛼, 𝑛, 𝜀 ≔ 𝐶 inf 𝜈+ (ℱ) "
+𝐶 inf 𝜈2 (ℱ)𝜀 "1"/2
"?+?% 0 23"
#
where 𝜈+ ℱ ≔ sup{(𝔼6∼8 𝑓 𝑋 − 𝑃𝑓 2 ) : 𝑓 ∈ ℱ} . '
Uniform performance of the trimmed mean
Theorem (O. & Resende 2023)
+
With a choice of 𝑘 ≈ 𝜀𝑛 + log <
,

ℙ sup{ 𝑇„-,6
2
𝑓 − 𝑃𝑓 ∶ 𝑓 ∈ ℱ} ≤ 𝑅 𝛼, 𝑛, 𝜀 ≥ 1 − 𝛼,
where
𝑅 𝛼, 𝑛, 𝜀 ≔ 𝐶EmpC ℱ + 𝐶𝑟ℱ 𝛼, 𝑛, 𝜀 .

Both terms in R are needed in general.

Main proof ideas
Counting Lemma (simplified)
ℱ a family of functions 𝑓: 𝕏 → ℝ, 𝑀D , 𝑀+ > 0 such that:
𝑡
𝑠up?∈ℱ 𝑃 |𝑓 𝑋 − 𝑃𝑓| > 𝑀D ≤
100𝑛
-E!
and 𝔼 sup?∈ℱ | 𝑃- − 𝑃 𝑓| ≤ .
F
Then with prob ≥ 1 − 𝑒 7F ,
∀𝑓 ∈ ℱ ∶ #{𝑖 ≤ 𝑛 ∶ 𝑓 𝑋/ − 𝑃𝑓 > 𝑀D ∨ 𝑀+ } ≤ 𝑡.

Generalization of Lugosi and Mendelson (AnnStat’21)

Main proof ideas
Bounding Lemma (simplified)
ℱ a family of functions 𝑓: 𝕏 → ℝ, 𝑀 > 0 such that:

∀𝑓 ∈ ℱ ∶ #{𝑖 ≤ 𝑛 ∶ 𝑓 𝑋/ − 𝑃𝑓 > 𝑀} ≤ 𝑘 − 𝜀𝑛

Let 𝜏E 𝑓 − 𝑃𝑓 = max{−𝑀, min 𝑓 − 𝑃𝑓, 𝑀 }. Then ∀𝑓 ∈ ℱ ∶

𝐶𝑀𝑘
„2
|𝑇 𝑓 − 𝑃𝑓 − 𝑃- 𝜏E 𝑓 − 𝑃𝑓 | ≤ .
-,6 𝑛
Improved vector mean estimation
Theorem (O. & Resende 2023)
If 𝕏 = ℝA , ∃𝜇G
-,6 estimator of the mean 𝜇 st. with prob. ≥ 1 − 𝛼:

||𝜇G
-,6 − 𝜇|| ≤ 𝐶𝔼9!:$ ∼//A * || 𝑋- − 𝜇||
#
! !-
% )*+ $
+ 𝐶 inf 𝜈( "
+𝐶 inf 𝜈. 𝜀 !-!/. .
!'('% & ./!

"
2 2
𝜈+ : = sup{(𝔼6∼8 ⟨𝑋 − 𝜇, 𝑓⟩ ) ∶ 𝑓 ∈ ℝ: , ||𝑓||∗ ≤ 1}.
Regression with squared loss
Given
I.i.d. (corrupted) sample of pairs 𝑋, 𝑌 ∈ 𝕏×ℝ with law 𝑃
Family ℱ of functions from 𝕏 to ℝ.
Goal
Estimate the best fit of 𝑌 from 𝑓(𝑋)
,
𝑓!"#$ ≔ arg min 𝔼 9,G ∼* 𝑌 − 𝑓 𝑋 vs 𝑓“$H! ∈ ℱ
?∈ℱ
,
Loss(𝑓!"#$ , 𝑓“$H! ) = 𝔼 9,G ∼* 𝑓“$H! (𝑋) − 𝑓!"#$ 𝑋 .
Results on regression
Setup
𝕏×ℝ, 𝒳×ℬ(ℝ), 𝑃 a probability space
𝑍!:& ≔ 𝑍! , … , 𝑍& ∼ 𝑃 with each 𝑍2 = 𝑋2 , 𝑌2 ∈ 𝕏×ℝ.
$
𝑍!:& satisfying #{𝑖 ∈ 𝑛 ∶ 𝑍2$ ≠ 𝑍2 } ≤ 𝜀𝑛.
%
ℱ ≔ some functions 𝑓: 𝕏 → ℝ; set ℓ# 𝑥, 𝑦 ≔ 𝑦 − 𝑓 𝑥 .

&-3
1
𝑇Y&,3
$
ℓ# − ℓ4 ≔ ^ ℓ# (𝑋 2,ℓ% -ℓ& , 𝑌(2,ℓ%-ℓ&) )
𝑛 − 2𝑘
2536!

Here (ℓ# − ℓ4 ) 𝑋 !,ℓ% -ℓ& ≤ ⋯ ≤ (ℓ# −ℓ4 ) 𝑋 &,ℓ% -ℓ& .

Regression with squared loss
𝑓!"#$ ≔ arg min 𝑃ℓ? = arg min(sup 𝑃(ℓ? −ℓI : 𝑔 ∈ ℱ} )
?∈ℱ ?∈ℱ

𝑓“$H! ≔ arg min(sup 𝑇„-,6

2
(ℓ? −ℓI : 𝑔 ∈ ℱ} )
?∈ℱ

Theorem (O. & Resende 2023)

If ℱ ⊂ 𝐿, (𝑃) is closed and convex, and a small-ball condition is
satisfied,
then ℙ ||𝑓“$H! − 𝑓FJKL ||M% * ≤ 𝑟?8 𝑛, 𝛼, 𝜀 ≥ 1 − 𝛼, where...
Localized bound – informal!
For each 𝑟 > 0,

ℱ; 𝑟 ≔ {𝑓 − 𝑓FJKL : 𝑓 ∈ ℱ , ||𝑓 − 𝑓FJKL ||M% (*) = 𝑟},

ℱN 𝑟 ≔ {(𝑌 − 𝑓FJKL ) (𝑓 − 𝑓FJKL ): 𝑓 ∈ ℱ , ||𝑓 − 𝑓FJKL ||M% * ≤ 𝑟}.

𝑟?8 = 𝑟?8 𝑛, 𝛼, 𝜀 basically solves an equation of the form

,
EmpC ℱ; (𝑟?8 ), 𝑃 + EmpC ℱN (𝑟?8 ), 𝑃 + noise ≤ 𝑐 𝑟?8
(a la Mendelson JACM 2014, Lerasle & Lecué Ann Stat 2020)
Linear regression with random design
Model
Covariates in ℝA , linear model with mean-0 noise.

𝑌/ = 𝛽FJKL , 𝑋/ + 𝜉/
Assumptions
2nd moment + small ball on 𝑋
𝑝-th moment bound on 𝜉/ (1 < 𝑝 ≤ 2)

! ,7,/8
A.Q>R .2
||𝛽“LOF − 𝛽FJKL ||,P ≤ 𝐶8 &
.
-
Linear regression with random design
Heuristic - alternating minimization/maximization
Performs quite well in experiments.

}D , 𝛽
Set initial 𝛽 }+ ∈ ℝA arbitrarily.
Repeat until convergence.
Trim ℓUT' 𝑋/2 , 𝑌/2 − ℓUT! 𝑋/2 , 𝑌/2 .
Choose one of 𝛽 }D or 𝛽
}+ to update.
}D or 𝛽
Perform OLS on trimmed sample to obtain new 𝛽 }+ .
Experiments vs. Median-of-means
TM MOM OLS

102
L2
∞
?∞
∞Ø̂n ° Ø ∞

101
∞ "
∞

100

10°1
0

4
0.

0.
0.

0.
"
Linear regression with normal errors and contamination.
Experiments vs. Median-of-means
TM MOM OLS

102
L2
∞
?∞
∞Ø̂n ° Ø ∞

101
∞ "
∞

100

10°1
0

4
0.

0.
0.

0.
"
Linear regression with student(1) errors and contamination.
Conclusion
Theory says trimmed means give the best-known estimators
for the problems we consider. Dependence on 𝜀 is optimal.
Gaussian approx. of trimmed process: upcoming by L. Resende.

Seems to perform very well in practice,

but there is no theory to back this up.
Work in progress with Philip Thompson, Zoraida Fernández-
Rico, Damien Vilcocq...
Thank you!

STAM Formula Sheet
100% (2)
STAM Formula Sheet
4 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
Bahir Dar University: Ethiopian Institute of Textile and Fashion Technology
No ratings yet
Bahir Dar University: Ethiopian Institute of Textile and Fashion Technology
14 pages
CS Ec Ec116 Bacani - J A 2019 1
No ratings yet
CS Ec Ec116 Bacani - J A 2019 1
4 pages
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
No ratings yet
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
9 pages
CH 11-Regression
No ratings yet
CH 11-Regression
52 pages
Session CLRM Review 1
No ratings yet
Session CLRM Review 1
47 pages
Inference in Linear Regression Models With Many Covariates and Heteroskedasticity
No ratings yet
Inference in Linear Regression Models With Many Covariates and Heteroskedasticity
47 pages
Classical Linear Regression and Its Assumptions
No ratings yet
Classical Linear Regression and Its Assumptions
63 pages
Intro To Regression
No ratings yet
Intro To Regression
4 pages
Chapter 1 - Linear Regression With 1 Predictor: Statistical Model
No ratings yet
Chapter 1 - Linear Regression With 1 Predictor: Statistical Model
35 pages
Homework Excercises 3 (MC Part)
No ratings yet
Homework Excercises 3 (MC Part)
1 page
Lecture 1
No ratings yet
Lecture 1
8 pages
Block 1
No ratings yet
Block 1
83 pages
Local Linear Regression For Functional Data: Alain Berlinet, Abdallah Elamine, André Mas Université Montpellier 2
No ratings yet
Local Linear Regression For Functional Data: Alain Berlinet, Abdallah Elamine, André Mas Université Montpellier 2
23 pages
2 Classical Linear Regression Models: 2.1 Assumptions For The Ordinary Least Squares Regression
No ratings yet
2 Classical Linear Regression Models: 2.1 Assumptions For The Ordinary Least Squares Regression
18 pages
A Family of Median Based Estimators in Simple Random Sampling
No ratings yet
A Family of Median Based Estimators in Simple Random Sampling
11 pages
Block 1
No ratings yet
Block 1
81 pages
Mathematical Model
No ratings yet
Mathematical Model
34 pages
SLRM Note
No ratings yet
SLRM Note
15 pages
Econometría
No ratings yet
Econometría
43 pages
Econometrics Notes - University of Utah (370 Pages)
No ratings yet
Econometrics Notes - University of Utah (370 Pages)
370 pages
Fundamentals of Mathematical Statistics 2020
No ratings yet
Fundamentals of Mathematical Statistics 2020
196 pages
Eco No Metrics
No ratings yet
Eco No Metrics
312 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
2024 - Math Data Sci RPT
No ratings yet
2024 - Math Data Sci RPT
48 pages
Econometrics
No ratings yet
Econometrics
310 pages
��
No ratings yet
��
3 pages
Ecmet
No ratings yet
Ecmet
1,644 pages
S.T (Tripos)
No ratings yet
S.T (Tripos)
5 pages
Class Notes in Statistics and Econometrics
No ratings yet
Class Notes in Statistics and Econometrics
1,644 pages
ECON835 Lecture Notes Part 2 Maximum Likelihood Through Panel Data (Fall 2014)
No ratings yet
ECON835 Lecture Notes Part 2 Maximum Likelihood Through Panel Data (Fall 2014)
68 pages
Classical LinearReg 000
No ratings yet
Classical LinearReg 000
41 pages
Ec2 1
No ratings yet
Ec2 1
11 pages
Linear Regression
No ratings yet
Linear Regression
56 pages
Industrial Mathematics Institute: Research Report
No ratings yet
Industrial Mathematics Institute: Research Report
25 pages
Estimating A Regression Line: F. Chiaromonte 1
No ratings yet
Estimating A Regression Line: F. Chiaromonte 1
13 pages
Basic Econometrics Health
No ratings yet
Basic Econometrics Health
183 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Msqe Metrics 1 ps2
No ratings yet
Msqe Metrics 1 ps2
11 pages
Advanced Econometrics: Instructor: Kanika Mahajan
No ratings yet
Advanced Econometrics: Instructor: Kanika Mahajan
36 pages
Michael Creel - Econometrics
No ratings yet
Michael Creel - Econometrics
490 pages
教材6Ruud, Paul a. 2000. an Introduction to Classical Econometric Theory
No ratings yet
教材6Ruud, Paul a. 2000. an Introduction to Classical Econometric Theory
975 pages
480 Note Lin
No ratings yet
480 Note Lin
11 pages
Notes Part 2 PDF
No ratings yet
Notes Part 2 PDF
63 pages
Week 2
No ratings yet
Week 2
61 pages
Mathematical Statistics Intro Course 1713243381
No ratings yet
Mathematical Statistics Intro Course 1713243381
142 pages
Regression 2
No ratings yet
Regression 2
27 pages
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
No ratings yet
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
8 pages
R - (2017) Understanding and Applying Basic Statistical Methods Using R (Wilcox - R - R) (Sols.)
No ratings yet
R - (2017) Understanding and Applying Basic Statistical Methods Using R (Wilcox - R - R) (Sols.)
91 pages
Econometrics Module 2
No ratings yet
Econometrics Module 2
38 pages
Estimations
100% (1)
Estimations
183 pages
03 Assumptions and Gauss Markov
No ratings yet
03 Assumptions and Gauss Markov
5 pages
Lec24 Linear Regression
No ratings yet
Lec24 Linear Regression
10 pages
Weather Wax Hastie Solutions Manual
No ratings yet
Weather Wax Hastie Solutions Manual
18 pages
Cramer, Mean and Variance of r2 - 000009
No ratings yet
Cramer, Mean and Variance of r2 - 000009
14 pages
WST 311 Notes Part 2 2024
No ratings yet
WST 311 Notes Part 2 2024
21 pages
Kruskal Wallis Test
No ratings yet
Kruskal Wallis Test
10 pages
The Mental Adjustment To Cancer Scale - French Replication and Assessment of Positive and Negative Adjustment Dimensions
No ratings yet
The Mental Adjustment To Cancer Scale - French Replication and Assessment of Positive and Negative Adjustment Dimensions
16 pages
BB Day 2 Exam
No ratings yet
BB Day 2 Exam
6 pages
Tugas 1 - MANPRO - Riska Tiana - 140610210002 - Silma Minnatika - 140610210014
No ratings yet
Tugas 1 - MANPRO - Riska Tiana - 140610210002 - Silma Minnatika - 140610210014
24 pages
1 s2.0 S1877050923001102 Main
No ratings yet
1 s2.0 S1877050923001102 Main
7 pages
Aiml Unit-3 MCQ
100% (1)
Aiml Unit-3 MCQ
6 pages
Least Square
No ratings yet
Least Square
6 pages
Statistical Inference - MA252
No ratings yet
Statistical Inference - MA252
2 pages
Chi Square Test
No ratings yet
Chi Square Test
23 pages
Skripsi Ilmi
No ratings yet
Skripsi Ilmi
3 pages
Table 9 3 Contains 40 Annual Counts of The Numbers of Recruits and Spawners in A Salmon
No ratings yet
Table 9 3 Contains 40 Annual Counts of The Numbers of Recruits and Spawners in A Salmon
2 pages
Exercise 3
No ratings yet
Exercise 3
10 pages
Bussines Stats CHO
No ratings yet
Bussines Stats CHO
23 pages
Ch. 9 (B) Lec
No ratings yet
Ch. 9 (B) Lec
38 pages
Introduction To Econometrics, 5 Edition: Chapter 2: Properties of The Regression Coefficients and Hypothesis Testing
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 2: Properties of The Regression Coefficients and Hypothesis Testing
32 pages
Lecture Two (Copy)
No ratings yet
Lecture Two (Copy)
27 pages
Association Between The Functional Movement Screen and Injury Development in College Athletes
No ratings yet
Association Between The Functional Movement Screen and Injury Development in College Athletes
8 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Lecture 10 Comparisons Involving Means
No ratings yet
Lecture 10 Comparisons Involving Means
38 pages
GE MODMAT Unit 4 Statistics 1
No ratings yet
GE MODMAT Unit 4 Statistics 1
14 pages
Z - Scores: Why Is This Impor Tant?
No ratings yet
Z - Scores: Why Is This Impor Tant?
10 pages
Lead Scoring Assignment Summary
No ratings yet
Lead Scoring Assignment Summary
4 pages
Relationships Between Two Quantitative Variables: Questions On Topic Four
No ratings yet
Relationships Between Two Quantitative Variables: Questions On Topic Four
6 pages
SQQS1013 Assignment 2 A221
No ratings yet
SQQS1013 Assignment 2 A221
2 pages
Assignment 1 Research Methodology
No ratings yet
Assignment 1 Research Methodology
5 pages
5d30 PDF
No ratings yet
5d30 PDF
12 pages
L10 - T Test
No ratings yet
L10 - T Test
28 pages

Trimmed Sample Means For Uniform Mean Estimation and Regression

Uploaded by

Trimmed Sample Means For Uniform Mean Estimation and Regression

Uploaded by

Trimmed sample

means for robust

Roberto Imbuzeiro Oliveira

Example of the latter: M-estimators.

𝜃!"#$ = argmin 𝔼(∼* ℓ 𝜃, 𝑍 𝜃∈Θ .

How well can one estimate expected values?

Asymptotic (CLT) Worst case - Chebyshev

ℙ 𝜇G- − 𝜇 ≤ 𝑟(𝛼, 𝑛) ≥ 1 − 𝛼 with best possible estimator.

Asymptotic (CLT) Worst distr.- best estimator

When it comes to ,inite-sample bounds for deviations,

See also experiments by Catoni.

Sample mean cannot tolerate contamination for any 𝜀 > 0.

Given a statistical task,

“Robustness” against heavy tails and contamination.

Given a statistical task,

“Robustness” against heavy tails and contamination.

Higher dimensions/covariances: Minsker (Bernoulli’15, AnnStat’18); Lugosi &

Regression/M-estimation: Lerasle & O. (arXiv’11); Brownlees, Lugosi & Joly

𝑋! 𝑋" 𝑋# 𝑋$ 𝑋% 𝑋( 𝑋' 𝑋&

𝑋(") 𝑋(%) 𝑋(#) 𝑋(&) 𝑋(() 𝑋(') 𝑋(!) 𝑋($)

Trimmed mean averages these points

Experiments and heuristics for linear regression

Zoraida’s talk: covariance estimation via trimmed means.

Vector mean estimation under general norms

Here 𝑓 𝑋 $!,# ≤ 𝑓 𝑋 $%,# ≤ ⋯ ≤ 𝑓 𝑋 $&,# .

Minimax error for worst-case function #

Both terms in R are needed in general.

Generalization of Lugosi and Mendelson (AnnStat’21)

Let 𝜏E 𝑓 − 𝑃𝑓 = max{−𝑀, min 𝑓 − 𝑃𝑓, 𝑀 }. Then ∀𝑓 ∈ ℱ ∶

Here (ℓ# − ℓ4 ) 𝑋 !,ℓ% -ℓ& ≤ ⋯ ≤ (ℓ# −ℓ4 ) 𝑋 &,ℓ% -ℓ& .

𝑓“$H! ≔ arg min(sup 𝑇„-,6

Theorem (O. & Resende 2023)

ℱ; 𝑟 ≔ {𝑓 − 𝑓FJKL : 𝑓 ∈ ℱ , ||𝑓 − 𝑓FJKL ||M% (*) = 𝑟},

𝑟?8 = 𝑟?8 𝑛, 𝛼, 𝜀 basically solves an equation of the form

Seems to perform very well in practice,

You might also like