The general framwork

A computer program is said to learn from experience $E$ with respect to some class of tasks $T$ and performance measure $P$ , if its performance at tasks $T$ , as measured by $P$ , improves with experience $E$ .

Classification task

The objective of the model is to learn the mapping function $T : X \to Y$ that maps inputs $X$ to a set of labels $Y$ . The expirience here is a collection of data, namly the pairs $E = {(x_{i}, y_{i})}_{i = 1, \dots, M}$ . A performance measure is some kind of distance $P = d i s t (\overset{y}{^}, y)$ .

The classical statistical perspective

The main assumption is that our data follows a (true) probability distribution. Let’s imagine a family of statistical models, where each member is specified by a set of real parameters $Λ \subseteq R^{p}$

{P_{λ} : λ \in Λ}

Our assumption is that our data is i.i.d like

D = {(x_{i}, y_{i})} \sim P_{λ_{0}}, λ_{0} \in Λ

our task is then to find the (true) $λ_{0}$ , that is find the correct set of parameters. To do this, we define an estimator $\hat{λ}$ based on the data

\hat{λ} (D)

given a distribution with parameters $λ$ , we can of course compute the expected value conditioned on $x$ (imagine $x$ is a picture, $y$ a label)

f_{λ} (x) := E [y ∣ x]

then we can define an empirical cost to measure the performance of our model

\hat{R}_{M} (λ) := \frac{1}{M} i = 1 \sum M d i s t (y_{i}, f_{λ} (x_{i}))

so we the average distance (averaging on all our dataset of $M$ samples) a choice of distance. It’s clear that using the true parameters $λ_{0}$ , this empirical cost would tend to zero in the limi $M \to \infty$ .

It makes sense to choose as our best guess for the parametes

\hat{λ} = a r g mi n_{λ} \hat{R}_{M} (λ)

Technical difficulties of this approach

Family of model not known!
The minimization task is usually hard ( $\hat{R}_{M}$ non convex)
We need $M$ big enough to be in the SLLN, i.e empirical cost need to be close to the (true) population cost.

Lorenzo Gregoris

Explorer

The general framwork

Classification task

The classical statistical perspective

Graph View

Table of Contents

Backlinks

Lorenzo Gregoris

Explorer

The general framwork

Classification task §

The classical statistical perspective §

Graph View

Table of Contents

Backlinks

Classification task

The classical statistical perspective