Skip to content

Recently Updated

07-17: 2023-07-17

07-04: 2023-07-04

06-21: In Context Learning

03-10: Draft-Dip-Nerf

02-24: Should you buy the DIP?

02-21: Machine Learning in Practice;

12-20: Language Models: Fancy Number Pattern Matching

11-29: Generalisation Revisited

11-27: Generalisation Paper

Tags

artificial_intelligence (3)

biologically_inspired (3)

causal_inference (1)

classic_papers (1)

computer_science (1)

computer_vision (3)

data_science (1)

deep_learning (2)

from_article (5)

from_paper (22)

from_podcast (1)

gradient_descent (3)

graph_neural_networks (1)

implicit_regularisation (1)

interpolation (3)

mathematics (1)

matrix_completion (2)

meta_analysis (1)

meta_learning (1)

money_stuff (2)

neural_networks (6)

neural_tangent_kernel (2)

neuroscience (2)

optimisation (4)

overparameterization (1)

regularization (1)

reinforcement_learning (1)

self_supervised_learning (2)

theoretical_statistics (2)

On this page

No Free Lunch

On this page

No Free Lunch

No Free Lunch

Let's start by thinking about search algorithms

Notation:

$X$ is countable search space (e.g. set of parameters)
objective function $f: X \mapsto Y$ that evaluates a particular choice
dataset $d^m = \{ d_X^m, d_Y^m \}$ is a set of $m$ pairs of $(x, f(x))$ (i.e. what you've tried so far)
search algorithm $\mathcal{A}$ simply appends to this dataset (i.e. continues the search, ideally finding $x$ 's that lead to smaller $f(x)$ )
arbitrary performance measure $\Phi: d_Y^m \to \mathbb{R}$ (i.e. evaluate how good a search algorithm is)

Suppose we want to find the minimum of $f$ . Then, the search algorithm is exactly gradient descent, as in a proposal for the next step in the search.

Key Points:

Tags

#machine_learning

Last updated on 2/23/2022