AI and ML


Reinforcement learning

Bellmann equation for utility

U(s) = R(s) + γ max [P(s'|s, a) * U(s')]
                a∈A

This essentially says this:

Utility of a state = Reward of that state + Utility of next state

Sketching

From here:

A sketch C(X) of some data set X with respect to some function f is a compression of X that allows us to compute, or approximately compute, f (X) given access only to C(X).