OCO
\[\text{Regret}_{T}(A) = \text{sup}\left[\sum_{t=1}^{T}f_{t}(x_{t}^{A}) - \text{min}_{x}\sum_{t=1}^{T}f_{t}(x)\right]\]
Theorem 1.2 Let \(\epsilon\in(0,0.5)\). Suppose that the best expert makes \(L\) mistakes. Then:
\[a_{t} = \begin{cases} A, & W_{t}(A) \geq W_{t}(B) \\ B, & \text{otherwise}\end{cases}\]
\[W_{t+1}(i) = \begin{cases}W_{t}(i), & \text{if expert i was correct} \\ W_{t}(i)(1-\epsilon), & \text{if expert i was wrong}\end{cases}\]
\[W_{t+1}(i) = W_{t}(i)e^{-\epsilon \ell_{t}(i)}\]