sabo

分类

c++(1) 因果推断(2) 技术栈(1) 机器学习(27) 程序静态分析(1) 算法实践(1) 算法总结(15) 统计(1) 自然语言处理(9) 论文笔记(13) 读书笔记(1) 面经(1) 项目学习(1) 题解(28)

/ 注册

TA的专栏 48篇文章 0人订阅

45篇文章 2424人学习

2篇文章 786人学习

1篇文章 194人学习

全部文章（共4篇）

stochastic multi-armed bandits, regret minimization

chernoff bounds 运用Markov inequality 最后根据可以得到 Hoeffding's equality Stochastic mult-armed bandits 休息一下

在线学习

2020-02-16

0 562

Introduction to online optimization: online gradient descent

online gradient descent: Theorem:For any closed convex action set A such that ,for any subdifferentiable loss with bounded subgradient , the OGD strat...

在线学习

2020-02-16

0 609

Introduction to online optimization: continuous exp strategy

objective: extend exp strategy to continuous exp strategy where convex and bounded For any convex loss taking values in [0,1], the continuous exp st...

在线学习

2020-02-16

0 535

Introduction to online optimization: introduction

online learning protocol: characteristic:limited feedback Exponentially weighed average forecaster Bounded convex loss and expert regret Hoeffding’s'...

在线学习

2020-02-15

0 743