sabo
sabo
全部文章
分类
c++(1)
因果推断(2)
技术栈(1)
机器学习(27)
程序静态分析(1)
算法实践(1)
算法总结(15)
统计(1)
自然语言处理(9)
论文笔记(13)
读书笔记(1)
面经(1)
项目学习(1)
题解(28)
归档
标签
去牛客网
登录
/
注册
Stay Forever Young
Act like a woman, think like a man
TA的专栏
48篇文章
0人订阅
算法小屋
45篇文章
2341人学习
c++
2篇文章
722人学习
大数据
1篇文章
99人学习
全部文章
(共4篇)
stochastic multi-armed bandits, regret minimization
chernoff bounds 运用Markov inequality 最后根据可以得到 Hoeffding's equality Stochastic mult-armed bandits 休息一下
在线学习
2020-02-16
0
531
Introduction to online optimization: online gradient descent
online gradient descent: Theorem:For any closed convex action set A such that ,for any subdifferentiable loss with bounded subgradient , the OGD strat...
在线学习
2020-02-16
0
577
Introduction to online optimization: continuous exp strategy
objective: extend exp strategy to continuous exp strategy where convex and bounded For any convex loss taking values in [0,1], the continuous exp st...
在线学习
2020-02-16
0
499
Introduction to online optimization: introduction
online learning protocol: characteristic:limited feedback Exponentially weighed average forecaster Bounded convex loss and expert regret Hoeffding’s'...
在线学习
2020-02-15
0
663