10月10日 史成春:combining experimental and historical data for policy evaluation-尊龙凯时首页

 10月10日 史成春:combining experimental and historical data for policy evaluation-尊龙凯时首页
中文

搜索
你想要找的

# 热门搜索 #

10月10日 史成春:combining experimental and historical data for policy evaluation
2024-10-10 15:00:00
活动主题:combining experimental and historical data for policy evaluation
主讲人:史成春
开始时间:2024-10-10 15:00:00
举行地点:普陀校区理科大楼a1514
主办单位:统计学院、统计交叉科学研究院
报告人简介

史成春博士,现任伦敦政治经济学院统计系副教授,曾在北卡罗来纳州立大学(north carolina state university)获得统计学博士学位。他的研究主要集中在强化学习领域(reinforcement learning),特别是在策略评估(policy evaluation)、因果推断(causal inference)、半监督学习(semi-supervised learning)等方面的应用与优化。史博士曾荣获institute of mathematical statistics (ims) tweedie award和royal statistical society (rss) research prize等奖项。


内容简介

this talk considers policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. we propose novel data integration methods that linearly integrate base policy value estimators constructed based on the experimental and historical data, with weights optimized to minimize the mean square error (mse) of the resulting combined estimator. we further apply the pessimistic principle to obtain more robust estimators, and extend these developments to sequential decision making. theoretically, we establish non-asymptotic error bounds for the mses of our proposed estimators, and derive their oracle, efficiency and robustness properties across a broad spectrum of reward shift scenarios. numerical experiments and real-data-based analyses from a ridesharing company demonstrate the superior performance of the proposed estimators.

网站地图