当前位置:网站首页>Paper notes: lbcf: a large scale budget constrained causal forest algorithm
Paper notes: lbcf: a large scale budget constrained causal forest algorithm
2022-06-25 16:37:00 【#Super Pig】
Original paper :M. Ai et al., “LBCF: A Large-Scale Budget-Constrained Causal Forest Algorithm,” in Proceedings of the ACM Web Conference 2022, New York, NY, USA, 2022, pp. 2310–2319. doi: 10.1145/3485447.3512103.
Motivation
This work takes the gold coin distribution scene as the background , Evaluate the causal effect of the number of gold coins distributed on the viewing duration of users , Based on the evaluation results, a personalized distribution mechanism is designed .
challenge:
- Large scale users ;
- multi-treatment;
contribution:
- Put forward LBCF Method solves the above two problems ;
- A new model effect evaluation index is proposed ;
- Experiments were carried out on real data ;
related work:
This part is mainly about myself challenge The shortcomings of the existing work are described , There are two types :
- The first category :LBCF Criticize them for using greedy tactics in making decisions , And through examples, this kind of greedy strategy can not achieve treament effect The optimal value 【 surface 1】
- The second category :LBCF It is pointed out that although the second kind uses the optimization algorithm to make decisions , However, the performance is insufficient in the face of large-scale data , Mainly :1. They can only do cohort-level The optimization of the , It can't be done member-level The optimization of the ;2. They are solving multi-treatment Many trees were constructed causal forest, In this case , Different forest The feature space is different , So the result of the evaluation treatment effect Also incomparable .
Methodology
in the light of related work The problem in ,LBCF The corresponding solutions are put forward , Methodology is mainly divided into two parts :
- The first part is causal forest The construction of , The main contribution of this part is to put forward a new split standard , And named their forest UDCF;
- The second part is the construction of the optimization problem , In this part, an optimization algorithm for large-scale data is proposed , The name for DGB;
Problem definition :
It is worth mentioning that LBCF Yes multi-treatment Next CATE The definition of :
They are in Y(T=0) For the baseline , Only consider Y(T=j) To Y(T=0) The difference between the .
UDCF
Author for multi-treatment problem , Think the forest they built ( namely UDCF) There are two requirements that need to be met :
- Unified: Refer to ,UDCF Need to make each treatment Of effect The evaluation is in the same feature space ;
- Discriminative : Refer to ,UDCF Need to make a difference treatment Corresponding effect Sufficiently heterogeneous ;
therefore ,UDCF The full name of is Unified Discriminative Causal Forest.
In order to meet the above two requirements , The author puts forward two New split criteria :
- inter split: The standard is to GRF be used for CATE A little change has been made to the split criteria of , bring CF Can be applied to multi-treatment The circumstances of , To satisfy the Unified characteristic , Its formal expression is as follows :
Not familiar with GRF Step by step Paper notes :GRF
This split criterion continues to be used GRF Thought , You want to maximize the heterogeneity between child nodes ; - Intra split: The standard is aimed at Discriminative Characteristics proposed 【 Assumptions are also attached 1】, The idea is to maximize the difference treatment Heterogeneity between , The formal expression is as follows :
Based on the above two split criteria , The author puts forward the calculation steps of node splitting :- First of all, according to the inter split Criteria filter out m A candidate ;
- According to intra split The criteria screen out the optimal split ;
DGB
It is also an optimization problem :
The details are brief
边栏推荐
- Function and implementation of closures
- 炮打司令部,别让一个UI框架把你毁了
- Ncnn source code learning collection
- Vscode有什么好用的插件?
- 从TiDB上线阿里云的背后,如何看待云数据库的变革趋势
- [100 questions of Blue Bridge Cup intensive training] scratch command mobile Blue Bridge Cup scratch competition special prediction programming question intensive training simulation exercise question
- Day_ 04
- Xinlou: un voyage de sept ans de Huawei Sports Health
- Record learning of hystrix knowledge --20210929
- In the wechat environment, H5 jumps to the specified page of the applet
猜你喜欢
Blue Bridge Cup - practice system login
論文筆記:LBCF: A Large-Scale Budget-Constrained Causal Forest Algorithm
【机器学习】基于多元时间序列对高考预测分析案例
Stop "outsourcing" Ai models! The latest research finds that some "back doors" that undermine the security of machine learning models cannot be detected
3.条件概率与独立性
Why does golang's modification of slice data affect the data of other slices?
【NLP】今年英语高考,CMU用重构预训练交出134高分,大幅超越GPT3
mysql整体架构和语句的执行流程
深入理解和把握数字经济的基本特征
论文笔记:LBCF: A Large-Scale Budget-Constrained Causal Forest Algorithm
随机推荐
Cocoapods installation in 2021
Helsinki traffic safety improvement project deploys velodyne lidar Intelligent Infrastructure Solution
File operation, serialization, recursive copy
_ 19_ IO stream summary
Perfect shuffle problem
【NLP】今年英语高考,CMU用重构预训练交出134高分,大幅超越GPT3
Preliminary understanding of JVM
[Third Party framework] retrofit2 (1) of network request framework -- Getting Started Guide
炮打司令部,别让一个UI框架把你毁了
The database records are read through the system time under the Android system, causing the problem of incomplete Reading Records!
Detailed explanation of IVX low code platform series -- Overview (I)
ES6 deconstruction assignment rename
Read mysql45 - a simple understanding of global locks and table locks
Built in function globals() locals()
[problem solving] dialogfragment can not be attached to a container view
Generate post order traversal according to pre order traversal and mid order traversal
【蓝桥杯集训100题】scratch指令移动 蓝桥杯scratch比赛专项预测编程题 集训模拟练习题第14题
Apijson simple to use
Unity技术手册 - 生命周期旋转RotationOverLifetime-速度旋转RotationBySpeed-外力ExternalForces
【精通高并发】深入理解C语言基础与汇编下的C语言