当前位置:网站首页>Bayesian network explanation
Bayesian network explanation
2022-06-26 17:50:00 【lmn_】
0x01 Overview of Bayesian networks
Sometimes we need to calculate the probability of an uncertain cause and give some observed evidence , In these cases, Bayesian methods can be used .
Bayesian network (bayesian network)
Is a probabilistic graphical model , It explicitly captures the known conditional dependencies of directed edges in the graphical model , It passes through a directed acyclic graph (DAG) Represents a set of variables and their conditional dependencies .
Bayesian networks are well suited for capturing events that have occurred and predicting any of several possible known causes . for example , Bayesian networks can represent the probabilistic relationship between diseases and symptoms . Given symptoms , The network can be used to calculate the probability of the existence of various diseases .
Efficient algorithms can be used for reasoning and learning in Bayesian networks .
Sequence of variables ( for example Speech signal or protein sequence ) The Bayesian network for modeling is called dynamic Bayesian network , The generalization of Bayesian networks that can represent and solve decision problems under uncertainty is called influence graph .
Unlike manual builds , Automatic learning does not require expert knowledge of the underlying domain . Bayesian networks can be automatically learned directly from databases using experience based algorithms that are usually built into appropriate software . But the disadvantage is that the automation construction requires high data .
0x02 Bayesian network model
stay 0x01 Already mentioned in , Bayesian networks are directed acyclic graphs (DAG), Its nodes represent variables in the Bayesian sense .
- There are many types of variables , There are observable quantities 、 Potential variables 、 Unknown parameters or assumptions .
- Use edges to represent conditional dependencies ; Unconnected nodes represent variables that are conditionally independent of each other .
Each node is associated with a probability function , The probability function takes a set of specific values of the node's parent variables as input , The probability of variables represented by nodes is given .
Simplest graph , It can be expressed in this way :
Wiki gives an example
There are two ways to make grass wet : Watering car & It's raining .
Rain will affect the travel of sprinkler , There are two results when the grass becomes wet :T( It means true ) and F( On behalf of false ).
Chain rule of probability for joint probability function :
- G =“ Grass becomes wet ( really / false )”
- S =“ Sprinkler on ( really / false )”
- R =“ It's raining ( really / false )”
The model can answer questions about the existence effect ( The so-called inverse probability ) Whether there is a problem of cause .
By using the conditional probability formula and summing all the variables :
Use the expansion of the joint probability function Pr(G,S,R)} And the conditional probability table described in the figure (CPT) Conditional probabilities in , Each term in the numerator and denominator can be evaluated .
Then the numerical results ( Subscript by the value of the relevant variable ) yes
Infer unobserved variables
Bayesian networks are often used to answer probabilistic queries about them , When there are other variables , The network can be used to update the knowledge of variable subset states . This process of calculating the posterior distribution is called probabilistic reasoning . When selecting values for a subset of variables , Some expected loss functions can be minimized , For example, the probability of wrong decision . So Bayesian network can be considered as a complicated problem with the mechanism of automatically applying Bayesian theorem .
Parameter learning
Specified in X The parent node of is conditional X Probability distribution of ,X The distribution conditional on the parent can take any form . Discrete or Gaussian distributions are usually used , Because it simplifies the calculation . Sometimes you only know the limits of distribution ; The maximum entropy principle can then be used to determine the individual distributions , That is, the distribution with maximum entropy under given constraints .
Structural learning
The model of Bayesian network can be divided into several forms :
Direct connection : Indicates that the direction of a directed graph is in a straight line , And point in the same direction .
Expressed as :G -> S -> R
Huilian : Two independent nodes point to the same node .
Expressed as :G -> S <- R
Successive : One node points to two different nodes .
Expressed as :G <- S -> R
A particularly fast and accurate BN The learning method is to transform the problem into an optimization problem , And use integer programming to solve . In the solution process, the acyclic constraint is added to the integer program in the form of cutting plane (IP) in , This method can handle up to 100 A variable problem .
Software
Useful software for processing graphical models , There are several options . The most common software packages are Genie、Hugin、BUGS etc. .
边栏推荐
- Lm06 the mystery of constructing the bottom and top trading strategy only by trading volume
- Preparing for the Blue Bridge Cup and ccf-csp
- Uncover the secret of Agora lipsync Technology: driving portraits to simulate human speech through real-time voice
- Dos et détails de la méthode d'attaque
- The king of Internet of things protocol: mqtt
- MySQL exports all table indexes in the database
- Quantitative contract system development analysis case - detailed explanation of contract quantitative system development scheme
- [C language] static modifies local variables
- Please advise tonghuashun which securities firm to choose for opening an account? Is it safe to open an account online now?
- 【Unity】在Unity中使用C#执行外部文件,如.exe或者.bat
猜你喜欢
ACL 2022 | zero sample multilingual extracted text summarization based on neural label search
Uncover the secret of Agora lipsync Technology: driving portraits to simulate human speech through real-time voice
LeetCode——226. 翻转二叉树(BFS)
并发之线程安全
MySql 导出数据库中的全部表索引
ACL 2022 | 基于神经标签搜索的零样本多语言抽取式文本摘要
mysql Add column 失败 因为之前有数据,不是默认null 不行
SIGIR 2022 | 港大等提出超图对比学习在推荐系统中的应用
Tsinghua & Shangtang & Shanghai AI & CUHK proposed Siamese image modeling, which has both linear probing and intensive prediction performance!
【推荐系统学习】推荐系统的技术栈
随机推荐
MySQL add column failed because there was data before, not null by default
Strength and appearance Coexist -- an exclusive interview with Liu Yu, a member of Apache pulsar PMC
Leetcode HOT100 (22--- bracket generation)
[buuctf.reverse] 126-130
数据加密标准(DES)概念及工作原理
Alibaba's "high concurrency" tutorial "basic + actual combat + source code + interview + Architecture" is a god class
ACL 2022 | 基于神经标签搜索的零样本多语言抽取式文本摘要
解决pycharm里面每个字母占一格空格的问题
Ndroid development from introduction to mastery Chapter 2: view and ViewGroup
玩轉Linux,輕松安裝配置MySQL
pycharm的plt.show()如何保持不关闭
Today, I met a "migrant worker" who took out 38K from Tencent, which let me see the ceiling of the foundation
Please advise tonghuashun which securities firm to choose for opening an account? Is it safe to open an account online now?
决策树与随机森林
如何将应用加入到deviceidle 白名单?
Army chat -- registration of Registration Center
Hello, is it safe to open an online stock account and buy stocks now?
国信证券怎么开户?通过链接办理股票开户安全吗
【代码随想录-动态规划】T583、两个字符串的删除操作
Leetcode daily [2022 - 02 - 16]