当前位置：网站首页>Summary of statistical learning methods

Summary of statistical learning methods

2022-06-21 14:26:00 【Record brother】

Common statistical learning consists of the following ways :

One , Supervised learning

Generation method ： Learn from data the joint probability distribution p(y|x), And then find it as a prediction model . The model represents a given input x Generate input y The generative relationship of .eg,nb,hmm.
Discrimination method ： This method learns the decision function directly from the data f(x) Or conditional probability distribution as a prediction model , The discriminant method is concerned with the given input x, What kind of output should be predicted y.eg,k Nearest neighbor algorithm , perceptron , Decision tree ,LR, Maximum entropy model ,SVM,CRF.

1, classification

perceptron ( Binary linear model )----> neural network ( A hidden layer is added to the former , Output layer neurons have more than one output )--> Deep neural network ( A neural network with many hidden layers , In essence, multilayer neural network and deep neural network DNN It's all one thing )
KNN
probability ( Naive Bayes （NB）,Logistic Regression（LR）, Maximum entropy MEM（ And LR Both belong to log linear classification model ）)
Support vector machine (SVM)
Decision tree (ID3、CART、C4.5)

    assembly learning{
        Boosting{
            Gradient Boosting{
                GBDT
                xgboost（ Tradition GBDT With CART As a base classifier ,xgboost It also supports linear classifiers , This is the time xgboost It's equivalent to taking L1 and L2 Logistic regression of regularized terms （ Classification problem ） Or linear regression （ The return question ）;xgboost yes Gradient Boosting An efficient system implementation of , Not a single algorithm .）
            }
            AdaBoost
        }   
        Bagging{
             Random forests 
        }
        Stacking
    }

2, Regression method

Linear regression
Trees return
Ridge Ridge return
Lasso Return to

3, Tagging

Probability graph model ：HMM,MEMM（ Maximum Entropy Markov ）,CRF( Annotation is essentially a generalization of the classification problem , It is also a simple form of a more complex structure prediction problem , Its input is an observation sequence , The output is a sequence of tags or states . The goal is to learn a model , So that he can give a marker sequence to the observation sequence as a prediction . The commonly used statistical learning methods are ： hidden Markov model , Conditional random field （ Applied to part of speech tagging ）)

Two , Unsupervised learning

1, clustering

Basic clustering （K—mean, Two points k-mean,K Median clustering ,GMM clustering ）
Hierarchical clustering
Density clustering
Spectral clustering

2, Theme model

pLSA
LDA Implicit Dirichlet analysis （ Applied to extract the subject features of the document ）

3, Correlation analysis

Apriori Algorithm
FP-growth Algorithm

4, Dimension reduction

PCA Algorithm
SVD Algorithm
LDA Linear discriminant analysis
LLE Local linear embedding

5, Anomaly detection

Anomaly detection

3、 ... and , Semi-supervised learning

 namely ,semi-supervised, The learning process does not depend on external consultation , A method of automatically utilizing the distribution information contained in unlabeled samples , namely , The training set contains both labeled sample data and unlabeled sample data . for example , Semi supervision SVM, Semi supervised clustering, etc

Four , Reinforcement learning

 namely ,reinforcement. Its essence is to solve decision making  problem , That is, making decisions automatically , And can make continuous decisions . for example , unmanned ,AlphaGo, Play games, etc .

原网站

版权声明
本文为[Record brother]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/02/202202221425206422.html