当前位置:网站首页>[data mining] final review Chapter 1
[data mining] final review Chapter 1
2022-06-21 06:12:00 【A delicious little pig】
Chapter one The introduction
1、 Definition of data mining
Technical level : Data mining is From a lot of 、 Not completely 、 Noisy 、 Vague 、 In random practical application data , Extraction is implicit in 、 What people don't know in advance 、 But potentially useful information The process of .
Business level : Data mining is a new business information processing technology , Its main feature is the analysis of business data extract 、 transformation 、 Analysis and other modeling processes , Extract key data to assist business decision-making .
2、 The task of data mining
Prediction task : According to the values of other attributes , Predict the value of a specific attribute , Such as classification 、 Return to 、 Outlier detection .
Describe task : Look for potential connection patterns in summary data , Such as cluster analysis 、 Correlation analysis .
- classification (Classification) analysis
Classification analysis is to analyze data , Make accurate description for each category or establish analysis model or mine classification rules , Then use this classification model to classify other records . - clustering (Clustering) analysis
Cluster analysis technology is to find out the similarities and differences in the data set , And aggregate objects with common characteristics in the same class . Clustering can help determine which combinations are more meaningful . - relation (Association) analysis
Correlation analysis , Discover the co-occurrence relationship between things . It is usually to discover frequently occurring schema knowledge from a given data set ( Also known as association rules ). Relevance analysis is widely used in marketing 、 Transaction analysis . - outliers (Outlier) analysis
Outlier analysis is the mining of data points that deviate from most of the data . For example, the automatic detection of commercial fraud , Network intrusion detection , Financial fraud detection, etc .
3、 The object of data mining
Including spatial database 、 Time series database 、 Stream data 、 Multimedia database 、 Text data and the world wide web
4、 The main steps of knowledge discovery
- Data cleaning (data clearing). Clear data noise 、 Obvious with the mining theme irrelevant The data of .
- Data integration (data integration). Related data from multiple data sources Combine together .
- Data conversion (data transformation). Convert data to Easy data mining Data form of .
- data mining (data mining). Use intelligent methods to mine data patterns or laws .
- Model assessment (pattern evaluation). According to the evaluation criteria, meaningful relevant knowledge is selected from the mining results .
- Knowledge means (knowledge presentation). Using visualization and knowledge representation techniques , Show users what they have
Mining related knowledge .
5、 The background and application field of data mining
The background :“ Data surplus ”、“ Information explosion ” And “ Lack of knowledge ” Make people drown in data , Difficult to make the right decisions !
- The application of data mining in business field : Market analysis and management , Corporate analysis and risk management , Fraud detection , Automatic trend forecasting ,…
- The application of data mining in computer field : Intrusion detection , Spam filtering , Internet Information / Use mining , Intelligent response system …
- Applications in other fields : Applications in industrial manufacturing , Data mining of biological information or genes ,…
边栏推荐
- Laravel
- tf.AUTO_REUSE作用
- 判断一棵树是否为完全二叉树
- Improve the determination of the required items of business details. When the error information is echoed according to the returned status code, the echoed information is inconsistent with the expecta
- Xshell7 connects to the server remotely and suspends the process to keep the program running
- sqli-labs23
- Aurora8B10B IP使用 -05- 收发测试应用示例
- 构建和保护小型网络考试
- 深度学习的几种优化方法
- 上手自定义线程池
猜你喜欢

sqli-labs26

代码生成器文件运行出错:The server time zone value ‘�й���ʱ��‘ is unrecognized or represents more than one time

FPGA - 7 Series FPGA selectio -05- logic of logic resources

【数据挖掘】期末复习 第二章

DDD 实践手册(4. Aggregate — 聚合)

复制 代码生成器 生成的代码到idea中,运行后网址报错怎么解决

IP - 射频数据转换器 -04- API使用指南 - 系统设置相关函数

Aurora8B10B IP使用 -05- 收发测试应用示例

397 linked list (206. reverse linked list & 24. exchange nodes in the linked list in pairs & 19. delete the penultimate node of the linked list & interview question 02.07. link list intersection & 142

【你所熟悉的网络真的安全吗?】万字文
随机推荐
Attack and defense world PHP_ rce
内卷大厂系列《LRU 缓存淘汰算法》
Fluorite Cloud Application
tf.compat.v1.global_variables_initializer
tf.AUTO_REUSE作用
The usage of Roca data visualization API 2.0 of Gaode map
Refine business details
图片隐写术:方法1
Which of the children's critical illness insurance companies has the highest cost performance in 2022?
FPGA - 7系列 FPGA SelectIO -02- 源语简介
sqli-labs-17
Picture steganography: Method 1
FPGA - 7系列 FPGA SelectIO -05- 逻辑资源之OLOGIC
Leetcode刷題 ——— (4)字符串中的第一個唯一字符
【数据挖掘】期末复习 第一章
构建和保护小型网络考试
FPGA - 7系列 FPGA SelectIO -06- 逻辑资源之ODELAY
Aurora8B10B IP使用 -03- IP配置应用指南
Solve the first problem of Huawei's machine test on April 20 by recursion and circulation (100 points)
Aurora8b10b IP usage-05-transceiver test application example