当前位置:网站首页>What is data mining?
What is data mining?
2022-06-26 06:43:00 【The code family】
1、 The concept of data mining
Data mining is from a large number of 、 Not completely 、 Noisy 、 Vague 、 In random actual data , Extract the implied , What people don't know in advance , But the process of potentially useful information and knowledge .
The data source used for data mining must be real and massive , And may be incomplete and include some interference data items . The discovered information and knowledge must be of interest and usefulness to the user . In general , The result of data mining is not required to be completely accurate knowledge , It's about finding a big trend .
Data mining can be simply understood as the operation of a large amount of data , The process of discovering useful knowledge . It is an interdisciplinary subject covering a wide range of fields , Including machine learning 、 mathematical statistics 、 neural network 、 database 、 pattern recognition 、 Rough set 、 Fuzzy mathematics and other related technologies .
2、 Application of data mining
For specific applications , Data mining is a process of using various analysis tools to find the relationship between models and data in massive data , These models and relationships can be used to make predictions .
Knowledge discovery in data mining , It is not to discover the truth that is universal , Nor is it to discover new natural science theorems and pure mathematical formulas , It is not a machine theorem proving . actually , All found knowledge is relative , There are specific premises and constraints , Oriented to a particular field , At the same time, it should be easy for users to understand , It is better to express the findings in natural language .
Data mining is actually a kind of deep-seated data analysis method . Data analysis itself has a long history , But in In the past , The purpose of data collection and analysis is for scientific research . in addition , Due to the limitation of computing power at that time , Complex data analysis methods that analyze large amounts of data are greatly limited .
3、 Value types of data mining
Data mining is to find valuable data in the mass of data , Provide basis for business decision-making . Value usually includes relevance 、 Trends and characteristics .
1) The correlation
Correlation analysis refers to the analysis of two or more variable elements with correlation , So as to measure the Degree of correlation .
Correlation analysis can only be carried out when there is a certain relationship or probability between elements . Correlation is not causality , The scope and field covered almost every aspect we have seen . Correlation analysis is used to determine changes between data , That is, whether the change of one or several attributes will affect other attributes , What is the impact . chart 1 These are examples of several common correlations .
2) trend
Trend analysis refers to the results that will actually be achieved , Compare with the historical data of similar indicators in the financial statements of different periods , To determine the financial position 、 An analytical method for the change trend and law of operating results and cash flow . The trend and trend of data can be predicted through the line chart , It can also be achieved through the link comparison 、 The results of the comparison are explained in a year-on-year manner , Pictured 2 Shown .
3) features
Feature analysis refers to finding the features of the main objects according to the specific analysis contents . for example , Internet data mining is to find out all aspects of the characteristics of users to portrait users , And according to different users, the user group will be labeled accordingly . Pictured 3 Shown .
边栏推荐
- Research Report on market supply and demand and strategy of China's microneedle device industry
- Load balancer does not have available server for client: userService问题解决
- 遇到女司机业余开滴滴,日入500!
- 宝塔服务器搭建及数据库远程连接
- Zotero使用之自定义参考文献格式
- STM 32 使用cube 生成TIM触发ADC并通过DMA传输的问题
- How to make the main thread wait for the sub thread to execute before executing
- 在公司逮到一个阿里10年的测试开发,聊过之后大彻大悟...
- DS18B20详解
- Mysql delete in 不走索引的
猜你喜欢
同步通信和异步通信的区别以及优缺点
Open source demo| you draw and I guess -- make your life more interesting
直播预告丨消防安全讲师培训“云课堂”即将开讲!
Pytorch uses multi GPU parallel training and its principle and precautions
Differences, advantages and disadvantages between synchronous communication and asynchronous communication
Customer Stories | Netease spring breeze: the "spring breeze" of the fun industry, reaching out to all areas through in-depth interaction
Library management system
SecureCRT运行SparkShell 删除键出现乱码的解法
MYSQL触发器要如何设置,简单教程新手一看就会
Connexion et déconnexion TCP, détails du diagramme de migration de l'état
随机推荐
Go语言学习笔记 1.2-变量篇
Reasons why MySQL indexes are not effective
MYSQL触发器要如何设置,简单教程新手一看就会
Dpdk - tcp/udp protocol stack server implementation (I)
LabVIEW Arduino tcp/ip remote smart home system (project part-5)
“试用期避免被辞退“ 指南攻略
Go language learning notes 1.2- variables
Lightgbm-- parameter adjustment notes
遇到女司机业余开滴滴,日入500!
Mysql delete in 不走索引的
Failed to configure a DataSource: ‘url‘ attribute is not specified and no embedded datasource could
SQL中空值的判断
Experience the new features of Milvus 2.0 together
I caught a 10-year-old Alibaba test developer in the company. After chatting with him, I realized everything
寶塔服務器搭建及數據庫遠程連接
Market trend report, technical innovation and market forecast of China's valeryl chloride
闭包问题C# Lua
My SQL(二)
China micro cultivator market trend report, technical dynamic innovation and market forecast
Marketing skills: compared with the advantages of the product, it is more effective to show the use effect to customers