当前位置:网站首页>Data-driven anomaly detection and early warning of item C in the May 1st mathematical modeling competition in 2021
Data-driven anomaly detection and early warning of item C in the May 1st mathematical modeling competition in 2021
2022-06-25 09:26:00 【Building block mathematical modeling】
2021 May 1st Mathematical Modeling Contest C topic Data driven anomaly detection and early warning
C topic Data driven anomaly detection and early warning
Original high-quality ideas Identify us The links below

High quality 、 original 、9.9 that will do Official account number 5 Discount sale
link :
https://mianbaoduo.com/o/bread/YZmVlJdw
C topic Data driven anomaly detection and early warning
Data description :100 A sensor 24 Hour time series data ( No data name given )

Data description diagram
Data processing :100 A sensor , Do not know the data name , It is necessary to filter variables . The first calculation is 100 The Euclidean distance between columns is clustered and then filtered . This is the simplest , An effective plan .
to update 1: Data processing 100 Column data to do matlab Of cluster Dimension reduction ,matlab Of cluster The function seems to have failed ( Maybe the dimension is high ) I don't know why , Still use here R Language helps us do well , Directly send the data set with good dimensionality reduction .
to update 2:pdist Function can solve , No correlation coefficient . Be careful , The data dimensionality reduction here should not be standardized 、 normalization , Not much to explain . The results are placed below , Just select a few of each class as the new data set , The code and data results will be sent soon .
We use the code Get the... After data processing 10 Column ( Relatively independent data ) after , Take a brief look at their trends and characteristics :

It can be seen that , The processed data has its own characteristics , It shows that our processing method is basically reliable ( For reference only ).
According to the above time series ( chart ) It can be seen that , It basically conforms to the properties given in the title : Such as the independent point of the picture in the lower right corner , The periodicity of the Chinese pictures on the .
Question 1 : Establish risk abnormal data detection model ( Give an assessment of risk and non risk )
According to the picture above , The above four properties of the time series of detection data are analyzed one by one
The following reference scheme is given :
1. Non risk outliers detection
Be careful ! There are some conceptual distinctions , The abnormal data of this problem has the same name as the abnormal value detection in machine learning , When you search for information, you should pay attention to distinguish , The abnormal data in this article refers to the logical abnormality of the sensor .
programme 1 ) The abnormal value is judged as non risk by a separate column ( It's not very realistic , According to the meaning of the topic , Just for learning )
The title mentioned that there will be independent points in the time series , Is one of the characteristics of non risk . Therefore, the first step in the abnormal data monitoring model is to remove the abnormal points of individual columns , Use simple outlier detection .( Single column outlier elimination )
programme 2*) The global outliers judge that the outliers are non risk
Fluctuations in the sensor - Outliers , There may or may not be risks , Fluctuations caused by changes in the external environment , We think it is risk-free , If at some point ...........................
边栏推荐
- matplotlib matplotlib中plt.grid()
- 5、 Project practice --- identifying man and horse
- Close a thread
- 《乔布斯传》英文原著重点词汇笔记(二)【 chapter one】
- Cazy eight trigrams maze of Chang'an campaign
- 3大问题!Redis缓存异常及处理方案总结
- Voiceprint Technology (II): Fundamentals of audio signal processing
- Are the top ten securities companies at great risk of opening accounts and safe and reliable?
- Is it safe for Huatai Securities to open an account on it? Is it reliable?
- 首期Techo Day腾讯技术开放日,628等你!
猜你喜欢

Online notes on Mathematics for postgraduate entrance examination (9): a series of courses on probability theory and mathematical statistics

C language: find all integers that can divide y and are odd numbers, and put them in the array indicated by B in the order from small to large

JMeter interface test, associated interface implementation steps (token)

Unity--configurable joint -- a simple tutorial to get you started with configurable joints

C program termination problem clr20r3 solution

Jmeter接口测试,关联接口实现步骤(token)

Matplotlib simple logistic regression visualization

C language: bubble sort

Webgl Google prompt memory out of bounds (runtimeerror:memory access out of bounds, Firefox prompt index out of bounds)

matplotlib matplotlib中plt.grid()
随机推荐
Compare and explain common i/o models
二、训练fashion_mnist数据集
Unity--configurable joint -- a simple tutorial to get you started with configurable joints
【OpenCV】—输入输出XML和YAML文件
The meshgrid() function in numpy
sklearn PolynomialFeatures的具体用法
某次比赛wp
Webgl Google prompt memory out of bounds (runtimeerror:memory access out of bounds, Firefox prompt index out of bounds)
Matplotlib decision boundary drawing function plot in Matplotlib_ decision_ Boundary and plt Detailed explanation of contour function
Cazy eight trigrams maze of Chang'an campaign
[IOU] intersection over union
When unity released webgl, jsonconvert Serializeobject() conversion failed
Socket programming -- poll model
Cassava tree disease recognition based on vgg16 image classification
【期末复习笔记】数字逻辑
How to delete a blank page that cannot be deleted in word
浅谈Mysql底层索引原理
2、 Training fashion_ MNIST dataset
C#程序终止问题CLR20R3解决方法
Oracle-单行函数大全