当前位置:网站首页>Hands on data analysis data reconstruction
Hands on data analysis data reconstruction
2022-06-21 12:00:00 【includeSteven】
Data refactoring
Introduce
After cleaning the data ( duplicate value 、 Missing value ), Data can be reconstructed . Why should we do data reconstruction , Because in reality , It is possible that the data is distributed across multiple files ; Or some data in the data is relevant , Integration is required ; Need to get valid information from data ( Such as maximum value 、 Average and other statistical information ), At this time, we need to use the related technology of data reconstruction .
Data merging
DataFrame The merger of
Include DataFrame Of merge and join、append
Used DataFrame Object merge Method , Here's the picture :

meanwhile ,merge You can also merge by index
join: combination append It is more convenient to merge according to the index
Axial connection
There are two ways :
- Use numpy:np.concatenat([arr, arr], axis=1)
- pd.concat():
Grouping and aggregation of data
The data packet
Use groupby function
Data aggregation
- sum: Calculate the sum of the groups
- mean: Calculate the average of the groups
- max: Calculate the maximum value of the group
References
- merge in pandas
- concat in pandas
- groupby in pandas
- Hands on data analysis Chapter viii. and Chapter ten
边栏推荐
- 请问各位大佬,flink cdc在抽取oracle全量数据之前会加表级排他锁
- typora免费版,无需破解,安装直接使用
- SSD [target detection]
- MySQL-DQL
- 2022危险化学品经营单位安全管理人员特种作业证考试题库及在线模拟考试
- Is 100W data table faster than 1000W data table query?
- 矩形覆盖面积
- MySQL-DML
- Nature sub Journal | Zhou concentrated the team to reveal that long-term climate warming leads to the decrease of soil microbial diversity in grassland
- 6-zabbix monitors and automatically discovers the memory and CPU usage of third-party Middleware
猜你喜欢

Musk's "good friend" impacts the largest IPO of Hong Kong stocks in 2022

A Kuan food: the battle for "the first share of convenience food" continues

Formation harmonyos I

这3个后生,比马化腾、张一鸣还狠

2022年高压电工判断题及答案

Interesting research on mouse pointer interaction

是德示波器软件,Keysight示波器上位机软件NS-Scope

Jenkins 通过Build periodically配置定时任务

One's deceased father grind politics English average cent furnace! What is your current level?

记一次Vmware虚拟机升级GLIBC导致系统瘫痪的恢复解决方法
随机推荐
异质化社群量化研究4丨RATE OF CHANGE WITH BANDS
这3个后生,比马化腾、张一鸣还狠
Is 100W data table faster than 1000W data table query?
Introduction to common source oscilloscope software and RIGOL oscilloscope upper computer software ns-scope
【综合笔试题】难度 2.5/5 :「树状数组」与「双树状数组优化」
使用Huggingface在矩池云快速加载预训练模型和数据集
STL基本容器测试
电源老化测试系统定制|充电桩自动化测试系统NSAT-8000概述
SSD的anchor_box计算
Discussion on outsourcing safety development management and control
Rename all files in the folder with one click
HMS core machine learning service ID card identification function to achieve efficient information entry
Centos7 升级MySQL5.6.40至企业版5.6.49
Never ending database injection attack and defense
Advanced technology management - how to improve team cooperation and technology atmosphere
碎知识...
华为云发布桌面IDE-CodeArts
第k小__
服务器被入侵了怎么办
[Harbin Institute of technology] information sharing for the first and second examinations of postgraduate entrance examination