当前位置:网站首页>MySql如何删除所有多余的重复数据
MySql如何删除所有多余的重复数据
2022-06-26 04:48:00 【唯空城】
MySql如何删除所有多余的重复数据
- 需要处理的数据,如:
- 出现重复的数据,如:
- 先用SELECT查询看看结果:
-- 方法一
SELECT * FROM t_user WHERE user_name IN (
SELECT user_name FROM t_user GROUP BY user_name HAVING COUNT(1)>1
)
AND id NOT IN (
SELECT MIN(id) FROM t_user GROUP BY user_name HAVING COUNT(1)>1
)
- 方法一查询出的所有多余的重复记录:
-- 方法二
SELECT * FROM t_user WHERE id NOT IN (
SELECT MIN(id) FROM t_user GROUP BY user_name
)
- 方法二查询出的所有多余的重复记录(与方法一的结果相同):
-- 方法三
SELECT * FROM t_user AS t1 WHERE t1.id <> (
SELECT MAX(t2.id) FROM t_user AS t2 WHERE t1.user_name=t2.user_name
)
- 方法三查询出的所有多余的重复记录:
这里方法三因为用了MAX()方法(也可改用MIN()),查询结果记录的id不太一样,但也可以被视为重复多余的数据,关键是你希望选择保留哪一条记录而已。
- 下面是对上面的SELECT语句稍作修改并加入了DELETE
-- 方法一(笨方法但容易理解)
DELETE FROM t_user WHERE user_name IN (
SELECT t1.user_name FROM (
-- 查询出所有重复的user_name
SELECT user_name FROM t_user GROUP BY user_name HAVING COUNT(1)>1
) t1
)
AND id NOT IN (
SELECT t2.min_id FROM (
-- 查询出所有重复的记录并各自只取其中一条(MIN(id)或MAX(id)都可以)
SELECT MIN(id) AS min_id FROM t_user GROUP BY user_name HAVING COUNT(1)>1
) t2
)
-- 方法二(推荐方法也容易理解)
DELETE FROM t_user WHERE id NOT IN (
SELECT t.min_id FROM (
-- 过滤出重复多余的数据,比如,如果所有记录中存在1条记录是user_name=zhangsan的,那么就取出它;
-- 如果所有记录中存在多条记录是user_name=lisi的,那么只取其中1条,其他的不查询出来
SELECT MIN(id) AS min_id FROM t_user GROUP BY user_name
) t
)
-- 方法三(推荐方法但不太容易理解)
DELETE FROM t_user WHERE id IN (
SELECT t.id FROM (
-- 1. 关于所有存在相同user_name的记录,只查询出(保留)重复记录中的1条,假设这样查询出来的集合为A集合。
-- 2. 在所有记录中,只要id不在A集合中的,都把它们查询出来
SELECT t1.id FROM t_user AS t1 WHERE t1.id <> (SELECT MAX(t2.id) FROM t_user AS t2 WHERE t1.user_name=t2.user_name)
) t
)
-- 或
DELETE FROM t_user t1
WHERE t1.id <> (
SELECT t2.max_id FROM (
SELECT MAX(t3.id) AS max_id FROM t_user t3 WHERE t1.user_name=t3.user_name
) t2
)
- 最后删除成功之后,显示数据已经没有重复的了
边栏推荐
- Create alicloud test instances
- LeetCode 94. Middle order traversal of binary tree
- Is education important or ability important in software testing
- Thinkphp6 parsing QR code
- Stm8 MCU ADC sampling function is triggered by timer
- An unexpected attempt (Imperial CMS list template filters spaces and newlines in smalltext introduction)
- DBeaver 安装及配置离线驱动
- 2022.2.13
- TP5 distinct method paging problem
- Multipass Chinese document - share data with instances
猜你喜欢
2021-02-07
企业的产品服务怎么进行口碑营销?口碑营销可以找人代做吗?
How to carry out word-of-mouth marketing for enterprises' products and services? Can word of mouth marketing be done on behalf of others?
08_ Spingboot integrated redis
钟珊珊:被爆锤后的工程师会起飞|OneFlow U
1.24 learning summary
2022.2.17
ROS notes (07) - Implementation of client and server
Stm8 MCU ADC sampling function is triggered by timer
Differences between TCP and UDP
随机推荐
2022.2.11
A method of quickly transplanting library function code to register code by single chip microcomputer
防撤回测试记录
Numpy data input / output
Multipass中文文档-使用Multipass服务授权客户端
Thinkphp6 parsing QR code
微信小程序保存图片的方法
做软件测试学历重要还是能力重要
LISP programming language
Jenkins introduces custom jars
[H5 development] 02 take you to develop H5 list page ~ including query, reset and submission functions
Hash problem
08_ Spingboot integrated redis
I like you!
Multipass Chinese document - use instance command alias
1.19 learning summary
Selection of programming language
2.22.2.14
Multipass Chinese document - share data with instances
Use shell script to analyze system CPU, memory and network throughput