Delete duplicates in mySQL table

Question

I am trying to write my first mySQL query. I need to delete rows if they have the same article-number field. I wrote this query:

    SELECT
    article_number, COUNT(*)
FROM
    article_numbers
GROUP BY
    article_number
HAVING 
    COUNT(*) > 1

It shows me all the rows that are duplicate. But how can I delete all but 1 for each duplicate?

Thanks

EDIT:

I tried this query:

delete article_numbers from article_numbers inner join 
    (select  article_number
     from article_numbers 
     group by article_number
     having count(1) > 1) as duplicates
   on (duplicates.article_number = article_numbers.article_number)

but it gives me this error:

Cannot delete or update a parent row: a foreign key constraint fails (api.products, CONSTRAINT products_article_number_id_foreign FOREIGN KEY (article_number_id) REFERENCES article_numbers (id))

EDIT 2:

I disabled the foreign key temporarily, and now my delete query works. But how can I modify it that one of the duplicate rows is not deleted?

You have to distinguish one row from it's duplicate somehow. (Any id column, or timestamp? What's the primary key?) — jarlh
– jarlh, Commented Sep 21, 2015 at 11:48
Also, don't tag products not involved. Are you using MySQL or MS SQL Server? — jarlh
– jarlh, Commented Sep 21, 2015 at 11:48
possible duplicate of How to delete duplicate rows in sql server? — Evaldas Buinauskas
– Evaldas Buinauskas, Commented Sep 21, 2015 at 11:48

Ullas · Accepted Answer · 2015-09-21 12:24:42Z

2

Use a CROSS JOIN.

Query

delete t1
from article_numbers t1,
article_numbers t2
where t1.id > t2.id 
and t1.article_number = t2.article_number;

Fiddle demo

edited Sep 21, 2015 at 12:24

answered Sep 21, 2015 at 12:05

Ullas

11.6k5 gold badges36 silver badges52 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

John Bell · Accepted Answer · 2015-09-21 11:52:04Z

0

I use a rather simple query to remove dupes:

;WITH DEDUPE AS (
SELECT ROW_NUMBER() OVER(
    PARTITION BY article_number
        ORDER BY (SELECT 1)) AS RN
FROM article_numbers)
DELETE FROM DEDUPE
WHERE RN != 1

answered Sep 21, 2015 at 11:52

John Bell

2,3501 gold badge15 silver badges23 bronze badges

2 Comments

Ullas Over a year ago

OP needs mysql query

John Bell Over a year ago

This wasn't apparent at the time of my posting.

koushik veldanda · Accepted Answer · 2015-09-21 11:54:22Z

0

Delete c
from (select *,rank() over(order by article_number) as r  from article_numbers )c
where c.r!=1

answered Sep 21, 2015 at 11:54

koushik veldanda

1,10710 silver badges23 bronze badges

Comments

jarlh · Accepted Answer · 2015-09-21 11:54:45Z

0

Delete a row if same article_number but higher id exists:

delete from article_numbers t1
where exists (select 1 from article_numbers t2
              where t2.article_number = t1.article_number
                and t2.id > t1.id)

Core ANSI SQL, so I suppose it works with both MySQL and SQL Server.

answered Sep 21, 2015 at 11:54

jarlh

44.9k8 gold badges52 silver badges68 bronze badges

2 Comments

razer Over a year ago

You have an error in your SQL syntax; check the manual that corresponds to your MariaDB server version for the right syntax to use near 't1 where exists (select 1 from article_numbers t2               where t2.article' at line 1

jarlh Over a year ago

Sorry, apparently MySQL doesn't support it. (Conforms to Core SQL-2003.)

ProblemSolver · Accepted Answer · 2015-09-21 11:59:27Z

0

I think this would help:

    WITH tblTemp as
    (
    SELECT ROW_NUMBER() Over(PARTITION BY Name,Department ORDER BY Name)
       As RowNumber,* FROM <table_name>
    )
    DELETE FROM tblTemp where RowNumber >1

answered Sep 21, 2015 at 11:59

ProblemSolver

6341 gold badge8 silver badges16 bronze badges

2 Comments

Ullas Over a year ago

OP needs mysql query.

ProblemSolver Over a year ago

oh this is for SQL Server. can be converted to MYSQL.

razer · Accepted Answer · 2015-09-21 12:23:36Z

I modified my query and I think it works now:

SET FOREIGN_KEY_CHECKS=0;
delete article_numbers from article_numbers inner join 
    (select  min(id) minid, article_number
     from article_numbers 
     group by article_number
     having count(1) > 1) as duplicates
   on (duplicates.article_number = article_numbers.article_number and duplicates.minid <> article_numbers.id)

But it seems very complex. I will check @Ullas method to see if it works, too.

Collectives™ on Stack Overflow

Delete duplicates in mySQL table

6 Answers 6

Comments

2 Comments

Comments

2 Comments

2 Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

Comments

2 Comments

Comments

2 Comments

2 Comments

Comments

Linked

Related