konhon

          忘掉過去,展望未來。找回自我,超越自我。
          逃避不一定躲的過, 面對不一定最難過, 孤單不一定不快樂, 得到不一定能長久, 失去不一定不再擁有, 可能因為某個理由而傷心難過, 但我卻能找個理由讓自己快樂.

          Google

          BlogJava 首頁 新隨筆 聯(lián)系 聚合 管理
            203 Posts :: 0 Stories :: 61 Comments :: 0 Trackbacks
           

          學(xué)習(xí)sql有一段時間了,發(fā)現(xiàn)在我建了一個用來測試的表(沒有建索引)中出現(xiàn)了許多的重復(fù)記錄。后來總結(jié)了一些刪除重復(fù)記錄的方法,在Oracle中,可以通過唯一rowid實現(xiàn)刪除重復(fù)記錄;還可以建臨時表來實現(xiàn)...這個只提到其中的幾種簡單實用的方法,希望可以和大家分享(以表employee為例)。

          SQL> desc employee

           Name                                      Null?    Type
           ----------------------------------------- -------- ------------------

          emp_id                                                NUMBER(10)
          emp_name                                           VARCHAR2(20)

          salary                                                  NUMBER(10,2)

           

           

          可以通過下面的語句查詢重復(fù)的記錄:

          SQL> select * from employee;

           

              EMP_ID EMP_NAME                                  SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   1 sunshine                                      10000

                   2 semon                                         20000

                   2 semon                                         20000

                   3 xyz                                           30000

                   2 semon                                         20000

           


          SQL>
          select distinct * from employee;

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   2 semon                                         20000

                   3 xyz                                             30000

          SQL>  select * from employee group by emp_id,emp_name,salary having count (*)>1

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   2 semon                                          20000


          SQL>
          select * from employee e1

          where rowid in (select max(rowid) from employe e2
           
          where e1.emp_id=e2.emp_id and

            e1.emp_name=e2.emp_name and e1.salary=e2.salary);

           

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   3 xyz                                             30000

                   2 semon                                         20000

           

           

          2. 刪除的幾種方法:

           

          1)通過建立臨時表來實現(xiàn)

          SQL>create table temp_emp as (select distinct * from employee) 

          SQL> truncate table employee; (清空employee表的數(shù)據(jù))

          SQL> insert into employee select * from temp_emp;  (再將臨時表里的內(nèi)容插回來)

           

          ( 2)通過唯一rowid實現(xiàn)刪除重復(fù)記錄.Oracle中,每一條記錄都有一個rowid,rowid在整個數(shù)據(jù)庫中是唯一的,rowid確定了每條記錄是在Oracle中的哪一個數(shù)據(jù)文件、塊、行上。在重復(fù)的記錄中,可能所有列的內(nèi)容都相同,但rowid不會相同,所以只要確定出重復(fù)記錄中那些具有最大或最小rowid的就可以了,其余全部刪除。

          SQL>delete from employee e2 where rowid not in (
                 
          select max(e1.rowid) from employee e1 where

                  e1.emp_id=e2.emp_id and e1.emp_name=e2.emp_name and e1.salary=e2.salary);--這里用min(rowid)也可以。

           

          SQL>delete from employee e2 where rowid <(
                 
          select max(e1.rowid) from employee e1 where
                  e1.emp_id
          =e2.emp_id and e1.emp_name=e2.emp_name and

                            e1.salary=e2.salary);

           

          3)也是通過rowid,但效率更高。

          SQL>delete from employee where rowid not in (
                 
          select max(t1.rowid) from employee t1 group by

                   t1.emp_id,t1.emp_name,t1.salary);--這里用min(rowid)也可以。

           

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   3 xyz                                             30000

                   2 semon                                         20000

           

           

           

          SQL> desc employee

           Name                                      Null?    Type
           ----------------------------------------- -------- ------------------

          emp_id                                                NUMBER(10)
          emp_name                                           VARCHAR2(20)

          salary                                                  NUMBER(10,2)

           

           

          可以通過下面的語句查詢重復(fù)的記錄:

          SQL> select * from employee;

           

              EMP_ID EMP_NAME                                  SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   1 sunshine                                      10000

                   2 semon                                         20000

                   2 semon                                         20000

                   3 xyz                                           30000

                   2 semon                                         20000

           


          SQL>
          select distinct * from employee;

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   2 semon                                         20000

                   3 xyz                                             30000

          SQL>  select * from employee group by emp_id,emp_name,salary having count (*)>1

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   2 semon                                          20000


          SQL>
          select * from employee e1

          where rowid in (select max(rowid) from employe e2
           
          where e1.emp_id=e2.emp_id and

            e1.emp_name=e2.emp_name and e1.salary=e2.salary);

           

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   3 xyz                                             30000

                   2 semon                                         20000

           

           

          2. 刪除的幾種方法:

           

          1)通過建立臨時表來實現(xiàn)

          SQL>create table temp_emp as (select distinct * from employee) 

          SQL> truncate table employee; (清空employee表的數(shù)據(jù))

          SQL> insert into employee select * from temp_emp;  (再將臨時表里的內(nèi)容插回來)

           

          ( 2)通過唯一rowid實現(xiàn)刪除重復(fù)記錄.Oracle中,每一條記錄都有一個rowid,rowid在整個數(shù)據(jù)庫中是唯一的,rowid確定了每條記錄是在Oracle中的哪一個數(shù)據(jù)文件、塊、行上。在重復(fù)的記錄中,可能所有列的內(nèi)容都相同,但rowid不會相同,所以只要確定出重復(fù)記錄中那些具有最大或最小rowid的就可以了,其余全部刪除。

          SQL>delete from employee e2 where rowid not in (
                 
          select max(e1.rowid) from employee e1 where

                  e1.emp_id=e2.emp_id and e1.emp_name=e2.emp_name and e1.salary=e2.salary);--這里用min(rowid)也可以。

           

          SQL>delete from employee e2 where rowid <(
                 
          select max(e1.rowid) from employee e1 where
                  e1.emp_id
          =e2.emp_id and e1.emp_name=e2.emp_name and

                            e1.salary=e2.salary);

           

          3)也是通過rowid,但效率更高。

          SQL>delete from employee where rowid not in (
                 
          select max(t1.rowid) from employee t1 group by

                   t1.emp_id,t1.emp_name,t1.salary);--這里用min(rowid)也可以。

           

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   3 xyz                                             30000

                   2 semon                                         20000

          posted on 2005-08-25 02:40 konhon 優(yōu)華 閱讀(753) 評論(0)  編輯  收藏 所屬分類: MS SQL Server
          主站蜘蛛池模板: 黎川县| 四子王旗| 丰城市| 迁西县| 汝州市| 望奎县| 海晏县| 瑞昌市| 兰州市| 乌拉特中旗| 宁城县| 政和县| 满城县| 太原市| 额济纳旗| 灵丘县| 彰化县| 张掖市| 墨玉县| 青河县| 称多县| 寿光市| 天长市| 辽阳市| 大厂| 凭祥市| 淄博市| 南和县| 田东县| 通河县| 永清县| 鄄城县| 莫力| 肇东市| 西平县| 大理市| 康乐县| 福清市| 定南县| 和平县| 盐津县|