konhon

          忘掉過去,展望未來。找回自我,超越自我。
          逃避不一定躲的過, 面對不一定最難過, 孤單不一定不快樂, 得到不一定能長久, 失去不一定不再擁有, 可能因為某個理由而傷心難過, 但我卻能找個理由讓自己快樂.

          Google

          BlogJava 首頁 新隨筆 聯系 聚合 管理
            203 Posts :: 0 Stories :: 61 Comments :: 0 Trackbacks
           

          學習sql有一段時間了,發(fā)現在我建了一個用來測試的表(沒有建索引)中出現了許多的重復記錄。后來總結了一些刪除重復記錄的方法,在Oracle中,可以通過唯一rowid實現刪除重復記錄;還可以建臨時表來實現...這個只提到其中的幾種簡單實用的方法,希望可以和大家分享(以表employee為例)。

          SQL> desc employee

           Name                                      Null?    Type
           ----------------------------------------- -------- ------------------

          emp_id                                                NUMBER(10)
          emp_name                                           VARCHAR2(20)

          salary                                                  NUMBER(10,2)

           

           

          可以通過下面的語句查詢重復的記錄:

          SQL> select * from employee;

           

              EMP_ID EMP_NAME                                  SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   1 sunshine                                      10000

                   2 semon                                         20000

                   2 semon                                         20000

                   3 xyz                                           30000

                   2 semon                                         20000

           


          SQL>
          select distinct * from employee;

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   2 semon                                         20000

                   3 xyz                                             30000

          SQL>  select * from employee group by emp_id,emp_name,salary having count (*)>1

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   2 semon                                          20000


          SQL>
          select * from employee e1

          where rowid in (select max(rowid) from employe e2
           
          where e1.emp_id=e2.emp_id and

            e1.emp_name=e2.emp_name and e1.salary=e2.salary);

           

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   3 xyz                                             30000

                   2 semon                                         20000

           

           

          2. 刪除的幾種方法:

           

          1)通過建立臨時表來實現

          SQL>create table temp_emp as (select distinct * from employee) 

          SQL> truncate table employee; (清空employee表的數據)

          SQL> insert into employee select * from temp_emp;  (再將臨時表里的內容插回來)

           

          ( 2)通過唯一rowid實現刪除重復記錄.Oracle中,每一條記錄都有一個rowidrowid在整個數據庫中是唯一的,rowid確定了每條記錄是在Oracle中的哪一個數據文件、塊、行上。在重復的記錄中,可能所有列的內容都相同,但rowid不會相同,所以只要確定出重復記錄中那些具有最大或最小rowid的就可以了,其余全部刪除。

          SQL>delete from employee e2 where rowid not in (
                 
          select max(e1.rowid) from employee e1 where

                  e1.emp_id=e2.emp_id and e1.emp_name=e2.emp_name and e1.salary=e2.salary);--這里用min(rowid)也可以。

           

          SQL>delete from employee e2 where rowid <(
                 
          select max(e1.rowid) from employee e1 where
                  e1.emp_id
          =e2.emp_id and e1.emp_name=e2.emp_name and

                            e1.salary=e2.salary);

           

          3)也是通過rowid,但效率更高。

          SQL>delete from employee where rowid not in (
                 
          select max(t1.rowid) from employee t1 group by

                   t1.emp_id,t1.emp_name,t1.salary);--這里用min(rowid)也可以。

           

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   3 xyz                                             30000

                   2 semon                                         20000

           

           

           

          SQL> desc employee

           Name                                      Null?    Type
           ----------------------------------------- -------- ------------------

          emp_id                                                NUMBER(10)
          emp_name                                           VARCHAR2(20)

          salary                                                  NUMBER(10,2)

           

           

          可以通過下面的語句查詢重復的記錄:

          SQL> select * from employee;

           

              EMP_ID EMP_NAME                                  SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   1 sunshine                                      10000

                   2 semon                                         20000

                   2 semon                                         20000

                   3 xyz                                           30000

                   2 semon                                         20000

           


          SQL>
          select distinct * from employee;

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   2 semon                                         20000

                   3 xyz                                             30000

          SQL>  select * from employee group by emp_id,emp_name,salary having count (*)>1

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   2 semon                                          20000


          SQL>
          select * from employee e1

          where rowid in (select max(rowid) from employe e2
           
          where e1.emp_id=e2.emp_id and

            e1.emp_name=e2.emp_name and e1.salary=e2.salary);

           

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   3 xyz                                             30000

                   2 semon                                         20000

           

           

          2. 刪除的幾種方法:

           

          1)通過建立臨時表來實現

          SQL>create table temp_emp as (select distinct * from employee) 

          SQL> truncate table employee; (清空employee表的數據)

          SQL> insert into employee select * from temp_emp;  (再將臨時表里的內容插回來)

           

          ( 2)通過唯一rowid實現刪除重復記錄.Oracle中,每一條記錄都有一個rowidrowid在整個數據庫中是唯一的,rowid確定了每條記錄是在Oracle中的哪一個數據文件、塊、行上。在重復的記錄中,可能所有列的內容都相同,但rowid不會相同,所以只要確定出重復記錄中那些具有最大或最小rowid的就可以了,其余全部刪除。

          SQL>delete from employee e2 where rowid not in (
                 
          select max(e1.rowid) from employee e1 where

                  e1.emp_id=e2.emp_id and e1.emp_name=e2.emp_name and e1.salary=e2.salary);--這里用min(rowid)也可以。

           

          SQL>delete from employee e2 where rowid <(
                 
          select max(e1.rowid) from employee e1 where
                  e1.emp_id
          =e2.emp_id and e1.emp_name=e2.emp_name and

                            e1.salary=e2.salary);

           

          3)也是通過rowid,但效率更高。

          SQL>delete from employee where rowid not in (
                 
          select max(t1.rowid) from employee t1 group by

                   t1.emp_id,t1.emp_name,t1.salary);--這里用min(rowid)也可以。

           

              EMP_ID EMP_NAME                                     SALARY

          ---------- ---------------------------------------- ----------

                   1 sunshine                                      10000

                   3 xyz                                             30000

                   2 semon                                         20000

          posted on 2005-08-25 02:40 konhon 優(yōu)華 閱讀(748) 評論(0)  編輯  收藏 所屬分類: MS SQL Server
          主站蜘蛛池模板: 兴义市| 连平县| 海伦市| 鄂尔多斯市| 剑川县| 宣城市| 错那县| 太原市| 平舆县| 弥渡县| 泗阳县| 普格县| 大港区| 原平市| 左云县| 武汉市| 宁陵县| 商丘市| 海林市| 辛集市| 湟源县| 兴文县| 如东县| 刚察县| 卢氏县| 鸡泽县| 兴宁市| 翼城县| 武胜县| 中超| 永仁县| 富阳市| 清镇市| 行唐县| 怀远县| 光山县| 开封市| 大厂| 三河市| 炎陵县| 深州市|