13 oracle数据库坏块-逻辑坏块(模拟/修复)

Posted yhq1314

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了13 oracle数据库坏块-逻辑坏块(模拟/修复)相关的知识,希望对你有一定的参考价值。

13 oracle数据库坏块-逻辑坏块

逻辑数据坏块的场景
1)oracle bug也可能导致逻辑坏块的产生. 特别是parallel dml. 例如:
Bug 5621677 Logical corruption with PARALLEL update
Bug 6994194 Logical corruption from UPDATE DML
Bug 15980234 ORA-1400 / logical corruption from direct path INSERT ALL(fail with
ORA-1400, in direct path / PDML) 。
2)多数情况下逻辑坏块可能都是软件问题导致,当然数据库异常也可能导致。比如掉电的情况下,
就可能导致块内数据不一致
逻辑数据坏块检测工具
RMAN备份恢忽略soft corruption
soft corruption的块不计入maxcorrupt
media recovery会忽略soft corrupt
RMAN validate命令不会在alert中记录soft corrupt的信息,但是会在v$database_block_corruption中记录
DBV可以检测soft corruption
如果不设置event10231或者类似事件,那么soft corruption的块再次访问时报ORA-1578

逻辑数据坏块的检测

exp\\expdp     该方式存在缺陷,是发现逻辑坏块后就会报错,不再继续检测.如果是索引逻辑坏块,那么exp是检测不到的
rman        同检测物理坏块一样,rman备份也具备检测逻辑坏块的功能,而且用的很多.
           通常我们在进行数据库不完全恢复时,很多时候可能需要使用隐含参数强制打开数据库,打开之后我们都建议
         使用rman进行检查,确认是否存在逻辑坏块等等,以免open后数据库出现异常
bbed        上面有介绍
sql查询       select select count(1) table_name ;
         select /*+index_ffs(xxx) */ count(1) from table_name;
analyze命令       analyze index idx_name validate structure;
         analyze table idx_name validate structure cascade online;
db_block_checking参数 在10g版本中,该参数默认是false,如果设置为true,那么可以进行逻辑坏块的检测,实际上设置以后, oracle在写入时就会进行check,避免产生逻辑坏块

逻辑坏块的分类
data block     通常我们是最大程度的将segment内的数据抢救出来,然后处理掉坏块.(如果是个别坏块,可以手工修复)
index block     对于index logical 坏块,一般来讲,我们通过rebuild就可以解决(有时或许不行,可以drop然后重建)
segment header/bitmap block ...etc   这种情况下比较复杂

模拟逻辑坏块
在很多情况下逻辑坏块都发生在索引上,所以这里我们以模拟索引逻辑坏块为例进行说明

[email protected] orcl >create table yhqtest_6 as select owner,object_id,object_name from dba_objects where rownum<300;
Table created.
[email protected] orcl >create index idx_yhqt6_id on yhqtest_6(object_id);
Index created.
[email protected] orcl >select owner,object_id from dba_objects where object_name=upper(‘idx_yhqt6_id‘);
OWNER				OBJECT_ID
------------------------------ ----------
YHQT				    88407
[email protected] orcl >alter session set events ‘immediate trace name treedump level 88407‘;
Session altered.

--查看trace结果

[[email protected] ~]$ more /u01/app/oracle/diag/rdbms/orcl/orcl/trace/orcl_ora_2354.trc
*** 2019-07-15 16:00:53.250
*** MODULE NAME:(SQL*Plus) 2019-07-15 16:00:53.250
*** ACTION NAME:() 2019-07-15 16:00:53.250
 
----- begin tree dump
leaf: 0x28000db 41943259 (0: nrow: 299 rrow: 299)
----- end tree dump
[email protected] orcl >select dbms_utility.data_block_address_file(TO_NUMBER(‘28000db‘,‘XXXXXXXX‘)) file_id,
dbms_utility.data_block_address_block(TO_NUMBER(‘28000db‘,‘XXXXXXXX‘)) block_id from dual;  2  

   FILE_ID   BLOCK_ID
---------- ----------
	10	  219

--使用bbed查看并修改

BBED> set file 10 block 219
	FILE#          	10
	BLOCK#         	219
BBED> map /v
 File: /u01/app/oracle/oradata/orcl/yhqt01.dbf (10)
 Block: 219                                   Dba:0x028000db
------------------------------------------------------------

BBED> p kdxle
struct kdxle, 32 bytes                      @100     
   struct kdxlexco, 16 bytes                @100     
      ub1 kdxcolev                          @100      0x00
      ub1 kdxcolok                          @101      0x00
      ub1 kdxcoopc                          @102      0x80
      ub1 kdxconco                          @103      0x02
      ub4 kdxcosdc                          @104      0x00000000
      sb2 kdxconro                          @108      299
      sb2 kdxcofbo                          @110      634
      sb2 kdxcofeo                          @112      4246
      sb2 kdxcoavs                          @114      3612
   sb2 kdxlespl                             @116      0
   sb2 kdxlende                             @118      0
   ub4 kdxlenxt                             @120      0x00000000
   ub4 kdxleprv                             @124      0x00000000
   ub1 kdxledsz                             @128      0x00
   ub1 kdxleflg                             @129      0x00 (NONE)

BBED> dump /v offset 108 count 32
 File: /u01/app/oracle/oradata/orcl/yhqt01.dbf (10)
 Block: 219     Offsets:  108 to  139  Dba:0x028000db
-------------------------------------------------------
 2b017a02 96101c0e 00000000 00000000 l +.z.............
[email protected] orcl >select to_char(299,‘xxxxxx‘) from dual;

TO_CHAR
-------
    12b
BBED> modify /x 2a offset 108
 File: /u01/app/oracle/oradata/orcl/yhqt01.dbf (10)
 Block: 219              Offsets:  108 to  139           Dba:0x028000db
------------------------------------------------------------------------
 2a017a02 96101c0e 00000000 00000000 00000000 00000000 601f0000 541f481f 

 <32 bytes per line>

BBED> sum apply
Check value for File 10, Block 219:
current = 0xf239, required = 0xf239

 使用rman进行查看

RMAN> backup validate check logical database;
File Status Marked Corrupt Empty Blocks Blocks Examined High SCN
---- ------ -------------- ------------ --------------- ----------
5    FAILED 0              81           64011           10373342  
  File Name: /home/oracle/backup/test01.tts
  Block Type Blocks Failing Blocks Processed
  ---------- -------------- ----------------
  Data       3              20628           
  Index      0              13088           
  Other      0              30203        
File Status Marked Corrupt Empty Blocks Blocks Examined High SCN
---- ------ -------------- ------------ --------------- ----------
10   FAILED 0              6190         6400            10411019  
  File Name: /u01/app/oracle/oradata/orcl/yhqt01.dbf
  Block Type Blocks Failing Blocks Processed
  ---------- -------------- ----------------
  Data       2              53              
  Index      1              2               
  Other      1              155  

用视图查询

[email protected] orcl >select * from v$database_block_corruption;

     FILE#     BLOCK#	  BLOCKS CORRUPTION_CHANGE# CORRUPTIO
---------- ---------- ---------- ------------------ ---------
	10	  135	       1		  0 FRACTURED
	10	  160	       1		  0 ALL ZERO
	 5	 2199	       1	    9533291 CORRUPT
	 5	 2207	       1	    9579846 CORRUPT
	 5	 2231	       1	    9620379 CORRUPT
	10	  151	       1	   10410172 CORRUPT
	10	  219	       1	   10411020 CORRUPT
CORRUPTION_CHANGE#值= 10410172,我们可以发现,corruption_change#值是大于0,那么表示是逻辑坏块

--查询坏块所在的对象

set autot off
set lines 150
col segment_name for a15
col owner for a20
SELECT tablespace_name, segment_type, owner, segment_name
FROM dba_extents
WHERE file_id = 10
and 219 between block_id AND block_id + blocks - 1; 
[email protected] orcl >SELECT tablespace_name, segment_type, owner, segment_name
FROM dba_extents
WHERE file_id = 10
and 219 between block_id AND block_id + blocks - 1;  2    3    4  

TABLESPACE_NAME            SEGMENT_TYPE      OWNER            SEGMENT_NAME
------------------------------ ------------------ -------------------- ---------------
YHQT                   INDEX          YHQT               IDX_YHQT6_ID

使用dbms_repair检测

创建repair table
[email protected] orcl >conn / as sysdba
[email protected] orcl >BEGIN
   DBMS_REPAIR.ADMIN_TABLES (
   TABLE_NAME => REPAIR_TABLE,
   TABLE_TYPE => dbms_repair.repair_table,
   ACTION => dbms_repair.create_action,
   TABLESPACE => YHQT);
END;
/
检查对象上是否存在坏块
[email protected] orcl >set serveroutput on
DECLARE num_corrupt INT;
BEGIN
num_corrupt := 0;
DBMS_REPAIR.CHECK_OBJECT (
SCHEMA_NAME  => YHQT,
OBJECT_NAME  => YHQTEST_6,
REPAIR_TABLE_NAME  => REPAIR_TABLE,
corrupt_count  => num_corrupt);
DBMS_OUTPUT.PUT_LINE(number corrupt:  || TO_CHAR (num_corrupt));
END;
/
通过check后,就可以查询repair table了
S[email protected] orcl >col CORRUPT_DESCRIPTION for a100
[email protected] orcl >select OBJECT_ID,RELATIVE_FILE_ID,BLOCK_ID,CORRUPT_TYPE,OBJECT_NAME,CORRUPT_DESCRIPTION FROM repair_table;

---修复逻辑块错误--index

[email protected] orcl >conn yhqt/***
Connected.
[email protected] orcl >alter index IDX_YHQT6_ID rebuild;
Index altered.
[email protected] orcl >select * from v$database_block_corruption;
     FILE#     BLOCK#      BLOCKS CORRUPTION_CHANGE# CORRUPTIO
---------- ---------- ---------- ------------------ ---------
    10      135           1          0 FRACTURED
    10      160           1          0 ALL ZERO
     5     2199           1        9533291 CORRUPT
     5     2207           1        9579846 CORRUPT
     5     2231           1        9620379 CORRUPT
    10      151           1       10410172 CORRUPT
    10      219           1       10411020 CORRUPT
7 rows selected.

[email protected] orcl >analyze table yhqtest_6 validate structure cascade online;
analyze table yhqtest_6 validate structure cascade online
*
ERROR at line 1:
ORA-01499: table/index cross reference failure - see trace file
[email protected] orcl >drop index IDX_YHQT6_ID;
Index dropped.
[email protected] orcl >create index idx_yhqt6_id on yhqtest_6(object_id);
Index created.
[email protected] orcl >analyze table yhqtest_6 validate structure cascade online;
Table analyzed.

 使用expdp备份

[[email protected] ~]$ expdp system/**** DIRECTORY=dump_file_dir_test compression=all schemas=YHQT dumpfile=schemas_YHQT_%U.DMP parallel=2 logfile=schemas_YHQT_20190715.log

技术图片

修复上一篇物理坏块测试中的

2.The block is Fractured/Incomplete - header and footer of the block do not match

分析alert日志

Mon Jul 15 11:12:19 2019
Hex dump of (file 10, block 143) in trace file /u01/app/oracle/diag/rdbms/orcl/orcl/trace/orcl_ora_2354.trc
Corrupt block relative dba: 0x0280008f (file 10, block 143)
Fractured block found during multiblock buffer read
Data in bad block:
 type: 6 format: 2 rdba: 0x0280008f
 last change scn: 0x0000.009e53f5 seq: 0x1 flg: 0x06
 spare1: 0x0 spare2: 0x0 spare3: 0x0
 consistency value in tail: 0x54f50601 ===>>>tail=2 lower bytes of SCN Base+type+seq=53f5+06+01 而改变之后的值是54f50601
 check value in block header: 0x46ca
 computed block checksum: 0x0
Reading datafile /u01/app/oracle/oradata/orcl/yhqt01.dbf for corruption at rdba: 0x0280008f (file 10, block 143)
Reread (file 10, block 143) found same corrupt data (no logical check)
Mon Jul 15 11:12:19 2019
Corrupt Block Found
         TSN = 10, TSNAME = YHQT
         RFN = 10, BLK = 143, RDBA = 41943183
         OBJN = 88385, OBJD = 88385, OBJECT = YHQTEST_2, SUBOBJECT = 
         SEGMENT OWNER = YHQT, SEGMENT TYPE = Table Segment
Corrupt Block Found
         TSN = 10, TSNAME = YHQT
         RFN = 10, BLK = 143, RDBA = 41943183
         OBJN = 88385, OBJD = 88385, OBJECT = YHQTEST_2, SUBOBJECT = 
         SEGMENT OWNER = YHQT, SEGMENT TYPE = Table Segment
Errors in file /u01/app/oracle/diag/rdbms/orcl/orcl/trace/orcl_ora_2354.trc  (incident=67366):
ORA-01578: ORACLE data block corrupted (file # 10, block # 143)
ORA-01110: data file 10: /u01/app/oracle/oradata/orcl/yhqt01.dbf
Incident details in: /u01/app/oracle/diag/rdbms/orcl/orcl/incident/incdir_67366/orcl_ora_2354_i67366.trc
Mon Jul 15 11:12:21 2019
Sweep [inc][67366]: completed
Errors in file /u01/app/oracle/diag/rdbms/orcl/orcl/trace/orcl_ora_2354.trc  (incident=67367):
ORA-01578: ORACLE data block corrupted (file # 10, block # 143)
ORA-01110: data file 10: /u01/app/oracle/oradata/orcl/yhqt01.dbf
Incident details in: /u01/app/oracle/diag/rdbms/orcl/orcl/incident/incdir_67367/orcl_ora_2354_i67367.trc
Hex dump of (file 10, block 135) in trace file /u01/app/oracle/diag/rdbms/orcl/orcl/incident/incdir_67366/orcl_m000_2539_i67366_a.trc

技术图片

用bbed修改

BBED> p tailchk
ub4 tailchk                                 @8188     0x54f50601
BBED> modify /x 0106f553 offset 8188
Warning: contents of previous BIFILE will be lost. Proceed? (Y/N) y
 File: /u01/app/oracle/oradata/orcl/yhqt01.dbf (10)
 Block: 143              Offsets: 8188 to 8191           Dba:0x0280008f
------------------------------------------------------------------------
 0106f553 
 <32 bytes per line>
BBED> sum apply
Check value for File 10, Block 143:
current = 0x41ca, required = 0x41ca
[email protected] orcl >alter system flush buffer_cache;
[email protected] orcl >set linesize 999
[email protected] orcl >select * from yhqtest_2;

    ID NAME
---------- ---------===查询正常,数据也还在
     1 yhq

以上是关于13 oracle数据库坏块-逻辑坏块(模拟/修复)的主要内容,如果未能解决你的问题,请参考以下文章

怎么修复数据库 修复数据库坏块方法

实战篇:Oracle 数据坏块的 N 种修复方式

对Oracle数据库坏块的理解

使用BBED修复Oracle坏块恢复方法

12 oracle 数据库坏块--物理坏块-ORA-01578/ORA-01110

实战篇:Oracle 数据坏块的 N 种修复方式