ALTER DISKGROUP data SET ATTRIBUTE 'disk_repair_time' = '6h'; Schedule weekly manual checks using:
Introduction If you are an Oracle Database Administrator (DBA) managing an Oracle Real Application Clusters (RAC) environment, you have likely encountered a cryptic but critical message in your alert logs or monitoring console: "ASM Health Checker found 1 new failures updated." asm health checker found 1 new failures updated
SELECT group_number, disk_number, name, path, state, mode_status, failgroup FROM v$asm_disk WHERE state != 'NORMAL'; Disks in FORCING state (attempting recovery) or OFFLINE state are the culprits. Is your disk group about to crash
At first glance, this message can induce panic. Does it mean data loss? Is your disk group about to crash? Will your production database go offline? Fortunately, in most cases, this alert is a proactive warning from Oracle’s Automatic Storage Management (ASM) diagnostics framework. However, ignoring it can lead to severe performance degradation or service interruption. However, ignoring it can lead to severe performance
This comprehensive guide will dissect every aspect of this error message. We will explore what the ASM Health Checker is, why it triggers this alert, how to diagnose the specific failure, and step-by-step remediation strategies. The ASM Health Checker is an internal diagnostic tool embedded within Oracle ASM (introduced in Oracle 11g Release 2 and enhanced in 12c, 18c, 19c, and 21c). Its primary function is to periodically validate the integrity, accessibility, and configuration of ASM disk groups and their underlying disks.
SELECT name, state, type, total_mb, free_mb, offline_disks FROM v$asm_diskgroup; If offline_disks > 0, you have confirmed physical disk failures.
WARNING: ASM Health Checker found 1 new failures updated Failure details: Disk group: DATA Disk: DATA_0003 (path: /dev/mapper/data_disk3) Failure type: OFFLINE Timestamp: 2025-01-15T14:23:10 Connect to your ASM instance using sqlplus / as sysasm and run the following diagnostic queries: