Asm Health Checker Found 1 New Failures Updated ~upd~

: Set DISK_REPAIR_TIME to an optimal value (e.g., 4h ) to prevent premature disk dropping during brief storage outages.

If you are an Oracle Database Administrator, seeing the alert in your logs or monitoring dashboard (like Enterprise Manager) can be a bit jarring. This message is the Oracle Automatic Storage Management (ASM) framework’s way of telling you that its internal diagnostic engine has detected an issue that could compromise the health of your storage layer.

When an underlying hardware or connectivity issue disrupts the storage fabric, the ASM Health Checker works in tandem with the Rebalance ( RBAL ) and Group Monitor ( GMON ) background processes. The process typically unfolds across four technical milestones: 1. Persistent Storage Timeout

Share public link

It is crucial to understand that this is an immediate fatal error. ASM is reporting that it found something abnormal, but the disk group may still be mounted and operational (often in a degraded state).

A new failure was identified during the latest ASM health check scan. Current Action Plan: incidents to identify the specific failure. Verifying disk group redundancy and member disk status. Running a manual health check via Oracle AHF/ORAchk to confirm if the issue is persistent.

You can also find the traditional text-based alert log for ASM, typically in: asm health checker found 1 new failures updated

The alert message is recorded in the ASM instance's alert log file. This log is your primary source of information for understanding the health of your ASM environment. It captures all significant events, errors, and warnings, including those from the Health Checker.

If one of the disks in a disk group is taken offline (due to a lost path, a multipath issue, or a silent I/O error) and the disk group’s redundancy level cannot be maintained, the health checker records a failure. In a NORMAL redundancy disk group, losing one failure group member often triggers the failure alert.

SELECT * FROM V$ASM_HEALTH_CHECK;

The CHECK ALL option validates both the disk group metadata and the individual files stored inside it.

If you've received a notification that the "ASM health checker found 1 new failures updated," it's essential to understand what this message means and take prompt action to resolve the issue. In this article, we'll delve into the details of ASM health checking, explore the possible causes of this error, and provide step-by-step guidance on how to troubleshoot and fix the problem.

When the checker detects a new anomaly that was not present in its previous scan, it increments a failure count and writes the message: to the ASM alert log. : Set DISK_REPAIR_TIME to an optimal value (e