By Owen Allen on Jul 25, 2013
Some users of Ops Center 12.1.4 are seeing an issue where, after upgrading or doing a new install, their assets are getting dozens of warning incidents that say "Management access point failed authentication during login to asset." Eventually, the severity increases as the warnings pile up, and communication with the asset can be disrupted.
You can prevent this issue, or halt it if you've already hit it, by logging in to the Enterprise Controller and creating a /var/tmp/no_os_selfhealing file if it doesn't exist. For example:
You can use this fix before you do the upgrade or installation, to prevent the issue from occurring at all.
If you have assets that are already having communication trouble because of the issue, then once you've added the file as described above, you can solve the issue by
deleting and rediscovering the asset deleting the Access Point for the Service Processor, and then rediscovering just the hardware. If you have a Proxy Controller or Enterprise Controller that's been affected, you should restart them after applying this fix.
Edit: You can also fix the issue by downloading the latest version of the OCDoctor (4.24) and running it with the --troubleshoot option. Take a look at this bulletin for more info.
Edit 2: Updated to use a less destructive fix for affected assets.