Showing posts with label reset memory errors ucs. Show all posts
Showing posts with label reset memory errors ucs. Show all posts

Sunday 5 June 2016

How to reset memory or DIMM errors of CISCO UCS Blade servers?

Or, Reset ECC memory error on CISCO UCS blade server

In my case: I want to reset memory or DIMM errors on CISCO UCS Blade server number 8, chassis number 1.

Steps:
1. Access your CISCO UCS Management IP using putty
2. Run below command in the sequence
------------------------------------------------------------------------------------------------------------------------
ca-1-A# scope server 1/8 (here 1 is Chassis number and 8 is blade server number)
ca-1-A /chassis/server # reset-all-memory-errors
ca-1-A /chassis/server* # commit
------------------------------------------------------------------------------------------------------------------------

You are done now… cheers...

Thursday 18 February 2016

Explained: EccSinglebitErrors and EccMultibitErrors in CISCO UCS Blade Servers

However there are lots of architectural differences in memory errors EccSinglebitErrors and EccMultibitErrors, the most logical difference is as below:

EccSinglebitErrors: These errors are also known as correctable errors which means if you will swap the  memory encountered with this error with another healthy memory in the same blade server, the erroneous memory will become healthy. But if the error counts are huge in number, then it may not work sometime by swapping it.
In this scenario, you have to replace the memory module with new one.

EccMultibitErrors: These errors are also known as non-correctable or in-correctable  errors which means there is no way to solve this error at your end. Even a Single Multibit error is indication of it's hardware fault and it will be resolved only after replacement of DIMM(as per my working experience on such issues with CISCO Tech support).

It looks like below: