High fan speed in lake1 after powering it back #39
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
After taking the burn voltage regulator and it replacing by the new one (see #22), the node boots and seems to be stable but is drawing a higher amount of power than other nodes (240W rather than around 120W) and the fans spin at 20.000 RPM (the maximum). The temperature seems to be fine as read from all the sensors in the board, so this power consumption could be just related to the high fan speed.
I continue to monitor closely the node, in case of any temperature or power surge.
I'm trying to find a way to make the fans go back to normal speed, but not successful yet.
changed the description
mentioned in issue #22
From https://www.intel.com/content/dam/support/us/en/documents/motherboards/server/sb/updating_frusdr_on_epsd_server.pdf :
More possible causes: https://www.intel.com/content/www/us/en/support/articles/000036464/server-products/server-boards.html
From the photos, it seems that the power supply was originally in the second slot, but is now in the first one:
Removing the two DIMM donor modules for eudy may also have affected.
The PS2 doesn't show in hut:
I did the following tests:
As I suspected, the BMC has seen two PS in the PS1 and PS2 sockets, but it only detects one. To change this information I need to perform an FRU update.
I moved the PS2 back to owl1 and then I swapped the PS1 back to the PS2 socket.
As a side effect, owl1 now can properly read the power consumption 🤷
After accessing the lake1 BMC control web interface, under the Configuration > SDR Configuration page, with the "Enable SDR Auto-configuration" setting set as "Enabled" and clicking the "Save" button and then "Parse", I managed to make the BMC re-scan the hardware and only detect the PS1.
Now the fans are running at 2500 rpm and the power consumption has dropped to around 100 W. Here is the info from ipmitool:
The PS2 info is now gone.
I will reboot the node and check that this still holds when it boots. If so, this issue can be considered solved.
Fan went up to 8000 rpm, as the node airflow increased, but it remains at a reasonable speed:
Fixed.