The optix OSN3500 device loses power due to a power outage in the server room, and the master, crossover, and business boards report a series of alarms after the power is restored to the OSN3500, resulting in the service being unavailable.
Host version: 5.21.20.55, single-support master, dual-support crossover, 9-slot crossover board is primary at the time of failure.
7-slot EGS2 Parameters: 0X01 0X00 0X06 0XFF 0XFF
13-slot EFS0 Parameters: 0X01 0X00 0X06 0XFF 0XFF
18-slot GSCC Parameters: 0X02 0XFF 0XFF 0 XFF
9-slot SXCSA Parameters: 0X02 0X00 0X04 0XFF 0XFF
CHIP_FAIL:
9-slot SXCSA Parameters: 0X00 0X00 0X00 0X01 0X00
2-slot PQ1
OOL
9-slot SXCSA Parameters: 03 00 01 ff ff
10-slot SXCSA Parameters: 01 00 01 ff ff
Temp_over
9-slot SXCSA Parameters: 01 00 01 01 ff
HSC _UNAVAIL
9-slot SXCSA Parameters: 03 01 09 ff ff
Bus_err
10-slot SXCSA Parameters: 0d 01 03 01 ff
Syn_bad
10-slot SXCSA Parameters: 08 01 ff ff ff
1、Site test voltage -54V, belongs to normal range.
2. Synchronize and check the alarms again, AUX does not have any alarms, combined with the normal status of the single board indicator at the site, if AUX is abnormal the single board can not start.
3, the network element reported more alarms, using the command line query single board physical board and logical board status is normal, the site feedback board indicator is also normal, taking into account the business is fully blocked, so the main control and cross board failure is the most likely. By analyzing the HARD_BAD alarm of the single master control, the parameter positioning is the 2-slot PQ1 abnormality, and the master control problem is unlikely. Continuing to analyze, it is found that there are more alarms on the 9-slot (primary) cross-board. Attempts to reverse the network management to reset the cross-board failed.
4, network management feedback 10-slot cross board active to master state, the number of alarms and parameters did not change, network management hard reset 9-slot, the number of alarms and parameters continue to be unchanged.
5, network management query cross-board temperature, command behavior (:cfg-get-bdtemp:9), the temperature is 70 degrees, has exceeded the temperature threshold, so reported temp over normal, the scene to verify that the air conditioning of the room did not work after the power outage, the temperature of the room is high. Therefore, it is suspected that the 9-slot single board is working abnormally, and the temperature is related.
5, it is recommended to pull out 9 slots on site to observe, while coordinating spare parts. On-site feedback after pulling out 9 slots and waiting for a few minutes, all the alarms gradually disappeared, and verified that the business also resumed.
6, in order to prepare for the positioning of the 9-slot cross board anomaly is caused by the temperature (before the single board continues to report temp over), the single board re-inserted into the 9-slot, observe the business continues to be normal, the query cross the temperature is 10 degrees lower than before.
Positioning cleaning fan dust net, control the temperature and humidity of the computer room.
The temperature and humidity requirements for normal operation of OptiX OSN equipment are: (The measurement points for temperature and humidity are the values measured 1.5m above the floor and 0.4m in front of the rack when there is no protective plate in front of or behind the rack.)
Long-term operation temperature : 0℃~45℃
Short-term operation temperature (Short-term operation means no more than 96 hours of continuous operation and no more than 15 days per year cumulatively.) : -5℃~55℃
Long-term operation humidity 5%~85%
Short-term operation humidity 5%~95%
At the same time, in order to enhance the reliability of product application, the server room should be equipped with special precision air conditioning for the server room, which controls the temperature and humidity in the following range:
Air conditioning control temperature: 15-30℃.
Air conditioning control humidity: 40%-75%.
Note: Air conditioners are prohibited to be installed above the equipment, air conditioner vents should be avoided to blow directly to the equipment, air conditioners should be installed as far as possible away from the window to avoid the moisture through the window through the air conditioner blowing to the equipment.
END


Chinese
English





