(Closed)Part of the supercomputer system failed because of a outage(Aug 23)

August 23, 2017, at 7:00 p.m. all system get back to normal.
 
----
 
August 23, a.m. 11:30 add

Some nodes of SX-ACE and IXS(Interconnect of SX-ACE) went down because of a power failure.
Therefore, all multi-nodes jobs do not run on SX-ACE. Now, We are rebooting all nodes of SX-ACE for restoration.
then We will pause running jobs and restart it after the reboot by the checkpoint-restart function of SX-ACE.

The following system will not be stopped.
- VCC, HCC
- Frontend server
- Login server
- Storage service
- Application software(License server)
- HPCI Shibboleth server
- WEB system(利用者ポータルサイト,利用者管理WEBシステム,WEB利用申請システム)

 
----
 
August 23, a.m. 8:50
August 23, 2017, at 5:50 p.m. some nodes went down because of a power failure.
 

We'll get contact with persons qualified.
 

We apologize for the inconvenience this has caused you.




Posted : August 23,2017