Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
Line 23: Line 23:
  
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
<b> September 1, 2020, 9:27 AM EST: </b> The Niagara cluster has moved to a new default software stack, NiaEnv/2019b.  If your job scripts used the previous default software stack before (NiaEnv/2018a), please put the command "module load NiaEnv/2018a" before other module commands in those scripts, to ensure they will continue to work, or try the new stack (recommended).
 
  
<b> August 24, 2020, 7:37 PM EST: </b> Connectivity is back to normal
+
<b> September 1, 2020, 2:15 PM EST: </b> A short power glitch caused a number of Niagara compute nodes to lose power; jobs running on them would have failed. Please check your jobs and resubmit.
  
<b> August 24, 2020, 6:35 PM EST: </b> We have partial connectivity back, but are still investigating.
+
<b> September 1, 2020, 9:27 AM EST: </b> The Niagara cluster has moved to a new default software stack, NiaEnv/2019b.  If your job scripts used the previous default software stack before (NiaEnv/2018a), please put the command "module load NiaEnv/2018a" before other module commands in those scripts, to ensure they will continue to work, or try the new stack (recommended).
 
 
<b> August 24, 2020, 3:15 PM EST: </b> There are issues connecting to the data centre. We're investigating.
 
 
 
<b> August 21, 2020, 6:00 PM EST: </b> The pump has been repaired, cooling is restored, systems are up.  <br/>Scratch purging is postponed until the evening of Friday Aug 28th, 2020.
 
 
 
<b>August 19, 2020, 4:40 PM EST:</b> Update: The current estimate is to have the cooling restored on Friday and we hope to have the systems available for users on Saturday August 22, 2020.
 
 
 
<b>August 17, 2020, 4:00 PM EST:</b> Unfortunately after taking the pump apart it was determined there was a more serious failure of the main drive shaft, not just the seal. As a new one will need to be sourced or fabricated we're estimating that it will take at least a few more days to get the part and repairs done to restore cooling. Sorry for the inconvenience. 
 
 
 
<b>August 15, 2020, 1:00 PM EST:</b> Due to parts availablity to repair the failed pump and cooling system it is unlikely that systems will be able to be restored until Monday afternoon at the earliest.
 
 
 
<b>August 15, 2020, 00:04 AM EST:</b>  A primary pump seal in the cooling infrastructure has blown and parts availability will not be able be determined until tomorrow. All systems are shut down as there is no cooling.  If parts are available, systems may be back at the earliest late tomorrow. Check here for updates. 
 
 
 
<b>August 14, 2020, 21:04 AM EST:</b> Tomorrow's /scratch purge has been postponed.
 
 
 
<b>August 14, 2020, 21:00 AM EST:</b> Staff at the datacenter. Looks like one of the pumps has a seal that is leaking badly.
 
 
 
<b>August 14, 2020, 20:37 AM EST:</b> We seem to be undergoing a thermal shutdown at the datacenter.
 
 
 
<b>August 14, 2020, 20:20 AM EST:</b> Network problems to niagara/mist. We are investigating.
 
 
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
{|style="border-spacing: 10px;width: 100%"
 
{|style="border-spacing: 10px;width: 100%"

Revision as of 18:44, 1 September 2020

System Status

Niagara HPSS Mist Teach
Jupyter Hub Scheduler File system Burst Buffer
Login Nodes External Network Globus


September 1, 2020, 2:15 PM EST: A short power glitch caused a number of Niagara compute nodes to lose power; jobs running on them would have failed. Please check your jobs and resubmit.

September 1, 2020, 9:27 AM EST: The Niagara cluster has moved to a new default software stack, NiaEnv/2019b. If your job scripts used the previous default software stack before (NiaEnv/2018a), please put the command "module load NiaEnv/2018a" before other module commands in those scripts, to ensure they will continue to work, or try the new stack (recommended).

QuickStart Guides

Tutorials, Manuals, etc.