Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
(System Status)
(System Status)
 
(183 intermediate revisions by 9 users not shown)
Line 9: Line 9:
 
|{{Up|Niagara|Niagara_Quickstart}}
 
|{{Up|Niagara|Niagara_Quickstart}}
 
|{{Up|HPSS|HPSS}}
 
|{{Up|HPSS|HPSS}}
|{{Up|SOSCIP GPU|SOSCIP_GPU}}
+
|{{Up|Mist|Mist}}
|{{Down|P8|P8}}
+
|{{Up|Teach|Teach}}
 
|-
 
|-
|{{Up|Teach|Teach}}
+
|{{Up|Jupyter Hub|Jupyter_Hub}}
|{{Down|Jupyter Hub|Jupyter_Hub}}
 
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 +
|{{Up|Burst Buffer|Burst_Buffer}}
 
|-
 
|-
 +
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}
 
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up|Globus|Globus}}
 
|{{Up|Globus|Globus}}
 
|}
 
|}
<!-- Current Messages: --><p>
 
<b>Sat, Nov 2 2019, 1:30 PM (update):</b>  Chiller has been fixed, all systems are no operational.   
 
</p>
 
<b>Fri, Nov 1 2019, 4:30 PM (update):</b>  We are operating in free cooling so have brought up about 1/2 of the Niagara compute nodes to reduce the cooling load.  Access, storage, and other systems should now be available. 
 
  
<b>Fri, Nov 1 2019, 12:05 PM (update):</b> A power module in the chiller has failed and needs to be replaced.   We should be able to operate in free cooling if the temperature stays cold enough, but we may not be able to run all systems. No eta yet on when users will be able to log back in.  
+
<!-- Current Messages: -->
 +
<b> October 9, 2020, 12:57 PM: </b> A short power glitch caused many of the Niagara compute nodes to lose power; jobs running on them would have failed. Please check your jobs and resubmit.
 +
 
 +
<b> October 8, 2020, 9:50 PM: </b> Jupyterhub service is back up.
 +
 
 +
<b> October 8, 2020, 5:40 PM: </b> Jupyterhub service is down. We are investigating.
 +
 
 +
<b> September 28, 2020, 11:00 AM EST: </b> A short power glitch caused many of the Niagara compute nodes to lose power; jobs running on them would have failed. Please check your jobs and resubmit.
  
<b>Fri, Nov 1 2019, 9:15 AM (update):</b> There was a automated shutdown because of rising temperatures, causing all systems to go down. We are investigating, check here for updates.
+
<b> September 1, 2020, 2:15 PM EST: </b> A short power glitch caused about half of the Niagara compute nodes to lose power; jobs running on them would have failed. Please check your jobs and resubmit.
  
<p><b>Fri, Nov 1 2019, 8:16 AM:</b> Unexpected data centre issue: Check here for updates.
+
<b> September 1, 2020, 9:27 AM EST: </b> The Niagara cluster has moved to a new default software stack, NiaEnv/2019b.  If your job scripts used the previous default software stack before (NiaEnv/2018a), please put the command "module load NiaEnv/2018a" before other module commands in those scripts, to ensure they will continue to work, or try the new stack (recommended).
</p>
 
<p>
 
<b>Announcement:</b>
 
The SciNet datacentre will undergo a maintenance shutdown on
 
Friday November 15th 2019, from 7 am to 11 pm (EST), with no access
 
to any of the SciNet systems (Niagara, P8, SGC, HPSS, Teach cluster,
 
or the filesystems) during that time.  
 
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
{|style="border-spacing: 10px;width: 100%"
 
{|style="border-spacing: 10px;width: 100%"
Line 45: Line 42:
 
* [[HPSS | HPSS archival storage]]
 
* [[HPSS | HPSS archival storage]]
 
* [[SOSCIP_GPU | SOSCIP GPU cluster]]
 
* [[SOSCIP_GPU | SOSCIP GPU cluster]]
* [[P8|Experimental Power 8 GPU cluster]]
+
* [[Mist| Mist Power 9 GPU cluster]]
 
* [[Teach|Teach cluster]]
 
* [[Teach|Teach cluster]]
 
* [[FAQ | FAQ (frequently asked questions)]]
 
* [[FAQ | FAQ (frequently asked questions)]]
Line 52: Line 49:
  
 
== Tutorials, Manuals, etc. ==
 
== Tutorials, Manuals, etc. ==
* [https://courses.scinet.utoronto.ca SciNet education material]
+
* [https://support.scinet.utoronto.ca/education/browse.php SciNet education material]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
Line 58: Line 55:
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 +
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[Visualization]]
 
* [[Visualization]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Jupyter Hub]]
 
* [[Jupyter Hub]]
 
|}
 
|}

Latest revision as of 17:09, 9 October 2020

System Status

Niagara HPSS Mist Teach
Jupyter Hub Scheduler File system Burst Buffer
Login Nodes External Network Globus

October 9, 2020, 12:57 PM: A short power glitch caused many of the Niagara compute nodes to lose power; jobs running on them would have failed. Please check your jobs and resubmit.

October 8, 2020, 9:50 PM: Jupyterhub service is back up.

October 8, 2020, 5:40 PM: Jupyterhub service is down. We are investigating.

September 28, 2020, 11:00 AM EST: A short power glitch caused many of the Niagara compute nodes to lose power; jobs running on them would have failed. Please check your jobs and resubmit.

September 1, 2020, 2:15 PM EST: A short power glitch caused about half of the Niagara compute nodes to lose power; jobs running on them would have failed. Please check your jobs and resubmit.

September 1, 2020, 9:27 AM EST: The Niagara cluster has moved to a new default software stack, NiaEnv/2019b. If your job scripts used the previous default software stack before (NiaEnv/2018a), please put the command "module load NiaEnv/2018a" before other module commands in those scripts, to ensure they will continue to work, or try the new stack (recommended).

QuickStart Guides

Tutorials, Manuals, etc.