Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
 
(170 intermediate revisions by 11 users not shown)
Line 7: Line 7:
 
<!-- Use "Up" or "Down"; these are templates. -->
 
<!-- Use "Up" or "Down"; these are templates. -->
 
{|style="width:100%"  
 
{|style="width:100%"  
|{{Up|Niagara|Niagara_Quickstart}}
+
|{{Up |Niagara|Niagara_Quickstart}}
|{{Up|HPSS|HPSS}}
+
|{{Up |Mist|Mist}}
|{{Up|Mist|Mist}}
+
|{{Up |Teach|Teach}}
|{{Up|Teach|Teach}}
+
|{{Up |Rouge|Rouge}}
 
|-
 
|-
|{{Up|Jupyter Hub|Jupyter_Hub}}
+
|{{Up |Jupyter Hub|Jupyter_Hub}}
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
+
|{{Up |Scheduler|Niagara_Quickstart#Submitting_jobs}}
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
+
|{{Up |File system|Niagara_Quickstart#Storage_and_quotas}}
|{{Up|Burst Buffer|Burst_Buffer}}
+
|{{Up |Burst Buffer|Burst_Buffer}}
 
|-
 
|-
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}  
+
|{{Up|HPSS|HPSS}}
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
+
|{{Up |Login Nodes|Niagara_Quickstart#Logging_in}}  
|{{Up|Globus|Globus}}
+
|{{Up |External Network|Niagara_Quickstart#Logging_in}}  
 +
|{{Up |Globus |Globus}}
 
|}
 
|}
  
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
<b> August 21, 2020, 6:00 PM EST: </b> The pump has been repaired, cooling is restored, systems are up. <br/>Scratch purging is postponed until the evening of Friday Aug 28th, 2020.
+
<b>Mon Nov 22 1:40 EST PM 2021</b> The Mist login node is back.
 +
 
 +
<b>Mon Nov 22 12:40 EST PM 2021</b> The Mist login node is experiencing issues, we are investigating.
  
<b>August 19, 2020, 4:40 PM EST:</b> Update: The current estimate is to have the cooling restored on Friday and we hope to have the systems available for users on Saturday August 22, 2020.
+
<b>Fri Nov 5 19:35 EDT 2021 </b> The filesystem issue from earlier in the afternoon is resolved.
  
<b>August 17, 2020, 4:00 PM EST:</b> Unfortunately after taking the pump apart it was determined there was a more serious failure of the main drive shaft, not just the seal. As a new one will need to be sourced or fabricated we're estimating that it will take at least a few more days to get the part and repairs done to restore cooling. Sorry for the inconvenience. 
+
<b>Fri Nov 5 16:58 EDT 2021 </b> We are experiencing filesystem issues, login to the clusters may not be possible until they are resolved.
  
<b>August 15, 2020, 1:00 PM EST:</b> Due to parts availablity to repair the failed pump and cooling system it is unlikely that systems will be able to be restored until Monday afternoon at the earliest.  
+
<b>Tue Oct 19 noon EDT - Thu Oct 21 noon EDT:</b> <b><i>Niagara at Scale:</i></b> Only users of selected projects run at large scale during these 48 hours. Other users can still login and access their files, and submit jobs for after the event.  SOSCIP and Mist users are not affected.
  
<b>August 15, 2020, 00:04 AM EST:</b> A primary pump seal in the cooling infrastructure has blown and parts availability will not be able be determined until tomorrow. All systems are shut down as there is no cooling.  If parts are available, systems may be back at the earliest late tomorrow. Check here for updates.
+
<b>Tue Oct 12 14:30 EDT 2021 </b> Mist login node is back up.
  
<b>August 14, 2020, 21:04 AM EST:</b> Tomorrow's /scratch purge has been postponed.
+
<b>Tue Oct 12 12:30 EDT 2021 </b> Mist login node is down for maintenance.
  
<b>August 14, 2020, 21:00 AM EST:</b> Staff at the datacenter. Looks like one of the pumps has a seal that is leaking badly.
+
<b>Mon Sep 27 16:11 EDT 2021 </b> HPSS is back online.
  
<b>August 14, 2020, 20:37 AM EST:</b> We seem to be undergoing a thermal shutdown at the datacenter.
+
<b>Wed Sep 23 17:23 EDT 2021 </b> Systems being brought back online. HPSS may be down for some more days.
  
<b>August 14, 2020, 20:20 AM EST:</b> Network problems to niagara/mist. We are investigating.
 
 
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
{|style="border-spacing: 10px;width: 100%"
 
{|style="border-spacing: 10px;width: 100%"
Line 48: Line 49:
 
* [[Niagara Quickstart]]
 
* [[Niagara Quickstart]]
 
* [[HPSS | HPSS archival storage]]
 
* [[HPSS | HPSS archival storage]]
* [[SOSCIP_GPU | SOSCIP GPU cluster]]
 
 
* [[Mist| Mist Power 9 GPU cluster]]
 
* [[Mist| Mist Power 9 GPU cluster]]
 
* [[Teach|Teach cluster]]
 
* [[Teach|Teach cluster]]
Line 56: Line 56:
  
 
== Tutorials, Manuals, etc. ==
 
== Tutorials, Manuals, etc. ==
* [https://support.scinet.utoronto.ca/education/browse.php SciNet education material]
+
* [https://education.scinet.utoronto.ca SciNet education material]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 +
* [[Modules for Mist]]
 
* [[Commercial software]]
 
* [[Commercial software]]
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 +
* [[SSH keys]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]

Latest revision as of 18:54, 22 November 2021

System Status

Niagara Mist Teach Rouge
Jupyter Hub Scheduler File system Burst Buffer
HPSS Login Nodes External Network Globus

Mon Nov 22 1:40 EST PM 2021 The Mist login node is back.

Mon Nov 22 12:40 EST PM 2021 The Mist login node is experiencing issues, we are investigating.

Fri Nov 5 19:35 EDT 2021 The filesystem issue from earlier in the afternoon is resolved.

Fri Nov 5 16:58 EDT 2021 We are experiencing filesystem issues, login to the clusters may not be possible until they are resolved.

Tue Oct 19 noon EDT - Thu Oct 21 noon EDT: Niagara at Scale: Only users of selected projects run at large scale during these 48 hours. Other users can still login and access their files, and submit jobs for after the event. SOSCIP and Mist users are not affected.

Tue Oct 12 14:30 EDT 2021 Mist login node is back up.

Tue Oct 12 12:30 EDT 2021 Mist login node is down for maintenance.

Mon Sep 27 16:11 EDT 2021 HPSS is back online.

Wed Sep 23 17:23 EDT 2021 Systems being brought back online. HPSS may be down for some more days.

QuickStart Guides

Tutorials, Manuals, etc.