Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to navigation Jump to search
 
(186 intermediate revisions by 12 users not shown)
Line 7: Line 7:
 
<!-- Use "Up" or "Down"; these are templates. -->
 
<!-- Use "Up" or "Down"; these are templates. -->
 
{|style="width:100%"  
 
{|style="width:100%"  
|{{Up|Niagara|Niagara_Quickstart}}
+
|{{Up |Niagara|Niagara_Quickstart}}
|{{Up|HPSS|HPSS}}
+
|{{Up |Mist|Mist}}
|{{Up|Mist|Mist}}
+
|{{Up |Teach|Teach}}
|{{Up|Teach|Teach}}
+
|{{Up |Rouge|Rouge}}
 
|-
 
|-
|{{Up|Jupyter Hub|Jupyter_Hub}}
+
|{{Up |Jupyter Hub|Jupyter_Hub}}
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
+
|{{Up |Scheduler|Niagara_Quickstart#Submitting_jobs}}
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
+
|{{Up |File system|Niagara_Quickstart#Storage_and_quotas}}
|{{Up|Burst Buffer|Burst_Buffer}}
+
|{{Up |Burst Buffer|Burst_Buffer}}
 
|-
 
|-
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}  
+
|{{Up|HPSS|HPSS}}
|{{Down|External Network|Niagara_Quickstart#Logging_in}}  
+
|{{Up |Login Nodes|Niagara_Quickstart#Logging_in}}  
|{{Up|Globus|Globus}}
+
|{{Up |External Network|Niagara_Quickstart#Logging_in}}  
 +
|{{Up |Globus |Globus}}
 
|}
 
|}
  
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
<b> August 24, 2020, 3:15 PM EST: </b> There are issues connecting to the data centre. We're investigating.
 
  
<b> August 21, 2020, 6:00 PM EST: </b> The pump has been repaired, cooling is restored, systems are up. <br/>Scratch purging is postponed until the evening of Friday Aug 28th, 2020.
+
<b>Sat Jan 8 11:42 EST AM 2022</b> The emergency maintenance is complete. Systems are up and available.
 +
 
 +
<b>Fri Jan 7 14:34 EST PM 2022</b> The SciNet shutdown is in progress. Systems are expected back on Saturday, Jan 8.
  
<b>August 19, 2020, 4:40 PM EST:</b> Update: The current estimate is to have the cooling restored on Friday and we hope to have the systems available for users on Saturday August 22, 2020.
+
<b><span style="color:red">Emergency shutdown Friday January 7, 2022</span></b>: An emergency shutdown of all SciNet to replace a crucial file system component is planned to take place on Friday January 7, 2022, starting at 8am EST, and will require at least 12 hours of downtime.  Updates will be posted during the day.
  
<b>August 17, 2020, 4:00 PM EST:</b> Unfortunately after taking the pump apart it was determined there was a more serious failure of the main drive shaft, not just the seal. As a new one will need to be sourced or fabricated we're estimating that it will take at least a few more days to get the part and repairs done to restore cooling. Sorry for the inconvenience. 
+
<b>Thu Jan 6 08:20 EST AM 2022</b> The SciNet filesystem is having issues. We are investigating.
  
<b>August 15, 2020, 1:00 PM EST:</b> Due to parts availablity to repair the failed pump and cooling system it is unlikely that systems will be able to be restored until Monday afternoon at the earliest.
 
  
<b>August 15, 2020, 00:04 AM EST:</b> A primary pump seal in the cooling infrastructure has blown and parts availability will not be able be determined until tomorrow. All systems are shut down as there is no cooling. If parts are available, systems may be back at the earliest late tomorrow. Check here for updates. 
+
<b>Fri Dec 24 13:31 EST PM 2021</b> Please note the following scheduled network maintenance, which will result in loss of connectivity to the SciNet datacentre: Start time
 +
Dec 29, 00:30 EST  Estimated duration  4 hours and 30 minutes.  
  
<b>August 14, 2020, 21:04 AM EST:</b> Tomorrow's /scratch purge has been postponed.
+
<b>Mon Dec 20 4:29 EST PM 2021</b> Filesystem is back to normal.  
  
<b>August 14, 2020, 21:00 AM EST:</b> Staff at the datacenter. Looks like one of the pumps has a seal that is leaking badly.
+
<b>Mon Dec 20 2:53 EST PM 2021</b> Filesystem problem - We are investigating.  
  
<b>August 14, 2020, 20:37 AM EST:</b> We seem to be undergoing a thermal shutdown at the datacenter.
 
  
<b>August 14, 2020, 20:20 AM EST:</b> Network problems to niagara/mist. We are investigating.
 
 
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
{|style="border-spacing: 10px;width: 100%"
 
{|style="border-spacing: 10px;width: 100%"
Line 50: Line 49:
 
* [[Niagara Quickstart]]
 
* [[Niagara Quickstart]]
 
* [[HPSS | HPSS archival storage]]
 
* [[HPSS | HPSS archival storage]]
* [[SOSCIP_GPU | SOSCIP GPU cluster]]
 
 
* [[Mist| Mist Power 9 GPU cluster]]
 
* [[Mist| Mist Power 9 GPU cluster]]
 
* [[Teach|Teach cluster]]
 
* [[Teach|Teach cluster]]
Line 58: Line 56:
  
 
== Tutorials, Manuals, etc. ==
 
== Tutorials, Manuals, etc. ==
* [https://support.scinet.utoronto.ca/education/browse.php SciNet education material]
+
* [https://education.scinet.utoronto.ca SciNet education material]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 +
* [[Modules for Mist]]
 
* [[Commercial software]]
 
* [[Commercial software]]
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 +
* [[SSH keys]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]

Latest revision as of 17:36, 8 January 2022

System Status

Niagara Mist Teach Rouge
Jupyter Hub Scheduler File system Burst Buffer
HPSS Login Nodes External Network Globus


Sat Jan 8 11:42 EST AM 2022 The emergency maintenance is complete. Systems are up and available.

Fri Jan 7 14:34 EST PM 2022 The SciNet shutdown is in progress. Systems are expected back on Saturday, Jan 8.

Emergency shutdown Friday January 7, 2022: An emergency shutdown of all SciNet to replace a crucial file system component is planned to take place on Friday January 7, 2022, starting at 8am EST, and will require at least 12 hours of downtime. Updates will be posted during the day.

Thu Jan 6 08:20 EST AM 2022 The SciNet filesystem is having issues. We are investigating.


Fri Dec 24 13:31 EST PM 2021 Please note the following scheduled network maintenance, which will result in loss of connectivity to the SciNet datacentre: Start time Dec 29, 00:30 EST Estimated duration 4 hours and 30 minutes.

Mon Dec 20 4:29 EST PM 2021 Filesystem is back to normal.

Mon Dec 20 2:53 EST PM 2021 Filesystem problem - We are investigating.


QuickStart Guides

Tutorials, Manuals, etc.