Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
(System Status)
 
(38 intermediate revisions by 8 users not shown)
Line 13: Line 13:
 
|-
 
|-
 
|{{Up |Jupyter Hub|Jupyter_Hub}}
 
|{{Up |Jupyter Hub|Jupyter_Hub}}
|{{up |Scheduler|Niagara_Quickstart#Submitting_jobs}}
+
|{{Up |Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up |File system|Niagara_Quickstart#Storage_and_quotas}}
 
|{{Up |File system|Niagara_Quickstart#Storage_and_quotas}}
 
|{{Up |Burst Buffer|Burst_Buffer}}
 
|{{Up |Burst Buffer|Burst_Buffer}}
 
|-
 
|-
|{{Up |HPSS|HPSS}}
+
|{{Up|HPSS|HPSS}}
 
|{{Up |Login Nodes|Niagara_Quickstart#Logging_in}}  
 
|{{Up |Login Nodes|Niagara_Quickstart#Logging_in}}  
 
|{{Up |External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up |External Network|Niagara_Quickstart#Logging_in}}  
|{{Up|Globus|Globus}}
+
|{{Up |Globus |Globus}}
 
|}
 
|}
  
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
<b> July 23th, 2021, 9:00 AM :</b> <b> Security update: </b> Due to a severe vulnerability in the Linux kernel (CVE-2021-33909), our team is currently patching and rebooting all login nodes and compute nodes.  There should be no affect on running jobs, however sessions on login and datamover nodes will be disrupted.  
+
<b>Mon Nov 22 1:40 EST PM 2021</b> The Mist login node is back.
  
<b> July 20th, 2021, 7:00 PM :</b> <b> SLURM configuration</b> - Changed the default behaviour to kill a job step if any task exits with a non-zero exit code. If your code is able to handle failures gracefully, please add srun's option --no-kill to recover the previous default behaviour.
+
<b>Mon Nov 22 12:40 EST PM 2021</b> The Mist login node is experiencing issues, we are investigating.
  
<b> July 20th, 2021, 7:00 PM :</b> Maintenance finished, systems are back online.  
+
<b>Fri Nov 5 19:35 EDT 2021 </b> The filesystem issue from earlier in the afternoon is resolved.
  
<b>SciNet Downtime July 20th, 2021 (Tuesday):</b> There will be a maintenance shutdown of the SciNet data center on Tuesday July 20th, starting at 7 am EDT. There will be no access to any of the SciNet systems (Niagara, Mist, HPSS, Teach cluster, or the file systems) during this time.  We expect to be able to bring the systems back online in the evening of July 20th.  The status of the Niagara cluster can be checked on status.computecanada.ca. For up-to-date and more detailed information on the status of all the SciNet systems, you can always check back here.
+
<b>Fri Nov 5 16:58 EDT 2021 </b> We are experiencing filesystem issues, login to the clusters may not be possible until they are resolved.
  
<b>June 28th, 2021, 4:06 PM:</b> Mist OS upgrade is complete.
+
<b>Tue Oct 19 noon EDT - Thu Oct 21 noon EDT:</b> <b><i>Niagara at Scale:</i></b> Only users of selected projects run at large scale during these 48 hours. Other users can still login and access their files, and submit jobs for after the event.  SOSCIP and Mist users are not affected.
  
<b>May 27, 2021:</b> Datamovers addresses have changed to improve high bandwidth connectivity and cybersecurity. The new addresses are 142.1.174.227 for nia-datamover1.scinet.utoronto.ca, and 142.1.174.228 for nia-datamover2.scinet.utoronto.ca.
+
<b>Tue Oct 12 14:30 EDT 2021 </b> Mist login node is back up.
  
If you have jobs that need to connect to a software license server using an ssh tunnel through nia-gw (which actually resolves to datamover1 or datamover2), you may need to ask the system administrators of that license server to allow incoming connections from the new addresses above.
+
<b>Tue Oct 12 12:30 EDT 2021 </b> Mist login node is down for maintenance.
 +
 
 +
<b>Mon Sep 27 16:11 EDT 2021 </b> HPSS is back online.
 +
 
 +
<b>Wed Sep 23 17:23 EDT 2021 </b> Systems being brought back online. HPSS may be down for some more days.
  
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
Line 52: Line 56:
  
 
== Tutorials, Manuals, etc. ==
 
== Tutorials, Manuals, etc. ==
* [https://support.scinet.utoronto.ca/education/browse.php SciNet education material]
+
* [https://education.scinet.utoronto.ca SciNet education material]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 +
* [[Modules for Mist]]
 
* [[Commercial software]]
 
* [[Commercial software]]
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 +
* [[SSH keys]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]

Latest revision as of 18:54, 22 November 2021

System Status

Niagara Mist Teach Rouge
Jupyter Hub Scheduler File system Burst Buffer
HPSS Login Nodes External Network Globus

Mon Nov 22 1:40 EST PM 2021 The Mist login node is back.

Mon Nov 22 12:40 EST PM 2021 The Mist login node is experiencing issues, we are investigating.

Fri Nov 5 19:35 EDT 2021 The filesystem issue from earlier in the afternoon is resolved.

Fri Nov 5 16:58 EDT 2021 We are experiencing filesystem issues, login to the clusters may not be possible until they are resolved.

Tue Oct 19 noon EDT - Thu Oct 21 noon EDT: Niagara at Scale: Only users of selected projects run at large scale during these 48 hours. Other users can still login and access their files, and submit jobs for after the event. SOSCIP and Mist users are not affected.

Tue Oct 12 14:30 EDT 2021 Mist login node is back up.

Tue Oct 12 12:30 EDT 2021 Mist login node is down for maintenance.

Mon Sep 27 16:11 EDT 2021 HPSS is back online.

Wed Sep 23 17:23 EDT 2021 Systems being brought back online. HPSS may be down for some more days.

QuickStart Guides

Tutorials, Manuals, etc.