Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to navigation Jump to search
 
(114 intermediate revisions by 12 users not shown)
Line 13: Line 13:
 
|-
 
|-
 
|{{Up |Jupyter Hub|Jupyter_Hub}}
 
|{{Up |Jupyter Hub|Jupyter_Hub}}
|{{up |Scheduler|Niagara_Quickstart#Submitting_jobs}}
+
|{{Up |Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up |File system|Niagara_Quickstart#Storage_and_quotas}}
 
|{{Up |File system|Niagara_Quickstart#Storage_and_quotas}}
 
|{{Up |Burst Buffer|Burst_Buffer}}
 
|{{Up |Burst Buffer|Burst_Buffer}}
Line 20: Line 20:
 
|{{Up |Login Nodes|Niagara_Quickstart#Logging_in}}  
 
|{{Up |Login Nodes|Niagara_Quickstart#Logging_in}}  
 
|{{Up |External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up |External Network|Niagara_Quickstart#Logging_in}}  
|{{Up|Globus|Globus}}
+
|{{Up |Globus |Globus}}
 
|}
 
|}
  
<!-- Current Messages: -->
+
'''Fri May 6th, 2022, 11:35 am:''' HPSS scheduler upgrade also finished.
<b> July 23th, 2021, 8:00 AM :</b> <b> Security update </b> Due to a severe vulnerability in the Linux kernel (CVE-2021-33909), our team is currently patching and rebooting all login nodes and compute nodes.  There should be no affect on running jobs, however sessions on login and datamover nodes will be disrupted.  
 
  
<b> July 20th, 2021, 7:00 PM :</b> <b> SLURM configuration</b> - Changed the default behaviour to kill a job step if any task exits with a non-zero exit code. If your code is able to handle failures gracefully, please add srun's option --no-kill to recover the previous default behaviour.
+
'''Thu May 5th, 2022, 7:45 pm:''' Upgrade of the scheduler has finished, with the exception of HPSS.
  
<b> July 20th, 2021, 7:00 PM :</b> Maintenance finished, systems are back online.  
+
'''Thu May 5th, 2022, 7:00 am - 3:00 pm EDT (approx):''' Starting from 7:00 am EDT, an upgrade of the scheduler of the Niagara, Mist, and Rouge clusters will be applied.  This requires the scheduler to be down for about 5-6 hours, and all compute and login nodes to be rebooted.
 +
Jobs cannot be submitted during this maintenance, but jobs submitted beforehand will remain in the queue.  For most of the time, the login nodes of the clusters will be available so that users may access their files on the home, scratch, and project file systems.
  
<b>SciNet Downtime July 20th, 2021 (Tuesday):</b> There will be a maintenance shutdown of the SciNet data center on Tuesday July 20th, starting at 7 am EDT. There will be no access to any of the SciNet systems (Niagara, Mist, HPSS, Teach cluster, or the file systems) during this time. We expect to be able to bring the systems back online in the evening of July 20th.  The status of the Niagara cluster can be checked on status.computecanada.ca. For up-to-date and more detailed information on the status of all the SciNet systems, you can always check back here.
+
'''Monday May 2nd, 2022, 9:30 - 11:00 am EDT:''' the Niagara login nodes, the jupyter hub, and nia-datamover2 will get rebooted for updates.  In the process, any login sessions will get disconnected, and servers on the jupyterhub will stop. Jobs in the Niagara queue will not be affected.
  
<b>June 28th, 2021, 4:06 PM:</b> Mist OS upgrade is complete.
+
'''Tue Apr 26, 11:20 AM EDT:''' A Rolling update of the Mist cluster is taking a bit longer than expected, affecting logins to Mist.  
 +
 +
<!--  When removing system status entries, please archive them to: -->
 +
[[Previous messages]]
  
<b>May 27, 2021:</b> Datamovers addresses have changed to improve high bandwidth connectivity and cybersecurity. The new addresses are 142.1.174.227 for nia-datamover1.scinet.utoronto.ca, and 142.1.174.228 for nia-datamover2.scinet.utoronto.ca.
 
 
If you have jobs that need to connect to a software license server using an ssh tunnel through nia-gw (which actually resolves to datamover1 or datamover2), you may need to ask the system administrators of that license server to allow incoming connections from the new addresses above.
 
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
 
{|style="border-spacing: 10px;width: 100%"
 
{|style="border-spacing: 10px;width: 100%"
 
|valign="top" style="margin: 1em; padding:1em; padding-top:.1em; border:2px solid #000; background-color:#fff; border-radius:7px; width: 49.5%" |
 
|valign="top" style="margin: 1em; padding:1em; padding-top:.1em; border:2px solid #000; background-color:#fff; border-radius:7px; width: 49.5%" |
Line 52: Line 50:
  
 
== Tutorials, Manuals, etc. ==
 
== Tutorials, Manuals, etc. ==
* [https://support.scinet.utoronto.ca/education/browse.php SciNet education material]
+
* [https://education.scinet.utoronto.ca SciNet education material]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 +
* [[Modules for Mist]]
 
* [[Commercial software]]
 
* [[Commercial software]]
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 +
* [[SSH#SSH Keys|SSH keys]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]

Latest revision as of 11:27, 9 May 2022

System Status

Niagara Mist Teach Rouge
Jupyter Hub Scheduler File system Burst Buffer
HPSS Login Nodes External Network Globus

Fri May 6th, 2022, 11:35 am: HPSS scheduler upgrade also finished.

Thu May 5th, 2022, 7:45 pm: Upgrade of the scheduler has finished, with the exception of HPSS.

Thu May 5th, 2022, 7:00 am - 3:00 pm EDT (approx): Starting from 7:00 am EDT, an upgrade of the scheduler of the Niagara, Mist, and Rouge clusters will be applied. This requires the scheduler to be down for about 5-6 hours, and all compute and login nodes to be rebooted. Jobs cannot be submitted during this maintenance, but jobs submitted beforehand will remain in the queue. For most of the time, the login nodes of the clusters will be available so that users may access their files on the home, scratch, and project file systems.

Monday May 2nd, 2022, 9:30 - 11:00 am EDT: the Niagara login nodes, the jupyter hub, and nia-datamover2 will get rebooted for updates. In the process, any login sessions will get disconnected, and servers on the jupyterhub will stop. Jobs in the Niagara queue will not be affected.

Tue Apr 26, 11:20 AM EDT: A Rolling update of the Mist cluster is taking a bit longer than expected, affecting logins to Mist.

Previous messages

QuickStart Guides

Tutorials, Manuals, etc.