Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
m
(System Status)
(47 intermediate revisions by 6 users not shown)
Line 7: Line 7:
 
<!-- Use "Up" or "Down"; these are templates. -->
 
<!-- Use "Up" or "Down"; these are templates. -->
 
{|style="width:100%"  
 
{|style="width:100%"  
|{{Down|Niagara|Niagara_Quickstart}}
+
|{{Up|Niagara|Niagara_Quickstart}}
|{{Down|HPSS|HPSS}}
+
|{{Up|HPSS|HPSS}}
|{{Up|SOSCIP&nbsp;GPU|SOSCIP_GPU}}
+
|{{Up|Mist|Mist}}
|{{Down|Mist|Mist}}
+
|{{Up|Teach|Teach}}
 
|-
 
|-
|{{Down|Teach|Teach}}
+
|{{Up|Jupyter Hub|Jupyter_Hub}}
|{{Down|Jupyter Hub|Jupyter_Hub}}
+
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
|{{Down|Scheduler|Niagara_Quickstart#Submitting_jobs}}
+
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
|{{Down|File system|Niagara_Quickstart#Storage_and_quotas}}
+
|{{Up|Burst Buffer|Burst_Buffer}}
 
|-
 
|-
|{{Down|Login Nodes|Niagara_Quickstart#Logging_in}}  
+
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}  
|{{Down|External Network|Niagara_Quickstart#Logging_in}}  
+
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
|{{Down|Globus|Globus}}
+
|{{Up|Globus|Globus}}
|{{Down|Burst Buffer|Burst_Buffer}}
 
 
|}
 
|}
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
<b> Wed Mar 25 7:00:00 EDT 2020:</b>  SciNet/Niagara downtime started.
 
  
<b> Mon Mar 23 18:45:10 EDT 2020:</b>  File system issues were resolved.
+
<b> June 29, 6:21:00  PM:</b> Systems are available again.  
  
<b> Mon Mar 23 18:01:19 EDT 2020:</b> There is currently an issue with the main Niagara filesystems. This effects all systems, all jobs have been killed. The issue is being investigated.  
+
<b> June 29, 12:30:00  PM:</b> Power Outage caused thermal shutdown.
  
<b> Fri Mar 20 13:15:33 EDT 2020: </b> There was a power glitch at the datacentre earlier this morning, which resulted in jobs getting killed.  Please resubmit failed jobs.  
+
<b>June 20, 2020, 10:24 PM:</b> File systems are back up.  Unfortunately, all running jobs would have died and users are asked to resubmit them.
  
<b> COVID-19 Impact on SciNet Operations, March 18, 2020</b>
+
<b>June 20, 2020, 9:48 PM:</b> An issue with the file systems is causing trouble.  We are investigating the cause.
  
Although the University of Toronto is closing of some of its
+
<b>June 15, 2020, 10:30 PM:</b> A <b>power glitch</b> caused some compute nodes to be rebooted: jobs running at the time may have failed; users are asked to resubmit these jobs.
research operations on Friday March 20 at 5 pm EDT, this does not
 
affect the SciNet systems (such as Niagara, Mist, and HPSS), which
 
will remain operational.
 
 
 
<b> SciNet/Niagara Downtime Announcement, March 25-26, 2020</b>
 
 
 
All resources at SciNet will undergo a two-day maintenance shutdown on March 25th and 26th 2020, starting at 7 am EDT on Wednesday March 25th.  There will be no access to any of the SciNet systems (Niagara, Mist, HPSS, Teach cluster, or the file systems) during this time.
 
 
 
This shutdown is necessary to finish the expansion of the Niagara cluster and its storage system.
 
 
 
We expect to be able to bring the systems back online the evening of March 26th.
 
  
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
Line 67: Line 54:
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 +
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[Visualization]]
 
* [[Visualization]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Jupyter Hub]]
 
* [[Jupyter Hub]]
 
|}
 
|}

Revision as of 12:43, 30 June 2020

System Status

Niagara HPSS Mist Teach
Jupyter Hub Scheduler File system Burst Buffer
Login Nodes External Network Globus

June 29, 6:21:00 PM: Systems are available again.

June 29, 12:30:00 PM: Power Outage caused thermal shutdown.

June 20, 2020, 10:24 PM: File systems are back up. Unfortunately, all running jobs would have died and users are asked to resubmit them.

June 20, 2020, 9:48 PM: An issue with the file systems is causing trouble. We are investigating the cause.

June 15, 2020, 10:30 PM: A power glitch caused some compute nodes to be rebooted: jobs running at the time may have failed; users are asked to resubmit these jobs.

QuickStart Guides

Tutorials, Manuals, etc.