Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
m (System Status)
(System Status)
(45 intermediate revisions by 6 users not shown)
Line 7: Line 7:
 
<!-- Use "Up" or "Down"; these are templates. -->
 
<!-- Use "Up" or "Down"; these are templates. -->
 
{|style="width:100%"  
 
{|style="width:100%"  
|{{Down|Niagara|Niagara_Quickstart}}
+
|{{Up|Niagara|Niagara_Quickstart}}
|{{Down|HPSS|HPSS}}
+
|{{Up|HPSS|HPSS}}
|{{Up|SOSCIP&nbsp;GPU|SOSCIP_GPU}}
+
|{{Up|Mist|Mist}}
|{{Down|Mist|Mist}}
+
|{{Up|Teach|Teach}}
 
|-
 
|-
|{{Down|Teach|Teach}}
+
|{{Up|Jupyter Hub|Jupyter_Hub}}
|{{Down|Jupyter Hub|Jupyter_Hub}}
+
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
|{{Down|Scheduler|Niagara_Quickstart#Submitting_jobs}}
+
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
|{{Down|File system|Niagara_Quickstart#Storage_and_quotas}}
+
|{{Up|Burst Buffer|Burst_Buffer}}
 
|-
 
|-
|{{Down|Login Nodes|Niagara_Quickstart#Logging_in}}  
+
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}  
|{{Down|External Network|Niagara_Quickstart#Logging_in}}  
+
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
|{{Down|Globus|Globus}}
+
|{{Up|Globus|Globus}}
|{{Down|Burst Buffer|Burst_Buffer}}
 
 
|}
 
|}
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
  
<b> Wed Mar 26 23:05:00 EDT 2020:</b> Some aspects of the maintenance took longer than expected. The systems will not be back up until some time tomorrow, Friday March 27, 2020.   
+
<b> June 29, 6:21:00 PM:</b> Systems are available again.   
  
<b> Wed Mar 25 7:00:00 EDT 2020:</b> SciNet/Niagara downtime started.
+
<b> June 29, 12:30:00 PM:</b> Power Outage caused thermal shutdown.
  
<b> Mon Mar 23 18:45:10 EDT 2020:</b>  File system issues were resolved.
+
<b>June 20, 2020, 10:24 PM:</b> File systems are back up. Unfortunately, all running jobs would have died and users are asked to resubmit them.
  
<b> Mon Mar 23 18:01:19 EDT 2020:</b> There is currently an issue with the main Niagara filesystems. This effects all systems, all jobs have been killed. The issue is being investigated.  
+
<b>June 20, 2020, 9:48 PM:</b> An issue with the file systems is causing trouble. We are investigating the cause.
  
<b> Fri Mar 20 13:15:33 EDT 2020: </b> There was a power glitch at the datacentre earlier this morning, which resulted in jobs getting killed.  Please resubmit failed jobs.
+
<b>June 15, 2020, 10:30 PM:</b> A <b>power glitch</b> caused some compute nodes to be rebooted: jobs running at the time may have failed; users are asked to resubmit these jobs.
 
 
<b> COVID-19 Impact on SciNet Operations, March 18, 2020</b>
 
 
 
Although the University of Toronto is closing of some of its
 
research operations on Friday March 20 at 5 pm EDT, this does not
 
affect the SciNet systems (such as Niagara, Mist, and HPSS), which
 
will remain operational.
 
 
 
<b> SciNet/Niagara Downtime Announcement, March 25-26, 2020</b>
 
 
 
All resources at SciNet will undergo a two-day maintenance shutdown on March 25th and 26th 2020, starting at 7 am EDT on Wednesday March 25th.  There will be no access to any of the SciNet systems (Niagara, Mist, HPSS, Teach cluster, or the file systems) during this time.
 
 
 
This shutdown is necessary to finish the expansion of the Niagara cluster and its storage system.
 
 
 
We expect to be able to bring the systems back online the evening of March 26th.
 
  
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
Line 70: Line 54:
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 +
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[Visualization]]
 
* [[Visualization]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Jupyter Hub]]
 
* [[Jupyter Hub]]
 
|}
 
|}

Revision as of 12:43, 30 June 2020

System Status

Niagara HPSS Mist Teach
Jupyter Hub Scheduler File system Burst Buffer
Login Nodes External Network Globus

June 29, 6:21:00 PM: Systems are available again.

June 29, 12:30:00 PM: Power Outage caused thermal shutdown.

June 20, 2020, 10:24 PM: File systems are back up. Unfortunately, all running jobs would have died and users are asked to resubmit them.

June 20, 2020, 9:48 PM: An issue with the file systems is causing trouble. We are investigating the cause.

June 15, 2020, 10:30 PM: A power glitch caused some compute nodes to be rebooted: jobs running at the time may have failed; users are asked to resubmit these jobs.

QuickStart Guides

Tutorials, Manuals, etc.