Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
(System Status)
(System Status)
(40 intermediate revisions by 5 users not shown)
Line 8: Line 8:
 
{|style="width:100%"  
 
{|style="width:100%"  
 
|{{Up|Niagara|Niagara_Quickstart}}
 
|{{Up|Niagara|Niagara_Quickstart}}
|{{Down|HPSS|HPSS}}
+
|{{Up|HPSS|HPSS}}
|{{Up|SOSCIP GPU|SOSCIP_GPU}}
 
 
|{{Up|Mist|Mist}}
 
|{{Up|Mist|Mist}}
 +
|{{Up|Teach|Teach}}
 
|-
 
|-
|{{Up|Teach|Teach}}
+
|{{Up|Jupyter Hub|Jupyter_Hub}}
|{{Down|Jupyter Hub|Jupyter_Hub}}
 
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 +
|{{Up|Burst Buffer|Burst_Buffer}}
 
|-
 
|-
 
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}  
 
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}  
 
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up|Globus|Globus}}
 
|{{Up|Globus|Globus}}
|{{Down|Burst Buffer|Burst_Buffer}}
 
 
|}
 
|}
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
<b> Fri Mar 27 15:29:00 EDT 2020:</b> SciNet systems are back up.
 
 
<b> Thu Mar 26 23:05:00 EDT 2020:</b>  Some aspects of the maintenance took longer than expected. The systems will not be back up until some time tomorrow, Friday March 27, 2020. 
 
 
<b> Wed Mar 25 7:00:00 EDT 2020:</b>  SciNet/Niagara downtime started.
 
 
<b> Mon Mar 23 18:45:10 EDT 2020:</b>  File system issues were resolved.
 
 
<b> Mon Mar 23 18:01:19 EDT 2020:</b> There is currently an issue with the main Niagara filesystems. This effects all systems, all jobs have been killed. The issue is being investigated.
 
 
<b> Fri Mar 20 13:15:33 EDT 2020: </b> There was a power glitch at the datacentre at 8:50 AM, which resulted in jobs getting killed.  Please resubmit failed jobs.
 
 
<b> COVID-19 Impact on SciNet Operations, March 18, 2020</b>
 
  
Although the University of Toronto is closing of some of its
+
<b> June 29, 6:21:00  PM:</b> Systems are available again.
research operations on Friday March 20 at 5 pm EDT, this does not
 
affect the SciNet systems (such as Niagara, Mist, and HPSS), which
 
will remain operational.
 
  
<b> SciNet/Niagara Downtime Announcement, March 25-26, 2020</b>
+
<b> June 29, 12:30:00  PM:</b> Power Outage caused thermal shutdown.
  
All resources at SciNet will undergo a two-day maintenance shutdown on March 25th and 26th 2020, starting at 7 am EDT on Wednesday March 25thThere will be no access to any of the SciNet systems (Niagara, Mist, HPSS, Teach cluster, or the file systems) during this time.
+
<b>June 20, 2020, 10:24 PM:</b> File systems are back upUnfortunately, all running jobs would have died and users are asked to resubmit them.
  
This shutdown is necessary to finish the expansion of the Niagara cluster and its storage system.
+
<b>June 20, 2020, 9:48 PM:</b> An issue with the file systems is causing trouble.  We are investigating the cause.
  
We expect to be able to bring the systems back online the evening of March 26th.
+
<b>June 15, 2020, 10:30 PM:</b> A <b>power glitch</b> caused some compute nodes to be rebooted: jobs running at the time may have failed; users are asked to resubmit these jobs.
  
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
Line 71: Line 54:
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 +
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[Visualization]]
 
* [[Visualization]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Jupyter Hub]]
 
* [[Jupyter Hub]]
 
|}
 
|}

Revision as of 12:43, 30 June 2020

System Status

Niagara HPSS Mist Teach
Jupyter Hub Scheduler File system Burst Buffer
Login Nodes External Network Globus

June 29, 6:21:00 PM: Systems are available again.

June 29, 12:30:00 PM: Power Outage caused thermal shutdown.

June 20, 2020, 10:24 PM: File systems are back up. Unfortunately, all running jobs would have died and users are asked to resubmit them.

June 20, 2020, 9:48 PM: An issue with the file systems is causing trouble. We are investigating the cause.

June 15, 2020, 10:30 PM: A power glitch caused some compute nodes to be rebooted: jobs running at the time may have failed; users are asked to resubmit these jobs.

QuickStart Guides

Tutorials, Manuals, etc.