Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
(System Status)
 
(20 intermediate revisions by 3 users not shown)
Line 23: Line 23:
  
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
<b> December 11,2020, 12:00 AM EST: </b> Cooling issue resolved. Systems back.
 
  
<b> December 11,2020, 6:00 PM EST: </b> Cooling issue at datacenter. All systems down.
+
From Tue Mar 30 at 12 noon EST to Thu Apr 1 at 12 noon EST, there will be a two-day reservation for the "Niagara at Scale" pilot  event.  During these 48 hours, only "Niagara at Scale" projects will run on the compute notes (as well as SOSCIP projects, on a subset of nodes). All other users can still login, access their data, and submit jobs throughout this event, but the jobs will not run until after the event.  The debugjob queue will remain available to  everyone as well.
  
<b> December 7, 2020, 7:25 PM EST: </b>All systems back; users can log in again.
+
The scheduler will not start batch jobs that cannot finish before the start of this event. Users can submit small and short jobs can take advantage of this, as the scheduler may be able to fit these jobs in before the event starts on the otherwise idle nodes.
  
<b> December 7, 2020, 6:46 PM EST: </b>User connectivity to data center not yet ready, but queued jobs on Mist and Niagara have been started.
+
Tue 23 Mar 2021 12:19:07 PM EDT - Planned external network maintenance 12pm-1pm Tuesday, March 23rd.  
 
<b> December 7, 2020, 7:00 AM EST: </b>Maintenance shutdown in effect. This is a one-day maintenance shutdown.  There will be no access to Niagara, Mist, HPSS or teach, nor to their file systems during this time.  We expect to be able to bring the systems back online this evening.
 
 
 
<b> December 2, 2020, 9:10 PM EST: </b>Power is back, systems are coming up. Please resubmit any jobs that failed because of this incident.
 
 
 
<b> December 2, 2020, 6:00 PM EST: </b>Power glitch at the data center, caused about half of the compute nodes to go down.  Power issue not yet resolved.
 
 
 
<b> <span style="color:#dd1111">Announcing a Maintenance Shutdown on December 7th, 2020</span></b> <br/>There will be a one-day maintenance shutdown on December 7th 2020, starting at 7 am EST.  There will be no access to Niagara, Mist, HPSS or teach, nor to their file systems during this time.  We expect to be able to bring the systems back online in the evening of the same day.
 
  
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
Line 46: Line 37:
 
* [[Niagara Quickstart]]
 
* [[Niagara Quickstart]]
 
* [[HPSS | HPSS archival storage]]
 
* [[HPSS | HPSS archival storage]]
* [[SOSCIP_GPU | SOSCIP GPU cluster]]
 
 
* [[Mist| Mist Power 9 GPU cluster]]
 
* [[Mist| Mist Power 9 GPU cluster]]
 
* [[Teach|Teach cluster]]
 
* [[Teach|Teach cluster]]

Latest revision as of 17:32, 1 April 2021

System Status

Niagara HPSS Mist Teach
Jupyter Hub Scheduler File system Burst Buffer
Login Nodes External Network Globus


From Tue Mar 30 at 12 noon EST to Thu Apr 1 at 12 noon EST, there will be a two-day reservation for the "Niagara at Scale" pilot event. During these 48 hours, only "Niagara at Scale" projects will run on the compute notes (as well as SOSCIP projects, on a subset of nodes). All other users can still login, access their data, and submit jobs throughout this event, but the jobs will not run until after the event. The debugjob queue will remain available to everyone as well.

The scheduler will not start batch jobs that cannot finish before the start of this event. Users can submit small and short jobs can take advantage of this, as the scheduler may be able to fit these jobs in before the event starts on the otherwise idle nodes.

Tue 23 Mar 2021 12:19:07 PM EDT - Planned external network maintenance 12pm-1pm Tuesday, March 23rd.

QuickStart Guides

Tutorials, Manuals, etc.