Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
(System Status)
(System Status)
 
(236 intermediate revisions by 10 users not shown)
Line 9: Line 9:
 
|{{Up|Niagara|Niagara_Quickstart}}
 
|{{Up|Niagara|Niagara_Quickstart}}
 
|{{Up|HPSS|HPSS}}
 
|{{Up|HPSS|HPSS}}
|{{Up|SOSCIP GPU|SOSCIP_GPU}}
+
|{{Up|Mist|Mist}}
|{{Down|P8|P8}}
+
|{{Up|Teach|Teach}}
 
|-
 
|-
|{{Up|Teach|Teach}}
+
|{{Up|Jupyter Hub|Jupyter_Hub}}
|{{Down|Jupyter Hub|Jupyter_Hub}}
 
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 +
|{{Up|Burst Buffer|Burst_Buffer}}
 
|-
 
|-
 +
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}
 
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up|Globus|Globus}}
 
|{{Up|Globus|Globus}}
 
|}
 
|}
<!-- Current Messages: --><p>
 
<b>Sat, Nov 2 2019, 1:30 PM (update):</b>  Chiller has been fixed, all systems are no operational.   
 
</p>
 
<b>Fri, Nov 1 2019, 4:30 PM (update):</b>  We are operating in free cooling so have brought up about 1/2 of the Niagara compute nodes to reduce the cooling load.  Access, storage, and other systems should now be available. 
 
  
<b>Fri, Nov 1 2019, 12:05 PM (update):</b> A power module in the chiller has failed and needs to be replaced.  We should be able to operate in free cooling if the temperature stays cold enough, but we may not be able to run all systems. No eta yet on when users will be able to log back in.  
+
<!-- Current Messages: -->
 +
 
 +
From Tue Mar 30 at 12 noon EST to Thu Apr 1 at 12 noon EST, there will be a two-day reservation for the "Niagara at Scale" pilot  event.  During these 48 hours, only "Niagara at Scale" projects will run on the compute notes (as well as SOSCIP projects, on a subset of nodes).  All other users can still login, access their data, and submit jobs throughout this event, but the jobs will not run until after the event.  The debugjob queue will remain available to everyone as well.
 +
 
 +
The scheduler will not start batch jobs that cannot finish before the start of this event. Users can submit small and short jobs can take advantage of this, as the scheduler may be able to fit these jobs in before the event starts on the otherwise idle nodes.
  
<b>Fri, Nov 1 2019, 9:15 AM (update):</b> There was a automated shutdown because of rising temperatures, causing all systems to go down. We are investigating, check here for updates.
+
Tue 23 Mar 2021 12:19:07 PM EDT - Planned external network maintenance 12pm-1pm Tuesday, March 23rd.  
  
<p><b>Fri, Nov 1 2019, 8:16 AM:</b> Unexpected data centre issue: Check here for updates.
 
</p>
 
<p>
 
<b>Announcement:</b>
 
The SciNet datacentre will undergo a maintenance shutdown on
 
Friday November 15th 2019, from 7 am to 11 pm (EST), with no access
 
to any of the SciNet systems (Niagara, P8, SGC, HPSS, Teach cluster,
 
or the filesystems) during that time.
 
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
{|style="border-spacing: 10px;width: 100%"
 
{|style="border-spacing: 10px;width: 100%"
Line 44: Line 37:
 
* [[Niagara Quickstart]]
 
* [[Niagara Quickstart]]
 
* [[HPSS | HPSS archival storage]]
 
* [[HPSS | HPSS archival storage]]
* [[SOSCIP_GPU | SOSCIP GPU cluster]]
+
* [[Mist| Mist Power 9 GPU cluster]]
* [[P8|Experimental Power 8 GPU cluster]]
 
 
* [[Teach|Teach cluster]]
 
* [[Teach|Teach cluster]]
 
* [[FAQ | FAQ (frequently asked questions)]]
 
* [[FAQ | FAQ (frequently asked questions)]]
Line 52: Line 44:
  
 
== Tutorials, Manuals, etc. ==
 
== Tutorials, Manuals, etc. ==
* [https://courses.scinet.utoronto.ca SciNet education material]
+
* [https://support.scinet.utoronto.ca/education/browse.php SciNet education material]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
Line 58: Line 50:
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 +
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[Visualization]]
 
* [[Visualization]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Jupyter Hub]]
 
* [[Jupyter Hub]]
 
|}
 
|}

Latest revision as of 17:32, 1 April 2021

System Status

Niagara HPSS Mist Teach
Jupyter Hub Scheduler File system Burst Buffer
Login Nodes External Network Globus


From Tue Mar 30 at 12 noon EST to Thu Apr 1 at 12 noon EST, there will be a two-day reservation for the "Niagara at Scale" pilot event. During these 48 hours, only "Niagara at Scale" projects will run on the compute notes (as well as SOSCIP projects, on a subset of nodes). All other users can still login, access their data, and submit jobs throughout this event, but the jobs will not run until after the event. The debugjob queue will remain available to everyone as well.

The scheduler will not start batch jobs that cannot finish before the start of this event. Users can submit small and short jobs can take advantage of this, as the scheduler may be able to fit these jobs in before the event starts on the otherwise idle nodes.

Tue 23 Mar 2021 12:19:07 PM EDT - Planned external network maintenance 12pm-1pm Tuesday, March 23rd.

QuickStart Guides

Tutorials, Manuals, etc.