Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
m (System Status)
(System Status)
 
(92 intermediate revisions by 6 users not shown)
Line 7: Line 7:
 
<!-- Use "Up" or "Down"; these are templates. -->
 
<!-- Use "Up" or "Down"; these are templates. -->
 
{|style="width:100%"  
 
{|style="width:100%"  
|{{Partial|Niagara|Niagara_Quickstart}}
+
|{{Up|Niagara|Niagara_Quickstart}}
 
|{{Up|HPSS|HPSS}}
 
|{{Up|HPSS|HPSS}}
|{{Up|BGQ|BGQ}}
 
 
|{{Up|SOSCIP&nbsp;GPU|SOSCIP_GPU}}
 
|{{Up|SOSCIP&nbsp;GPU|SOSCIP_GPU}}
 +
|{{Up|P8|P8}}
 
|-
 
|-
|{{Down|P7|P7}}
+
|{{Up|Teach|Teach}}
|{{Down|P8|P8}}
+
|{{Up|Jupyter Hub|Jupyter_Hub}}
|{{Down|Teach|Teach}}
 
|{{Down|Jupyter Hub|Jupyter_Hub}}
 
|-
 
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 +
|-
 
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up|External Network|Niagara_Quickstart#Logging_in}}  
 
|{{Up|Globus|Globus}}
 
|{{Up|Globus|Globus}}
Line 24: Line 22:
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
  
<b>Thursday May 30, 2019 11:00:00 PM:</b>
+
<b> Jan 21, 2020, 4:05PM: </b>   The was a partial power outage the took down a large amount of the compute nodes.  If your job died during this period please resubmit.
The maintenance downtime of SciNet's data center has finished, and systems are being brought online now.  You can check the progress here. Some systems might not be available until Friday morning. Login access to Niagara is not yet enabled, but jobs that were in the queue are running.
 
 
 
Some action on the part of users will be required when they first connect again to a Niagara login nodes or datamoversThis is due to the security upgrade of the Niagara cluster, which is now in line with currently accepted best practices.  
 
 
 
The details of the required actions can be found on the [[SSH_Changes_in_May_2019]] wiki page.
 
 
 
'''SCHEDULED SHUTDOWN''':
 
  
Please be advised that on '''Wednesday May 29th through Thursday May 30th''', the SciNet datacentre will undergo a two-day maintenance shutdown, starting at 7 am EDT on Wednesday May 29th.  There will be no access to any of the SciNet systems (Niagara, P7, P8, BGQ, SGC, HPSS, Teach cluster, or the file systems) during this time.
+
<b>Jan 13, 2020, 7:35 PM:</b> Maintenance finished.
  
This is necessary to finish the installation of an emergency power generator, to perform the annual cooling tower maintenance, and to enhance login security.
+
<b>Jan 13, 2020, 8:20 AM:</b> The announced maintenance downtime started (see below).
  
We expect to be able to bring the systems back online the evening of May 30th.  Due to the enhanced login security, the ssh applications of users will need to update their known host list. More detailed information on this procedure will be sent shortly before the systems are back online.
+
<b>Jan 9 2020, 11:30 AM:</b> External ssh connectivity restored, issue related to the university network.
  
Fri 5 Apr 2019: Software updates on Niagara: The default CCEnv software stack now uses avx512 on Niagara, and there is now a NiaEnv/2019b stack ("epoch").
+
<b>Jan 9 2020, 9:24 AM:</b> We received reports of users having trouble connecting into the SciNet data centre; we're investigating.  Systems are up and running and jobs are fine.<p>
 +
As a work around, in the meantime, it appears to be possible to log into graham, cedar or beluga, and then ssh to niagara.</p>
  
Thu 4 Apr 2019: The 2019 compute and storage allocations have taken effect on Niagara.  
+
<b>Downtime announcement:</b>
 +
To prepare for the upcoming expansion of Niagara, there will be a
 +
one-day maintenance shutdown on <b>January 13th 2020, starting at 8 am
 +
EST</b>.  There will be no access to Niagara, Mist, HPSS or teach, nor
 +
to their file systems during this time.
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
{|style="border-spacing: 10px;width: 100%"
 
{|style="border-spacing: 10px;width: 100%"
Line 47: Line 43:
  
 
== QuickStart Guides ==
 
== QuickStart Guides ==
* [[Niagara_Quickstart|Niagara Quickstart]]
+
* [[Niagara Quickstart]]
 
* [[HPSS | HPSS archival storage]]
 
* [[HPSS | HPSS archival storage]]
* [[BGQ | SOSCIP BlueGene/Q cluster]]
 
 
* [[SOSCIP_GPU | SOSCIP GPU cluster]]
 
* [[SOSCIP_GPU | SOSCIP GPU cluster]]
* [[P7|Experimental Power 7 cluster]]
 
 
* [[P8|Experimental Power 8 GPU cluster]]
 
* [[P8|Experimental Power 8 GPU cluster]]
 
* [[Teach|Teach cluster]]
 
* [[Teach|Teach cluster]]
 
* [[FAQ | FAQ (frequently asked questions)]]
 
* [[FAQ | FAQ (frequently asked questions)]]
* [[Acknowledging_SciNet | Acknowledging SciNet]]
+
* [[Acknowledging SciNet]]
 
| valign="top" style="margin: 1em; padding:1em; padding-top:.1em; border:2px solid #000; background-color:#fff; border-radius:7px; width: 49.5%" |
 
| valign="top" style="margin: 1em; padding:1em; padding-top:.1em; border:2px solid #000; background-color:#fff; border-radius:7px; width: 49.5%" |
  
 
== Tutorials, Manuals, etc. ==
 
== Tutorials, Manuals, etc. ==
* [https://courses.scinet.utoronto.ca SciNet education material]
+
* [https://support.scinet.utoronto.ca/education/browse.php SciNet education material]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  
 
* [[Modules specific to Niagara|Software Modules specific to Niagara]]  

Latest revision as of 21:16, 21 January 2020

System Status

Niagara HPSS SOSCIP GPU P8
Teach Jupyter Hub Scheduler File system
External Network Globus

Jan 21, 2020, 4:05PM: The was a partial power outage the took down a large amount of the compute nodes. If your job died during this period please resubmit.

Jan 13, 2020, 7:35 PM: Maintenance finished.

Jan 13, 2020, 8:20 AM: The announced maintenance downtime started (see below).

Jan 9 2020, 11:30 AM: External ssh connectivity restored, issue related to the university network.

Jan 9 2020, 9:24 AM: We received reports of users having trouble connecting into the SciNet data centre; we're investigating. Systems are up and running and jobs are fine.

As a work around, in the meantime, it appears to be possible to log into graham, cedar or beluga, and then ssh to niagara.

Downtime announcement: To prepare for the upcoming expansion of Niagara, there will be a one-day maintenance shutdown on January 13th 2020, starting at 8 am EST. There will be no access to Niagara, Mist, HPSS or teach, nor to their file systems during this time.

QuickStart Guides

Tutorials, Manuals, etc.