Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to navigation Jump to search
 
(885 intermediate revisions by 16 users not shown)
Line 5: Line 5:
 
==System Status==
 
==System Status==
  
<!-- Use "Up" or "Down"; these are templates. "Up2" and "Down2" allow for external references. -->
+
<!-- Use "Up", "Partial" or "Down"; these are templates. -->
 
+
{|style="width:100%"  
{|style="width:65%"  
+
|{{Up   |Niagara|Niagara_Quickstart}}
|style="width:10%"|{{Up|Niagara|Niagara_Quickstart}}
+
|{{Up  |Mist|Mist}}
|style="width:10%"|{{Up|HPSS|HPSS}}
+
|{{Up  |Teach|Teach}}
|style="width:10%"|{{Up2|BGQ|https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ}}
+
|{{Up   |Rouge|Rouge}}
|style="width:10%"|{{Up2|SGC|https://wiki.scinet.utoronto.ca/wiki/index.php/SOSCIP_GPU}}
+
|-
 +
|{{Up  |Jupyter Hub|Jupyter_Hub}}
 +
|{{Up  |Scheduler|Niagara_Quickstart#Submitting_jobs}}
 +
|{{Up  |File system|Niagara_Quickstart#Storage_and_quotas}}
 +
|{{Up  |Burst Buffer|Burst_Buffer}}
 
|-
 
|-
|style="width:10%"|{{Up2|P7|https://wiki.scinet.utoronto.ca/wiki/index.php/P7_Linux_Cluster}}
+
|{{Up  |HPSS|HPSS}}
|style="width:10%"|{{Up2|P8|https://wiki.scinet.utoronto.ca/wiki/index.php/P8}}
+
|{{Up  |Login Nodes|Niagara_Quickstart#Logging_in}}  
|style="width:10%"|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
+
|{{Up   |External Network|Niagara_Quickstart#Logging_in}}  
|style="width:10%"|{{Up|External Network|External Network}}
+
|{{Up   |Globus |Globus}}
 
|}
 
|}
  
Current Messages:
+
'''Mon Jun 5, 2023, 2:35 PM EDT:''' All systems are operational again.
 +
 
 +
'''Mon Jun 5, 2023, 11:55 AM EDT:''' There were issues with the cooling system.  The login nodes and file systems are now accessible again, but compute nodes are still off.
  
<!--  When removing system status entries, please archive them to:    https://docs.scinethpc.ca/wiki/index.php/Previous_messages -->
+
'''Mon Jun 5, 2023, 6:55 AM EDT:''' Issues at the data center, we are investigating.
* May 4, 2018: [[HPSS]] is now operational on Niagara.
 
* May 3, 2018: [[Burst Buffer]] is available upon request.
 
* May 3, 2018: The [https://docs.computecanada.ca/wiki/Globus Globus] endpoint for Niagara is available: computecanada#niagara.
 
* May 1, 2018: System status moved here.
 
* April 10, 2018: Niagara commissioned.
 
  
|}
+
'''Sat May 27, 2023, 21:00AM EDT:''' We have been able to mitigate the UPS issue for now, until new parts arrive sometime during the week. System will be accessible soon
 +
 
 +
'''Sat May 27, 2023, 16:00AM EDT:''' We identified an UPS/Power related issue on the datacenter, that is adversely affecting several components, in particular all file systems. Out of an abundance of caution we are shutting down the cluster, until the UPS situation is resolved. Ongoing jobs will be canceled.
 +
 
 +
'''Sat May 27, 2023, 11:18AM EDT:''' Filesystem issues, investigating.
 +
 
 +
<!--  When removing system status entries, please archive them to: -->
 +
[[Previous messages]]
  
{|style="border-spacing: 10px;width: 95%"
+
{|style="border-spacing: 10px;width: 100%"
 
|valign="top" style="margin: 1em; padding:1em; padding-top:.1em; border:2px solid #000; background-color:#fff; border-radius:7px; width: 49.5%" |
 
|valign="top" style="margin: 1em; padding:1em; padding-top:.1em; border:2px solid #000; background-color:#fff; border-radius:7px; width: 49.5%" |
  
 
== QuickStart Guides ==
 
== QuickStart Guides ==
* [[Niagara_Quickstart|Niagara cluster for large parallel jobs]]
+
* [[Niagara Quickstart]]
 
* [[HPSS | HPSS archival storage]]
 
* [[HPSS | HPSS archival storage]]
* [https://wiki.scinet.utoronto.ca/wiki/index.php/BGQ SOSCIP BlueGene/Q cluster]
+
* [[Mist| Mist Power 9 GPU cluster]]
* [https://wiki.scinet.utoronto.ca/wiki/index.php/SOSCIP_GPU SOSCIP GPU cluster]
+
* [[Teach|Teach cluster]]
* [https://wiki.scinet.utoronto.ca/wiki/index.php/P7_Linux_Cluster Experimental Power 7 cluster]
 
* [https://wiki.scinet.utoronto.ca/wiki/index.php/P8 Experimental Power 8 GPU cluster]
 
 
* [[FAQ | FAQ (frequently asked questions)]]
 
* [[FAQ | FAQ (frequently asked questions)]]
* [[Acknowledging_SciNet | Acknowledging SciNet]]
+
* [[Acknowledging SciNet]]
 
| valign="top" style="margin: 1em; padding:1em; padding-top:.1em; border:2px solid #000; background-color:#fff; border-radius:7px; width: 49.5%" |
 
| valign="top" style="margin: 1em; padding:1em; padding-top:.1em; border:2px solid #000; background-color:#fff; border-radius:7px; width: 49.5%" |
  
 
== Tutorials, Manuals, etc. ==
 
== Tutorials, Manuals, etc. ==
* [https://courses.scinet.utoronto.ca SciNet education material]
+
* [https://education.scinet.utoronto.ca SciNet education material]
* [https://www.youtube.com/channel/UC42CaO-AAQhwqa8RGzE3daQ SciNet's YouTube channel]
+
* [https://www.youtube.com/c/SciNetHPCattheUniversityofToronto SciNet's YouTube channel]
* [[Modules specific to Niagara]]  
+
* [[Modules specific to Niagara|Software Modules specific to Niagara]]
 +
* [[Modules for Mist]]
 +
* [[Commercial software]]
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 +
* [[SSH#SSH Keys|SSH keys]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 +
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[Visualization]]
 
* [[Visualization]]
 +
* [[Running Serial Jobs on Niagara]]
 +
* [[Jupyter Hub]]
 
|}
 
|}

Latest revision as of 18:38, 5 June 2023

System Status

Niagara Mist Teach Rouge
Jupyter Hub Scheduler File system Burst Buffer
HPSS Login Nodes External Network Globus

Mon Jun 5, 2023, 2:35 PM EDT: All systems are operational again.

Mon Jun 5, 2023, 11:55 AM EDT: There were issues with the cooling system. The login nodes and file systems are now accessible again, but compute nodes are still off.

Mon Jun 5, 2023, 6:55 AM EDT: Issues at the data center, we are investigating.

Sat May 27, 2023, 21:00AM EDT: We have been able to mitigate the UPS issue for now, until new parts arrive sometime during the week. System will be accessible soon

Sat May 27, 2023, 16:00AM EDT: We identified an UPS/Power related issue on the datacenter, that is adversely affecting several components, in particular all file systems. Out of an abundance of caution we are shutting down the cluster, until the UPS situation is resolved. Ongoing jobs will be canceled.

Sat May 27, 2023, 11:18AM EDT: Filesystem issues, investigating.

Previous messages

QuickStart Guides

Tutorials, Manuals, etc.