Difference between revisions of "Main Page"

From SciNet Users Documentation
Jump to: navigation, search
m (System Status)
(82 intermediate revisions by 8 users not shown)
Line 8: Line 8:
 
{|style="width:100%"  
 
{|style="width:100%"  
 
|{{Up|Niagara|Niagara_Quickstart}}
 
|{{Up|Niagara|Niagara_Quickstart}}
|{{Up|HPSS|HPSS}}
+
|{{up|HPSS|HPSS}}
|{{Up|SOSCIP GPU|SOSCIP_GPU}}
 
 
|{{Up|Mist|Mist}}
 
|{{Up|Mist|Mist}}
 +
|{{Up|Teach|Teach}}
 
|-
 
|-
|{{Up|Teach|Teach}}
 
 
|{{Up|Jupyter Hub|Jupyter_Hub}}
 
|{{Up|Jupyter Hub|Jupyter_Hub}}
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|Scheduler|Niagara_Quickstart#Submitting_jobs}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 
|{{Up|File system|Niagara_Quickstart#Storage_and_quotas}}
 +
|{{Up|Burst Buffer|Burst_Buffer}}
 
|-
 
|-
 
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}  
 
|{{Up|Login Nodes|Niagara_Quickstart#Logging_in}}  
Line 21: Line 21:
 
|{{Up|Globus|Globus}}
 
|{{Up|Globus|Globus}}
 
|}
 
|}
 +
 
<!-- Current Messages: -->
 
<!-- Current Messages: -->
 +
<b>July 30, 2020, 9:00 AM</b> Project backup in progress but incomplete: please be aware that after we deployed the new, larger storage appliance for scratch and project two months ago, we started a full backup of project (1.5PB). This backup is taking a while to complete, and there are still a few areas which have not been backed up fully. Please be careful to not delete things from project that you still need, in particular if they are recently added material.
  
<b> March 7, 2020, 10:15 PM:</b> File system issues have been cleared.
+
<b>July 27, 2020, 5:00 PM:</b> Scheduler issues resolved.
 
 
<b> March 6, 2020, 7:30 PM:</b> File system issues; we are investigating
 
 
 
<b> March 2, 2020, 1:30 PM:</b> For the extension of Niagara, the operating system on all Niagara nodes has been upgraded
 
from CentOS 7.4 to 7.6.  This required all
 
nodes to be rebooted. Running compute jobs are allowed to finish
 
before the compute node gets rebooted. Login nodes have all been rebooted, as have the datamover nodes and the jupyterhub service.
 
 
 
<b> Feb 24, 2020, 1:30PM: </b> The [[Mist]] login node got rebooted.  It is back, but we are still monitoring the situation.
 
 
 
<b> Feb 12, 2020, 11:00AM: </b> The [[Mist]] GPU cluster now available to users.
 
 
 
<b> Feb 11, 2020, 2:00PM: </b> The Niagara compute nodes were accidentally rebooted, killing all running jobs.
 
 
 
<b> Feb 10, 2020, 19:00PM: </b> HPSS is back to normal.
 
  
<b> Jan 30, 2020, 12:01PM: </b> We are having an issue with HPSS, in which the disk-cache is full. We put a reservation on the whole system (Globus, plus archive and vfs queues), until it has had a chance to clear some space on the cache.
+
<b>July 27, 2020, 3:00 PM:</b> Scheduler issues. We are investigating.
  
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
 
<!--  When removing system status entries, please archive them to: https://docs.scinet.utoronto.ca/index.php/Previous_messages -->
Line 63: Line 50:
 
* [[Burst Buffer]]
 
* [[Burst Buffer]]
 
* [[SSH Tunneling]]
 
* [[SSH Tunneling]]
 +
* [[SSH#Two-Factor_authentication|Two-Factor Authentication]]
 
* [[Visualization]]
 
* [[Visualization]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Running Serial Jobs on Niagara]]
 
* [[Jupyter Hub]]
 
* [[Jupyter Hub]]
 
|}
 
|}

Revision as of 20:23, 30 July 2020

System Status

Niagara HPSS Mist Teach
Jupyter Hub Scheduler File system Burst Buffer
Login Nodes External Network Globus

July 30, 2020, 9:00 AM Project backup in progress but incomplete: please be aware that after we deployed the new, larger storage appliance for scratch and project two months ago, we started a full backup of project (1.5PB). This backup is taking a while to complete, and there are still a few areas which have not been backed up fully. Please be careful to not delete things from project that you still need, in particular if they are recently added material.

July 27, 2020, 5:00 PM: Scheduler issues resolved.

July 27, 2020, 3:00 PM: Scheduler issues. We are investigating.

QuickStart Guides

Tutorials, Manuals, etc.