Main Page

From SciNet Users Documentation
Revision as of 18:21, 28 February 2024 by Chen (talk | contribs) (→‎System Status)
Jump to navigation Jump to search

System Status

Niagara Mist Teach Rouge
Jupyter Hub Scheduler File system Burst Buffer
HPSS Login Nodes External Network Globus
Balam CCEnv

February 28, 2024, 1:00 PM EDT: A loop pump fault caused many compute nodes overheat. If you jobs failed around this time, please resubmit and report issues to support@scinet.utoronto.ca.

February 22, 2024, 5:45 PM EDT: Maintenance finished and system restored. Please report issues to support@scinet.utoronto.ca.

February 21, 2024, 7:00 AM EDT: Maintenance starting. Niagara login nodes and the file system are kept up as much as possible, but will be rebooted at some point.

February 20, 2024, 3:45 PM EDT: Cooling tower has been restored, all systems are in production.

February 20, 2024, 1:30 AM EDT: Cooling tower malfunction, all compute nodes are shutdown, the root cause will be addressed earliest in the morning.

February 21 and 22, 2024: SciNet Data Centre Maintenance:
This annual winter maintenance involves a full data centre shutdown starting at 7:00 am EST on Wednesday, February 21st. None of the SciNet systems (Niagara, Mist, Rouge, Teach, the file systems, as well as hosted equipment) will be accessible. All systems should be fully available again in the last afternoon of the 22nd.

The scheduler will hold jobs that cannot finish before the start of the shutdown. Users are encouraged to submit small and short jobs that can take advantage of this, as the scheduler may be able to fit these jobs in before the maintenance on otherwise idle nodes.

Previous messages

QuickStart Guides

Tutorials, Manuals, etc.