SIGN-IN

Cluster graham.sharcnet.ca

Links System documentation in the SHARCNET Help Wiki

Manufacturer Huawei
Operating System CentOS 7
Interconnect EDR + FDR Infiniband
Total processors/cores 33448
Nodes
1‑800
32 cores
2 sockets x 16 cores per socket
Intel E5-2683 v4 (Broadwell) @ 2.1 GHz
Type: Compute
Notes: Base profile compute nodes.
Memory: 128.0 GB
Local storage: 1.2 TB
801‑803
56 cores
4 sockets x 14 cores per socket
Intel E7-4850 v3 (Haswell) @ 2.2 GHz
Type: Compute
Memory: 3072.0 GB
Local storage: 1.2 TB
804‑827
32 cores
2 sockets x 16 cores per socket
Intel E5-2683 v4 (Broadwell) @ 2.1 GHz
Type: Compute
Memory: 512.0 GB
Local storage: 1.2 TB
828‑987
32 cores
2 sockets x 16 cores per socket
Intel E5-2683 v4 (Broadwell) @ 2.1 GHz
Type: Compute
Notes: Accelerated compute nodes with 2 × NVIDIA Pascal P100 GPUs (12GB HBM2)
Memory: 128.0 GB
Local storage: 800 TB
988‑1043
32 cores
2 sockets x 16 cores per socket
Intel E5-2683 v4 (Broadwell) @ 2.1 GHz
Type: Compute
Notes: Cloud configuration
Memory: 256.0 GB
Local storage: 1.2 TB
Total attached storage 14500 TB
Suitable use

Heterogeneous cluster, suitable for a variety of workloads.

Software available

ADF/BAND, STAR-CCM+, HDF, AMBER, BIOPERL, MAP, BLAST, DDT, PETSC_SLEPC, DAR, LSDYNA, ANSYS, ESPRESSO, MATLAB, COMSOL, GAUSSIAN, INTEL

Current system state details Graphs

Recent System Notices

Status Status Notes
Jan 11 2019, 09:32AM
(about 1 month ago)

System is fully operational

Oct 02 2018, 02:18PM
(5 months ago)

Starting Tuesday, October 9th, 2018 at 10 p.m. ET, the Graham cluster will be unavailable to all users and running jobs will be terminated. This outage is required due to electrical work being done by the regional utility and will impact half of the Waterloo campus. We will take advantage of this unexpected downtime to perform updates to the cluster to improve the stability, performance and overall security.

The cluster should be reopened Thursday, October 11th.

Sep 28 2018, 08:23AM
(5 months ago)

The patching of the project file system went well and the cluster has been reopened.

For more details: http://status.computecanada.ca/

Sep 26 2018, 12:32PM
(5 months ago)

Starting Thursday, September 27, 2018 at 8 a.m. ET, the Graham cluster will be unavailable to all users and running jobs will be terminated. During this outage we will be patching and upgrading the /project file system. This is required to properly clean up after the recent file system issue and to help prevent a recurrence.

Graham will reopen to users by 8am Friday the 28th.

For more details: http://status.computecanada.ca/

Sep 07 2018, 03:05PM
(5 months ago)

Graham returned to service on Sept 5, with a problem that affects some files on the /project filesystem. We expect to have all the affected files restored to normal in about a week.

For more details: http://status.computecanada.ca/

Sign-in to get full status history