[Lilab] Fwd: UCR HPCC Nov 2 Outage Update

Wei Vivian Li weil at ucr.edu
Sun Nov 3 09:23:47 PST 2024


FYI. If you are using hpcc in the past few days, please check if your jobs
were impacted.

Vivian


-------------------------------
Wei Vivian Li

Assistant Professor
Department of Statistics
University of California, Riverside
vivianli.org



---------- Forwarded message ---------
From: HPCC_noreply <no-reply at hpcc.ucr.edu>
Date: Sun, Nov 3, 2024 at 6:53 AM
Subject: UCR HPCC Nov 2 Outage Update
To: <weil at ucr.edu>


UCR HPCC Nov 2 Outage Update
[image: UCR High Performance Computing Center]
------------------------------

When

Saturday, Nov 2 2024 ~10AM - Ongoing

Reason

An air conditioning unit in the UCR HPCC server room suffered a critical
failure. The high room temperatures caused our battery backup system to
overheat, and in turn, cut power to our systems. At this time, we believe
the battery backup system did not sustain any permanent damage, and power
has been fully restored. However, the failed AC unit will need to undergo
extensive repairs, beginning Monday at the earliest. The HPCC cluster can
not operate at full capacity until both of our AC units are functional.

What to Expect

As of this e-mail, you may log into the cluster with SSH to retrieve your
files and do some light work. Although the Slurm scheduler is online, all
compute nodes are still offline.

A limited number of compute nodes may come online on Sunday, depending on
sysadmin discretion and cooling constraints.

We apologize for the disruption to your research and teaching workflows.
Thanks for your understanding.
------------------------------

Contacts

Austin Leong
Sr. HPC Systems Administrator
austin.leong at ucr.edu

Emerson Jacobson
HPC Systems Administrator
emerson.jacobson at ucr.edu

Thomas Girke
Professor
Director of HPC Center
thomas.girke at ucr.edu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.ucr.edu/pipermail/lilab/attachments/20241103/ae843327/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: noname
Type: image/png
Size: 19073 bytes
Desc: not available
URL: <https://lists.ucr.edu/pipermail/lilab/attachments/20241103/ae843327/attachment-0001.png>


More information about the LiLab mailing list