GPU server issues june 16

Here you can find out about our Folding Team! Our goal: to understand protein folding, protein aggregation, and related diseases
Post Reply
vbironchef
Legit Extremist
Legit Extremist
Posts: 2301
Joined: Tue Mar 27, 2007 3:35 pm

GPU server issues june 16

Post by vbironchef »

June 16, 2009
GPU server issues
We've had a rough night with GPU servers. One has been down hard over the day yesterday (it crashed hard and now can't find its / partition -- the admins are attempting a rescue disk fsck this morning). Two more went down last night (PST) due to the heavy load, but those were easy to get back up (they are up now).



We are stretched a bit thin as we are implementing the new server infrastructure in parallel with the old one. The upshot is that once the new one has been deployed, we will have much more functional collection servers (CS's) and also get work servers (WS's) that should not need to be restarted nearly as frequently when under heavy load.


We are beginning the roll out of the new WS (v5) code this week onto GPU servers, although these issues have slowed us down a bit.

Posted


What is WS(v5) code?
User avatar
Darkstar
Legit Extremist
Legit Extremist
Posts: 1910
Joined: Thu Feb 01, 2007 12:24 pm
Location: San Diego
Contact:

Re: GPU server issues june 16

Post by Darkstar »

My guess would be that WS stands for Work Server since they seem to be talking about the new server infrastructure ....

:drinkers:
Phenom II 1075T,Phenom II 1090T,Intel i7 870
Gigabyte 890XA-UD3
Evga GTX460
8 GB Corsair
Agility2 120GB SSD
Dual 24" Samsungs LCD's
Post Reply