low performance on HPC cluster (sge) when running multiple jobs
Posted
by
Yotam
on Super User
See other posts from Super User
or by Yotam
Published on 2011-11-24T13:57:29Z
Indexed on
2012/06/24
9:18 UTC
Read the original article
Hit count: 304
O know this is a long-shot but I'm clueless here. I'm running several computer simulations on High Performance Computation cluster (HPC) of oracale grid engine (sge). A single job runs at a certain speed (roughly 80 steps per second) when I add jobs to the machine, at a certain treshhold, the speed is recuded by two. On one machine (I don't know the cpu kind) the treshold is 11 jobs for 16 cpu's. On another one with the same number and kind of cpu's , the treshold is 8.
I thought at first that this is a memory issue but each job takes about 60MB - 100MB and I have 16GB of ram on each of those machine.
Did any of you encountered such a problem? is there any way to analyz this?
Thanks.
© Super User or respective owner