low performance on HPC cluster (sge) when running multiple jobs
- by Yotam
O know this is a long-shot but I'm clueless here. I'm running several computer simulations on High Performance Computation cluster (HPC) of oracale grid engine (sge). A single job runs at a certain speed (roughly 80 steps per second) when I add jobs to the machine, at a certain treshhold, the speed is recuded by two.
On one machine (I don't know the cpu kind) the treshold is 11 jobs for 16 cpu's. On another one with the same number and kind of cpu's , the treshold is 8.
I thought at first that this is a memory issue but each job takes about 60MB - 100MB and I have 16GB of ram on each of those machine.
Did any of you encountered such a problem? is there any way to analyz this?
Thanks.