4:57:05 PM PST - Wed, Jan 9th 2013 |
|
I found that I can run on more than 4 nodes if I only use 1 or 2 of the processors on each node (i.e 24 nodes x 1 processor per node). Also, if I set ARMCI_DEFAULT_SHMMAX (i.e. 4084) or even install with a different DDFLT_TOT_MEM value the number of nodes/processors I can use changes, but I can't seem to eliminate the problem altogether.
If I set DDFLT_TOT_MEM to 16777216, I can use 2 nodes at best. If I compiled with DDFLT_TOT_MEM=259738112 (tried larger values), the value I obtained from running the getmem.nwchem script, I can use ~4 nodes and all the processors. I best that I have been able to to is 6 nodes x 16 processors per node. I have the same problem when running DFT, HF, or MP2.
I guess it is some memory allocation problem, but I have no idea how to fix it... I want to run very large jobs 50+ nodes.
Suggestions are welcome,
Thanks
|
|