NWChem 6.1.1 CCSD(T) parallel running


Click here for full thread
Forum Vet
Diego
I have managed to get this input working on a Infiniband cluster using NWChem 6.3.
Here is some details of what I have done on a run using 224 processors(16 processors on each one of the 14 nodes)

1) Increased global memory input line to 1.6GB
memory global 1600 mb heap 100 mb stack 600 mb

2) Set ARMCI_DEFAULT_SHMMAX=8192

3) You need to have the system administrators to modify some of the kernel driver options for your Infiniband Hardware
Here are some webpages related to this very topic
http://www.open-mpi.org/faq/?category=openfabrics#ib-low-reg-mem
http://community.mellanox.com/docs/DOC-1120

In my case, the cluster I am using has the following parameter for the mlx4_core driver (but older
hardware might require different setting, as mentioned in the two webpages above)
log_num_mtt=20
log_mtts_per_seg=4

Cheers, Edo