TCE error ival=4 problem


Click here for full thread
Forum Vet
Looks like your running on an Infiniband cluster. Setting the following in your job script may help:

  setenv ARMCI_DEFAULT_SHMMAX 2048
unsetenv MA_USE_ARMCI_MEM

If the output from your job below was run on 80 processors, then it is running slow. Ran it on our cluster at 40 seconds a step for the Lambda iterations...

Bert



Quote:Bert Oct 6th 11:44 pm
Sven,

I was able to run it successfully with your input deck.

What kind of hardware are you running on?

Do you set ARMCI_DEFAULT_SHMMAX in your job script or environment?

Do you set MA_USE_ARMCI_MEM in your job script or environment?

Bert


Quote: Oct 6th 9:39 am
Mhh, this is strange it is maybe a compilation problem. Unfortunately, I do not know how the program is compiled and what libraries are used.
The rest of the error message is

CCSD Lambda iterations
---------------------------------------------
Iter Residuum Cpu Wall
---------------------------------------------
1 65.6301924841714 407.4 417.5
0: error ival=4
(rank:0 hostname:u4n075 pid:24296):ARMCI DASSERT fail. openib.c:armci_call_data_server():2010 cond:(pdscr->status==IBV_WC_SUCCESS)
4: error ival=4
(rank:4 hostname:u4n053 pid:29430):ARMCI DASSERT fail. openib.c:armci_call_data_server():2010 cond:(pdscr->status==IBV_WC_SUCCESS)

Please let me know if you are able to run the calculation with io ga. If so I think I have to contact the admin of the cluster.

Sven