memory problem in parallel running "ARMCI DASSERT fail"


Click here for full thread
Clicked A Few Times
 argument  1 = fh2.nw
(rank:12 hostname:compute-11-3.local pid:1523):ARMCI DASSERT fail. ../../ga-5-1/armci/src/devices/openib/openib.c:armci_server_register_region():1124 cond:(memhdl->memhndl!=((void *)0))
Last System Error Message from Task 12:: Cannot allocate memory
application called MPI_Abort(comm=0x84000003, 1) - process 12
(rank:0 hostname:compute-11-32.local pid:4764):ARMCI DASSERT fail. ../../ga-5-1/armci/src/devices/openib/openib.c:armci_server_register_region():1124 cond:(memhdl->memhndl!=((void *)0))
Last System Error Message from Task 0:: Cannot allocate memory
rank 12 in job 2  i11-32_41208   caused collective abort of all ranks
  exit status of rank 12: killed by signal 9 
[5:i11-32] unexpected disconnect completion event from [12:i11-3]
Assertion failed in file ../../dapl_conn_rc.c at line 1128: 0
internal ABORT - process 5


Thanks!


Quote:Edoapra Nov 5th 9:54 am
What is the error when you set ARMCI_DEFAULT_SHMMAX=8192 ?
Thanks, Edo