The err--the parallel runing by compiled version from source and binary


Clicked A Few Times
Hello, everyone

I run the binary version of Nwchem by parallel execute when I set .nwchem file.
../../bin/parallel ../../bin/nwchem /home/li/workdir/nwchem6.0/examples/tcepolar/ccsdt_polar_small.nw


The err happens as below:

It is the same problem with the compiled version from the source of NWChem.



============================================================
sock=0, pid=1, nsent=-1, len=56
 1: snd_remote: writing message to socket 199999999 (0xbebc1ff).
1: snd_remote: writing message to socket 199999999 (0xbebc1ff).
system error message: Bad file descriptor
0:Child process terminated prematurely, status=: 256
(rank:0 hostname:localhost.localdomain pid:16914):ARMCI DASSERT fail. signaltrap.c:SigChldHandler():167 cond:0
Last System Error Message from Task 0:: Inappropriate ioctl for device
 0: ARMCI aborting 0 (0).
0: ARMCI aborting 0 (0).
system error message: Illegal seek
 2: interrupt(1)
WaitAll: No children or error in wait?

=============================================================

what is wrong with it ? How do I solve it?

Do I need install TCGMSG again when I install Nwchem from the source?

My system is redhat 5.2 and SUSE 11.0

Thanks

javacfish

Forum Vet
Quote:Javacfish Nov 12th 4:04 am
Hello, everyone

I run the binary version of Nwchem by parallel execute when I set .nwchem file.
../../bin/parallel ../../bin/nwchem /home/li/workdir/nwchem6.0/examples/tcepolar/ccsdt_polar_small.nw


The err happens as below:

It is the same problem with the compiled version from the source of NWChem.



============================================================
sock=0, pid=1, nsent=-1, len=56
 1: snd_remote: writing message to socket 199999999 (0xbebc1ff).
1: snd_remote: writing message to socket 199999999 (0xbebc1ff).
system error message: Bad file descriptor
0:Child process terminated prematurely, status=: 256
(rank:0 hostname:localhost.localdomain pid:16914):ARMCI DASSERT fail. signaltrap.c:SigChldHandler():167 cond:0
Last System Error Message from Task 0:: Inappropriate ioctl for device
 0: ARMCI aborting 0 (0).
0: ARMCI aborting 0 (0).
system error message: Illegal seek
 2: interrupt(1)
WaitAll: No children or error in wait?

=============================================================

what is wrong with it ? How do I solve it?

Do I need install TCGMSG again when I install Nwchem from the source?

My system is redhat 5.2 and SUSE 11.0

Thanks

javacfish


Please read http://nwchemgit.github.io/index.php/Running#Parallel_execution_on_UNIX-based_parallel_machi... on properly running with the parallel command. The send argument is NOT the binary, but rather the nwchem.p file that you have defined.

Bert


Forum >> NWChem's corner >> Compiling NWChem