ECCE 6.3 apps won't start


Jump to page 1Prev 162Next 16Last
Gets Around
Mike,

Have you successfully run a previous version (before 6.3) of the ECCE Builder? If you have the GL test application glxgears on your system, I'd try running that. It should be in /usr/bin if you have it. That would tell you whether it's an ECCE-related issue or a generic OpenGL issue you are having. You can do an "ldd /usr/bin/glxgears" to make sure it is using the exact same GL libraries as ECCE.

Do you get any terminal output at all from running ebuilder (when the ldd command isn't there)? Does it give a "Segmentation fault" message or anything? Edit the ebuilder script again, duplicate the last line that invokes builder, comment out one of the duplicate lines with a "#" sign, and then modify the other by removing removing the first "|&" and everything after it (that logic is used to filter out some annoying warnings that normally don't indicate an actual problem). Send me the output from that.

I do agree that building from source code is a reasonable next step. Another option is trying to get the 32-bit distribution of ECCE working on your 64-bit cluster. While doing all the source code packaging for the ECCE 6.3 release I did see that there are 64-bit vs. 32-bit OpenGL issues so I wouldn't be surprised if the 32-bit distribution worked. More than likely you already have the necessary 32-bit compatibility libraries on your system (definitely you do if you ran ECCE prior to 6.3 on this cluster). The most likely ones you don't have would again be the GL libraries. But, you can install those with yum (package names are mesa-libGL.i386 and mesa-libGLU.i386). The only thing you lose by going with a 32-bit distribution is a bit of speed perhaps in some areas like Viewer MO calculations. Finally, if you try the 32-bit distribution and it doesn't work for you, you can modify the $ECCE_HOME/siteconfig/site_runtime file to specify that the OpenGL libraries shipped with ECCE be used instead of the local one you installed (search for "MESA" in site_runtime and comment out the two lines setting the related variables).

Gary

Clicked A Few Times
Gary,

This is the first time we've tried to use ECCE on this cluster. We have ECCE 6.0 on another cluster, but I see it's the 32-bit version. I ran glxgears and it works fine, it's using the same GL libraries as ECCE.

If I run ebuilder I do get a segmentation fault. I made the edits you suggested to the ebuilder script such that the last line looks like "./builder -standalone $file". When I run it, again all I get is segmentation fault.

I will try the 32-bit version.

Mike

Clicked A Few Times
Gary,

The 32-bit version works! Apparently there is a problem with using 64-bit OpenGL.

Mike

Gets Around
Mike,

Did you try glxgears? If that didn't work for you, then yes you wouldn't be able to get 64-bit ECCE to work without resolving the issue so that glxgears would run. If you want I can provide you with the 64-bit OpenGL libraries that I (successfully) use when running ECCE on RHEL 5.8 and tell you where to put them so your 64-bit distribution of ECCE uses those. Or, you can stick with the 32-bit distribution since that obviously didn't take much work to get going.

Gary

Clicked A Few Times
Gary,

I did run glxgears and it did work, so I think my OpenGL libraries are ok. I'm fine to stick with the 32-bit version right now since apparently that's what we've used up to now anyway.

Mike

Gets Around
Yes, ECCE 6.3 is the first time we've ever had a native 64-bit distribution (applications). Since all the work was done to be able to build 64-bit applications from the ECCE source code distribution there was no reason not to distribute a 64-bit binary distribution to save people time/effort if they didn't already have 32-bit compatibility libraries installed, 32-bit OpenGL, etc.

Gets Around
Hi Mike,

I went ahead and created a new ECCE 64-bit distribution in our normal download area that bunldes the Mesa OpenGL 64-bit libraries used on my RHEL 5.8 build. It would be great if you could try out that version to see if it also fixes your problem like the 32-bit ECCE distribution does.

You'll need to make one minor change though in order for ECCE to use those bundled libraries instead of your /usr/lib64 ones. After installing the new download and before starting ebuilder or ecce, edit the $ECCE_HOME/siteconfig/site_runtime file. Search for "ECCE_MESA_EXCEPT" and comment that line out by adding a "#" sign in the first column. Then try starting ebuilder and let me know if it works for you. Of course I wouldn't remove the 32-bit install of ECCE right away since that one is known to work.

Thanks,
Gary

Clicked A Few Times
Gary,

Thanks, I think I will try that. We're seeing a problem with the 32-bit version working from PC clients that logon to the cluster using X-win32 for X-windows support. An older version of X-win32 works, but newer versions don't. We think they may not be handling the 32-bit version. It works fine from Linux workstations.

So I'll try the new 64-bit version and let you know.

Thanks,
Mike

Gets Around
Hmmm, that seems odd. Worth trying the 64-bit version, but I'm not up on these X Windows emulation packages to offer any guidance.

Gary

Clicked A Few Times
Gary,

I tried the new 64-bit version, however when I try to run ebuilder I get:
builder: error while loading shared libraries: libnvidia-tls.so.290.10: cannot open shared object file: No such file or directory

I don't find this file anywhere in the ecce installation.

Mike

Gets Around
Mike,

I unknowingly packaged up the NVidia specific version of libGL.so that I run on my 64-bit RHEL 5.8 workstation. I see two possible fixes. One is that I go back to using software-only OpenGL on my workstation and the other is packaging the libnvidia libraries in a new 64-bit ECCE distribution and seeing if that works for you.

I'm going to try the second approach initially because I prefer to keep using the NVidia drivers if I can. I just updated the distributions that can be downloaded. Download the latest 64-bit one and try again. I'll cross my fingers, but I think there's a good chance this won't work without having the required graphics card. It's worth a try though. Thanks for hanging in there.

Gary

Gets Around
Mike,

I just tried building ECCE with the "software-only" OpenGL libs available via yum (no NVidia linkages). That included recompiling all of the ECCE viz-related code using these versions of the libraries and include files. Unfortunately I'm getting a segmentation fault. That means that packaging up the NVidia shared libraries as I have done is the only solution I'll have for you.

Let me know if that version works. If it doesn't then you'll need to go back to the 32-bit distribution of ECCE and try to figure out the issue with the X-win32 application. I'll also remove those OpenGL libraries completely from the 64-bit ECCE binary distribution because they wouldn't work for anyone.

Gary

Gets Around
Hi Mike,

If you can't get the latest 64-bit ECCE binary distribution to work I think it's finally time to try to build ECCE from the source code distribution on your cluster. I'm pretty certain that will resolve your issue because it will insure consistency between all the libraries that are used. I'm not at all sure though whether it will fix your X-win32 issue. Do let me know though how it goes with trying the latest ECCE 64-bit distribution. You can also just install and try the standalone builder distribution since I updated tha as well and the OpenGL problems you are having will show up in the builder alone. If it doesn't work for you I'm going to remove all the Mesa OpenGL libraries I'm currently packaging with the binary distribution because I know they won't work for others either. Finally, don't forget when you install the latest ECCE distribution to edit the $ECCE_HOME/siteconfig/site_runtime file and comment out the ECCE_MESA_EXCEPT setting. Otherwise it will fall back to using the your local OpenGL libraries rather than the ones bundled with ECCE.

Thanks,
Gary

Clicked A Few Times
Gary,

The builder application starts with the last updated 64-bit distribution you made. I do get the message:
"Xlib: extension "NV-GLX" missing on display ":11.0".", but the application comes up anyway. I'm having one of the users try it out to see if it works ok for them.

Mike

Clicked A Few Times
Gary,

The user is still having the X-win32 problem. So I guess I'll try the source build. I'm not sure if that will fix it, but if it doesn't, at least I'll know I have to get the X-win32 vendor to fix something on their end.

Mike

Clicked A Few Times
Gary,

After building ecce from source, I'm back to my original problem with the first 64-bit install, ebuilder seg-faults. The only working version has been the 32-bit version, albeit with the X-win32 issue. I guess I'll have to address that issue and use the 32-bit version.

Mike


Forum >> ECCE: Extensible Computational Chemistry Environment >> General ECCE Topics
Jump to page 1Prev 162Next 16Last