Problems and Solutions on SPL Machine Blog
Current Problems on Debugging for SCIRun on SPL Machines
With the install of Fedora 7 we have been having some problems, namely with the OpenGL driver recognition by SCIRun.
For example when we utilize the tools on the previous page SCIRun gets pointed at the correct drivers as shown here:
spl_tm64_1:/workspace/mjolley/Modeling/trunk/SCIRun/bin% ldd scirun | grep GL
libGL.so.1 => /usr/lib64/nvidia/libGL.so.1 (0x0000003f8e200000) libGLU.so.1 => /usr/lib64/libGLU.so.1 (0x000000360c200000) libGLcore.so.1 => /usr/lib64/nvidia/libGLcore.so.1 (0x0000003f80e00000)
When these drivers are recognized SCIRun runs as expected. However, each time SCIRun is run it reverts back to the wrong drivers as evidenced here.
spl_tm64_1:/workspace/mjolley/Modeling/trunk/SCIRun/bin% ldd scirun | grep GL
libGL.so.1 => /usr/lib64/libGL.so.1 (0x0000003f84800000) libGLU.so.1 => /usr/lib64/libGLU.so.1 (0x000000360c200000)
After this, any OpenGL dependent modules crash upon opening. If you repeat the steps "unsetenv" and run Dav's script again you get back to:
spl_tm64_1:/workspace/mjolley/Modeling/trunk/SCIRun/bin% ldd scirun | grep GL
libGL.so.1 => /usr/lib64/nvidia/libGL.so.1 (0x0000003f8e200000) libGLU.so.1 => /usr/lib64/libGLU.so.1 (0x000000360c200000) libGLcore.so.1 => /usr/lib64/nvidia/libGLcore.so.1 (0x0000003f80e00000)
To summarize:
unset LD_LIBRARY_PATH
ldd scirun | grep GL
<correct GL is reported>
run script
ldd scirun | grep GL
<wrong GL is reported>
So I put:
unset LD_LIBRARY_PATH into the first line of the script(run in bash) and into my .bashrc and I still have the same behavior where it switches back from the good OpenGL setup to the mesa drivers after initially being pointed to the correct ones.
This still didn't seem to do it so we put in the following line in the script:
create_scirun_script() {
echo "scirun -E ${NETWORK} --logfile ALL.log" >/tmp/script-fe.sh chmod 0770 /tmp/script-fe.sh }
to
create_scirun_script() { echo "unset LD_LIBRARY_PATH" > /tmp/script-fe.sh echo "scirun -E ${NETWORK} --logfile ALL.log" >> /tmp/script-fe.sh chmod 0770 /tmp/script-fe.sh }
And the above behavior with it switching back to the MESA drivers appears to have stopped, but the script still always hangs at JoinField in when the third net is run:
That said it still will not run past the third net. I am going back an confirming this set of .bdl files runs on my ubuntu machine and in the manual nets on the SPL machine. Looking at ALL.log this is what it typically does, hangs on the first JoinField after loading the second or third .bdl, never on on the first:
SCIRUN: CREATED NEW NETWORK
SCHEDULER: START SCHEDULING
SCHEDULER: START SCHEDULING
REMARK(SCIRun_DataIO_ReadBundle_0): loading file /projects/cardio/Clinical-HClean//Electrodes_Plus_Torso/10ybdls//One/10y-Left-abd-can+10cm-right-parasternal-T4-top.bdl
REMARK(SCIRun_DataIO_ReadMatrix_0): loading file SCIRun_Scripts/Permutations/P1-500-0.mat
REMARK(SCIRun_Math_SolveLinearSystem_0): Done in 98.1578 seconds.
REMARK(SCIRun_Math_SolveLinearSystem_0): Conjugate Gradient done in 98.1578 seconds.
REMARK(SCIRun_DataIO_WriteString_0): saving file /workspace/mjolley/Results/Clinical/mjolley-21-Aug/Script2-FE-refine-elec-dilate-5-100x100x150-all-cases-permut-matrix-10y-Left-abd-can+10cm-right-parasternal-T4-top-P1-500-0.txt
REMARK(SCIRun_DataIO_WriteString_1): saving file /workspace/mjolley/Results/Clinical/mjolley-21-Aug/Script2-FE-refine-elec-dilate-5-100x100x150-all-cases-permut-matrix-10y-Left-abd-can+10cm-right-parasternal-T4-top-P1-500-0.db.txt
REMARK(SCIRun_DataIO_WriteBundle_0): saving file /workspace/mjolley/Results/Clinical/mjolley-21-Aug/Script2-FE-refine-elec-dilate-5-100x100x150-all-cases-permut-matrix-10y-Left-abd-can+10cm-right-parasternal-T4-top-P1-500-0.bdl
SCIRUN: CREATED NEW NETWORK
SCHEDULER: START SCHEDULING
SCHEDULER: START SCHEDULING
REMARK(SCIRun_DataIO_ReadMatrix_0): loading file SCIRun_Scripts/Permutations/P1-500-0.mat
REMARK(SCIRun_DataIO_ReadBundle_0): loading file /projects/cardio/Clinical-HClean//Electrodes_Plus_Torso/10ybdls//One/10y-Left-abd-can+10cm-right-parasternal-T6-top.bdl
It always hangs that this same position in the net with no error message or red box in the Module:
.--Mjolley 13:38, 20 August 2007 (EDT)