Installation of GridWay 5.6.1
Documentation ¶
The following guides are available:
Tests ¶
- grid-proxy-init
Your identity: /C=DE/O=GermanGrid/OU=ZAH/CN=Klaus Rieger Enter GRID pass phrase for this identity: Creating proxy ..................................... Done Your proxy is valid until: Fri Feb 3 06:44:17 2012
- gsissh mintaka.ari.uni-heidelberg.de
Last login: Thu Feb 2 18:43:34 2012 from asterope.ari.uni-heidelberg.de Linux mintaka 2.6.18-6-xen-amd64 #1 SMP Thu Dec 25 22:21:42 UTC 2008 x86_64 The programs included with the Debian GNU/Linux system are free software; the exact distribution terms for each program are described in the individual files in /usr/share/doc/*/copyright. Debian GNU/Linux comes with ABSOLUTELY NO WARRANTY, to the extent permitted by applicable law.
All necessary tests for three hosts (dgsi.zah.uni-heidelberg.de, mintaka.ari.uni-heidelberg.de, titan.ari.uni-heidelberg.de) are described following. If you like to use more hosts, use AstroGridTest ( SVN PDF) for testing.
Regardless of the method: Use hosts for GridWay, only, if they have passed all tests!
Pre-WS Tests ¶
Authorization Test ¶
- globusrun -a -r dgsi.zah.uni-heidelberg.de
GRAM Authentication test successful
- globusrun -a -r mintaka.ari.uni-heidelberg.de
GRAM Authentication test successful
- globusrun -a -r titan.ari.uni-heidelberg.de
GRAM Authentication test successful
Submission Test ¶
- globus-job-run dgsi.zah.uni-heidelberg.de /bin/uname -a
Linux dgsi.zah.uni-heidelberg.de 2.6.18-194.3.1.el5 #1 SMP Fri May 7 01:52:57 EDT 2010 i686 athlon i386 GNU/Linux
- globus-job-run mintaka.ari.uni-heidelberg.de /bin/uname -a
Linux mintaka 2.6.18-6-xen-amd64 #1 SMP Thu Dec 25 22:21:42 UTC 2008 x86_64 GNU/Linux
- globus-job-run titan.ari.uni-heidelberg.de /bin/uname -a
Linux titan.ari.uni-heidelberg.de 2.6.18-274.3.1.el5 #1 SMP Tue Sep 6 18:52:56 EDT 2011 x86_64 x86_64 x86_64 GNU/Linux
File Transfer Test ¶
- echo "TEST" > test.txt
- globus-url-copy file:///home/Agrid/agrid107/test.txt gsiftp://dgsi.zah.uni-heidelberg.de/home/agrid/agrid107/test.txt
- globus-url-copy gsiftp://dgsi.zah.uni-heidelberg.de/home/agrid/agrid107/test.txt file:///home/Agrid/agrid107/test_dgsi.txt
- globus-url-copy file:///home/Agrid/agrid107/test.txt gsiftp://titan.ari.uni-heidelberg.de/home/Tit2/Agrid/agrid107/test.txt
- globus-url-copy gsiftp://titan.ari.uni-heidelberg.de/home/Tit2/Agrid/agrid107/test.txt file:///home/Agrid/agrid107/test_titan.txt
- ls -l
total 18 -rw-r--r-- 1 agrid107 agrid 5 Feb 2 18:58 test.txt -rw-r--r-- 1 agrid107 agrid 5 Feb 2 19:10 test_dgsi.txt -rw-r--r-- 1 agrid107 agrid 5 Feb 2 19:10 test_titan.txt
WS Tests ¶
Submission Test ¶
- globusrun-ws -submit -F dgsi.zah.uni-heidelberg.de -s -c /bin/uname -a
Delegating user credentials...Done. Submitting job...Done. Job ID: uuid:eec2b812-fbe1-11e0-8414-000c296d7bcb Termination time: 10/22/2011 12:37 GMT Current job state: Active Current job state: CleanUp-Hold Linux dgsi.zah.uni-heidelberg.de 2.6.18-194.3.1.el5 #1 SMP Fri May 7 01:52:57 EDT 2010 i686 athlon i386 GNU/Linux Current job state: CleanUp Current job state: Done Destroying job...Done. Cleaning up any delegated credentials...Done.
- globusrun-ws -submit -F mintaka.ari.uni-heidelberg.de -s -c /bin/uname -a
Delegating user credentials...Done. Submitting job...Done. Job ID: uuid:a28ee11e-fbe1-11e0-9aca-000c296d7bcb Termination time: 10/22/2011 12:38 GMT Current job state: Active Current job state: CleanUp-Hold Linux mintaka 2.6.18-6-xen-amd64 #1 SMP Thu Dec 25 22:21:42 UTC 2008 x86_64 GNU/Linux Current job state: CleanUp Current job state: Done Destroying job...Done. Cleaning up any delegated credentials...Done.
- globusrun-ws -submit -F titan.ari.uni-heidelberg.de -s -c /bin/uname -a
Delegating user credentials...Done. Submitting job...Done. Job ID: uuid:bf96867c-fbe1-11e0-9d69-000c296d7bcb Termination time: 10/22/2011 12:39 GMT Current job state: Active Current job state: CleanUp-Hold Linux titan.ari.uni-heidelberg.de 2.6.18-274.3.1.el5 #1 SMP Tue Sep 6 18:52:56 EDT 2011 x86_64 x86_64 x86_64 GNU/Linux Current job state: CleanUp Current job state: Done Destroying job...Done. Cleaning up any delegated credentials...Done.
Information Retrieval Test ¶
- wsrf-query -s https://dgsi.zah.uni-heidelberg.de:8443/wsrf/services/DefaultIndexService
<ns0:IndexRP xmlns:glue="http://mds.globus.org/glue/ce/1.1" xmlns:ns0="http://mds.globus.org/index" ... <Site UniqueID="dgsi.zah.uni-heidelberg.de" xmlns="http://infnforge.cnaf.infn.it/glueinfomodel/Spec/V12/R2" xmlns:ns1="http://infnforge.cnaf.infn.it/glueinfomodel/Spec/V12/R2"> <Description>AstroGrid-D services at dgsi.zah.uni-heidelberg.de</Description> <Latitude>49.41776</Latitude> <Location>Moenchhofstrasse 12-14, 69120 Heidelberg, Germany</Location> <Longitude>8.68790</Longitude> <Name>Astronomisches Rechen-Institut</Name> <OtherInfo>0running GT 4.0.8</OtherInfo> <SecurityContact>rieger@ari.uni-heidelberg.de</SecurityContact> <Sponsor>http://www.bmbf.de</Sponsor> <SysAdminContact>rieger@ari.uni-heidelberg.de</SysAdminContact> <UserSupportContact>rieger@ari.uni-heidelberg.de</UserSupportContact> <Web>http://www.ari.uni-heidelberg.de</Web> </Site> </ns0:IndexRP>
- wsrf-query -s https://mintaka.ari.uni-heidelberg.de:8443/wsrf/services/DefaultIndexService
<ns0:IndexRP xmlns:glue="http://mds.globus.org/glue/ce/1.1" xmlns:ns0="http://mds.globus.org/index" ... <Site UniqueID="mintaka.ari.uni-heidelberg.de" xmlns="http://infnforge.cnaf.infn.it/glueinfomodel/Spec/V12/R2" xmlns:ns1="http://infnforge.cnaf.infn.it/glueinfomodel/Spec/V12/R2"> <Description>AstroGrid-D services at mintaka</Description> <Latitude>49.41780</Latitude> <Location>Moenchhofstrasse 12-14, 69120 Heidelberg, Germany</Location> <Longitude>8.68790</Longitude> <Name>Astronomisches Rechen-Institut</Name> <OtherInfo>0running GT 4.0.5</OtherInfo> <SecurityContact>tbruese@ari.uni-heidelberg.de</SecurityContact> <Sponsor>http://www.bmbf.de</Sponsor> <SysAdminContact>admin@ari.uni-heidelberg.de</SysAdminContact> <UserSupportContact>rieger@ari.uni-heidelberg.de</UserSupportContact> <Web>http://www.ari.uni-heidelberg.de</Web> </Site> </ns0:IndexRP>
- wsrf-query -s https://titan.ari.uni-heidelberg.de:8443/wsrf/services/DefaultIndexService
<ns0:IndexRP xmlns:glue="http://mds.globus.org/glue/ce/1.1" xmlns:ns0="http://mds.globus.org/index" ... <Site UniqueID="titan.ari.uni-heidelberg.de" xmlns="http://infnforge.cnaf.infn.it/glueinfomodel/Spec/V12/R2" xmlns:ns1="http://infnforge.cnaf.infn.it/glueinfomodel/Spec/V12/R2"> <Description>AstroGrid-D services at titan</Description> <Latitude>49.41778</Latitude> <Location>Moenchhofstrasse 12-14, 69120 Heidelberg, Germany</Location> <Longitude>8.68780</Longitude> <Name>Astronomisches Rechen-Institut</Name> <OtherInfo>0running GT 4.0.8 </OtherInfo> <SecurityContact>tbruese@ari.uni-heidelberg.de</SecurityContact> <Sponsor>http://www.bmbf.de</Sponsor> <SysAdminContact>titan-admin@ari.uni-heidelberg.de</SysAdminContact> <UserSupportContact>rieger@ari.uni-heidelberg.de</UserSupportContact> <Web>http://www.ari.uni-heidelberg.de</Web> </Site> </ns0:IndexRP>
Debug Test ¶
- wsrf-query -s https://dgsi.zah.uni-heidelberg.de:8443/wsrf/services/DefaultIndexService | grep -i DEBUG
- wsrf-query -s https://mintaka.ari.uni-heidelberg.de:8443/wsrf/services/DefaultIndexService | grep -i DEBUG
- wsrf-query -s https://titan.ari.uni-heidelberg.de:8443/wsrf/services/DefaultIndexService | grep -i DEBUG
Hint: OK, if there is no output!
- exit
Connection to mintaka.ari.uni-heidelberg.de closed.
Add User GridWay ¶
- ssh root@mintaka.ari.uni-heidelberg.de
- useradd --create-home --gid globus gwadmin --home-dir /opt/d-grid/gridway
- passwd gwadmin
Changing password for user gwadmin. New UNIX password: Retype new UNIX password: passwd: all authentication tokens updated successfully.
- exit
logout
Install GridWay ¶
Download ¶
- Download GridWay 5.6.1
- scp gridway_5.6.1.tar.gz root@mintaka.ari.uni-heidelberg.de:
Complete Universal User Profile ¶
- ssh root@mintaka.ari.uni-heidelberg.de
- cd /etc/profile.d/
For sh-style shells (sh, ksh, ash, bash):
- vi globus.sh
- Edit (i)
# User specific environment and startup programs export GLOBUS_LOCATION=/usr/local/globus-4.0.8 export GW_LOCATION=/usr/local/gridway-5.6.1 export GLOBUS_PATH=$GLOBUS_LOCATION/sbin:$GLOBUS_LOCATION/bin export GLOBUS_TCP_PORT_RANGE=20000,25000 PATH=$PATH:$GLOBUS_PATH:$GW_LOCATION/bin export ANT_HOME=/usr/share/ant export JAVA_HOME=/usr/lib/jvm/java-1.6.0-sun export PATH=$PATH:/usr/lib/jvm/java-sun/bin export PATH
- Save (ESC :wq)
- Edit (i)
For csh-style shells (csh, tcsh):
- vi globus.csh
- Edit (i)
# User specific environment and startup programs setenv GLOBUS_LOCATION /usr/local/globus-4.0.8 setenv GW_LOCATION /usr/local/gridway-5.6.1 setenv GLOBUS_PATH $GLOBUS_LOCATION/sbin:$GLOBUS_LOCATION/bin setenv GLOBUS_TCP_PORT_RANGE 20000,25000 setenv ANT_HOME /usr/share/ant setenv JAVA_HOME /usr/lib/jvm/java-1.6.0-sun set path=($PATH:$GLOBUS_PATH:/usr/lib/jvm/java-sun/bin$:GW_LOCATION/bin)
- Save (ESC :wq)
- Edit (i)
- vi globus.sh
Configuration ¶
- mkdir /opt/d-grid/gridway
- chown gwadmin:agrid /opt/d-grid/gridway
- tar xzf gridway_5.6.1.tar.gz
- chown -R gwadmin:agrid gridway_5.6.1
- su - gwadmin
- source /opt/d-grid/globus/gt405/etc/globus-devel-env.sh
- cd /root/gridway_5.6.1
- ./configure --prefix=$GW_LOCATION --enable-jsdl --with-tests
checking for Globus Toolkit...ok checking for Globus Toolkit...ok configuring Globus build env...ok checking build system type... x86_64-unknown-linux-gnu ... checking whether stripping libraries is possible... yes configure: creating ./config.status config.status: creating Makefile config.status: creating src/Makefile config.status: creating pkgdata/pkg_data_src.gpt config.status: executing depfiles commands
Compile and Install ¶
- make 2>&1 | tee make_gw.log
Making all in src make[1]: Entering directory `/root/gridway_5.6.1/src' ... Note: Some input files use unchecked or unsafe operations. Note: Recompile with -Xlint:unchecked for details. jar cf ./cmds/gw_jsdl.jar -C ./cmds/package/ . make[1]: Leaving directory `/root/gridway_5.6.1/src' make[1]: Entering directory `/root/gridway_5.6.1' make[1]: Nothing to be done for `all-am'. make[1]: Leaving directory `/root/gridway_5.6.1'
- make install 2>&1 | tee make_install_gw.log
Making install in src make[1]: Entering directory `/root/gridway_5.6.1/src' ... test -z "" || mkdir -p -- "" make[2]: Leaving directory `/root/gridway_5.6.1/src' make[1]: Leaving directory `/root/gridway_5.6.1/src' make[1]: Entering directory `/root/gridway_5.6.1' make[2]: Entering directory `/root/gridway_5.6.1' make install-exec-hook make[3]: Entering directory `/root/gridway_5.6.1' make[3]: Leaving directory `/root/gridway_5.6.1' make install-data-hook make[3]: Entering directory `/root/gridway_5.6.1' mkdir -p /opt/d-grid/gridway/var/acct mkdir -p /opt/d-grid/gridway/etc mkdir -p /opt/d-grid/gridway/xml_schema cp -r ./etc/im_examples /opt/d-grid/gridway/etc/ cp ./etc/gwd.conf \ ./etc/job_template.default \ ./etc/sched.conf \ ./etc/gwrc \ /opt/d-grid/gridway/etc/ cp ./xml_schema/gridway.xsd /opt/d-grid/gridway/xml_schema make[3]: Leaving directory `/root/gridway_5.6.1' make[2]: Leaving directory `/root/gridway_5.6.1' make[1]: Leaving directory `/root/gridway_5.6.1'
Check Files ¶
- ls -l $GW_LOCATION
total 52 drwxr-xr-x 2 gwadmin agrid 4096 Feb 17 14:28 bin drwxr-xr-x 3 gwadmin agrid 4096 Feb 17 13:28 etc -rw-r--r-- 1 gwadmin agrid 4984 Feb 16 14:02 gwd.conf drwxr-xr-x 2 gwadmin agrid 4096 Feb 17 14:28 include drwxr-xr-x 2 gwadmin agrid 4096 Feb 17 14:28 lib drwxr-xr-x 3 gwadmin agrid 4096 Feb 17 13:28 libexec drwxr-xr-x 3 gwadmin agrid 4096 Feb 17 14:28 share drwxr-xr-x 3 gwadmin agrid 4096 Feb 17 14:28 test drwxr-xr-x 4 gwadmin agrid 4096 Feb 17 14:35 var drwxr-xr-x 2 gwadmin agrid 4096 Feb 17 13:28 xml_schema
- ls -l $GW_LOCATION/bin/
total 4640 -rwxr-xr-x 1 gwadmin agrid 720 Feb 17 14:28 gw_em_mad_ws -rwxr-xr-x 1 gwadmin agrid 452781 Feb 17 14:28 gw_flood_scheduler -rwxr-xr-x 1 gwadmin agrid 3868 Feb 17 14:28 gw_im_mad_common.sh -rwxr-xr-x 1 gwadmin agrid 3457 Feb 17 14:28 gw_im_mad_mds4 -rwxr-xr-x 1 gwadmin agrid 2047 Feb 17 14:28 gw_im_mad_mds4_thr -rwxr-xr-x 1 gwadmin agrid 1605 Feb 17 14:28 gw_im_mad_static -rw-r--r-- 1 gwadmin agrid 1959 Feb 17 14:28 gw_mad_common.sh -rwxr-xr-x 1 gwadmin agrid 448124 Feb 17 14:28 gw_sched -rwxr-xr-x 1 gwadmin agrid 1088 Feb 17 14:28 gw_tm_mad_ftp -rwxr-xr-x 1 gwadmin agrid 65760 Feb 17 14:28 gw_tm_mad_ftp.bin -rwxr-xr-x 1 gwadmin agrid 232337 Feb 17 14:28 gwacct -rwxr-xr-x 1 gwadmin agrid 1110623 Feb 17 14:28 gwd -rwxr-xr-x 1 gwadmin agrid 2589 Feb 17 14:28 gwdagman -rwxr-xr-x 1 gwadmin agrid 331119 Feb 17 14:28 gwhistory -rwxr-xr-x 1 gwadmin agrid 333819 Feb 17 14:28 gwhost -rwxr-xr-x 1 gwadmin agrid 331808 Feb 17 14:28 gwkill -rwxr-xr-x 1 gwadmin agrid 335177 Feb 17 14:28 gwps -rwxr-xr-x 1 gwadmin agrid 332455 Feb 17 14:28 gwsubmit -rwxr-xr-x 1 gwadmin agrid 330468 Feb 17 14:28 gwuser -rwxr-xr-x 1 gwadmin agrid 333960 Feb 17 14:28 gwwait -rwxr-xr-x 1 gwadmin agrid 2081 Feb 17 14:28 jsdl2gw
Set Sudo ¶
- exit
- whoami
root
- /usr/sbin/visudo -s
- Add at the end of the file (i):
# GridWay settings gwadmin ALL=(GPOOL) NOPASSWD: /opt/d-grid/gridway/bin/gw_em_mad_ws * gwadmin ALL=(GPOOL) NOPASSWD: /opt/d-grid/gridway/bin/gw_im_mad_common.sh * gwadmin ALL=(GPOOL) NOPASSWD: /opt/d-grid/gridway/bin/gw_im_mad_mds4 * gwadmin ALL=(GPOOL) NOPASSWD: /opt/d-grid/gridway/bin/gw_im_mad_mds4_thr * gwadmin ALL=(GPOOL) NOPASSWD: /opt/d-grid/gridway/bin/gw_im_mad_static * gwadmin ALL=(GPOOL) NOPASSWD: /opt/d-grid/gridway/bin/gw_tm_mad_ftp * gwadmin ALL=(GPOOL) NOPASSWD: /opt/d-grid/gridway/bin/gw_tm_mad_ftp.bin * gwadmin ALL=(GPOOL) NOPASSWD: /opt/d-grid/globus/gt405/bin/grid-proxy-info *
- Save (ESC :wq)
Hint: Sudoers allow particular users to run various commands as the root user without needing the root password.
Set Up GridWay ¶
Get version ¶
- su - gwadmin
- whoami
gwadmin
Hint: Never run gwd as root!
Get Version ¶
- gwd -v
GridWay 5.6.1 Copyright 2002-2009 GridWay Team, Distributed Systems Architecture Group (http://dsa-research.org), Universidad Complutense de Madrid GridWay is distributed and licensed for use under the terms of the Apache License, Version 2.0 (http://www.apache.org/licenses/LICENSE-2.0).
Test Run ¶
- gwd
- gwps
USER JID DM EM START END EXEC XFER EXIT NAME HOST
Edit gwd.conf ¶
- vi $GW_LOCATION/etc/gwd.conf
- Search "MAD Configuration for WS" (?MAD Configuration for WS)
- Change (i)
... # Example MAD Configuration for WS testbeds # #IM_MAD = mds4:gw_im_mad_mds4_thr:-s cygnus.dacya.ucm.es:gridftp:ws #EM_MAD = ws:gw_em_mad_ws::rsl2 #TM_MAD = gridftp:gw_tm_mad_ftp: ...
to... # Example MAD Configuration for WS testbeds # IM_MAD = mds4-lrz:gw_im_mad_mds4:-s mds-lrz.lrz.de:gridftp:ws IM_MAD = mds4-aip:gw_im_mad_mds4:-s astrogrid-mds.aip.de:gridftp:ws EM_MAD = ws:gw_em_mad_ws::rsl2 TM_MAD = gridftp:gw_tm_mad_ftp: ...
- Save (ESC :wq)
Final Run ¶
- pkill gwd (Kill GridWay daemon)
- gwd -m -c (Run GridWay daemon in multiuser mode clearing previous state)
- gwhost
HID PRIO OS ARCH MHZ %CPU MEM(F/T) DISK(F/T) N(U/F/T) LRMS HOSTNAME
After a few seconds:
- gwhost
HID PRIO OS ARCH MHZ %CPU MEM(F/T) DISK(F/T) N(U/F/T) LRMS HOSTNAME 0 1 0 0 0/0 0/0 0/0/0 astrogrid-mds.aip.de 1 1 0 0 0/0 0/0 0/0/0 titan.ari.uni-heidelberg.de 2 1 0 0 0/0 0/0 0/0/0 mintaka.ari.uni-heidelberg.de 3 1 0 0 0/0 0/0 0/0/0 astrodata.astrogrid-d.org 4 1 0 0 0/0 0/0 0/0/0 astar.aip.de 5 1 0 0 0/0 0/0 0/0/0 mardschana.zib.de 6 1 0 0 0/0 0/0 0/0/0 bladekemper21.informatik.tu-muenchen.de 7 1 0 0 0/0 0/0 0/0/0 dgsi.zah.uni-heidelberg.de 8 1 0 0 0/0 0/0 0/0/0 mds-lrz.lrz.de 9 1 0 0 0/0 0/0 0/0/0 ptgrid.it.irf.tu-dortmund.de 10 1 0 0 0/0 0/0 0/0/0 udo-mds01.grid.tu-dortmund.de 11 1 0 0 0/0 0/0 0/0/0 mds-dgi.lrz.de 12 1 0 0 0/0 0/0 0/0/0 srvgrid01.offis.uni-oldenburg.de 13 1 0 0 0/0 0/0 0/0/0 koios.rz.uni-ulm.de 14 1 0 0 0/0 0/0 0/0/0 globus.bfg.uni-freiburg.de 15 1 0 0 0/0 0/0 0/0/0 gt4.uni-tuebingen.de 16 1 0 0 0/0 0/0 0/0/0 mintaka.aip.de 17 1 0 0 0/0 0/0 0/0/0 gridmon.gwdg.de 18 1 0 0 0/0 0/0 0/0/0 c3grid.it.irf.tu-dortmund.de 19 1 0 0 0/0 0/0 0/0/0 stuttgart-globus.iao.fraunhofer.de
After a few minutes:
- gwhost
HID PRIO OS ARCH MHZ %CPU MEM(F/T) DISK(F/T) N(U/F/T) LRMS HOSTNAME 0 1 NULLNULL NULL 0 0 0/0 0/0 0/0/0 Fork astrogrid-mds.aip.de 1 1 Linux2.6.18-194 x86_6 3200 100 354/3942 61448/73162 0/1/4 Fork titan.ari.uni-heidelberg.de 2 1 NULLNULL NULL 0 0 0/0 0/0 0/0/0 GW mintaka.ari.uni-heidelberg.de 3 1 Linux2.6.18-238 x86_6 2411 199 230/7982 6690347/10892868 0/2/2 Fork astrodata.astrogrid-d.org 4 1 0 0 0/0 0/0 0/0/0 astar.aip.de 5 1 NULLNULL NULL 0 0 0/0 0/0 0/77/480 PBS mardschana.zib.de 6 1 0 0 0/0 0/0 0/0/0 bladekemper21.informatik.tu-muenchen.de 7 1 NULLNULL NULL 0 0 0/0 0/0 0/0/1 Fork dgsi.zah.uni-heidelberg.de 8 1 NULLNULL NULL 0 0 0/0 0/0 0/0/0 Fork mds-lrz.lrz.de 9 1 Linux2.6.35-25- x86_6 2999 200 3294/3663 24480/29255 0/138/388 PBS ptgrid.it.irf.tu-dortmund.de 10 1 NULLNULL NULL 0 0 0/0 0/0 0/0/0 Fork udo-mds01.grid.tu-dortmund.de 11 1 NULLNULL NULL 0 0 0/0 0/0 0/0/0 Fork mds-dgi.lrz.de 12 1 NULLNULL NULL 0 0 0/0 0/0 0/1/45 PBS srvgrid01.offis.uni-oldenburg.de 13 1 NULLNULL NULL 0 0 0/0 0/0 0/13/2240 PBS koios.rz.uni-ulm.de 14 1 NULLNULL NULL 0 0 0/0 0/0 0/107/1440 PBS globus.bfg.uni-freiburg.de 15 1 NULLNULL NULL 0 0 0/0 0/0 0/103/1936 PBS gt4.uni-tuebingen.de 16 1 NULLNULL NULL 0 0 0/0 0/0 0/0/0 Fork mintaka.aip.de 17 1 NULLNULL NULL 0 0 0/0 0/0 0/175/2484 PBS gridmon.gwdg.de 18 1 Linux2.6.35-25- x86_6 2999 187 3294/3663 24480/29255 0/138/388 PBS c3grid.it.irf.tu-dortmund.de 19 1 0 0 0/0 0/0 0/0/0 stuttgart-globus.iao.fraunhofer.de
Troubleshooting ¶
Hints ¶
- Repeat all Pre-WS and WS tests
- Check the time (date)
Login ¶
- ssh gwadmin@mintaka.ari.uni-heidelberg.de
or
- ssh root@mintaka.ari.uni-heidelberg.de
Enter passphrase for key '/home/Tux/rieger/.ssh/id_dsa': Last login: Sun Feb 5 10:19:06 2012 from asterope.ari.uni-heidelberg.de Linux mintaka 2.6.18-6-xen-amd64 #1 SMP Thu Dec 25 22:21:42 UTC 2008 x86_64 The programs included with the Debian GNU/Linux system are free software; the exact distribution terms for each program are described in the individual files in /usr/share/doc/*/copyright. Debian GNU/Linux comes with ABSOLUTELY NO WARRANTY, to the extent permitted by applicable law.
- su - gwadmin
Watch log files ¶
- tail -f $GW_LOCATION/var/sched.log
- tail -f $GW_LOCATION/var/gwd.log
Stop GridWay daemon ¶
- pkill gwd
Remove lock file ¶
- cd $GW_LOCATION/var
- ls -al
- rm .lock (if .look exists)
- rm globus-gw.log (and other files if huge)
Exclude problematic hosts ¶
- vi $GW_LOCATION/etc/sched.conf
- Search "RP_HOST" (?RP_HOST)
- Add problematic host, e.g. srvgrid01.offis.uni-oldenburg.de: (i)
... RP_HOST[srvgrid01.offis.uni-oldenburg.de] = 00 # Do not use srvgrid01 ...
- Save (ESC :wq)
Restart GridWay Daemon ¶
- gwd -m -c
Back to The GridWay Metascheduler (master document)
Forward to Using GridWay (next item)