Tags:
, view all tags

TESTS

  • Normal jobs work: OK

  • Dag jobs work: OK

  • Perusal jobs work: OK

  • MPICH jobs work: OK
Modified mpirun: Executing command: /home/dteam029/globus-tmp.griditwn03.7486.0/https_3a_2f_2fdevel17.cnaf.infn.it_3a9000_2fOsslm3cw4T7lgR09qJTR4g/cpi
 Process 0 of 1 on griditwn03.na.infn.it
 pi is approximately 3.1415926544231341, Error is 0.0000000008333410
 wall clock time = 10.001266

  • Submission of 270 collections of 100 jobs each (10 collections every 30 minutes), using 1 user and a fuzzy rank (used 90 lcg CEs):
    • Success > 99.99% OK
    • Cancelled about 1800 jobs due to a problem with the CEs at in2p3.fr

Check bugs:

  • BUG #13494: FIXED
    • checked by Laurence Field and ARC developers

  • BUG #21909: FIXED in the wmproxy startup script
         if ( /sbin/pidof $httpd ) >/dev/null 2>&1 ; then
          echo $httpd \(pid `/sbin/pidof $httpd`\) is running ....

  • BUG #23443: FIXED
    • Required documents are not put into the glite doc template in edms

    for edg_rm_command in $GLITE_LOCATION/bin/edg-rm                            $EDG_LOCATION/bin/edg-rm                            `which edg-rm 2>/dev/null`; do

  • BUG #24690: NOT COMPLETELY FIXED
    • The message error that you could find in the wmproxy log (also with level 5) is: edg_wll_JobStat GSSAPI Error
    • In any case now there is a dedicated cron script to renew host-proxy (e.g. it is not included in the cron-purger script)

  • BUG #26885: FIXED
    • Job wrongly kept in ICE cache with status UNKNOWN: checked with two subsequent submissions of 5 collections made of 50 nodes each. ICE does not leave any job with status UNKNOWN behind in the cache

  • BUG #27215: NOT COMPLETELY FIXED
[ale@cream-15 regression]$ ls -l /tmp/ale_StdrEDNZljNnxCLx45ILIw
 total 8
-rw-rw-r--  1 ale ale 30 Jul  8 16:02 out1.tail
-rw-rw-r--  1 ale ale 70 Jul  8 16:02 out2
-rw-rw-r--  1 ale ale  0 Jul  8 16:02 test.err
-rw-rw-r--  1 ale ale  0 Jul  8 16:02 test.out
It is not fixed instead using a CREAM -CE

[ale@cream-15 UI]$ glite-wms-job-logging-info -v 2 https://devel17.cnaf.infn.it:9000/Hr_TRdWT9XZrBux4DyWQsw | grep -A 2 Match | grep Dest
- Dest id                    =    ce103.cern.ch:2119/jobmanager-lcglsf-grid_dteam
- Dest id                    =    ce103.cern.ch:2119/jobmanager-lcglsf-grid_dteam
- Dest id                    =    ce103.cern.ch:2119/jobmanager-lcglsf-grid_dteam

  • BUG #28249: Hopefully fixed
    • bug posted by the developer

  • BUG #28498: FIXED
    • compilation error with gcc-4.x

[ale@cream-15 UI]$ cat /tmp/ale_zngnB9uVCWKT7B7MkSlBtA/env.out  | grep LD_LIBRARY
 LD_LIBRARY_PATH=.

  • BUG #29182: Hopefully fixed
    • not easy to reproduce

  • BUG #29538: Hopefully fixed
    • bug posted by the developer

  • BUG #30289: FIXED
    • Fixed by not using 'clog'

Master node is: node72.grid.pg.infn.it
 Is should run on the following nodes:
node72.grid.pg.infn.it
 node72.grid.pg.infn.it
 node71.grid.pg.infn.it
 node71.grid.pg.infn.it
*************************************
Current working directory is: /home/dteamsgm003/globus-tmp.node72.24167.0/https_3a_2f_2fdevel17.cnaf.infn.it_3a9000_2f-6An9ZDwkvot3aOLSzScdg
 List files on the working directory:
/home/dteamsgm003/globus-tmp.node72.24167.0/https_3a_2f_2fdevel17.cnaf.infn.it_3a9000_2f-6An9ZDwkvot3aOLSzScdg:
total 352
 drwxr-xr-x  2 dteamsgm003 dteamsgm   4096 Jun 30 11:03 .
drwx------  5 dteamsgm003 dteamsgm   4096 Jun 30 11:03 ..
-rwxr-xr-x  1 dteamsgm003 dteamsgm    822 Jun 30 11:03 30308_exe.sh
-rw-r--r--  1 dteamsgm003 dteamsgm   3687 Jun 30 11:03 .BrokerInfo
-rw-r--r--  1 dteamsgm003 dteamsgm    218 Jun 30 11:03 https_3a_2f_2fdevel17.cnaf.infn.it_3a9000_2f-6An9ZDwkvot3aOLSzScdg.output
-rw-r--r--  1 dteamsgm003 dteamsgm 330910 Jun 30 11:03 mpitest
-rw-r--r--  1 dteamsgm003 dteamsgm      0 Jun 30 11:03 test.err
-rw-r--r--  1 dteamsgm003 dteamsgm    385 Jun 30 11:03 test.out
-rw-------  1 dteamsgm003 dteamsgm      0 Jun 30 11:03 tmp.rdgPL24747
*********************************

  • BUG #30518: Hopefully fixed
    • not easy to reproduce

*************************************************************
BOOKKEEPING INFORMATION:

Status info for the Job : https://devel17.cnaf.infn.it:9000/LEzR7tTwyh3P-iYZrKlwxg
 Current Status:     Aborted
 Status Reason:      The maximum number of output sandbox files is reached
 Submitted:          Tue Jul  8 16:12:10 2008 CEST
*************************************************************


Error - WMProxy Server Error
The Operation is not allowed: The maximum number of perusal files is reached

Method: enableFilePerusal

  • BUG #31006: Hopefully FIXED
    • Not easy to reproduce

  • BUG #32345: FIXED
    • reproduced the problem by inserting a 500 sec sleep in the dirmanager and killing it by hand while unzipping the ISB. The job stays in status 'waiting' and is not forwarded to the WM.

  # customization point
  if [ -n "${GLITE_LOCAL_CUSTOMIZATION_DIR}" ]; then
    if [ -f "${GLITE_LOCAL_CUSTOMIZATION_DIR}/cp_1_5.sh" ]; then
      . "${GLITE_LOCAL_CUSTOMIZATION_DIR}/cp_1_5.sh"
    fi
  fi

  • BUG #33140: Hopefully FIXED
    • Not easy to reproduce

  • BUG #35878: FIXED
    • compilation error with gcc-4.x

  • BUG #36341: Hopefully fixed
    • bug posted by the developer

  • BUG #36466: Hopefully fixed
    • bug posted by the developer

  • BUG #36536: FIXED
    • submitted a normal job
    • waited until finished successfully
    • checked the job record is in the LBProxy mysql DB
    • retrieved the output via 'glite-wms-job-output'
    • checked the job record is no more in the LBProxy mysql DB

  • BUG #36870: FIXED
    • Fixed by removing the spec file

  • BUG #36876: Hopefully fixed
    • bug posted by the developer

  • BUG #36907: Hopefully fixed
    • Not easy to reproduce

  • BUG #37674: Hopefully FIXED
    • Not easy to reproduce

  • BUG #37756: NOT COMPLETELY FIXED
    • Tested using a short proxy to submit a longer job and ICE does not resubmit it, but afterwards the status is not updated to Done by ICE, due to another bug #39807

[root@wms008 init.d]# grep GLITE_LOCATION glite-wms-ice
GLITE_LOCATION=${GLITE_LOCATION:-/opt/glite}

  • BUG #37916: Hopefully fixed
    • bug posted by the developer

[ale@cream-15 UI]$ ls -l /tmp/ale_eRWc528nX8QpEcs7im-R7g
 total 8
-rw-rw-r--  1 ale ale 50 Jul  8 12:06 out1
-rw-rw-r--  1 ale ale  0 Jul  8 12:06 out2.tail
-rw-rw-r--  1 ale ale 50 Jul  8 12:06 out3
-rw-rw-r--  1 ale ale  0 Jul  8 12:06 out4.tail
-rw-rw-r--  1 ale ale  0 Jul  8 12:06 test.err
-rw-rw-r--  1 ale ale  0 Jul  8 12:06 test.out

-- AlessioGianelle - 27 Jun 2008

Edit | Attach | PDF | History: r61 | r59 < r58 < r57 < r56 | Backlinks | Raw View | More topic actions...
Topic revision: r57 - 2008-09-03 - AlessioGianelle
 
  • Edit
  • Attach
This site is powered by the TWiki collaboration platformCopyright © 2008-2023 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback