Line: 1 to 1 | ||||||||
---|---|---|---|---|---|---|---|---|
WMS Test Plan
| ||||||||
Line: 1428 to 1428 | ||||||||
bug #68944: Bug in ICE's start/stop script | ||||||||
Added: | ||||||||
> > | Test description.
[root@devel09 ~]# ps aux |grep ice glite 2716 0.0 0.0 220996 2192 ? Ss 09:36 0:00 /usr/bin/glite-wms-ice-safe --conf glite_wms.conf --daemon /tmp/glite-wms-ice-safe.pid glite 2720 0.0 0.0 63856 1108 ? S 09:36 0:00 sh -c /usr/bin/glite-wms-ice --conf glite_wms.conf /var/log/wms/ice_console.log 2>&1 glite 2721 0.0 0.1 483464 7404 ? Sl 09:36 0:02 /usr/bin/glite-wms-ice --conf glite_wms.conf /var/log/wms/ice_console.log root 31253 0.0 0.0 61184 788 pts/1 S+ 14:29 0:00 grep ice [root@devel09 ~]# /etc/init.d/glite-wms-ice restart stopping ICE... ok starting ICE... ok [root@devel09 ~]# ps aux |grep ice glite 31318 0.0 0.0 220996 2196 ? Ss 14:30 0:00 /usr/bin/glite-wms-ice-safe --conf glite_wms.conf --daemon /tmp/glite-wms-ice-safe.pid glite 31322 0.0 0.0 63856 1112 ? S 14:30 0:00 sh -c /usr/bin/glite-wms-ice --conf glite_wms.conf /var/log/wms/ice_console.log 2>&1 glite 31323 1.0 0.1 483464 7352 ? Sl 14:30 0:00 /usr/bin/glite-wms-ice --conf glite_wms.conf /var/log/wms/ice_console.log root 31363 0.0 0.0 61180 784 pts/1 S+ 14:30 0:00 grep iceConsidered successful if ice processes are restarted. bug #70061: WMS hates collections with 192 nodes!Test description.[mcecchi@cert-19 ~]$ glite-wms-job-submit -a -c devel11.conf coll_192.jdl Connecting to the service https://devel11.cnaf.infn.it:7443/g... ====================== glite-wms-job-submit Success ====================== The job has been successfully submitted to the WMProxy Your job identifier is: https://devel11.cnaf.infn.it:9000/T... ==========================================================================Considered successful if the collection is submitted and executed correctly. bug #70331: glite-wms-create-proxy "ambiguous redirect"Test description.[root@cream-44 cron.d]# su - glite [glite@cream-44 ~]$ /usr/sbin/glite-wms-create-proxy.sh /var/wms.proxy /var/log/wms/create_proxy.log [glite@cream-44 ~]$Considered successful if the command works with no error messages. bug #70824: environment values in JDL cannot have spacesTest description. Submit this jdl:[ Executable = "/bin/env"; Stdoutput = "env.out" ; StdError = "env.err" ; shallowretrycount = 2; retrycount = 3; OutputSandbox = { "env.out" ,"env.err" } ; Environment = { "ATHENA_RUN_EVENTS='[(152345L, 216721L)]'" }; ] ======================= glite-wms-job-status Success ===================== BOOKKEEPING INFORMATION: Status info for the Job : https://devel09.cnaf.infn.it:9000/g... Current Status: Done (Success) Exit code: 0 Status Reason: Job terminated successfully Destination: gridgate.ucc.ie:2119/jobmanager-pbs-sixhour Submitted: Thu Apr 21 17:11:46 2011 CEST ==========================================================================After output retrieval: [ale@cream-12 UI]$ glite-wms-job-output https://devel09.cnaf.infn.it:9000/g... Connecting to the service https://devel09.cnaf.infn.it:7443/g... ================================================================================ JOB GET OUTPUT OUTCOME Output sandbox files for the job: https://devel09.cnaf.infn.it:9000/g... have been successfully retrieved and stored in the directory: /tmp/jobOutput/ale_gvpbgkBN_KkLhly-uNjpxQ ================================================================================Check the output file: [ale@cream-12 UI]$ cat /tmp/jobOutput/ale_gvpbgkBN_KkLhly-uNjpxQ/env.out | grep ATHENA ATHENA_RUN_EVENTS=[(152345L, 216721L)]Considered successful if the environment contains the variable with spaces. bug #71863: JobWrapper tries to use "test -eq" for string comparisonTest description. Looking into a JW:fatal_error() # 1 - reason, 2 - transfer OSB { if [ "x${LM_done_failed_reason}" != "x" ]; then jw_echo "LM_log_done_begin" # parsed 'as is' by LM jw_echo "${LM_done_failed_reason}" jw_echo "LM_log_done_end" # parsed 'as is' by LM fi jw_echo "$1" log_done_failed 1 "$1" if [ "x$2" == "xOSB" ]; then OSB_transfer fi doExit 1 }Considered successful if the comparison is done via == bug #73192: Submission failed due to a credential problemTest description. Simply try a submit:[ale@cream-12 UI]$ glite-wms-job-submit -a -c etc/wmp_devel09.conf jdl/deepresub.jdl Connecting to the service https://devel09.cnaf.infn.it:7443/g... ====================== glite-wms-job-submit Success ====================== The job has been successfully submitted to the WMProxy Your job identifier is: https://devel09.cnaf.infn.it:9000/L... ==========================================================================Considered successful if the job is correctly submitted. bug #73699: Wrong retry count computationTest description. Submit this jdl:[ requirements = (other.GlueCEStateStatus == "Production"); Rank = -2 * other.GlueCEStateWaitingJobs; Executable = "/bin/ls"; prologue = "/bin/false"; shallowretrycount = -1; RetryCount = 3; usertags = [ exe = "deepresub" ]; requirements=regexp("8443/cream", other.GlueCEUniqueID); ]The job should file after 3 resubmissions: ======================= glite-wms-job-status Success ===================== BOOKKEEPING INFORMATION: Status info for the Job : https://devel09.cnaf.infn.it:9000/L... Current Status: Aborted Logged Reason(s): - Prologue failed with error 1 - reason=1; Prologue failed with error 1 - Transfer to CREAM failed due to exception: Failed to create a delegation id for job https://devel09.cnaf.infn.it:9000/L...: reason is Received NULL fault; the error is due to another cause: FaultString=[] - FaultCode=[SOAP-ENV:Server.generalException] - FaultSubCode=[SOAP-ENV:Server.generalException] - FaultDetail=[<ns1:DelegationException><msg>User C=IT,O=INFN,OU=Personal Certificate,L=Padova,CN=Alessio Gianelle not authorized for operation {http://www.gridsite.org/namespaces/...</msg></ns1:DelegationException><ns2:exceptionName xmlns:ns2="http://xml.apache.org/axis/">org.glite.security.delegation.DelegationException</ns2:exceptionName><ns3:hostname xmlns:ns3="http://xml.apache.org/axis/">cluster50.knu.ac.kr</ns3:hostname>] - Prologue failed with error 1 - Prologue failed with error 1 - Prologue failed with error 1 - reason=1; Prologue failed with error 1 Status Reason: hit job retry count (3) Destination: infnlns-ce-01.ct.pi2s2.it:8443/cream-lsf-cert Submitted: Thu Apr 21 16:47:33 2011 CESTInfact: [ale@cream-12 UI]$ glite-wms-job-logging-info --event Resubmission https://devel09.cnaf.infn.it:9000/L... ===================== glite-wms-job-logging-info Success ===================== LOGGING INFORMATION: Printing info for the Job : https://devel09.cnaf.infn.it:9000/L... --- Event: Resubmission - Result = WILLRESUB - Source = LogMonitor - Timestamp = Thu Apr 21 16:49:44 2011 CEST --- Event: Resubmission - Result = WILLRESUB - Source = WorkloadManager - Timestamp = Thu Apr 21 16:49:45 2011 CEST --- Event: Resubmission - Result = WILLRESUB - Source = LogMonitor - Timestamp = Thu Apr 21 16:49:50 2011 CEST --- Event: Resubmission - Result = WILLRESUB - Source = WorkloadManager - Timestamp = Thu Apr 21 16:49:51 2011 CEST --- Event: Resubmission - Result = WILLRESUB - Source = LogMonitor - Timestamp = Thu Apr 21 16:52:00 2011 CEST --- Event: Resubmission - Result = WILLRESUB - Source = WorkloadManager - Timestamp = Thu Apr 21 16:52:01 2011 CEST --- Event: Resubmission - Result = WILLRESUB - Source = LogMonitor - Timestamp = Thu Apr 21 16:54:11 2011 CEST ==========================================================================(Two events one from LM and the other from WM for every resubmission, plus the last try which failed because limit is reached.) Considered successful if the job is resubmitted the right number of times (three in this case). bug #73711: edg_wll_SetLoggingJobProxy with empty sequence code returns "no state in DB"Test description. This bug represents a development issue. It is simply verified by checking that submission for both a job and a collection returns the jobid.[mcecchi@cert-19 ~]$ glite-wms-job-submit -a -c devel11.conf job.jdl Connecting to the service https://devel11.cnaf.infn.it:7443/g... ====================== glite-wms-job-submit Success ====================== The job has been successfully submitted to the WMProxy Your job identifier is: https://devel11.cnaf.infn.it:9000/B... ========================================================================== [mcecchi@cert-19 ~]$ glite-wms-job-submit -a -c devel11.conf coll_10.jdl Connecting to the service https://devel11.cnaf.infn.it:7443/g... ====================== glite-wms-job-submit Success ====================== The job has been successfully submitted to the WMProxy Your job identifier is: https://devel11.cnaf.infn.it:9000/T... ==========================================================================Considered successful if the jobids are correctly returned. bug #73715: missing ReallyRunning event from LogMonitorTest description.[ale@cream-12 UI]$ glite-wms-job-logging-info -v 3 --event ReallyRunning https://devel09.cnaf.infn.it:9000/v... ===================== glite-wms-job-logging-info Success ===================== LOGGING INFORMATION: Printing info for the Job : https://devel09.cnaf.infn.it:9000/v... --- Event: ReallyRunning - Arrived = Thu Apr 21 16:08:28 2011 CEST - Host = grid003.fc.up.pt - Level = SYSTEM - Priority = synchronous - Seqcode = UI=000000:NS=0000000004:WM=000005:BH=0000000000:JSS=000002:LM=000002:LRMS=000003:APP=000000:LBS=000000 - Source = LRMS - Timestamp = Thu Apr 21 16:08:28 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: ReallyRunning - Arrived = Thu Apr 21 16:09:43 2011 CEST - Host = devel09.cnaf.infn.it - Level = SYSTEM - Priority = synchronous - Seqcode = UI=000000:NS=0000000004:WM=000005:BH=0000000000:JSS=000002:LM=000007:LRMS=000000:APP=000000:LBS=000000 - Source = LogMonitor - Timestamp = Thu Apr 21 16:09:43 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy - Wn seq = UI=000000:NS=0000000004:WM=000005:BH=0000000000:JSS=000002:LM=000002:LRMS=000000:APP=000000:LBS=000000 Alessio Gianelle <ale> Project MemberConsidered successful if the reallyrunning event is effectively logged. bug #74259: Previous matches information is not taken into account if direct submission is usedTest description. Submit directly (i.e. use option -r[ale@cream-12 UI]$ glite-wms-job-logging-info -v 2 --event UserTag https://devel09.cnaf.infn.it:9000/Q... ===================== glite-wms-job-logging-info Success ===================== LOGGING INFORMATION: Printing info for the Job : https://devel09.cnaf.infn.it:9000/Q... --- Event: UserTag - Arrived = Fri Apr 22 10:10:37 2011 CEST - Host = devel09.cnaf.infn.it - Name = exe - Source = NetworkServer - Src instance = https://devel09.cnaf.infn.it:7443/g... - Timestamp = Fri Apr 22 10:10:37 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle - Value = deepresub --- Event: UserTag - Arrived = Fri Apr 22 10:10:39 2011 CEST - Host = devel09.cnaf.infn.it - Name = CEInfoHostName - Source = WorkloadManager - Src instance = 21624 - Timestamp = Fri Apr 22 10:10:39 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy/CN=proxy/CN=proxy - Value = alice16.spbu.ru --- Event: UserTag - Arrived = Fri Apr 22 10:10:41 2011 CEST - Host = devel09.cnaf.infn.it - Name = CEInfoHostName - Source = WorkloadManager - Src instance = 21624 - Timestamp = Fri Apr 22 10:10:41 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy/CN=proxy/CN=proxy - Value = alice16.spbu.ru --- Event: UserTag - Arrived = Fri Apr 22 10:10:43 2011 CEST - Host = devel09.cnaf.infn.it - Name = CEInfoHostName - Source = WorkloadManager - Src instance = 21624 - Timestamp = Fri Apr 22 10:10:43 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy/CN=proxy/CN=proxy - Value = alice16.spbu.ru --- Event: UserTag - Arrived = Fri Apr 22 10:10:45 2011 CEST - Host = devel09.cnaf.infn.it - Name = CEInfoHostName - Source = WorkloadManager - Src instance = 21624 - Timestamp = Fri Apr 22 10:10:45 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy/CN=proxy/CN=proxy - Value = alice16.spbu.ru ==========================================================================Considered successful if the tag "CEInfoHostName" reports the same value (equal to the chosen CE) for all the events. bug #74577: Wrong counter in ICE database is set at the job creationTest description. we can check that the first time a job is submitted the counter should be set to 1:[ale@cream-12 UI]$ glite-wms-job-submit -a -c etc/wmp_devel09.conf test.jdl Connecting to the service https://devel09.cnaf.infn.it:7443/g... ====================== glite-wms-job-submit Success ====================== The job has been successfully submitted to the WMProxy Your job identifier is: https://devel09.cnaf.infn.it:9000/Z... ========================================================================== and on the WMS: [root@devel09 persist_dir]# sqlite3 ice.db "select counter from proxy where myproxyurl='myproxy.cern.ch';" 1Considered successful if the counter is set to 1. bug #75223: wrong reason loggedTest description. submit a jdl like this one:executable="/bin/sleep"; arguments="6"; Stdoutput = "std.out"; OutputSandbox = {"std.out"}; requirements=regexp("8443/cream", other.GlueCEUniqueID); ShallowRetryCount = 2; prologue="/bin/false";and look for the Done failed reason in the logging info: [ale@cream-12 UI]$ glite-wms-job-logging-info -v 2 --event Done https://devel09.cnaf.infn.it:9000/T... ===================== glite-wms-job-logging-info Success ===================== LOGGING INFORMATION: Printing info for the Job : https://devel09.cnaf.infn.it:9000/T... --- Event: Done - Arrived = Thu Apr 21 13:31:27 2011 CEST - Exit code = 0 - Host = lcg1371.gridpp.rl.ac.uk - Reason = Prologue failed with error 1 - Source = LRMS - Status code = FAILED - Timestamp = Thu Apr 21 13:31:27 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: Done - Arrived = Thu Apr 21 13:32:27 2011 CEST - Exit code = 0 - Host = devel09.cnaf.infn.it - Reason = Prologue failed with error 1 - Source = LogMonitor - Status code = FAILED - Timestamp = Thu Apr 21 13:32:27 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxyConsidered successful if the Reason contains only "Prologue failed with error 1". bug #75368: ICE should log a DONE_FAILED to LB every time the job is going to be resubmittedTest description. Submit a jdl to a failure Cream CE (i.e. a CE where you know that for some reasons job will fail) and check the logging-info, before "Resubmission" event ICE should log a "Done Failed" with the failure reason.Event: Done - Arrived = Thu Apr 21 13:24:09 2011 CEST - Exit code = 0 - Host = devel09.cnaf.infn.it - Reason = Transfer to CREAM failed due to exception: Failed to create a delegation id for job https://devel09.cnaf.infn.it:9000/6...: reason is Received NULL fault; the error is due to another cause: FaultString=[] - FaultCode=[SOAP-ENV:Server.generalException] - FaultSubCode=[SOAP-ENV:Server.generalException] - FaultDetail=[<ns1:DelegationException><msg>User C=IT,O=INFN,OU=Personal Certificate,L=Padova,CN=Alessio Gianelle not authorized for operation {http://www.gridsite.org/namespaces/...</msg></ns1:DelegationException><ns2:exceptionName xmlns:ns2="http://xml.apache.org/axis/">org.glite.security.delegation.DelegationException</ns2:exceptionName><ns3:hostname xmlns:ns3="http://xml.apache.org/axis/">foam.grid.kiae.ru</ns3:hostname>] - Source = LogMonitor - Status code = FAILED - Timestamp = Thu Apr 21 13:24:09 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy --- Event: Resubmission - Arrived = Thu Apr 21 13:24:09 2011 CEST - Host = devel09.cnaf.infn.it - Reason = Resubmitting because of exception Failed to create a delegation id for job https://devel09.cnaf.infn.it:9000/6...: reason is Received NULL fault; the error is due to another cause: FaultString=[] - FaultCode=[SOAP-ENV:Server.generalException] - FaultSubCode=[SOAP-ENV:Server.generalException] - FaultDetail=[<ns1:DelegationException><msg>User C=IT,O=INFN,OU=Personal Certificate,L=Padova,CN=Alessio Gianelle not authorized for operation {http://www.gridsite.org/namespaces/...</msg></ns1:DelegationException><ns2:exceptionName xmlns:ns2="http://xml.apache.org/axis/">org.glite.security.delegation.DelegationException</ns2:exceptionName><ns3:hostname xmlns:ns3="http://xml.apache.org/axis/">foam.grid.kiae.ru</ns3:hostname>] CEUrl https://foam.grid.kiae.ru:8443/ce-c... - Result = WILLRESUB - Source = LogMonitor - Tag = unavailable - Timestamp = Thu Apr 21 13:24:09 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy bug #75402: Synchronization loss between real validity of proxy and exp. time saved in ICE's databaseTest description. Rarely means that is not easy to reproduce, so we consider it hopefully fixed.bug #77004: Wrong myproxyserver string processing in ICETest description. Submit this jdl: executable="/bin/sleep"; arguments="6"; Stdoutput = "std.out"; OutputSandbox = {"std.out"}; requirements=regexp("8443/cream", other.GlueCEUniqueID); MyProxyServer = " "; Then check the database's table "proxy" of ice in the WMS to see if the proxy is associated to a fake myproxy server: [root@devel09 persist_dir]# sqlite3 ice.db "select * from proxy;" /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle-/dteam/Role=NULL/Capability=NULL||/var/ice/persist_dir/B23D0D7177A8B6234F1985493FA09FF41A4FA98C.proxy|1303415475|0 (the name of myproxy server should be in the second field)bug #77055: "MyProxyServer: wrong type caught for attribute" for parametric jobsTest description.[ale@cream-12 UI]$ cat param.jdl // Test parametric job. // The standar Output and Error are parametrizied such as the // enviroment variable: MYPATH_PARAM_. [ JobType = "parametric"; Executable = "/bin/env"; Environment = {"MYPATH_PARAM_=$PATH:/bin:/usr/bin:$HOME"}; StdOutput = "echo_PARAM_.out"; StdError = "echo_PARAM_.err"; OutputSandbox = {"echo_PARAM_.out","echo_PARAM_.err"}; ParameterStart = 0; ParameterStep = 2; Parameters = 10; myproxyserver=""; ] [ale@cream-12 UI]$ glite-wms-job-submit -a -c etc/wmp_cream-44.conf -o test.out param.jdl Connecting to the service https://cream-44.pd.infn.it:7443/gl... ====================== glite-wms-job-submit Success ====================== The job has been successfully submitted to the WMProxy Your job identifier is: https://cream-44.pd.infn.it:9000/H5... The job identifier has been saved in the following file: /home/ale/UI/test.out ==========================================================================Considered successful if the job is correctly submitted. bug #77694: Resource BDII for WMS needs to be revisitTest description. [glite@cream-44 ~]$ cat /opt/glite/yaim/node-info.d/glite-wms | grep config_gip_only [glite@cream-44 ~]$ Considered successful if the function config_gip_only has been removed.bug #77876: While purging DAGs/Collections the CLEAR event is only logged for the parent nodeTest description. Forcing a run of the cron purger you can then check the logging info of a "parent" node:[ale@cream-12 UI]$ glite-wms-job-logging-info -v 2 https://cream-44.pd.infn.it:9000/vR... ===================== glite-wms-job-logging-info Success ===================== LOGGING INFORMATION: Printing info for the Job : https://cream-44.pd.infn.it:9000/vR... --- Event: RegJob - Arrived = Wed Apr 20 14:01:25 2011 CEST - Host = cream-44.pd.infn.it - Jobtype = COLLECTION - Ns = https://cream-44.pd.infn.it:7443/gl... - Nsubjobs = 100 - Seed = WMPROXY - Source = NetworkServer - Src instance = https://cream-44.pd.infn.it:7443/gl... - Timestamp = Wed Apr 20 14:01:25 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: Accepted - Arrived = Wed Apr 20 14:01:34 2011 CEST - From = NetworkServer - From host = cream-12.pd.infn.it - Host = cream-44.pd.infn.it - Source = NetworkServer - Src instance = https://cream-44.pd.infn.it:7443/gl... - Timestamp = Wed Apr 20 14:01:34 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: EnQueued - Arrived = Wed Apr 20 14:01:34 2011 CEST - Host = cream-44.pd.infn.it - Queue = /var/workload_manager/jobdir - Result = START - Source = NetworkServer - Src instance = https://cream-44.pd.infn.it:7443/gl... - Timestamp = Wed Apr 20 14:01:34 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: EnQueued - Arrived = Wed Apr 20 14:01:57 2011 CEST - Host = cream-44.pd.infn.it - Queue = /var/workload_manager/jobdir - Result = OK - Source = NetworkServer - Src instance = https://cream-44.pd.infn.it:7443/gl... - Timestamp = Wed Apr 20 14:01:57 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: DeQueued - Arrived = Wed Apr 20 14:05:03 2011 CEST - Host = cream-44.pd.infn.it - Queue = /var/workload_manager/jobdir - Source = WorkloadManager - Src instance = 24901 - Timestamp = Wed Apr 20 14:05:03 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy --- Event: CollectionState - Arrived = Wed Apr 20 14:05:11 2011 CEST - Child = https://cream-44.pd.infn.it:9000/_y... - Child event = Running - Done code = 0 - Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=1, Done=0, Cleared=0, Aborted=0, Cancelled=0, Unknown=99, Purged=0 - Host = cream-44.pd.infn.it - Source = LBServer - State = Running - Timestamp = Wed Apr 20 14:05:11 2011 CEST - User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it --- Event: CollectionState - Arrived = Wed Apr 20 14:29:59 2011 CEST - Child = https://cream-44.pd.infn.it:9000/H3... - Child event = Done - Done code = 0 - Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=0, Done=82, Cleared=0, Aborted=7, Cancelled=0, Unknown=11, Purged=0 - Host = cream-44.pd.infn.it - Source = LBServer - State = Waiting - Timestamp = Wed Apr 20 14:29:59 2011 CEST - User = LBProxy --- Event: Clear - Arrived = Thu Apr 21 10:13:20 2011 CEST - Host = cream-44.pd.infn.it - Reason = 2 - Source = NetworkServer - Src instance = 20407 - Timestamp = Thu Apr 21 10:13:20 2011 CEST - User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it/CN=431015228 ==========================================================================Or you can check the logging-info on a parent node after the output retrieval: [ale@cream-12 UI]$ glite-wms-job-logging-info -v 2 https://cream-44.pd.infn.it:9000/tn... ===================== glite-wms-job-logging-info Success ===================== LOGGING INFORMATION: Printing info for the Job : https://cream-44.pd.infn.it:9000/tn... --- Event: RegJob - Arrived = Thu Apr 21 10:18:34 2011 CEST - Host = cream-44.pd.infn.it - Jobtype = COLLECTION - Ns = https://cream-44.pd.infn.it:7443/gl... - Nsubjobs = 3 - Seed = WMPROXY - Source = NetworkServer - Src instance = https://cream-44.pd.infn.it:7443/gl... - Timestamp = Thu Apr 21 10:18:34 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: Accepted - Arrived = Thu Apr 21 10:18:34 2011 CEST - From = NetworkServer - From host = cream-12.pd.infn.it - Host = cream-44.pd.infn.it - Source = NetworkServer - Src instance = https://cream-44.pd.infn.it:7443/gl... - Timestamp = Thu Apr 21 10:18:34 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: EnQueued - Arrived = Thu Apr 21 10:18:34 2011 CEST - Host = cream-44.pd.infn.it - Queue = /var/workload_manager/jobdir - Result = START - Source = NetworkServer - Src instance = https://cream-44.pd.infn.it:7443/gl... - Timestamp = Thu Apr 21 10:18:34 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: EnQueued - Arrived = Thu Apr 21 10:18:35 2011 CEST - Host = cream-44.pd.infn.it - Queue = /var/workload_manager/jobdir - Result = OK - Source = NetworkServer - Src instance = https://cream-44.pd.infn.it:7443/gl... - Timestamp = Thu Apr 21 10:18:35 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle --- Event: DeQueued - Arrived = Thu Apr 21 10:18:36 2011 CEST - Host = cream-44.pd.infn.it - Queue = /var/workload_manager/jobdir - Source = WorkloadManager - Src instance = 15496 - Timestamp = Thu Apr 21 10:18:36 2011 CEST - User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy --- Event: CollectionState - Arrived = Thu Apr 21 10:18:42 2011 CEST - Child = https://cream-44.pd.infn.it:9000/d_... - Child event = Running - Done code = 0 - Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=1, Done=0, Cleared=0, Aborted=0, Cancelled=0, Unknown=2, Purged=0 - Host = cream-44.pd.infn.it - Source = LBServer - State = Running - Timestamp = Thu Apr 21 10:18:42 2011 CEST - User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it --- Event: CollectionState - Arrived = Thu Apr 21 10:19:50 2011 CEST - Child = https://cream-44.pd.infn.it:9000/zo... - Child event = Done - Done code = 0 - Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=0, Done=2, Cleared=0, Aborted=0, Cancelled=0, Unknown=1, Purged=0 - Host = cream-44.pd.infn.it - Source = LBServer - State = Waiting - Timestamp = Thu Apr 21 10:19:50 2011 CEST - User = LBProxy --- Event: CollectionState - Arrived = Thu Apr 21 10:24:12 2011 CEST - Child = https://cream-44.pd.infn.it:9000/as... - Child event = Running - Done code = 0 - Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=1, Done=2, Cleared=0, Aborted=0, Cancelled=0, Unknown=0, Purged=0 - Host = cream-44.pd.infn.it - Source = LBServer - State = Running - Timestamp = Thu Apr 21 10:24:12 2011 CEST - User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it --- Event: CollectionState - Arrived = Thu Apr 21 10:28:04 2011 CEST - Child = https://cream-44.pd.infn.it:9000/as... - Child event = Done - Done code = 0 - Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=0, Done=3, Cleared=0, Aborted=0, Cancelled=0, Unknown=0, Purged=0 - Host = cream-44.pd.infn.it - Source = LBServer - State = Done - Timestamp = Thu Apr 21 10:28:04 2011 CEST - User = LBProxy --- Event: CollectionState - Arrived = Thu Apr 21 10:33:00 2011 CEST - Child = https://cream-44.pd.infn.it:9000/zo... - Child event = Clear - Done code = 0 - Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=0, Done=0, Cleared=3, Aborted=0, Cancelled=0, Unknown=0, Purged=0 - Host = cream-44.pd.infn.it - Source = LBServer - State = Cleared - Timestamp = Thu Apr 21 10:33:00 2011 CEST - User = LBProxy --- Event: Clear - Arrived = Thu Apr 21 10:33:00 2011 CEST - Host = cream-44.pd.infn.it - Reason = 1 - Source = NetworkServer - Src instance = 21720 - Timestamp = Thu Apr 21 10:33:00 2011 CEST - User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it/CN=431015228 bug #78030: Alternative GLITE_WMS_LOG_DESTINATION in the jobwrapperTest description. Stop the gatekeeper on the CE and check if the JW is able to log (looking for LRMS events into the logging-info).Test description. Submit a lot of jobs setting EnableWmsFeedback = true; in the jdl to trigger a replanning and then looking into the wm log file:20 Apr, 14:17:10 -I: [Info] operator()(/home/condor/execute/dir_24787/userdir/emi.wms.wms-manager/src/replanner.cpp:237): created replanning request for job https://cream-44.pd.infn.it:9000/JE... with token /var/SandboxDir/JE/https_3a_2f_2fcream-44.pd.infn.it_3a9000_2fJE5axeaAk2SU_5fG1ij5ljNg/token.txt_1you should see that replanning works as expected. [ale@cream-12 UI]$ glite-wms-job-status https://cream-44.pd.infn.it:9000/JE... ======================= glite-wms-job-status Success ===================== BOOKKEEPING INFORMATION: Status info for the Job : https://cream-44.pd.infn.it:9000/JE... Current Status: Done (Success) Logged Reason(s): - Cannot take token - job completed - Job Terminated Successfully Exit code: 0 Status Reason: Job Terminated Successfully Destination: ce202.cern.ch:8443/cream-lsf-grid_2nh_dteam Submitted: Wed Apr 20 14:01:25 2011 CEST Parent Job: https://cream-44.pd.infn.it:9000/vR... ========================================================================== bug #78406: [ yaim-wms ] yaim should set IsmIiLDAPCEFilterExt according to the supported VO(s)Test description.[root@devel11 libexec]# grep IsmIiLDAPCEFilterExt /etc/glite-wms/glite_wms.conf IsmIiLDAPCEFilterExt = "(|(GlueCEAccessControlBaseRule=VO:atlas)(GlueCEAccessControlBaseRule=VOMS:/atlas/)(GlueCEAccessControlBaseRule=VO:alice)(GlueCEAccessControlBaseRule=VOMS:/alice/)(GlueCEAccessControlBaseRule=VO:lhcb)(GlueCEAccessControlBaseRule=VOMS:/lhcb/)(GlueCEAccessControlBaseRule=VO:cms)(GlueCEAccessControlBaseRule=VOMS:/cms/)(GlueCEAccessControlBaseRule=VO:dteam)(GlueCEAccessControlBaseRule=VOMS:/dteam/)(GlueCEAccessControlBaseRule=VO:biomed)(GlueCEAccessControlBaseRule=VOMS:/biomed/)(GlueCEAccessControlBaseRule=VO:ops)(GlueCEAccessControlBaseRule=VOMS:/ops/)(GlueCEAccessControlBaseRule=VO:infngrid)(GlueCEAccessControlBaseRule=VOMS:/infngrid/))"; | |||||||
-- ElisabettaMolinari - 2010-02-24 |