Difference: WmsTestPlan (26 vs. 27)

Revision 272011-06-28 - FabioCapannini

Line: 1 to 1
 

WMS Test Plan

Line: 1428 to 1428
 

bug #68944: Bug in ICE's start/stop script

Added:
>
>
Test description.

[root@devel09 ~]# ps aux |grep ice
glite     2716  0.0  0.0 220996  2192 ?        Ss   09:36   0:00 /usr/bin/glite-wms-ice-safe --conf glite_wms.conf --daemon /tmp/glite-wms-ice-safe.pid
glite     2720  0.0  0.0  63856  1108 ?        S    09:36   0:00 sh -c /usr/bin/glite-wms-ice --conf glite_wms.conf /var/log/wms/ice_console.log 2>&1
glite     2721  0.0  0.1 483464  7404 ?        Sl   09:36   0:02 /usr/bin/glite-wms-ice --conf glite_wms.conf /var/log/wms/ice_console.log
root     31253  0.0  0.0  61184   788 pts/1    S+   14:29   0:00 grep ice
[root@devel09 ~]# /etc/init.d/glite-wms-ice restart
stopping ICE... ok
starting ICE... ok
[root@devel09 ~]# ps aux |grep ice
glite    31318  0.0  0.0 220996  2196 ?        Ss   14:30   0:00 /usr/bin/glite-wms-ice-safe --conf glite_wms.conf --daemon /tmp/glite-wms-ice-safe.pid
glite    31322  0.0  0.0  63856  1112 ?        S    14:30   0:00 sh -c /usr/bin/glite-wms-ice --conf glite_wms.conf /var/log/wms/ice_console.log 2>&1
glite    31323  1.0  0.1 483464  7352 ?        Sl   14:30   0:00 /usr/bin/glite-wms-ice --conf glite_wms.conf /var/log/wms/ice_console.log
root     31363  0.0  0.0  61180   784 pts/1    S+   14:30   0:00 grep ice

Considered successful if ice processes are restarted.

bug #70061: WMS hates collections with 192 nodes!

Test description.

[mcecchi@cert-19 ~]$ glite-wms-job-submit -a -c devel11.conf coll_192.jdl

Connecting to the service https://devel11.cnaf.infn.it:7443/g...

====================== glite-wms-job-submit Success ======================

The job has been successfully submitted to the WMProxy
Your job identifier is:

https://devel11.cnaf.infn.it:9000/T...

==========================================================================

Considered successful if the collection is submitted and executed correctly.

bug #70331: glite-wms-create-proxy "ambiguous redirect"

Test description.

[root@cream-44 cron.d]# su - glite
[glite@cream-44 ~]$ /usr/sbin/glite-wms-create-proxy.sh /var/wms.proxy /var/log/wms/create_proxy.log
[glite@cream-44 ~]$

Considered successful if the command works with no error messages.

bug #70824: environment values in JDL cannot have spaces

Test description.

Submit this jdl:

[
Executable = "/bin/env";
Stdoutput = "env.out" ;
StdError = "env.err" ;
shallowretrycount = 2;
retrycount = 3;
OutputSandbox = { "env.out" ,"env.err" } ;
Environment = {
"ATHENA_RUN_EVENTS='[(152345L, 216721L)]'"
};

]

======================= glite-wms-job-status Success =====================
BOOKKEEPING INFORMATION:

Status info for the Job : https://devel09.cnaf.infn.it:9000/g...
Current Status: Done (Success)
Exit code: 0
Status Reason: Job terminated successfully
Destination: gridgate.ucc.ie:2119/jobmanager-pbs-sixhour
Submitted: Thu Apr 21 17:11:46 2011 CEST
==========================================================================

After output retrieval:

[ale@cream-12 UI]$ glite-wms-job-output https://devel09.cnaf.infn.it:9000/g...

Connecting to the service https://devel09.cnaf.infn.it:7443/g...

================================================================================

JOB GET OUTPUT OUTCOME

Output sandbox files for the job:
https://devel09.cnaf.infn.it:9000/g...
have been successfully retrieved and stored in the directory:
/tmp/jobOutput/ale_gvpbgkBN_KkLhly-uNjpxQ

================================================================================

Check the output file:

[ale@cream-12 UI]$ cat /tmp/jobOutput/ale_gvpbgkBN_KkLhly-uNjpxQ/env.out | grep ATHENA
ATHENA_RUN_EVENTS=[(152345L, 216721L)] 

Considered successful if the environment contains the variable with spaces.

bug #71863: JobWrapper tries to use "test -eq" for string comparison

Test description.

Looking into a JW:

fatal_error() # 1 - reason, 2 - transfer OSB
{
if [ "x${LM_done_failed_reason}" != "x" ]; then
jw_echo "LM_log_done_begin" # parsed 'as is' by LM
jw_echo "${LM_done_failed_reason}"
jw_echo "LM_log_done_end" # parsed 'as is' by LM
fi
jw_echo "$1"
log_done_failed 1 "$1"
if [ "x$2" == "xOSB" ]; then
OSB_transfer
fi
doExit 1
} 

Considered successful if the comparison is done via ==

bug #73192: Submission failed due to a credential problem

Test description.

Simply try a submit:

[ale@cream-12 UI]$ glite-wms-job-submit -a -c etc/wmp_devel09.conf jdl/deepresub.jdl

Connecting to the service https://devel09.cnaf.infn.it:7443/g...

====================== glite-wms-job-submit Success ======================

The job has been successfully submitted to the WMProxy
Your job identifier is:

https://devel09.cnaf.infn.it:9000/L...

========================================================================== 

Considered successful if the job is correctly submitted.

bug #73699: Wrong retry count computation

Test description.

Submit this jdl:

[
requirements = (other.GlueCEStateStatus == "Production");
Rank = -2 * other.GlueCEStateWaitingJobs;
Executable = "/bin/ls";
prologue = "/bin/false";
shallowretrycount = -1;
RetryCount = 3;
usertags = [ exe = "deepresub" ];
requirements=regexp("8443/cream", other.GlueCEUniqueID);

]

The job should file after 3 resubmissions:

======================= glite-wms-job-status Success =====================
BOOKKEEPING INFORMATION:

Status info for the Job : https://devel09.cnaf.infn.it:9000/L...
Current Status: Aborted
Logged Reason(s):
- Prologue failed with error 1
- reason=1; Prologue failed with error 1
- Transfer to CREAM failed due to exception: Failed to create a delegation id for job https://devel09.cnaf.infn.it:9000/L...: reason is Received NULL fault; the error is due to another cause: FaultString=[] - FaultCode=[SOAP-ENV:Server.generalException] - FaultSubCode=[SOAP-ENV:Server.generalException] - FaultDetail=[<ns1:DelegationException><msg>User C=IT,O=INFN,OU=Personal Certificate,L=Padova,CN=Alessio Gianelle not authorized for operation {http://www.gridsite.org/namespaces/...</msg></ns1:DelegationException><ns2:exceptionName xmlns:ns2="http://xml.apache.org/axis/">org.glite.security.delegation.DelegationException</ns2:exceptionName><ns3:hostname xmlns:ns3="http://xml.apache.org/axis/">cluster50.knu.ac.kr</ns3:hostname>]
- Prologue failed with error 1
- Prologue failed with error 1
- Prologue failed with error 1
- reason=1; Prologue failed with error 1
Status Reason: hit job retry count (3)
Destination: infnlns-ce-01.ct.pi2s2.it:8443/cream-lsf-cert
Submitted: Thu Apr 21 16:47:33 2011 CEST

Infact:

[ale@cream-12 UI]$ glite-wms-job-logging-info --event Resubmission https://devel09.cnaf.infn.it:9000/L...

===================== glite-wms-job-logging-info Success =====================

LOGGING INFORMATION:

Printing info for the Job : https://devel09.cnaf.infn.it:9000/L...

---
Event: Resubmission
- Result = WILLRESUB
- Source = LogMonitor
- Timestamp = Thu Apr 21 16:49:44 2011 CEST
---
Event: Resubmission
- Result = WILLRESUB
- Source = WorkloadManager
- Timestamp = Thu Apr 21 16:49:45 2011 CEST
---
Event: Resubmission
- Result = WILLRESUB
- Source = LogMonitor
- Timestamp = Thu Apr 21 16:49:50 2011 CEST
---
Event: Resubmission
- Result = WILLRESUB
- Source = WorkloadManager
- Timestamp = Thu Apr 21 16:49:51 2011 CEST
---
Event: Resubmission
- Result = WILLRESUB
- Source = LogMonitor
- Timestamp = Thu Apr 21 16:52:00 2011 CEST
---
Event: Resubmission
- Result = WILLRESUB
- Source = WorkloadManager
- Timestamp = Thu Apr 21 16:52:01 2011 CEST
---
Event: Resubmission
- Result = WILLRESUB
- Source = LogMonitor
- Timestamp = Thu Apr 21 16:54:11 2011 CEST
==========================================================================

(Two events one from LM and the other from WM for every resubmission, plus the last try which failed because limit is reached.)

Considered successful if the job is resubmitted the right number of times (three in this case).

bug #73711: edg_wll_SetLoggingJobProxy with empty sequence code returns "no state in DB"

Test description.

This bug represents a development issue. It is simply verified by checking that submission for both a job and a collection returns the jobid.

[mcecchi@cert-19 ~]$ glite-wms-job-submit -a -c devel11.conf job.jdl

Connecting to the service https://devel11.cnaf.infn.it:7443/g...

====================== glite-wms-job-submit Success ======================

The job has been successfully submitted to the WMProxy
Your job identifier is:

https://devel11.cnaf.infn.it:9000/B...

==========================================================================

[mcecchi@cert-19 ~]$ glite-wms-job-submit -a -c devel11.conf coll_10.jdl

Connecting to the service https://devel11.cnaf.infn.it:7443/g...

====================== glite-wms-job-submit Success ======================

The job has been successfully submitted to the WMProxy
Your job identifier is:

https://devel11.cnaf.infn.it:9000/T...

========================================================================== 

Considered successful if the jobids are correctly returned.

bug #73715: missing ReallyRunning event from LogMonitor

Test description.

[ale@cream-12 UI]$ glite-wms-job-logging-info -v 3 --event ReallyRunning https://devel09.cnaf.infn.it:9000/v...

===================== glite-wms-job-logging-info Success =====================

LOGGING INFORMATION:

Printing info for the Job : https://devel09.cnaf.infn.it:9000/v...

---
Event: ReallyRunning
- Arrived = Thu Apr 21 16:08:28 2011 CEST
- Host = grid003.fc.up.pt
- Level = SYSTEM
- Priority = synchronous
- Seqcode = UI=000000:NS=0000000004:WM=000005:BH=0000000000:JSS=000002:LM=000002:LRMS=000003:APP=000000:LBS=000000
- Source = LRMS
- Timestamp = Thu Apr 21 16:08:28 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: ReallyRunning
- Arrived = Thu Apr 21 16:09:43 2011 CEST
- Host = devel09.cnaf.infn.it
- Level = SYSTEM
- Priority = synchronous
- Seqcode = UI=000000:NS=0000000004:WM=000005:BH=0000000000:JSS=000002:LM=000007:LRMS=000000:APP=000000:LBS=000000
- Source = LogMonitor
- Timestamp = Thu Apr 21 16:09:43 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy
- Wn seq = UI=000000:NS=0000000004:WM=000005:BH=0000000000:JSS=000002:LM=000002:LRMS=000000:APP=000000:LBS=000000
   Alessio Gianelle <ale>
Project Member

Considered successful if the reallyrunning event is effectively logged.

bug #74259: Previous matches information is not taken into account if direct submission is used

Test description.

Submit directly (i.e. use option -r ) a job which trigger a resubmission (e.g. prologue="/bin/false").

Then, when it fails, check the "UserTag" events:

[ale@cream-12 UI]$ glite-wms-job-logging-info -v 2 --event UserTag https://devel09.cnaf.infn.it:9000/Q...
===================== glite-wms-job-logging-info Success =====================

LOGGING INFORMATION:

Printing info for the Job : https://devel09.cnaf.infn.it:9000/Q...

---
Event: UserTag
- Arrived = Fri Apr 22 10:10:37 2011 CEST
- Host = devel09.cnaf.infn.it
- Name = exe
- Source = NetworkServer
- Src instance = https://devel09.cnaf.infn.it:7443/g...
- Timestamp = Fri Apr 22 10:10:37 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
- Value = deepresub
---
Event: UserTag
- Arrived = Fri Apr 22 10:10:39 2011 CEST
- Host = devel09.cnaf.infn.it
- Name = CEInfoHostName
- Source = WorkloadManager
- Src instance = 21624
- Timestamp = Fri Apr 22 10:10:39 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy/CN=proxy/CN=proxy
- Value = alice16.spbu.ru
---
Event: UserTag
- Arrived = Fri Apr 22 10:10:41 2011 CEST
- Host = devel09.cnaf.infn.it
- Name = CEInfoHostName
- Source = WorkloadManager
- Src instance = 21624
- Timestamp = Fri Apr 22 10:10:41 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy/CN=proxy/CN=proxy
- Value = alice16.spbu.ru
---
Event: UserTag
- Arrived = Fri Apr 22 10:10:43 2011 CEST
- Host = devel09.cnaf.infn.it
- Name = CEInfoHostName
- Source = WorkloadManager
- Src instance = 21624
- Timestamp = Fri Apr 22 10:10:43 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy/CN=proxy/CN=proxy
- Value = alice16.spbu.ru
---
Event: UserTag
- Arrived = Fri Apr 22 10:10:45 2011 CEST
- Host = devel09.cnaf.infn.it
- Name = CEInfoHostName
- Source = WorkloadManager
- Src instance = 21624
- Timestamp = Fri Apr 22 10:10:45 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy/CN=proxy/CN=proxy
- Value = alice16.spbu.ru
==========================================================================

Considered successful if the tag "CEInfoHostName" reports the same value (equal to the chosen CE) for all the events.

bug #74577: Wrong counter in ICE database is set at the job creation

Test description.

we can check that the first time a job is submitted the counter should be set to 1:

[ale@cream-12 UI]$ glite-wms-job-submit -a -c etc/wmp_devel09.conf test.jdl

Connecting to the service https://devel09.cnaf.infn.it:7443/g...

====================== glite-wms-job-submit Success ======================

The job has been successfully submitted to the WMProxy
Your job identifier is:

https://devel09.cnaf.infn.it:9000/Z...

==========================================================================

and on the WMS:
[root@devel09 persist_dir]# sqlite3 ice.db "select counter from proxy where myproxyurl='myproxy.cern.ch';"
1 

Considered successful if the counter is set to 1.

bug #75223: wrong reason logged

Test description.

submit a jdl like this one:

executable="/bin/sleep";
arguments="6";
Stdoutput = "std.out";
OutputSandbox = {"std.out"};
requirements=regexp("8443/cream", other.GlueCEUniqueID);
ShallowRetryCount = 2;
prologue="/bin/false";

and look for the Done failed reason in the logging info:

[ale@cream-12 UI]$ glite-wms-job-logging-info -v 2 --event Done https://devel09.cnaf.infn.it:9000/T...

===================== glite-wms-job-logging-info Success =====================

LOGGING INFORMATION:

Printing info for the Job : https://devel09.cnaf.infn.it:9000/T...

---
Event: Done
- Arrived = Thu Apr 21 13:31:27 2011 CEST
- Exit code = 0
- Host = lcg1371.gridpp.rl.ac.uk
- Reason = Prologue failed with error 1
- Source = LRMS
- Status code = FAILED
- Timestamp = Thu Apr 21 13:31:27 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: Done
- Arrived = Thu Apr 21 13:32:27 2011 CEST
- Exit code = 0
- Host = devel09.cnaf.infn.it
- Reason = Prologue failed with error 1
- Source = LogMonitor
- Status code = FAILED
- Timestamp = Thu Apr 21 13:32:27 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy 

Considered successful if the Reason contains only "Prologue failed with error 1".

bug #75368: ICE should log a DONE_FAILED to LB every time the job is going to be resubmitted

Test description.

Submit a jdl to a failure Cream CE (i.e. a CE where you know that for some reasons job will fail) and check the logging-info, before "Resubmission" event ICE should log a "Done Failed" with the failure reason.

Event: Done
- Arrived = Thu Apr 21 13:24:09 2011 CEST
- Exit code = 0
- Host = devel09.cnaf.infn.it
- Reason = Transfer to CREAM failed due to exception: Failed to create a delegation id for job https://devel09.cnaf.infn.it:9000/6...: reason is Received NULL fault; the error is due to another cause: FaultString=[] - FaultCode=[SOAP-ENV:Server.generalException] - FaultSubCode=[SOAP-ENV:Server.generalException] - FaultDetail=[<ns1:DelegationException><msg>User C=IT,O=INFN,OU=Personal Certificate,L=Padova,CN=Alessio Gianelle not authorized for operation {http://www.gridsite.org/namespaces/...</msg></ns1:DelegationException><ns2:exceptionName xmlns:ns2="http://xml.apache.org/axis/">org.glite.security.delegation.DelegationException</ns2:exceptionName><ns3:hostname xmlns:ns3="http://xml.apache.org/axis/">foam.grid.kiae.ru</ns3:hostname>]
- Source = LogMonitor
- Status code = FAILED
- Timestamp = Thu Apr 21 13:24:09 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy
---
Event: Resubmission
- Arrived = Thu Apr 21 13:24:09 2011 CEST
- Host = devel09.cnaf.infn.it
- Reason = Resubmitting because of exception Failed to create a delegation id for job https://devel09.cnaf.infn.it:9000/6...: reason is Received NULL fault; the error is due to another cause: FaultString=[] - FaultCode=[SOAP-ENV:Server.generalException] - FaultSubCode=[SOAP-ENV:Server.generalException] - FaultDetail=[<ns1:DelegationException><msg>User C=IT,O=INFN,OU=Personal Certificate,L=Padova,CN=Alessio Gianelle not authorized for operation {http://www.gridsite.org/namespaces/...</msg></ns1:DelegationException><ns2:exceptionName xmlns:ns2="http://xml.apache.org/axis/">org.glite.security.delegation.DelegationException</ns2:exceptionName><ns3:hostname xmlns:ns3="http://xml.apache.org/axis/">foam.grid.kiae.ru</ns3:hostname>] CEUrl https://foam.grid.kiae.ru:8443/ce-c...
- Result = WILLRESUB
- Source = LogMonitor
- Tag = unavailable
- Timestamp = Thu Apr 21 13:24:09 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy

bug #75402: Synchronization loss between real validity of proxy and exp. time saved in ICE's database

Test description.

Rarely means that is not easy to reproduce, so we consider it hopefully fixed.

bug #77004: Wrong myproxyserver string processing in ICE

Test description.

Submit this jdl: executable="/bin/sleep"; arguments="6"; Stdoutput = "std.out"; OutputSandbox = {"std.out"}; requirements=regexp("8443/cream", other.GlueCEUniqueID); MyProxyServer = " ";

Then check the database's table "proxy" of ice in the WMS to see if the proxy is associated to a fake myproxy server:

[root@devel09 persist_dir]# sqlite3 ice.db "select * from proxy;" /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle-/dteam/Role=NULL/Capability=NULL||/var/ice/persist_dir/B23D0D7177A8B6234F1985493FA09FF41A4FA98C.proxy|1303415475|0

(the name of myproxy server should be in the second field)

bug #77055: "MyProxyServer: wrong type caught for attribute" for parametric jobs

Test description.

[ale@cream-12 UI]$ cat param.jdl
// Test parametric job.
// The standar Output and Error are parametrizied such as the
// enviroment variable: MYPATH_PARAM_.
[
JobType = "parametric";
Executable = "/bin/env";
Environment = {"MYPATH_PARAM_=$PATH:/bin:/usr/bin:$HOME"};
StdOutput = "echo_PARAM_.out";
StdError = "echo_PARAM_.err";
OutputSandbox = {"echo_PARAM_.out","echo_PARAM_.err"};
ParameterStart = 0;
ParameterStep = 2;
Parameters = 10;
myproxyserver="";
]
[ale@cream-12 UI]$ glite-wms-job-submit -a -c etc/wmp_cream-44.conf -o test.out param.jdl

Connecting to the service https://cream-44.pd.infn.it:7443/gl...

====================== glite-wms-job-submit Success ======================

The job has been successfully submitted to the WMProxy
Your job identifier is:

https://cream-44.pd.infn.it:9000/H5...

The job identifier has been saved in the following file:
/home/ale/UI/test.out

========================================================================== 

Considered successful if the job is correctly submitted.

bug #77694: Resource BDII for WMS needs to be revisit

Test description.

[glite@cream-44 ~]$ cat /opt/glite/yaim/node-info.d/glite-wms | grep config_gip_only [glite@cream-44 ~]$

Considered successful if the function config_gip_only has been removed.

bug #77876: While purging DAGs/Collections the CLEAR event is only logged for the parent node

Test description.

Forcing a run of the cron purger you can then check the logging info of a "parent" node:

[ale@cream-12 UI]$ glite-wms-job-logging-info -v 2 https://cream-44.pd.infn.it:9000/vR...

===================== glite-wms-job-logging-info Success =====================

LOGGING INFORMATION:

Printing info for the Job : https://cream-44.pd.infn.it:9000/vR...

---
Event: RegJob
- Arrived = Wed Apr 20 14:01:25 2011 CEST
- Host = cream-44.pd.infn.it
- Jobtype = COLLECTION
- Ns = https://cream-44.pd.infn.it:7443/gl...
- Nsubjobs = 100
- Seed = WMPROXY
- Source = NetworkServer
- Src instance = https://cream-44.pd.infn.it:7443/gl...
- Timestamp = Wed Apr 20 14:01:25 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: Accepted
- Arrived = Wed Apr 20 14:01:34 2011 CEST
- From = NetworkServer
- From host = cream-12.pd.infn.it
- Host = cream-44.pd.infn.it
- Source = NetworkServer
- Src instance = https://cream-44.pd.infn.it:7443/gl...
- Timestamp = Wed Apr 20 14:01:34 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: EnQueued
- Arrived = Wed Apr 20 14:01:34 2011 CEST
- Host = cream-44.pd.infn.it
- Queue = /var/workload_manager/jobdir
- Result = START
- Source = NetworkServer
- Src instance = https://cream-44.pd.infn.it:7443/gl...
- Timestamp = Wed Apr 20 14:01:34 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: EnQueued
- Arrived = Wed Apr 20 14:01:57 2011 CEST
- Host = cream-44.pd.infn.it
- Queue = /var/workload_manager/jobdir
- Result = OK
- Source = NetworkServer
- Src instance = https://cream-44.pd.infn.it:7443/gl...
- Timestamp = Wed Apr 20 14:01:57 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: DeQueued
- Arrived = Wed Apr 20 14:05:03 2011 CEST
- Host = cream-44.pd.infn.it
- Queue = /var/workload_manager/jobdir
- Source = WorkloadManager
- Src instance = 24901
- Timestamp = Wed Apr 20 14:05:03 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy
---
Event: CollectionState
- Arrived = Wed Apr 20 14:05:11 2011 CEST
- Child = https://cream-44.pd.infn.it:9000/_y...
- Child event = Running
- Done code = 0
- Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=1, Done=0, Cleared=0, Aborted=0, Cancelled=0, Unknown=99, Purged=0
- Host = cream-44.pd.infn.it
- Source = LBServer
- State = Running
- Timestamp = Wed Apr 20 14:05:11 2011 CEST
- User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it
---
Event: CollectionState
- Arrived = Wed Apr 20 14:29:59 2011 CEST
- Child = https://cream-44.pd.infn.it:9000/H3...
- Child event = Done
- Done code = 0
- Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=0, Done=82, Cleared=0, Aborted=7, Cancelled=0, Unknown=11, Purged=0
- Host = cream-44.pd.infn.it
- Source = LBServer
- State = Waiting
- Timestamp = Wed Apr 20 14:29:59 2011 CEST
- User = LBProxy
---
Event: Clear
- Arrived = Thu Apr 21 10:13:20 2011 CEST
- Host = cream-44.pd.infn.it
- Reason = 2
- Source = NetworkServer
- Src instance = 20407
- Timestamp = Thu Apr 21 10:13:20 2011 CEST
- User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it/CN=431015228
==========================================================================

Or you can check the logging-info on a parent node after the output retrieval:

[ale@cream-12 UI]$ glite-wms-job-logging-info -v 2 https://cream-44.pd.infn.it:9000/tn...

===================== glite-wms-job-logging-info Success =====================

LOGGING INFORMATION:

Printing info for the Job : https://cream-44.pd.infn.it:9000/tn...

---
Event: RegJob
- Arrived = Thu Apr 21 10:18:34 2011 CEST
- Host = cream-44.pd.infn.it
- Jobtype = COLLECTION
- Ns = https://cream-44.pd.infn.it:7443/gl...
- Nsubjobs = 3
- Seed = WMPROXY
- Source = NetworkServer
- Src instance = https://cream-44.pd.infn.it:7443/gl...
- Timestamp = Thu Apr 21 10:18:34 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: Accepted
- Arrived = Thu Apr 21 10:18:34 2011 CEST
- From = NetworkServer
- From host = cream-12.pd.infn.it
- Host = cream-44.pd.infn.it
- Source = NetworkServer
- Src instance = https://cream-44.pd.infn.it:7443/gl...
- Timestamp = Thu Apr 21 10:18:34 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: EnQueued
- Arrived = Thu Apr 21 10:18:34 2011 CEST
- Host = cream-44.pd.infn.it
- Queue = /var/workload_manager/jobdir
- Result = START
- Source = NetworkServer
- Src instance = https://cream-44.pd.infn.it:7443/gl...
- Timestamp = Thu Apr 21 10:18:34 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: EnQueued
- Arrived = Thu Apr 21 10:18:35 2011 CEST
- Host = cream-44.pd.infn.it
- Queue = /var/workload_manager/jobdir
- Result = OK
- Source = NetworkServer
- Src instance = https://cream-44.pd.infn.it:7443/gl...
- Timestamp = Thu Apr 21 10:18:35 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle
---
Event: DeQueued
- Arrived = Thu Apr 21 10:18:36 2011 CEST
- Host = cream-44.pd.infn.it
- Queue = /var/workload_manager/jobdir
- Source = WorkloadManager
- Src instance = 15496
- Timestamp = Thu Apr 21 10:18:36 2011 CEST
- User = /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy
---
Event: CollectionState
- Arrived = Thu Apr 21 10:18:42 2011 CEST
- Child = https://cream-44.pd.infn.it:9000/d_...
- Child event = Running
- Done code = 0
- Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=1, Done=0, Cleared=0, Aborted=0, Cancelled=0, Unknown=2, Purged=0
- Host = cream-44.pd.infn.it
- Source = LBServer
- State = Running
- Timestamp = Thu Apr 21 10:18:42 2011 CEST
- User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it
---
Event: CollectionState
- Arrived = Thu Apr 21 10:19:50 2011 CEST
- Child = https://cream-44.pd.infn.it:9000/zo...
- Child event = Done
- Done code = 0
- Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=0, Done=2, Cleared=0, Aborted=0, Cancelled=0, Unknown=1, Purged=0
- Host = cream-44.pd.infn.it
- Source = LBServer
- State = Waiting
- Timestamp = Thu Apr 21 10:19:50 2011 CEST
- User = LBProxy
---
Event: CollectionState
- Arrived = Thu Apr 21 10:24:12 2011 CEST
- Child = https://cream-44.pd.infn.it:9000/as...
- Child event = Running
- Done code = 0
- Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=1, Done=2, Cleared=0, Aborted=0, Cancelled=0, Unknown=0, Purged=0
- Host = cream-44.pd.infn.it
- Source = LBServer
- State = Running
- Timestamp = Thu Apr 21 10:24:12 2011 CEST
- User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it
---
Event: CollectionState
- Arrived = Thu Apr 21 10:28:04 2011 CEST
- Child = https://cream-44.pd.infn.it:9000/as...
- Child event = Done
- Done code = 0
- Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=0, Done=3, Cleared=0, Aborted=0, Cancelled=0, Unknown=0, Purged=0
- Host = cream-44.pd.infn.it
- Source = LBServer
- State = Done
- Timestamp = Thu Apr 21 10:28:04 2011 CEST
- User = LBProxy
---
Event: CollectionState
- Arrived = Thu Apr 21 10:33:00 2011 CEST
- Child = https://cream-44.pd.infn.it:9000/zo...
- Child event = Clear
- Done code = 0
- Histogram = Submitted=0, Waiting=0, Ready=0, Scheduled=0, Running=0, Done=0, Cleared=3, Aborted=0, Cancelled=0, Unknown=0, Purged=0
- Host = cream-44.pd.infn.it
- Source = LBServer
- State = Cleared
- Timestamp = Thu Apr 21 10:33:00 2011 CEST
- User = LBProxy
---
Event: Clear
- Arrived = Thu Apr 21 10:33:00 2011 CEST
- Host = cream-44.pd.infn.it
- Reason = 1
- Source = NetworkServer
- Src instance = 21720
- Timestamp = Thu Apr 21 10:33:00 2011 CEST
- User = /C=IT/O=INFN/OU=Host/L=Padova/CN=cream-44.pd.infn.it/CN=431015228 

bug #78030: Alternative GLITE_WMS_LOG_DESTINATION in the jobwrapper

Test description.

Stop the gatekeeper on the CE and check if the JW is able to log (looking for LRMS events into the logging-info).

Test description.

Submit a lot of jobs setting EnableWmsFeedback = true; in the jdl to trigger a replanning and then looking into the wm log file:

20 Apr, 14:17:10 -I: [Info] operator()(/home/condor/execute/dir_24787/userdir/emi.wms.wms-manager/src/replanner.cpp:237): created replanning request for job https://cream-44.pd.infn.it:9000/JE... with token /var/SandboxDir/JE/https_3a_2f_2fcream-44.pd.infn.it_3a9000_2fJE5axeaAk2SU_5fG1ij5ljNg/token.txt_1

you should see that replanning works as expected.

[ale@cream-12 UI]$ glite-wms-job-status https://cream-44.pd.infn.it:9000/JE...

======================= glite-wms-job-status Success =====================
BOOKKEEPING INFORMATION:

Status info for the Job : https://cream-44.pd.infn.it:9000/JE...
Current Status: Done (Success)
Logged Reason(s):
- Cannot take token
- job completed
- Job Terminated Successfully
Exit code: 0
Status Reason: Job Terminated Successfully
Destination: ce202.cern.ch:8443/cream-lsf-grid_2nh_dteam
Submitted: Wed Apr 20 14:01:25 2011 CEST
Parent Job: https://cream-44.pd.infn.it:9000/vR...
==========================================================================

bug #78406: [ yaim-wms ] yaim should set IsmIiLDAPCEFilterExt according to the supported VO(s)

Test description.

[root@devel11 libexec]# grep IsmIiLDAPCEFilterExt /etc/glite-wms/glite_wms.conf
IsmIiLDAPCEFilterExt = "(|(GlueCEAccessControlBaseRule=VO:atlas)(GlueCEAccessControlBaseRule=VOMS:/atlas/)(GlueCEAccessControlBaseRule=VO:alice)(GlueCEAccessControlBaseRule=VOMS:/alice/)(GlueCEAccessControlBaseRule=VO:lhcb)(GlueCEAccessControlBaseRule=VOMS:/lhcb/)(GlueCEAccessControlBaseRule=VO:cms)(GlueCEAccessControlBaseRule=VOMS:/cms/)(GlueCEAccessControlBaseRule=VO:dteam)(GlueCEAccessControlBaseRule=VOMS:/dteam/)(GlueCEAccessControlBaseRule=VO:biomed)(GlueCEAccessControlBaseRule=VOMS:/biomed/)(GlueCEAccessControlBaseRule=VO:ops)(GlueCEAccessControlBaseRule=VOMS:/ops/)(GlueCEAccessControlBaseRule=VO:infngrid)(GlueCEAccessControlBaseRule=VOMS:/infngrid/))"; 
  -- ElisabettaMolinari - 2010-02-24
 
This site is powered by the TWiki collaboration platformCopyright © 2008-2020 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback