Difference: EMI2CREAMTask31337 (14 vs. 15)

Revision 152012-08-30 - SergioTraldi

Line: 1 to 1
 
META TOPICPARENT name="TestingCreamDgas"

Testing report: IGIRTC-91

Line: 428 to 428
 
Bug #95593 CREAM cannot insert in the command queue if the lenght of the localUser field is > 14 chars FIXED
Added:
>
>
From CE:

Remeber to delete the user dteam* from /etc/passwd and if you submit job before this test my hint is to delete the contents of /etc/grid-security/gridmapdir

[root@cream-30 siteinfo]# grep dteam users.conf
18118:dteamlonguser001:2688:dteam:dteam::
18119:dteamlonguser002:2688:dteam:dteam::
50501:prddtmlonguser01:2689,2688:dteamprd,dteam:dteam:prd
50502:prddtmlonguser02:2689,2688:dteamprd,dteam:dteam:prd
60501:sgmdtmlonguser01:2690,2688:dteamsgm,dteam:dteam:sgm
60502:sgmdtmlonguser02:2690,2688:dteamsgm,dteam:dteam:sgm

[root@cream-30 ~]# /opt/glite/yaim/bin/yaim -c -d 6 -s /root/siteinfo/site-info.def -n creamCE -n TORQUE_server -n TORQUE_utils  2>&1 | tee /root/conf_EMI2-CE_Torque.`hostname -s`.`date +%Y%m%d-%H%M%S`.log


[root@cream-30 siteinfo]# grep dteam /etc/passwd
dteamlonguser001:x:18118:2688:mapped user for group dteam:/home/dteamlonguser001:/bin/bash
dteamlonguser002:x:18119:2688:mapped user for group dteam:/home/dteamlonguser002:/bin/bash
prddtmlonguser01:x:50501:2689:mapped user for group dteamprd:/home/prddtmlonguser01:/bin/bash
prddtmlonguser02:x:50502:2689:mapped user for group dteamprd:/home/prddtmlonguser02:/bin/bash
sgmdtmlonguser01:x:60501:2690:mapped user for group dteamsgm:/home/sgmdtmlonguser01:/bin/bash
sgmdtmlonguser02:x:60502:2690:mapped user for group dteamsgm:/home/sgmdtmlonguser02:/bin/bash

Note you have also to reconfigure with that users.conf the WNs in your cluster.

From UI:

 
Added:
>
>
[traldi@cert-25 ~]$ glite-ce-job-submit -r cream-30.pd.infn.it:8443/cream-pbs-cert -a simple2.jdl https://cream-30.pd.infn.it:8443/CREAM678472851
 
Added:
>
>
[traldi@cert-25 ~]$ glite-ce-job-status -L 2 https://cream-30.pd.infn.it:8443/CREAM678472851

**** JobID=[https://cream-30.pd.infn.it:8443/CREAM678472851] Current Status = [REALLY-RUNNING] Working Dir = reserved Grid JobID = [N/A] LRMS Abs JobID = reserved LRMS JobID = reserved Deleg Proxy ID = [60007c291353a7124c2b712fdca6fd3977226120] DelegProxyInfo = isRFC="false"; valid from="8/30/12 9:37 AM (GMT)"; valid to="8/30/12 8:36 PM (GMT)"; holder DN="CN=Sergio Traldi,L=Padova,OU=Personal Certificate,O=INFN,C=IT"; holder AC issuer="CN=proxy,CN=proxy,CN=Sergio Traldi,L=Padova,OU=Personal Certificate,O=INFN,C=IT"; VO="dteam"; AC issuer="CN=voms2.hellasgrid.gr,OU=hellasgrid.gr,O=HellasGrid,C=GR"; VOMS attributes={ /dteam/Role=NULL/Capability=NULL, /dteam/NGI_IT/Role=NULL/Capability=NULL } Worker Node = [N/A] Local User = [dteamlonguser002] CREAM ISB URI = [gsiftp://cream-30.pd.infn.it/var/cream_sandbox/dteam/CN_Sergio_Traldi_L_Padova_OU_Personal_Certificate_O_INFN_C_IT_dteam_Role_NULL_Capability_NULL_dteamlonguser002/67/CREAM678472851/ISB] CREAM OSB URI = [gsiftp://cream-30.pd.infn.it/var/cream_sandbox/dteam/CN_Sergio_Traldi_L_Padova_OU_Personal_Certificate_O_INFN_C_IT_dteam_Role_NULL_Capability_NULL_dteamlonguser002/67/CREAM678472851/OSB] JDL = BatchSystem = "pbs"; QueueName = "cert"; Executable = "/bin/sleep"; Arguments = "10"; VirtualOrganisation = "dteam" Type = [normal]

Job status changes: ------------------- Status = [REGISTERED] - [Thu 30 Aug 2012 11:42:11] (1346319731) Status = [PENDING] - [Thu 30 Aug 2012 11:42:12] (1346319732) Status = [RUNNING] - [Thu 30 Aug 2012 11:42:15] (1346319735) Status = [REALLY-RUNNING] - [Thu 30 Aug 2012 11:42:15] (1346319735)

Issued Commands: -------------------

* Command Name = [JOB_REGISTER] Command Category = [JOB_MANAGEMENT] Command Status = [SUCCESSFULL] Creation Time = [Thu 30 Aug 2012 11:42:11] (1346319731) Start Scheduling Time = [Thu 30 Aug 2012 11:42:11] (1346319731) Start Processing Time = [Thu 30 Aug 2012 11:42:11] (1346319731) Execution Completed Time = [Thu 30 Aug 2012 11:42:11] (1346319731)

* Command Name = [JOB_START] Command Category = [JOB_MANAGEMENT] Command Status = [CREATED] Creation Time = [Thu 30 Aug 2012 11:42:11] (1346319731) Start Scheduling Time = [Thu 30 Aug 2012 11:42:12] (1346319732) Start Processing Time = [Thu 30 Aug 2012 11:42:12] (1346319732)

[traldi@cert-25 ~]$ glite-ce-job-status https://cream-30.pd.infn.it:8443/CREAM678472851

**** JobID=[https://cream-30.pd.infn.it:8443/CREAM678472851] Status = [DONE-OK] ExitCode = [0]

 
Line: 458 to 538
 
Bug #93091 Add some resubmission machinery to CREAM FIXED
Added:
>
>
From UI:
[traldi@cert-25 ~]$ glite-ce-job-submit -r cream-30.pd.infn.it:8443/cream-pbs-cert -a simple2.jdl 
https://cream-30.pd.infn.it:8443/CREAM406480779


[traldi@cert-25 ~]$ glite-ce-job-status https://cream-30.pd.infn.it:8443/CREAM406480779

******  JobID=[https://cream-30.pd.infn.it:8443/CREAM406480779]
        Status        = [PENDING]

From CE:

[root@cream-30 siteinfo]# grep sleep /usr/libexec/pbs_submit.sh
sleep 10m


[root@cream-30 siteinfo]# tail -f /var/log/cream/glite-ce-cream.log
30 Aug 2012 11:59:55,508 org.glite.ce.commonj.authz.gjaf.VomsServicePDP - VOMS attribute authorized: /dteam/Role=NULL/Capability=NULL
30 Aug 2012 11:59:55,516 org.glite.ce.commonj.authz.axis2.AuthorizationHandler - request for OPERATION={http://glite.org/2007/11/ce/cream/types}JobStatus; REMOTE_REQUEST_ADDRESS=193.206.210.252; USER_DN=CN=Sergio Traldi,L=Padova,OU=Personal Certificate,O=INFN,C=IT; USER_FQAN={ /dteam/Role=NULL/Capability=NULL; /dteam/NGI_IT/Role=NULL/Capability=NULL; };  AUTHORIZED
30 Aug 2012 11:59:55,517 org.glite.ce.cream.cmdmanagement.CommandManager - new command [NAME="JOB_STATUS"; PRIORITY_LEVEL=1; IS_ASYNCHRONOUS=false; STATUS=ACCEPTED; CATEGORY="JOB_MANAGEMENT"; EXECUTOR_NAME="BLAHExecutor"; USER_ID="CN_Sergio_Traldi_L_Padova_OU_Personal_Certificate_O_INFN_C_IT_dteam_Role_NULL_Capability_NULL"; CREATION_TIME="Thu Aug 30 11:59:55 CEST 2012"; REMOTE_REQUEST_ADDRESS="193.206.210.252"; JOB_ID_LIST={ CREAM406480779 }; IS_ADMIN="false"; USER_FQAN={ /dteam/Role=NULL/Capability=NULL; /dteam/NGI_IT/Role=NULL/Capability=NULL }; USER_DN="CN=Sergio Traldi,L=Padova,OU=Personal Certificate,O=INFN,C=IT"]
30 Aug 2012 12:02:25,785 org.glite.ce.creamapi.jobmanagement.db.DBInfoManager - set submissionEnabled to 0 in the database
30 Aug 2012 12:02:25,785 org.glite.ce.creamapi.jobmanagement.cmdexecutor.JobSubmissionManager - AcceptNewJobs by script = true
30 Aug 2012 12:02:55,290 org.glite.ce.creamapi.jobmanagement.cmdexecutor.AbstractJobExecutor - submission to BLAH failed [jobId=CREAM406480779; reason=BLAH error: submission command failed (exit code = 143) (stdout:) (stderr: <blah> execute_cmd: 200 seconds timeout expired, killing child process.-) N/A (jobId = CREAM406480779); retry count=1/3]

 
 
This site is powered by the TWiki collaboration platformCopyright © 2008-2021 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback