Tags:
, view all tags

PATCH 3621

List Match

  • without data: Yes / Done
  • with data: No

Submission/GetOutput

  • Normal jobs through
    • ICE work: Yes / Done
    • JC work: Yes / Done
      • Sometimes the status of the job is not correctly computed due to a wrong WorkerNode SequenceCode:
        Event: ReallyRunning
        - Arrived                    =    Tue Feb 16 12:54:26 2010 CET
        - Host                       =    wms007.cnaf.infn.it
        - Level                      =    SYSTEM
        - Priority                   =    asynchronous
        - Seqcode                    =    UI=000000:NS=0000000005:WM=000004:BH=0000000000:JSS=000003:LM=000007:LRMS=000000:APP=000000:LBS=000000
        - Source                     =    LogMonitor
        - Src instance               =    unique
        - Timestamp                  =    Tue Feb 16 12:54:25 2010 CET
        - User                       =    /C=IT/O=INFN/OU=Personal Certificate/L=Padova/CN=Alessio Gianelle/CN=proxy/CN=proxy
        - Wn seq                     =    UI=000000:NS=0000000000:WM=000000:BH=0000000000:JSS=000000:LM=000000:LRMS=000003:APP=000000:LBS=000000
This happen with jobs sent to cclcgceliXX.in2p3.fr:2119/jobmanager-bqs-xxx CEs

  • Dag jobs through:
    • JC work: No FAILED
      • Failed with the following:
        glite-wms-job-status https://devel17.cnaf.infn.it:9000/Nf8jcFJuDKCKxB2eKuFo-w
         
        
        *************************************************************
        BOOKKEEPING INFORMATION:
        
        Status info for the Job : https://devel17.cnaf.infn.it:9000/Nf8jcFJuDKCKxB2eKuFo-w
        Current Status:     Done (Exit Code !=0)
        Exit code:          1
        Status Reason:      Warning: job exit code != 0
        Destination:        dagman
        Submitted:          Tue Feb 16 10:08:59 2010 CET
        *************************************************************
      • The "real" reason is: Unrecognized argument: -Condorlog. See also here.

  • Collection jobs through:
    • ICE work: Yes / Done
    • JC work: Yes / Done
    • also job-output for collections works even though only the parent node is set to 'Cleared'

  • Parametric jobs through:
    • ICE work: No
    • JC work: No
      • tested with the following
         [
          JobType = "parametric";
          Executable = "/usr/bin/env";
          Environment = {"MYPATH_PARAM_=$PATH:/bin:/usr/bin:$HOME"};
          StdOutput = "echo_PARAM_.out";
          StdError = "echo_PARAM_.err";
          OutputSandbox = {"echo_PARAM_.out","echo_PARAM_.err"};
          Parameters =  5;
                usertags = [ jdl = "parametric" ];
         ]

  • Bulk jobs sent both through ICE and JC and RetryCount = 0; :
    • Submit a bulk of 3 jobs -> success ???% No
    • Submit a bulk of 50 jobs -> success ???% No
    • Submit a bulk of 100 jobs -> success ???% No
    • Submit a bulk of 500 jobs -> success ???% No
    • Submit a bulk of 1000 jobs -> success ???% No

  • Perusal jobs through:
    • JC work: No
    • ICE work: No

  • MPICH jobs: No

Cancel

  • Normal jobs
    • ICE: No
    • JC: No
  • Dag: No
  • Collection: No
  • Node of a collection: No

Others

  • BrokerInfo
    • ICE creation No
    • JC creation: No
    • Verify all the glite-brokerinfo functions with the generated file No

  • Resubmission
    • Shallow: No
    • Deep: No

  • Job Recovery
    • Tested with a few collections re-starting the wm while some node jobs are still in a 'submitted or 'waiting' status No

  • Prologue and Epilogue jobs
    • ICE: No
    • JC: No



Check bugs:

  • Bug #42288: Problem in forwarding cerequirements to a CREAM CE

  • Bug #48910: Failure starting LM if its output jobdir doesn't exist; unprotected chown in WM/LM/JC startup scripts

  • Bug #55452: CMS production struck by waves of "Globus error 10: data transfer to the server failed"

  • Bug #61405: Missing proxy validity evaluation in ICE

-- AlessioGianelle - 2010-02-05

Edit | Attach | PDF | History: r75 | r8 < r7 < r6 < r5 | Backlinks | Raw View | More topic actions...
Topic revision: r6 - 2010-02-16 - AlessioGianelle
 
  • Edit
  • Attach
This site is powered by the TWiki collaboration platformCopyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback