Tags:
create new tag
,
view all tags
%TOC% ---+ APEL Deployment We are changing the accounting system used in our infrastructure, from DGAS to APEL and the procedure is quite simple. Each resource centre needs to install a new node (the APEL Publisher) which receives the accounting information sent from the CE(s) by the APEL parser. Then the APEL publisher sends the data to the EGI central database using the messaging infrastructure. In the past months we tested two scenarios: * the accounting data are sent directly to the central databse (canonic installation) * the accounting data are sent to FAUST and to the EGI central database and we chose the second one ---++ Registration you need to register the APEL publisher in the GOC-DB: the service endpoint name to add is *glite-APEL* and fill in also the certificate subject information. Changes in GOCDB can take up to 4 hours to make it to the message brokers. this is necessary to authorize the publisher host in using the broker network. Do not touch the *APEL* service endpoint instead, otherwise nagios won't monitor the accounting data publication. (for reference https://wiki.egi.eu/wiki/MAN09_Accounting_data_publishing) ---++ APEL Publisher Installation and Configuration Follow the [[https://twiki.cern.ch/twiki/bin/view/EMI/GenericInstallationConfigurationEMI3][EMI3 generic installation guide]] and the APEL one https://twiki.cern.ch/twiki/pub/EMI/EMI3APELClient/APEL_Publisher_System_Administrator_Guide.pdf * Example of [[ApelClientCFG][client.cfg]] * Example of [[ApelSsmCfg][sender.cfg]] Use the production queue of the broker network: <verbatim> # Queue to which SSM will send messages (use this) destination: /queue/global.accounting.cpu.central </verbatim> and comment out or delete the testing one ---+++ Sending the data to FAUST In order to send the accounting data also to FAUST, after installing and configuring the APEL Publisher as explained in the section above, follow the instructions https://github.com/andreaguarise/ssm-dupl-send this mean that instead of apelclient script, *you have to use the ssm-dupl-send.sh one*. Among the important paramaters to set in the faust-sender.cfg there are the following ones: <verbatim> host: dgas-broker.to.infn.it port: 61613 use_ssl: false destination: apel.input </verbatim> Create with mkdir the directory: <verbatim> /var/spool/faust/outgoing </verbatim> *IMPORTANT*: for Tier1 and Tier2 it will be used a dedicated queue: <verbatim> destination: apel.<SITE-NAME>.input </verbatim> For instance, in the case of INFN-PISA, into file faust-sender.cfg it will be set: <verbatim> destination: apel.INFN-PISA.input </verbatim> * Example of [[FaustSsmCfg][faust-sender.cfg]] *IMPORTANT*: Run the FAUST script only after having launched the parser one on the CEs * Example of the cron: <verbatim> cat /etc/cron.d/ssm-dupl-send # Run APEL client once daily 05 01 * * * root /root/bin/ssm-dupl-send.sh </verbatim> ---++ APEL Parser Installation and Configuration Install and configure the APEL parser on each computing element of your resource centre. Follow the [[https://twiki.cern.ch/twiki/bin/view/EMI/GenericInstallationConfigurationEMI3][EMI3 generic installation guide]] and the APEL one https://twiki.cern.ch/twiki/pub/EMI/EMI3APELClient/APEL_Parsers_System_Administrator_Guide.pdf * [[ParserConf][Here an example]] of parser.cfg file *IMPORTANT*: Send the accounting data starting from September, because the previous ones have been already sent by DGAS to APEL, otherwise the will be overwritten causing some inconsistencies. Configure the parser accordingly to make process the proper files (or move the old logs in another directory). you can launch the apelparser script after the setting-up of the apelclient database an example of the cron: <verbatim> cat /etc/cron.d/apelparser # Run APEL parser once daily 04 22 * * * root /usr/bin/apelparser </verbatim> *NOTE*: the empty logfiles produce a CRITICAL error in the parsing operation: <verbatim> 2014-09-12 11:02:08,683 - apel.common.exceptions - CRITICAL - Unhandled exception raised! 2014-09-12 11:02:08,683 - apel.common.exceptions - CRITICAL - Please send a bug report with following information: 2014-09-12 11:02:08,684 - apel.common.exceptions - CRITICAL - UnboundLocalError: local variable 'line_number' referenced before assignment 2014-09-12 11:02:08,684 - apel.common.exceptions - CRITICAL - parse_file [/usr/bin/apelparser 139] 2014-09-12 11:02:08,684 - apel.common.exceptions - CRITICAL - scan_dir in /usr/bin/apelparser [187] 2014-09-12 11:02:08,684 - apel.common.exceptions - CRITICAL - handle_parsing in /usr/bin/apelparser [296] 2014-09-12 11:02:08,684 - apel.common.exceptions - CRITICAL - main in /usr/bin/apelparser [380] 2014-09-12 11:02:08,684 - apel.common.exceptions - CRITICAL - ? in /usr/bin/apelparser [392] </verbatim> so check for these empty files and delete them. ---+++ Bug in APEL 1.2.1 - Apply the EMI-3 update 20 The EMI-3 update 20 released a fix of the APEL software for a bug in the parser which prevents it from working in the most common cases: it is unable to open uncompressed accounting logs for parsing. Sites with this problem will have version 1.2.1 installed and see many log messages like this: <verbatim> 2014-08-11 12:54:11,819 - parser - ERROR - Cannot open file blahp.log-20140811: Not a gzipped file </verbatim> in their parser log file - usually at /var/log/apel/parser.log. Sites who have installed version 1.2.1 should upgrade to 1.2.2 immediately: http://www.eu-emi.eu/releases/emi-3-monte-bianco/updates/-/asset_publisher/5Na8/content/update-20-12-09-2014-v-3-11-0-1 ---+++ What about DGAS sensors? Because the DGAS server problems occurred at the beginning of September, stop dgas sensors on your computing element(s). ---++ Fast checks When you launch the apelparser script for the first time, if there are no errors, it will be filled in the tables BlahdRecords e EventRecords (database "apelclient"), so check if they really contain the data. Then with the execution of ssm-dupl-send.sh script on the publisher host, it will be done the join of those tables (filling in the JobRecords e VJobRecords ones), and the data will be sent to FAUST and to EGI, so you can perform the following check: <verbatim> mysql> use apelclient mysql> SELECT year(EndTime),Month(EndTime),InfrastructureDescription,count(*) FROM VJobRecords GROUP BY 1,2,3; +---------------+----------------+---------------------------+----------+ | year(EndTime) | Month(EndTime) | InfrastructureDescription | count(*) | +---------------+----------------+---------------------------+----------+ | 2014 | 5 | APEL-CREAM-PBS | 3747 | | 2014 | 6 | APEL-CREAM-PBS | 7243 | | 2014 | 7 | APEL-CREAM-PBS | 4852 | | 2014 | 8 | APEL-CREAM-PBS | 4770 | | 2014 | 9 | APEL-CREAM-PBS | 3882 | +---------------+----------------+---------------------------+----------+ 5 rows in set (0.13 sec) </verbatim> ---+ STATUS |*SITE NAME*|*TICKET*|*STATUS*|*INFO*| | BIOCOMP | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17459][17459]] | SOLVED || | CIRMMP | [[https://xgus.ggus.eu/ngi_it/?mode=ticket_info&ticket_id=12][12]] | SOLVED || | CNR-ILC-PISA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17461][17461]] | SOLVED || | CRS4 | [[https://xgus.ggus.eu/ngi_it/index.php?mode=ticket_info&ticket_id=13][13]] | SOLVED || | FBF-Brescia-IT | [[https://xgus.ggus.eu/ngi_it/index.php?mode=ticket_info&ticket_id=14][14]] | SOLVED || | GARR-01-DIR | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17464][17464]] | SOLVED || | GILDA-INFN-CATANIA | [[https://xgus.ggus.eu/ngi_it/?mode=ticket_info&ticket_id=27][27]] | SOLVED || | GILDA-SIRIUS | [[https://xgus.ggus.eu/ngi_it/?mode=ticket_info&ticket_id=29][29]] | SOLVED || | GRISU-COMETA-INFN-CT | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17465][17465]] | SOLVED || | GRISU-UNINA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17466][17466]] | SOLVED || | ICEAGE-CATANIA | [[https://xgus.ggus.eu/ngi_it/?mode=ticket_info&ticket_id=28][28]] | SOLVED || | INAF-TS | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17467][17467]] | SOLVED || | INFN-BARI | [[https://xgus.ggus.eu/ngi_it/index.php?mode=ticket_info&ticket_id=15][15]] | IN PROGRESS | in attesa del primo lancio | | INFN-BOLOGNA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17468][17468]] | SOLVED || | INFN-BOLOGNA-T3 | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17469][17469]] | SOLVED | | INFN-CATANIA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17449][17449]] | SOLVED | | INFN-CNAF-LHCB | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17470][17470]] | SOLVED | | INFN-COSENZA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17471][17471]] | SOLVED || | INFN-FERRARA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17472][17472]] | SOLVED | | INFN-FRASCATI | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17452][17452]] | SOLVED || | INFN-GENOVA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17473][17473]] | SOLVED || | INFN-LECCE | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17474][17474]] | SOLVED | | INFN-LNL-2 | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17450][17450]] | SOLVED | | | INFN-MILANO-ATLASC | [[https://xgus.ggus.eu/ngi_it/index.php?mode=ticket_info&ticket_id=16][16]] | IN PROGRESS | in attesa della prima pubblicazione | | INFN-NAPOLI-ARGO | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17477][17477]] | SOLVED || | INFN-NAPOLI-ATLAS | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17453][17453]] | SOLVED || | INFN-NAPOLI-CMS | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17478][17478]] | SOLVED || | INFN-NAPOLI-PAMELA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17479][17479]] | SOLVED || | INFN-PADOVA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17480][17480]] | SOLVED | | | INFN-PAVIA | [[https://xgus.ggus.eu/ngi_it/index.php?mode=ticket_info&ticket_id=17][17]] | OPEN || | INFN-PERUGIA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17483][17483]] | SOLVED || | INFN-PISA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17454][17454]] | SOLVED || | INFN-ROMA1 | [[https://xgus.ggus.eu/ngi_it/index.php?mode=ticket_info&ticket_id=26][26]] | IN PROGRESS | manderà i dati assieme a INFN-ROMA1-VIRGO e INFN-ROMA1-CMS | | INFN-ROMA1-CMS | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17456][17456]] | SOLVED || | INFN-ROMA1-VIRGO | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17484][17484]] | OPEN || | INFN-ROMA2 | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17485][17485]] | SOLVED | | INFN-ROMA3 | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17486][17486]] | SOLVED | | INFN-T1 | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17457][17457]] | SOLVED | | INFN-TORINO | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17458][17458]] | SOLVED || | INFN-TRIESTE | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17487][17487]] | SOLVED || | RECAS-NAPOLI | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17488][17488]] | SOLVED || | SNS-PISA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17489][17489]] | SOLVED | sito sospeso | | TRIGRID-INFN-CATANIA | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17490][17490]] | SOLVED || | UNI-PERUGIA | [[https://xgus.ggus.eu/ngi_it/index.php?mode=ticket_info&ticket_id=38][38]] | SOLVED || | UNINA-EGEE | [[https://ticketing.cnaf.infn.it/checklist-new/modules/xhelp/ticket.php?id=17492][17492]] | SOLVED || -- Main.AlessandroPaolini - 2014-06-13
E
dit
|
A
ttach
|
PDF
|
H
istory
: r59
<
r58
<
r57
<
r56
<
r55
|
B
acklinks
|
V
iew topic
|
M
ore topic actions
Topic revision: r59 - 2014-11-13
-
AlessandroPaolini
Home
Site map
CEMon web
CREAM web
Cloud web
Cyclops web
DGAS web
EgeeJra1It web
Gows web
GridOversight web
IGIPortal web
IGIRelease web
MPI web
Main web
MarcheCloud web
MarcheCloudPilotaCNAF web
Middleware web
Operations web
Sandbox web
Security web
SiteAdminCorner web
TWiki web
Training web
UserSupport web
VOMS web
WMS web
WMSMonitor web
WeNMR web
SiteAdminCorner Web
Create New Topic
Index
Search
Changes
Notifications
RSS Feed
Statistics
Preferences
View
Raw View
Print version
Find backlinks
History
More topic actions
Edit
Raw edit
Attach file or image
Edit topic preference settings
Set new parent
More topic actions
Account
Log In
E
dit
A
ttach
Copyright © 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback