HANA Logcommandline backup errors at single glance

  • 15 August 2022
  • 1 reply
  • 198 views

Userlevel 2
Badge +3

SAP HANA Logcommandline Backup is invoked from HANA side (HANA Studio configuration), it automatically converted to Commvault backup jobs and normally there's nothing we should do, running every 15 minutes by default per HANA's setting.

But this job is slightly different from the other "normal" jobs, when any interim errors, like disconnection of networks, shortage of Commvault resources (typically # of streams on libraries or strage policy copy level), the job would fail.

As mentioned above, this job would repeat every 15 minutes (by default), so any failure would be recovered quickly so typically end user won't lose any of data. But sometimes there's another issues might be at CS/MA side, hard to keep watching even setting up alerts, job monitoring, etc.

 

This is to list up all failure reasons from CSDB quickly (for a Japanese customer), if any suspicious errors identified you can dig into the specific job for detailed research:

use CommServ



SET TRANSACTION ISOLATION LEVEL READ UNCOMMITTED;



SELECT distinct

bkstat.jobid

,DateAdd(second, bkstat.servStartDate+3600*9, '1970-01-01'). -- this converts EPOCH to local timezone, Japanese GMT+9

,bkstat.status

,bkstat.duration

,bkstat.failureReason

,apc.name

,apap.subclientName

,fr.messageId

,fr.client

,frp.data

,evm.Message

,evm2.Message

FROM [CommServ].[dbo].[JMBkpStats] bkstat

inner join APP_Application apap on bkstat.appId = apap.id

inner join APP_Client apc on apc.id = apap.clientId

inner join JMFailureReasonMsg fr on fr.jobId = bkstat.jobId

left outer join JMFailureReasonMsgParam frp on frp.msgId = fr.id

inner join EvLocaleMsgs evm on evm.MessageID = fr.messageId and evm.LocaleID = 0

left outer join EvLocaleMsgs evm2 on evm2.MessageID = try_cast(frp.data as bigint) and evm2.LocaleID = 0



where bkstat.appType in (135, 136)

and bkstat.status != 1

-- several filters

and fr.messageId not in (301990034) -- waiting for stream resource on library

and fr.messageId not in (318768431, 318768699, 318769028) -- cannot reach to client service

and fr.messageid not in (301990034, 318768357) -- HANA service down

and bkstat.duration < 200 -- timeout caused by waiting for stream resources

order by bkstat.jobid desc

Hope this helps a bit,


1 reply

Userlevel 7
Badge +23

Thanks for sharing, this is great!

Reply