Want a tool that may be able to assist you undertake these activities? If so then keep reading.
I previously posted a thread that contained DataStage objects that can be used to obtain the content of the Job Log. This thread takes that a step further. The content of the log can be rendered into a format suitable for processing with a tool that provides both a visual and a comparative means of presenting the information. This thread contains the
DataStage objects necessary to perform the rendering and contains a link to a location from where the Excel spreadsheet can be downloaded.
Aside from importing the DataStage objects within this thread, you are welcome to download the spreadsheet and use it. However, you may neither redistribute nor sell the spreadsheet nor may it be sold as part of any service that you or any company for which you work without explicit authorisation from me.
The DataStage objects and spreadsheet have been tested for DataStage 6.5 and Excel 2002. Any feedback on the success (or otherwise) for other version of DataStage and/or Excel is appreciated. Note that for versions on Excel prior to Excel 2002 will not display a "progress" window but will (should!) display progress messages within the Status Bar instead. This
is due to the absence of non-modal dialog windows in VBA prior to Excel 2002.
The Excel spreadsheet is available for downloading. The link also contains a couple of screenshots illustrate how I use this tool at a client site for profiling their DataStage job performance.
Two DataStage objects are contained in replies below:
- JobLogAnalysis. Required over and above that from the original thread. This job interprets the Job Log and renders it into the format suitable for input into the spreadsheet.
- DataStageJobLogContent, which is an alternative to that within the original post. It is almost identical to the original except that the hash files use the hash directory rather than the account mechanism for the location of the scratch hash files. The hash file directory is passed as a parameter to the instance of DataStageJobLogContent within JobLogAnalysis. Use whichever version complies with your environment's standards and, if necessary, drop (or ignore) the hash directory parameter to JobLogAnalysis.
- Import as many instances of the Job log that you wish. Each imported file should be given a distinct reference value.
- Click the "Process" button
The only known caveat about the objects that follow is that JobLogAnalysis should not be run when other DataStage jobs are running. I think that it hangs until the completion of the running jobs before continuing. If your experience is different then please reply to this post and dispel my misconception.
Hopefully this profiler will be as useful for you as it has been for me.
If you encounter any issues with the objects or spreadsheet, please post a reply to this thread and I'll follow up on it in due course.
David