Search found 65 matches

by djm
Tue Jan 10, 2006 4:41 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Capturing the Duplicates
Replies: 11
Views: 2762

If the data is in a flat file and you are happy executing UNIX commands (e.g. ExecSh in a "before-job subroutine), you may want to consider the UNIX "uniq" command. There is an option that allows you to only output duplicated rows. Try "man uniq" at the UNIX command line for...
by djm
Tue Jan 10, 2006 4:36 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Concatenation of Rows
Replies: 15
Views: 12288

Hi sengs,

this is often referred to in these forums using the term "vertical pivot". Search the forum and see what you can find. Another search option to try is "man paste" (Craig - don't go there!).

David.
by djm
Sun Jan 08, 2006 10:37 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Job Log - get the entire content using DS job
Replies: 12
Views: 14776

Hi Holymac, Let's see if the following helps you. 1) For each of the "code" blocks in the above thread, copy the text and paste it into a distinctly named file with a file name suffix of ".xml". 2) Run the DataStage Manager tool and select the menu option "Import/DataStage C...
by djm
Sun Jan 08, 2006 4:05 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 5917

Thanks for checking it out Jim. I only get to feed my UNIX addiction Monday through Friday! :(

David
by djm
Sat Jan 07, 2006 7:40 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 5917

Hence why, on reflection, I had offered the more robust alternative, ( sed 's/^\[[0-9]\{1,\},[0-9]\{1,\}\]//g' 's/\(|\)\[[0-9]\{1,\},[0-9]\{1,\}\]/\1/g' ), which deals with the coordinates at the start of the line explicitly and replaced "zero or more" instances of | with exactly one insta...
by djm
Sat Jan 07, 2006 1:52 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 5917

Yes. Though searching for coordinates prefixed by the | reduces the likelihood of an unexpected coordinate-like pattern embedded in the data, which is meant to be there, being discarded.

D
by djm
Fri Jan 06, 2006 5:31 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 5917

Ray, I guess you mean other than "man sed" and "man 5 regexp"? Basically the bit between the quotes is a command to sed (stream-editor). A breakdown of the particular command issued is as follows: s/\(|*\)\[[0-9][0-9]*,[0-9][0-9]*\]/\1/g s = substitute command / = delimiter for d...
by djm
Fri Jan 06, 2006 3:10 pm
Forum: Site/Forum
Topic: Grid Computing - donate spare CPU cycles to research
Replies: 17
Views: 8874

Look at this thread? :wink:

man paste indeed!

David
by djm
Fri Jan 06, 2006 2:31 pm
Forum: Site/Forum
Topic: Grid Computing - donate spare CPU cycles to research
Replies: 17
Views: 8874

and then there were 7
by djm
Thu Jan 05, 2006 11:12 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Transpose a row Question in Parallel jobs
Replies: 18
Views: 9626

To add my standard reply ...

If the number of rows in the data set is consistent (e.g. there are always "n" rows in the data set that have to be vertically pivoted), I'd lay money on the unix "paste" command doing this quickest. Try "man paste" for more details.

David
by djm
Thu Jan 05, 2006 11:03 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Help with using sed, awk, nawk or tr
Replies: 22
Views: 5917

On the presumption that the co-ordinates syntax does not appear within the useful data e.g. you don't have a field something like ...|[1,2]blah blah [3,5]blah|... see whether the following achieves the desired result. sed 's/\(|*\)\[[0-9][0-9]*,[0-9][0-9]*\]/\1/g' yourfile > newfile post the success...
by djm
Wed Jan 04, 2006 1:56 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: split file based on condition
Replies: 5
Views: 1576

Another alternative may be to do the splitting within UNIX. Run the command "man split" for more details. I would suggest it will be quicker using UNIX than using DS.

David
by djm
Wed Jan 04, 2006 3:15 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Retrieving the Job Version Number
Replies: 10
Views: 3065

ray.wurlod wrote:I'm not doing DataStage at all this week - conducting a Red Brick training class
Ray, if the training is in Sydney and the participant is Phil from NZ, you can stun him by telling him that the devil duck is waiting for him on his desk on his return. :lol:
by djm
Mon Dec 12, 2005 1:45 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Ex: feedscoming frm UK and my processing area isin Hokng,etc
Replies: 5
Views: 2341

I'm interpreting your request as "how can you extract data from a system in one time zone and display the data meaningfully for another time zone" e.g. if the data has a timestamp of 15:00 in one time zone, for a time zone that is two hours in front of it, you want to diplay the time as 17...
by djm
Fri Dec 09, 2005 11:42 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Rows to columns
Replies: 6
Views: 2267

As your are on unix, if the data is going to have a consistent number of rows each time (e.g. in your example always 3 rows to be turned into 3 columns), the unix "paste" command will achieve this. Try the commnad "man paste" for an explanation.

David.