Search found 3045 matches

by vmcburney
Mon Oct 10, 2005 8:11 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Generate Sequence Number
Replies: 3
Views: 1060

Parallel jobs have the surrogate key generation stage, you can also use a counter in a transformation stage as shown in the FAQ forum. Both methods will ensure you have unique generated numbers across your instances. You have to make sure you have a correct starting value for your sequence. You can ...
by vmcburney
Mon Oct 10, 2005 6:44 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Problem With Comma Separated File
Replies: 10
Views: 2321

Your file is perfectly valid as it uses string delimiters and comma seperators. If you setup your sequential file input stage to recognise " as delimiters then it will handle any commas inside the " ". To remove the comma from your data try a server routine that receives the string as a job paramete...
by vmcburney
Mon Oct 10, 2005 6:18 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: What is new in Hawk release?
Replies: 4
Views: 1211

Haven't seen the beta yet and it looks like it will be a couple months late. Maybe after IIIL2005.
by vmcburney
Mon Oct 10, 2005 5:56 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: XML Output Stage
Replies: 1
Views: 902

You will notice the XML stages are in the real time folder of Designer, XML is best suited for B2B transactions as individual documents or in small volumes, it is not suited for large volume batch processing. Do your own benchmark, write out the same file as XML and as sequential file, it does not m...
by vmcburney
Mon Oct 10, 2005 5:50 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Parallel Extender Memory issue.
Replies: 12
Views: 6917

Have you monitored your disk space usage while the job is running? If you have a 1.61G dataset remember that with partitioning you will get one of these for each partition, plus whatever other datasets you have in your job. If you are throwing around datasets approaching 2G then I suggest you bump y...
by vmcburney
Mon Oct 10, 2005 5:44 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Hawk/Serrano Release
Replies: 4
Views: 1956

I haven't seen the Hawk release yet but I would expect Server Edition Hawk to be an improvement over Server Edition 7.x. You get all the improvements in the Designer such as better metadata reporting, better impact analysis, better developer collaboration and improved job sequences. Ascential have a...
by vmcburney
Sun Oct 09, 2005 5:42 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: how to load summary and detail records in single target file
Replies: 1
Views: 848

You could expirement with sub records via the make subrecord, split subrecord stages. Haven't used them myself. You can also do it by splitting your input stream into two output streams and combining them again. - A transformer would split the streams and allocate a counter to each row. - A column e...
by vmcburney
Thu Oct 06, 2005 11:40 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Is the XML Output stage *really* this slow?
Replies: 2
Views: 924

Could you run a version writing to a sequential file (don't worry about XML tags) and post the comparison figures. Would be interesting to know what hte overhead of the XML Output is over a standard sequential file write. Probably fixed width sequential is the closest match to XML output.
by vmcburney
Thu Oct 06, 2005 5:22 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Datastage Capacity planning ...HELP
Replies: 11
Views: 5900

It gets more confusing. XE comes with something call the web services client pack, SOA comes with web services. XE can be taken as an SOA version, to give you web services to go with your web services client pack????
by vmcburney
Wed Oct 05, 2005 9:43 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Can we run a parallel Extender on s single processor
Replies: 13
Views: 3610

Yes you can run a parallel job on a single CPU. You can also type with the cap lock off and start sentences with a capital letter but that's getting off the topic. If you have a one CPU dev environment and you are delivering to a multiple CPU prod environment I recommend you configure at least two n...
by vmcburney
Wed Oct 05, 2005 8:02 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Multi Reference lookup counts
Replies: 8
Views: 2486

That's a VERY slow lookup, doing a group by statement on every row of input data, if you get performance problems consider preloading a hash file with group by statement: Select key1, key2, COUNT(*) FROM table Do a reference lookup against the hash file to get the count. The @INROWNUM and @O...
by vmcburney
Wed Oct 05, 2005 7:29 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Datastage Capacity planning ...HELP
Replies: 11
Views: 5900

So you only have server jobs, no parallel jobs. Goes back to my initial reply, if you have a lot of CPUs or multiple servers then parallel jobs provide easier scalability. If you are staying with server edition consider running all four data mart loads at once. This should keep all your CPUs busy an...
by vmcburney
Wed Oct 05, 2005 5:43 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: surrogate ID generation in DS EE
Replies: 18
Views: 6832

Okay, I was following you right up until that first fullstop. The idea of using DB2 sequences or id fields is that you don't need counters in your datastage job. Why would you be creating your own NEXTVAL value when the database is generating it?
by vmcburney
Wed Oct 05, 2005 5:33 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Allow duplicate values in Lookup properties
Replies: 5
Views: 1949

Blindly removing duplicate lookups via the "ignore duplicates" option is just asking for trouble. You have used a blunt technical feature to solve a business problem. Far better to get business input on how to make the non key columns unique for each key combination. Then design a job that prepares ...
by vmcburney
Wed Oct 05, 2005 12:19 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: single source record need load two records in the target
Replies: 10
Views: 2990

Assuming you have columns A B C D E on both transformer output links where column B is null on the first link you can use the funnel stage to turn it back into a single stream and write it to a sequential file.