Search found 3045 matches

by vmcburney
Fri Nov 18, 2005 3:35 pm
Forum: IBM QualityStage
Topic: Can QS MATCH be Run in Parallel Mode
Replies: 3
Views: 2370

You can use DataStage to provide both the primary input and the reference input into a QualityStage plugin that is doing matching. The safest design would be to sort and partition the primary input and set the partitioning of the reference data as entire. I doubt there is any partitioning method oth...
by vmcburney
Fri Nov 18, 2005 3:23 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Remove duplicate Error
Replies: 2
Views: 764

The sort keys on the sort stage need to be the same as the remove duplicate keys. Better yet you can remove duplicates in the sort stage using one of the properties. From memory the only thing the remove duplicates gives you that isn't in the sort stage is a "Last" option.
by vmcburney
Fri Nov 18, 2005 2:56 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: PX Aggregator Stage(Last & First Values)
Replies: 5
Views: 1705

The Remove Duplicates stage lets you keep the first or last instance based on key fields.
by vmcburney
Fri Nov 18, 2005 3:28 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Setting environment variable
Replies: 3
Views: 983

Since you are setting a value in a stage variable in a transformer you are setting it for every row of data. Therefore the easiest way to pass it to another stage is to put it into a new column. If you set an environment variable you are setting it potentially millions of times.
by vmcburney
Fri Nov 18, 2005 2:08 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: DS 7.5 EE Solaris to DS 7.5 EE windows
Replies: 2
Views: 844

Database stages can also be a pain. Sometimes the enterprise DB2 and Oracle stages do not get installed on the Windows server, or the databases cannot be accessed from a Windows server.
by vmcburney
Thu Nov 17, 2005 9:53 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: KeyMgtGetNextValueConcurrent(), Server Container, PX Job
Replies: 6
Views: 1854

If you dont need the key value elsewhere in your etl process and only care that it is a unique value then you should consider using a trigger on the table into which you are inserting the data... Trigger surrogate key generators are getting the cold shoulder nowadays, too much overhead on bulk inse...
by vmcburney
Thu Nov 17, 2005 9:47 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: error while viewing the data
Replies: 4
Views: 1145

I'll also have a wild stab in the dark, could it be that on line 6, and in the other lines mentioned, you have too many columns in the record?
by vmcburney
Thu Nov 17, 2005 8:38 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: KeyMgtGetNextValueConcurrent(), Server Container, PX Job
Replies: 6
Views: 1854

I agree, ranges are bad, you pass in the start value in one of two ways, either as a job parameter into the job (where it is retrieved via an operating system script) or via a lookup stage to the target table (as a max select). The start value can then be set in the surrogate key stage start value p...
by vmcburney
Thu Nov 17, 2005 4:09 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Max Lookups
Replies: 3
Views: 906

I don't know the answer but it seems to me that if you have to ask the question then you probably have the wrong job design. I can't see any job with more then 1000 lookup stages being buildable, compilable or maintainable.
by vmcburney
Thu Nov 17, 2005 4:07 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Extract data from HTML document.
Replies: 2
Views: 863

It's almost structured data if the file always has the same set of fields and you don't need to break down the data inside fields. Read it as a sequential file with a seperator of : and change the end of record character to something different so it reads all rows in a record as one record. You may ...
by vmcburney
Thu Nov 17, 2005 3:54 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: KeyMgtGetNextValueConcurrent(), Server Container, PX Job
Replies: 6
Views: 1854

Any reason why you are not using the surrogate key stage, which generates key much more efficiently, or a parallel transformer counter field as described in the FAQ forum. Both methods can be passed a start value and will concurrently increment across partitions.
by vmcburney
Thu Nov 17, 2005 3:46 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: documentation of engine command
Replies: 4
Views: 1205

It can be hard to find database products other then DB2 on the IBM website, there is a Universe link in the DataStage information FAQ:
http://www.dsxchange.com/viewtopic.php?t=88590
by vmcburney
Wed Nov 16, 2005 10:15 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Job Repository in version control
Replies: 8
Views: 5535

Has anyone tried using DSJOB against the VERSION project? For example dsjob -ljobs VERSION. This might give you a list of all job names without much hassle. You could use any of the DataStage job reporting options listed in the related FAQ such as exporting VERSION to an XML export file and running ...
by vmcburney
Tue Nov 15, 2005 11:55 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Job Seq Problem
Replies: 2
Views: 789

You really need to read the section on Sequence jobs in the Designer Guide as I'm sure it will answer this question and a lot of other questions that will follow it. Have a look at the page on Job Sequence Properties - General Page. It explains checkpoints and restarts.
by vmcburney
Tue Nov 15, 2005 11:46 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: logic to create multiple outputs and combine them
Replies: 3
Views: 1210

This is one crazy job. The transform and funnel look wrong. You should be able to write this job with just a lookup stage and retrieve whatever country values you need from that lookup. Why do you have 105 output links? The lookup stage has a multiple lookup matches option that lets you return multi...