Search found 53125 matches

by ray.wurlod
Mon Jan 17, 2005 8:42 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Surrogate Key Generation
Replies: 3
Views: 1684

There's nothing to stop you using database sequences or SERIAL data types when loading the dimension tables. Whatever mechanism you use to load the dimension tables, you will need to build a "reverse lookup" mechanism when loading the fact tables, because you will need to map the "business keys" enc...
by ray.wurlod
Mon Jan 17, 2005 8:39 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Recovery Mode
Replies: 7
Views: 1808

Not hard, just something you have to design for if you want it. 8)
by ray.wurlod
Mon Jan 17, 2005 8:38 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: CFF conversion issue
Replies: 3
Views: 735

Murphy's fourth postulate: If it's stupid but it works, it's not stupid. Your technique is entirely valid in server jobs, because the Engine does not use data types - everything is a string with implicit conversion governed by context (for example arithmetic). This helps DataStage to be tolerant of ...
by ray.wurlod
Mon Jan 17, 2005 3:18 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Redbrick Bulk Loader
Replies: 3
Views: 711

The point about parallel TMU ( rb_ptmu ) is that the indexes are updated by separate processes, thereby not slowing down the loading of rows into the main table. The main bottleneck is usually foreign key constraints (referential integrity). These must be checked by the process loading the main tabl...
by ray.wurlod
Mon Jan 17, 2005 3:12 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Bulk Loaders in DS
Replies: 6
Views: 1299

The only Red Brick stage in DataStage is the bulk loader stage. :wink:

Says something important about purpose-designed DW databases, I feel.
by ray.wurlod
Mon Jan 17, 2005 3:10 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Recovery Mode
Replies: 7
Views: 1808

So restarting is not automatic in DataStage. You can create designs so that restartability is automatic. Techniques used to be taught on the Ascential class "DataStage Best Practices" - is this still offered? There is a cost; you must create one, better two, staging areas as recovery points. You al...
by ray.wurlod
Mon Jan 17, 2005 3:13 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: "Usage of MS Sql Server & ODBC Stages from Unix DS
Replies: 3
Views: 668

I had replied to this earlier, just before the DSXChange server failed. :cry:

Read about the failure on the home page.
by ray.wurlod
Mon Jan 17, 2005 3:11 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Failed to connect to Host ( server installed in HP UNIX)
Replies: 4
Views: 1988

You have not informed us whether or not the dsrpcd process is running. Without that vital clue we are powerless to assist further. Please advise the result of executing the command suggested by Craig. If the dsrpcd process is not running, you will never be able to connect. In that case, we will assi...
by ray.wurlod
Sun Jan 16, 2005 8:15 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: "Usage of MS Sql Server & ODBC Stages from Unix DS
Replies: 3
Views: 668

SAP Extract PACK and SAP BW Load PACK must be separately purchased. The additional charge is largely the cost of certification from SAP. The software, if you have it, is on a separate CD. ODBC drivers for UNIX are specialized pieces of software provided by third-party vendors. The drivers that ship ...
by ray.wurlod
Sun Jan 16, 2005 8:07 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Datastage in UNIX
Replies: 4
Views: 1301

It is absolutely vital that the UNIX kernel parameters referred to in the installation documentation are set to at least the advised values.
DataStage Engine relies heavily on shared memory and semaphores; the default UNIX values are simply insufficient. (Exception: AIX has a "self tuning kernel").
by ray.wurlod
Sun Jan 16, 2005 3:20 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Improve Performance
Replies: 5
Views: 1321

If the table has a key you can pre-load a hashed file with just the key values, and use that to check for existence (and therefore the insert versus update decision). This will run much faster than trying to do the lookups against the target table itself. If the key-only hashed file is sufficiently ...
by ray.wurlod
Sun Jan 16, 2005 3:18 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: performance tuning
Replies: 2
Views: 744

Memory can also be a bottleneck if you're using memory-hungry stage types (Aggregator, Sort) on an underconfigured machine.

Other areas of performance tuning is choosing efficient algorithms; this is largely experience, and depends on your data, so there's no magic list of cures.
by ray.wurlod
Sun Jan 16, 2005 3:16 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Getting deadlocks in Oracle via ODBC Driver Inserts
Replies: 6
Views: 2531

Probably just imps and gremlins (other users?) doing random inserts on just the page where your process wants to be.
by ray.wurlod
Sat Jan 15, 2005 5:12 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Improve Performance
Replies: 5
Views: 1321

Not really much information in your question. Are you doing a lot of computation (transformation) per row? How are you loading the Oracle table? If they are all new rows for Oracle, seriously consider an approach that uses sqlldr rather than INSERT. It is way faster. The best approach is to have you...
by ray.wurlod
Sat Jan 15, 2005 5:08 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Getting deadlocks in Oracle via ODBC Driver Inserts
Replies: 6
Views: 2531

In my experience with DataStage another common reason for deadlocks is updating the same table you are selecting from, and shooting for huge transaction size (unlimited rows per transaction). Oof!