Search found 4992 matches

by kcbland
Tue Jul 20, 2004 4:20 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Using dsjob from a system that doesn't host a DS Server
Replies: 4
Views: 1433

Research rsh, it allows you to execute a command on a remote server (Unix or NT). You could have your NT workstation execute an rsh command to a script on the DS server which invokes dsjob.
by kcbland
Mon Jul 19, 2004 10:00 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Info required about hashed files
Replies: 2
Views: 464

Search this forum, there's thousands of posts. Here's one:

http://www.dsxchange.com/viewtopic.php?t=85364
by kcbland
Mon Jul 19, 2004 8:51 am
Forum: General
Topic: Multiple instances of DataStage Job
Replies: 8
Views: 18480

Multiple instances are incredibly useful for partitioning datastreams and using multiple copies of the job to process the data. Why have 1 job crunch the numbers on a 16 cpu system when you can have 16 copies of the job with each crunching 1/16th of the input datastream.
by kcbland
Mon Jul 19, 2004 8:49 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Hashfile sizing
Replies: 5
Views: 1072

Okay, but are you processing ALL target rows every time? Probably not. You probably are processing a vastly insignificant portion of your data each load cycle. Therefore, you only NEED in your hash file the data pertinent to your current load cycle. Consider this approach. Parse your source data fil...
by kcbland
Sun Jul 18, 2004 4:02 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Hashfile sizing
Replies: 5
Views: 1072

Why is your hash file so big? Are you maintaining a copy of a target table (or subset of columns) in a persistent staging hash file? Why not simply populate the hash file at runtime with those rows necessary for reference? Your hash file is too big for caching, requires significant maintenance inves...
by kcbland
Tue Jul 13, 2004 1:19 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Some Orchestrate questions..
Replies: 2
Views: 553

These questions suspiciously look like those that would be asked in a job interview technical assessment....
by kcbland
Tue Jul 13, 2004 11:12 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Global Routines across projects
Replies: 6
Views: 1730

Routines and functions are stored in a localized file to each project (DS_ROUTINES). In order to make them global to all projects, you will either have to pick a project to house the routines and then destroy the localized file and replace it with a synonym pointing to the global; or, you'll have to...
by kcbland
Tue Jul 13, 2004 8:20 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Global Routines across projects
Replies: 6
Views: 1730

You can, using very specific steps, setup routines to be globally available to all projects for Server jobs. However, the effort involved IMO is not worth the trouble. Craig has given you the best solution available which has little maintenance effort. To register a function or subroutine to all pro...
by kcbland
Fri Jul 09, 2004 12:22 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Reading sequential file and sorter
Replies: 3
Views: 1234

I didn't say only one cpu was being used, I said the net effect should be about one cpu. You're only as fast as the slowest piece in a series. CoSort's plugin is available thru them, not Ascential. As for sorting, maybe you could clarify what you're trying to achieve and maybe we could give you some...
by kcbland
Fri Jul 09, 2004 11:58 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Reading sequential file and sorter
Replies: 3
Views: 1234

By sorter, I assume you mean a Sort stage? If so, then what you see happening is that the sorting is single threaded and happening as fast as the source data can be fed into it, which by a single threaded processes would just about give you a net result of using only a single cpu. Notice 25% of 4 is...
by kcbland
Fri Jul 09, 2004 5:08 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Datastage version 7 over version 5.2.1
Replies: 6
Views: 1496

There is no difference in the scheduler, as it is just a GUI over cron. There is no difference in the job logs, they are proprietary and closed. If you need data from the logs, you must extract it using the published APIs.
by kcbland
Thu Jul 08, 2004 2:29 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: execute a job for a list of file
Replies: 2
Views: 686

Use DS BASIC job control in an Batch job. Setup a loop to cycle looking in the directory of interest and execute the jobstream for each file present. As you finish a file, either move it or rename it to take it out of the processing loop. Continuously loop until the list of files are exhausted. You ...
by kcbland
Thu Jul 08, 2004 2:26 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: merging multiple line as one record -- Seq data file
Replies: 3
Views: 2198

Read the data from the file as a single column. Use stage variables to queue data and output on change of group --> the "}" character. Since your last row should be that } character you shouldn't have the issue of the last group not being output.
by kcbland
Thu Jul 08, 2004 8:05 am
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Password Encryption
Replies: 7
Views: 1508

DataStage has poor encryption capabilities. For example, even encrypting a password doesn't mean you can't expose the unencrypted value. Just simply map that parameter into an output file as a column derivation and you will see the value. One useful method for passing in the parameters is to have jo...