Page 1 of 1

Merge Stage or Join Stage inconsistent output

Posted: Sat Mar 27, 2010 1:29 pm
by ajay.kumar2
In my parallel job I have a merge stage with master link and five update links.
The update links are outputs from 5 different modify stages.

I have set the Execution mode of Merge Stage to "sequential" and preserve partitioning to "clear". All other stages have default values (default parallel and default propagate).

The problem is that the output rows from merge stage are coming to be inconsistent, in one run i have 3 output rows from merge if there are 3 master rows but in some other runs i get only 1 row. It should be 3 rows each time.
the input file and data and everything is kept same for both runs.

note : in my configuration file i have two nodes defined.

Posted: Sat Mar 27, 2010 1:45 pm
by ray.wurlod
What is your "unmatched masters" rule?

Posted: Sun Mar 28, 2010 11:27 pm
by ajay.kumar2
ray.wurlod wrote:What is your "unmatched masters" rule? ...
unmatched masters mode = "KEEP"

Posted: Mon Mar 29, 2010 12:19 am
by ray.wurlod
What stage type is downstream of the Merge stage, and is it executing in sequential or parallel mode? Does your Merge stage have a reject link and is it capturing any rows? One possibility is that you are trying to match the same key value three times, but are not aware that rows from the update inputs are consumed when there is a match.