Jobs Hanging

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
diamondabhi
Premium Member
Premium Member
Posts: 108
Joined: Sat Feb 05, 2005 6:52 pm
Location: US

Jobs Hanging

Post by diamondabhi »

When I try to run any of my jobs, they are starting and hanging at the following message:
Parallel job initiated

When I grep for phantoms, a phantom is being created for the job. We rebooted the server and no other jobs are running on the server. Please help.

They were all running fine before, but cannot guess what changed. Please help.

DS Version 8.0.1
AIX 5.3
Oracle MetaData Repository

Abhi.
Every great mistake has a halfway moment, a split second when it can be recalled and perhaps remedied.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Try doing a "truss-p <pid>" on the processes to see which, if any, system calls they are doing. Perhaps they are waiting on database locks in the repository. Also, try creating a dummy server job or job sequence and see if that has the same problems.
diamondabhi
Premium Member
Premium Member
Posts: 108
Joined: Sat Feb 05, 2005 6:52 pm
Location: US

Post by diamondabhi »

truss -p <pid>

I get the following output, but I do not understand what it means:

kread(18, "\0\0\0\0\0\0\0\0\0\0\0\0".., 4096) (sleeping...)
kread(18, "\0\0\0\0\0\0\0\0\0\0\0\0".., 4096) Err#82 ERESTART
Received signal #14, SIGALRM [caught]
sigprocmask(2, 0x300BF840, 0x00000000) = 0
incinterval(0, 0x2FF1B258, 0x2FF1B268) = 0
sigprocmask(0, 0x00000000, 0x2FF1B250) = 0
sigprocmask(2, 0xF0498790, 0x2FF1B1B0) = 0
_sigaction(14, 0x2FF1B270, 0x2FF1B260) = 0
thread_setmymask_fast(0x00000000, 0x00000000, 0x00000000, 0x101EC0C9, 0x00000000, 0x000002DC, 0x00000000, 0x00000000) = 0x00000000
appgettimer(9, 0x2FF1B330) = 0
sigprocmask(0, 0x00000000, 0x3027FD74) = 0
klseek(26, 0, 2048, 0x00000000) = 0
kread(26, "\0\0\b8C\0\0\08C\0\0\f03".., 2048) = 2048
klseek(22, 0, 4096, 0x00000000) = 0
kread(22, "\0\011 \\0\0\t \\0\0\f\v".., 2048) = 2048
sigprocmask(2, 0x3027FD74, 0x00000000) = 0
sigprocmask(0, 0x00000000, 0x2FF1B250) = 0
sigprocmask(2, 0xF0498790, 0x2FF1B1B0) = 0
_sigaction(14, 0x2FF1B270, 0x2FF1B260) = 0
thread_setmymask_fast(0x00000000, 0x00000000, 0x00000000, 0x101EC0C9, 0x00000000, 0x000002AC, 0x00000000, 0x00000000) = 0x00000000
incinterval(0, 0x2FF1B258, 0x2FF1B268) = 0
sigprocmask(0, 0x00000000, 0x300BF840) = 0
Every great mistake has a halfway moment, a split second when it can be recalled and perhaps remedied.
diamondabhi
Premium Member
Premium Member
Posts: 108
Joined: Sat Feb 05, 2005 6:52 pm
Location: US

Post by diamondabhi »

When I run a dummy server job, nothing happens and it stays in the compiled status only.
Every great mistake has a halfway moment, a split second when it can be recalled and perhaps remedied.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

I am not certain what is happening in your case. Is your repository on the same machine as the Datastage server engine? If not, could you stop and restart both the DataStage engine as well as the database?
diamondabhi
Premium Member
Premium Member
Posts: 108
Joined: Sat Feb 05, 2005 6:52 pm
Location: US

Post by diamondabhi »

Well it was the new 5.3 ODBC drivers that messed up the things, reverting back to 5.2 drivers fixed the issue. IBM is working on getting their act together.

Thanks for your help.

Abhi.
Every great mistake has a halfway moment, a split second when it can be recalled and perhaps remedied.
diamondabhi
Premium Member
Premium Member
Posts: 108
Joined: Sat Feb 05, 2005 6:52 pm
Location: US

Post by diamondabhi »

Well it was the new 5.3 ODBC drivers that messed up the things, reverting back to 5.2 drivers fixed the issue. IBM is working on getting their act together.

Thanks for your help.

Abhi.
Every great mistake has a halfway moment, a split second when it can be recalled and perhaps remedied.
Post Reply