SIGSEGV, SIGBUS

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Ultramundane
Participant
Posts: 407
Joined: Mon Jun 27, 2005 8:54 am
Location: Walker, Michigan
Contact:

Re: SIGSEGV, SIGBUS

Post by Ultramundane »

Usually bad code that caused a segmentation fault. It could also be failing hardware. Did anything get logged by the error daemon?
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

The UNIX command man signal will inform you about signals. Here are the most common ones. An asterisk indicates that a core image is dumped.

Code: Select all

Value    Description
 1     (SIGHUP) hangup
 2     (SIGINT) interrupt
 3*    (SIGQUIT) quit
 4*    (SIGILL) illegal instruction
 5*    (SIGTRAP) trace trap
 6*    (SIGIOT) IOT instruction
 7*    (SIGEMT) EMT instruction
 8*    (SIGFPE) floating-point exception
 9     (SIGKILL) kill (cannot be caught or ignored)
10*    (SIGBUS) bus error
11*    (SIGSEGV) segmentation violation
12*    (SIGSYS) bad argument to system call
13     (SIGPIPE) write on a pipe with no one to read it
14     (SIGALRM) alarm clock
15     (SIGTERM) software termination signal
16     (SIGURG) urgent condition present on socket
17     (SIGSTOP) stop (cannot be caught or ignored)
18     (SIGTSTP) stop signal generated from keyboard
19     (SIGCONT) continue after stop
20     (SIGCHLD) child status has changed
21     (SIGTTIN) background read attempted from control terminal
22     (SIGTTOU) background write attempted to control terminal
23     (SIGIO) I/O is possible on a descriptor
24     (SIGXCPU) CPU time limit exceeded
25     (SIGXFSZ) file size limit exceeded
26     (SIGVTALRM) virtual time alarm
27     (SIGPROF) profiling timer alarm
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

HI,

I guess this would be an intermintent issue.
Pls let know what happens if you reset and resun if no load in the server.

-Kumar
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

I guess you might have approached australian support for this.
We too got the same resolution :wink:
IBM did develop a patch, which fixed these problems
You received any patch for this :?: :!: :?: :!:

I wish java code for jobmon could be upgraded. 8)

-Kumar
rhys.jones@target.com
Participant
Posts: 24
Joined: Mon Mar 14, 2005 6:42 pm
Location: Minneapolis, Minnesota

Post by rhys.jones@target.com »

Sorry, good ol' USA support engineers. :wink:

Yes we did receive a patch, and it resolved the SIGBUS/SIGSEGV errors that were coming from this bug. We now have been able to reenable time-based monitoring. Check with your support engineer for the patch under ecase e74035. We received a patch for both AIX and Sun.
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

Hi,

Is the patch doing somthing more than, truning off the job mon, and converting time base monitoring to size base monitoring???
I guess i cannot directly go ahead and access this case no right?

-Kumar
bmadhav
Charter Member
Charter Member
Posts: 50
Joined: Wed May 12, 2004 1:16 pm

Post by bmadhav »

U could be having a memory leak when you're job is running and the operating system is issuing a hard kill when all the memory is chewed up by the datastage process. U shd monitor memory usage when ur job starts up.
As the others suggested, u shd contact IBM tech support and send them the job log.
Post Reply