Unix error=111,APT_Communicator::connectTo: connect() fail

A forum for discussing DataStage<sup>®</sup> basics. If you're not sure where your question goes, start here.

Moderators: chulett, rschirm, roy

Post Reply
wbjk
Participant
Posts: 24
Joined: Thu Aug 09, 2007 1:31 am

Unix error=111,APT_Communicator::connectTo: connect() fail

Post by wbjk »

//In the enviroment of my company,,there are 40,000 jobs running on 20 servers per day. I think maybe datastage run out of the tcp resourcs of the system? how to fix it?
//CCB_Decode is a cutom stage.
//
here are the details of log:

2007-09-13 02:28:46 FATAL event id: 17535
CCB_Decode,3: APT_Communicator::connectTo: connect() failed due to Unix error = 111 (Connectioo
n refused) on node ODSETL13 using ConnectionInfo object 'TCP, connection Host: localhost.localdomaii
n (127.0.0.1), TCP port number: 11000', RETRYING connect()
APT_Communicator::connectTo: connect() failed due to Unix error = 111 (Connection refused) on
node ODSETL13 using ConnectionInfo object 'TCP, connection Host: localhost.localdomain (127.0.0.1),,
TCP port number: 11000', RETRYING connect()


2007-09-13 02:28:46 FATAL event id: 17536
CCB_Decode,3: Fatal Error: APT_Communicator::pmSendPartitionInfo() failed on node ODSETL13 forr
partition 3 of dataset 0 (write failed to handle 4) Bad file descriptor


2007-09-13 02:28:46 FATAL event id: 17537
node_node0005: Player 1 terminated unexpectedly.


2007-09-13 02:28:46 FATAL event id: 17538
main_program: Unexpected exit status 1
Unexpected exit status 1



2007-09-13 02:28:51 FATAL event id: 17539
main_program: The Section Leader on node node0004 has terminated unexpectedly.


2007-09-13 02:28:51 FATAL event id: 17541
main_program: Step execution finished with status = FAILED.
2007-09-13 02:28:51 STARTED event id: 17543
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

The APT_Communicator class is used mainly (only?) for communication between player processes, which basically means when repartitioning is going on. Perhaps you could try to reduce the amount of repartitioning that is occurring in your job design.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
wbjk
Participant
Posts: 24
Joined: Thu Aug 09, 2007 1:31 am

Post by wbjk »

I am not a premium poster, I cannot see all the content of your post ,can you give me a private message?

:(

This error has happened for some days. So many core files appear in the user's directory!
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

For less than 30c per day you can be a premium member, and help to defray the bandwidth costs needed to keep this site alive. I do not propose to prejudice that mechanism.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply