confusion and request for help setup Windows UNIX datastage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
dsex100
Premium Member
Premium Member
Posts: 45
Joined: Wed Apr 01, 2009 9:41 am

confusion and request for help setup Windows UNIX datastage

Post by dsex100 »

Hello,

I HAVE THE FOLLOWING SETUP:

DataStage Enterprise Edition 7.5x2 installed running on Windows Server 2003 Enterprise Edition SP2 (both client and server components)

single processor machine
----

I'm only writing to text files right now to learn the Enterprise Edition.

I will install some databases, starting with Oracle or SQL Server, not sure yet.

I WANT TO SETUP THE FOLLOWING:

I have another machine with a dual-core processor.

I want to setup DataStage on that other machine in order to use the parallel configuration capabilities.

I want to install UNIX on that other machine in order to run the jobs on UNIX.

So, I'm in the process of installing CentOS-3.9 which is the redistributable of Red Hat Linux Enterprise Server 3 (formerly known as Red Hat Linux Advanced Server 3 as in the datastage documentation)

I had some problems with the installation of CentOS-3.9 not detecting my monitor because of drivers so it was recommended to me to install CentOS-5 in order to not have a hardware driver problem.

(This would be the same as Red Hat Linux Enterprise Server 5)

So well, I would like to know if anyone has run datastage on CentOS-5 or Red Hat Linux Enterprise Server 5?

I'm a little confused as to how I will be able to or how I can work with Datastage, setting up between Windows and UNIX between 2 machines.

In the documentation and as I have tested, you can run parallel jobs on Windows Server 2003..this is fine.

Will I be able to kick off parallel jobs from the Windows Server 2003 over my network connection where the jobs are actually located on my UNIX machine?

If so, will the jobs appear in Datastage Director even though they actually reside on the UNIX machine?

I was reading in the Datastage documenation that you can install Datastage on a UNIX machine and use the Datastage client components on Windows XP or 2000.

So, would this also be possible from the Windows Server 2003 "client?"

Something else that made me wonder- in one of the videos in the DSXCHANGE LEARNING CENTER called "WINDOWS: Start dssh" by Ken Bland he connects to a Telnet Session on Windows Server 2003.

Is he connecting to a UNIX machine where the Datastage server components are installed or is he using a UNIX server on Windows??

If it would be possible to use a UNIX server environment that's already on Windows that would be amazing.


Well as you might notice, I'm sort of confused and need to figure this out before I can figure out how to setup database connectivity between these two environments.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

A couple of things... one is that you don't need multiple processors to "use the parallel configuration capabilities". Node <> cpu.

The other is you have a Windows version of the tool, so no UNIX anything for you. Not without a license purchase for your flavor of UNIX, that is.
-craig

"You can never have too many knives" -- Logan Nine Fingers
dsex100
Premium Member
Premium Member
Posts: 45
Joined: Wed Apr 01, 2009 9:41 am

Post by dsex100 »

hmm so node<>cpu ok..

would you say that parallel jobs are faster to run on the server even if there's only 1 processor compared to datastage server jobs?

and what was I thinking it's true I didn't even think that my version of datastage would only install on windows! duh!
dsex100
Premium Member
Premium Member
Posts: 45
Joined: Wed Apr 01, 2009 9:41 am

Post by dsex100 »

thank you for your reply , it really set me straight and on the right track

wow
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

dsex100 wrote:would you say that parallel jobs are faster to run on the server even if there's only 1 processor compared to datastage server jobs?
Probably not for small to medium volumes, because the startup overhead of a parallel job is significant compared to that of a server job.
Possibly not for large volumes, because the machine itself will be more likely to be overloaded, not by the total volume necessarily but by the extra processes' demands for resources.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
dsex100
Premium Member
Premium Member
Posts: 45
Joined: Wed Apr 01, 2009 9:41 am

Post by dsex100 »

thank you for your help :)

i went through the readme file for my version of datastage and I found 2 locations where it specifically stated or answered my question if it would run on UNIX,

1- Build Platform: WINNT
2- Platform - It states that this product is only intended for Windows
clients- windows xp windows 2000
server windows server 2003 standard or enterprise

i will mark this as resolved and continue my questions in another post
kcbland
Participant
Posts: 5208
Joined: Wed Jan 15, 2003 8:56 am
Location: Lutz, FL
Contact:

Post by kcbland »

Just to be clear - you purchase either a Windoze or a Unix version of DataStage. The Clients are always on Windoze, the Server is platform selectable. Everything resides on the Server side. If you write your jobs to be transportable, they can run in either Server environment. Most companies have a single consistent Server platform and don't care about transportability.

As for parallel performance on a single cpu, the answer is usually quite simple. The more tasks you have a single cpu doing the fewer amount of cycles it will spend on those tasks. There's also more overhead of task coordination. The more cpus you have the more you can spread tasks across cpus and the thread coordination overhead becomes distributed.
Kenneth Bland

Rank: Sempai
Belt: First degree black
Fight name: Captain Hook
Signature knockout: right upper cut followed by left hook
Signature submission: Crucifix combined with leg triangle
kcbland
Participant
Posts: 5208
Joined: Wed Jan 15, 2003 8:56 am
Location: Lutz, FL
Contact:

Post by kcbland »

In case I wasn't absolutely clear - you buy a Windoze license and it only installs on Windoze.
Kenneth Bland

Rank: Sempai
Belt: First degree black
Fight name: Captain Hook
Signature knockout: right upper cut followed by left hook
Signature submission: Crucifix combined with leg triangle
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

And if that still wasn't clear - crucifix combined with leg triangle!
-craig

"You can never have too many knives" -- Logan Nine Fingers
JRodriguez
Premium Member
Premium Member
Posts: 425
Joined: Sat Nov 19, 2005 9:26 am
Location: New York City
Contact:

Post by JRodriguez »

Nowdays - IIS 8.xx - the licenses are for all supported OS .... IBM allows you to download the software for any supported OS. The xml license file is the same for all
Julio Rodriguez
ETL Developer by choice

"Sure we have lots of reasons for being rude - But no excuses
Post Reply