Search found 3045 matches

by vmcburney
Thu Mar 02, 2006 6:15 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Clustering and High Availability
Replies: 3
Views: 1139

I've got a book from IBM here called "An Overview of the IBM WebSphere Data Integration Suite", the section on SOA states the following benefits: * Scalable - They provide real-time scalability by distributing request processing and stopping/starting job instances across multiple WebSphere servers. ...
by vmcburney
Tue Feb 28, 2006 4:59 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Parsing URL's in a Web Log file
Replies: 3
Views: 744

You can process it as a normal sequential file and parse the data in a transformer. It is worth installing the click pack routines as they will parse URLs for you without you needing to write the code. I found the log reader stage was very efficient and using the sequential file stage did not give a...
by vmcburney
Tue Feb 28, 2006 4:54 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Clustering and High Availability
Replies: 3
Views: 1139

There was a session on this at AscentialWorld a couple years ago, from memory they were able to provide high availability by running the jobs through the RTI pack. The config files are hard coded to nodes, the RTI pack is more flexible letting jobs run on servers that are free or under utilised. You...
by vmcburney
Tue Feb 28, 2006 2:53 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Calling DS Job through web page
Replies: 5
Views: 1632

If you were doing a lot of these you could go for the RTI pack and call it as a web service.
by vmcburney
Mon Feb 27, 2006 6:39 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Select First Record only
Replies: 12
Views: 3357

There is a FAQ How do I remove duplicate rows from a data source? This will give you a few options.
by vmcburney
Mon Feb 27, 2006 6:35 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Parsing URL's in a Web Log file
Replies: 3
Views: 744

You need to read the PDF and help file that comes with the click pack. The Log Reader stage has a property that defines the type of information to be retrieved from the log file using a set of character codes. There are also a set of routines installed including one that can parse a URL from a log e...
by vmcburney
Mon Feb 27, 2006 2:47 pm
Forum: IBM<sup>®</sup> Infosphere DataStage Server Edition
Topic: Datastage 7.5.1 Release Notes
Replies: 5
Views: 1723

You can access all the release readme notes on the Ascential-IBM e.services web site. You will need your serial number to create a login.
by vmcburney
Mon Feb 27, 2006 2:44 pm
Forum: Data Integration
Topic: If you have enjoyed my blogs please vote for me
Replies: 7
Views: 12645

I've been nominated for Best New Blog! Please head over to m and vote for me on the second voting screen. "An Experts Guide to WebSphere Information Integration" is currently running equal third on 12%. I'm up against some very good bloggers and I would like to thank everyone on the site who nominat...
by vmcburney
Fri Feb 24, 2006 8:17 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Datasets having 30 M worth of data
Replies: 5
Views: 1325

If you use a lookup stage with 30 million rows you will find your job still needs to move most of that reference data into a lookup file. It cannot lookup directly against a dataset. The dataset will still be fast in that it is already partitioned and on the nodes. The lookup fileset will be faster ...
by vmcburney
Fri Feb 24, 2006 8:14 pm
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: User Interface with Datastage
Replies: 4
Views: 1485

Seems like a perfect business case for use Real Time Integration. Currently it is an additional license fee, though in the Hawk release it may be free. It lets you turn any job into a web service. You can then write something that calls the web service and passes it the customer number.
by vmcburney
Thu Feb 23, 2006 4:45 pm
Forum: Site/Forum
Topic: voting for the most interesting topics
Replies: 3
Views: 2660

I have always wanted a voting button on replies to indicate that a reply was useful or popular. Maybe a rating out of five. Similar to the voting on sites such as Yahoo Answers. This would make the search facility far more effective. I've lost count of the number of times I've searched for an answer...
by vmcburney
Wed Feb 22, 2006 8:41 pm
Forum: Data Integration
Topic: If you have enjoyed my blogs please vote for me
Replies: 7
Views: 12645

Thanks for the great feedback, it is one of the things that makes blogging rewarding. You can also nominate one of my entries for best blog entry of 2005 but you need to choose entry from last year. For DataStage tips I recommend you read: DataStage server v enterprise: some performance stats If you...
by vmcburney
Wed Feb 22, 2006 2:20 pm
Forum: Data Integration
Topic: If you have enjoyed my blogs please vote for me
Replies: 7
Views: 12645

If you have enjoyed my blogs please vote for me

The ittoolbox web site is running IT Community Choice Awards for favourite blogs. If you have read and enjoyed my blog An Expert's Guide to WebSphere Information Integration on that site please vote for me at: m I am hoping to sneak into the top 5 "Best New Blog" category. If you haven't seen my blo...
by vmcburney
Sun Feb 19, 2006 3:52 am
Forum: IBM<sup>®</sup> DataStage Enterprise Edition (Formerly Parallel Extender/PX)
Topic: Joining Billions of records with millions
Replies: 9
Views: 2726

Yah, join with the reference data already sorted may be good as 2 billion will never fit into lookup RAM memory. Make sure your job doesn't try to resort the data. The lookup fileset would take a long time to load and may be an option if you use this particular 2 billion dataset in multiple jobs. Wi...
by vmcburney
Sun Feb 19, 2006 12:11 am
Forum: Data Integration
Topic: Level of ELT DataStage can do..
Replies: 4
Views: 12357

It is an interesting topic. I think both products will have a strong future. Informatica have bet on both horses with ELT ability built into the latest PowerCenter. IBM-Ascential have opted to bypass ELT and go with feature rich ETL, Hawk release sees better data quality plugins. If you tried to do ...