ETL of Twitter,LinkedIn,GoogleAnalytics data using Datastage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
mydsworld
Participant
Posts: 321
Joined: Thu Sep 07, 2006 3:55 am

ETL of Twitter,LinkedIn,GoogleAnalytics data using Datastage

Post by mydsworld »

Please let me know how can I access the following data source for ETL work, using Datastage.

1. Twitter
2. Google Analytics
3. LinkedIn

Thanks.
eostic
Premium Member
Premium Member
Posts: 3838
Joined: Mon Oct 17, 2005 9:34 am

Post by eostic »

Absolutely......but ask first, "how" do you get to all of those data sources? [ie...what technology is supported by those vendors or outlets].... For example, Twitter supports a JSON based REST web service. DataStage can issue such calls....so there's your answer. The others are probably the same, but maybe they have other mechanisms (downloads, other APIs, etc.).

Ernie
Ernie Ostic

blogit!
<a href="https://dsrealtime.wordpress.com/2015/0 ... ere/">Open IGC is Here!</a>
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Right... don't worry about the "from DataStage" part quite yet. Find out what they support, what external access methods / API they have and go from there.
-craig

"You can never have too many knives" -- Logan Nine Fingers
Teej
Participant
Posts: 677
Joined: Fri Aug 08, 2003 9:26 am
Location: USA

Post by Teej »

With knowledge of C/C++, and the ability to build a custom stage, you can access just about anything. The real question is whether or not this will provide you with information that can be useful in a structured data format, or if a different solution is ideal due to unstructured data.
Developer of DataStage Parallel Engine (Orchestrate).
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

With knowledge of Java, and the ability to work with the Java stages, you can access just about anything. The real question is whether or not this will provide you with information that can be useful in a structured data format, or if a different solution is ideal due to unstructured data.

Note also that some environments have security requirements.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply