Hi
I have various binary EBCDIC files which I need to read from Hadoop.
I have read the files via a CFF stage (EBCDIC) when I have just the files. How can I read from Hadoop? Do I need to land the files somewhere and then use the CFF stage?
The problem is they are very large files 100's GB
Or is there another way?
How to read a binary EBCDIC file that is on Hadoop
Moderators: chulett, rschirm, roy
I'm not yet familiar with the connectivity with Hadoop, so others will need to offer advice on that, but there should be two main options for you: read the file from the Hadoop server directly using CFF, or FTP the file to your local DataStage server first.
You can use FTP directly into your processing job. I have dozens of jobs doing that from the mainframe. However, an FTP session is less stable than a local read. CFF is definitely the preferred method.
You can use FTP directly into your processing job. I have dozens of jobs doing that from the mainframe. However, an FTP session is less stable than a local read. CFF is definitely the preferred method.
Franklin Evans
"Shared pain is lessened, shared joy increased. Thus do we refute entropy." -- Spider Robinson
Using mainframe data FAQ: viewtopic.php?t=143596 Using CFF FAQ: viewtopic.php?t=157872
"Shared pain is lessened, shared joy increased. Thus do we refute entropy." -- Spider Robinson
Using mainframe data FAQ: viewtopic.php?t=143596 Using CFF FAQ: viewtopic.php?t=157872