How to read a binary EBCDIC file that is on Hadoop

trenicar · Post by **trenicar** » Mon Nov 24, 2014 4:38 am

Hi

I have various binary EBCDIC files which I need to read from Hadoop.

I have read the files via a CFF stage (EBCDIC) when I have just the files. How can I read from Hadoop? Do I need to land the files somewhere and then use the CFF stage?

The problem is they are very large files 100's GB

Or is there another way?

FranklinE · Post by **FranklinE** » Mon Nov 24, 2014 9:52 am

I'm not yet familiar with the connectivity with Hadoop, so others will need to offer advice on that, but there should be two main options for you: read the file from the Hadoop server directly using CFF, or FTP the file to your local DataStage server first.

You can use FTP directly into your processing job. I have dozens of jobs doing that from the mainframe. However, an FTP session is less stable than a local read. CFF is definitely the preferred method.