Big integrate - joins
Posted: Fri Jul 13, 2018 4:00 am
Hello,
I have datastage job which reads data from BDFS stage. The underlying HQL is currently reading data from single table (Hive Hql). All the other lookup / joins are done in Datastage job.
What would be the advantage if I perform the joins with multiple tables in Hive hql and have the data read from BDFS file stage instead of performing the joins in Datastage after reading data from single table?
Thanks.
I have datastage job which reads data from BDFS stage. The underlying HQL is currently reading data from single table (Hive Hql). All the other lookup / joins are done in Datastage job.
What would be the advantage if I perform the joins with multiple tables in Hive hql and have the data read from BDFS file stage instead of performing the joins in Datastage after reading data from single table?
Thanks.