BDFS Row Column Number
Moderators: chulett, rschirm, roy
-
- Premium Member
- Posts: 39
- Joined: Tue Apr 15, 2014 9:14 am
BDFS Row Column Number
I would like to understand whether the "Row Column Number" option in the BDFS file input step is impacted by partitioning or other parallelism options, or if it will always produce row numbers matching the source file.
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Keep in mind, too, that "the file" is not necessary a valid concept in Big Data. Data in what is logically "a file" will more than likely be distributed across nodes in a Hadoop distributed file system or similar. So what can "row number" mean in this context?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.