Hi,
I have been asked by our develoment team if there is an alternate/better way to serach through a dataset. I origianlly pointed them to the Data Set Management utility and they cam back with "searching through millions of rows would take hours with the limited row display". Besides dumping to a text file or stage table, is there a way to easily query a dataset for debugging purposes.
We are running Versiuon 8.0.1 on Windows and are in the process of upgrading to 8.7 on Windows.
Thanks - - John
Best way to search through a DataSet
Moderators: chulett, rschirm, roy
-
- Premium Member
- Posts: 306
- Joined: Wed Jun 21, 2006 11:41 am
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Short answer: no.
Probably the fastest would be a parallel job that reads the Data Set and uses a Transformer stage to effect the search. You can run this with more nodes than exist in the Data Set.
Probably the fastest would be a parallel job that reads the Data Set and uses a Transformer stage to effect the search. You can run this with more nodes than exist in the Data Set.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
What Ray said is the best way. You can also use ORCHADMIN command to move the data into text file and open it with .xls / use grep (MKS tool kit) and find the name in that text file.
Again it depends your data volume. So you need to decide.
Again it depends your data volume. So you need to decide.
Thanks
Ram
----------------------------------
Revealing your ignorance is fine, because you get a chance to learn.
Ram
----------------------------------
Revealing your ignorance is fine, because you get a chance to learn.
-
- Premium Member
- Posts: 306
- Joined: Wed Jun 21, 2006 11:41 am
so, ray, did you mean, we could write the dataset using one config file and read it with another, how is it possible?ray.wurlod wrote:Probably the fastest would be a parallel job that reads the Data Set and uses a Transformer stage to effect the search. You can run this with more nodes than exist in the Data Set.
reg
praveen
Praveen
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Yes, that's what I'm saying. A copy of the configuration file used to write the Data Set is stored in its descriptor file and this can be used to read the Data Set (the data then have to be automatically re-partitioned in to the nodes of the currently active configuration file). DataStage looks after that for you. If you prefer to use the orchadmin command specify the -x option.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.