Warning in aggregator stage (Hash table has grown etc..)
Posted: Tue Jan 04, 2011 2:10 pm
Hi, thanks for reading. Getting the following warning.
agg_UWMargin,0: Hash table has grown to 16384 entries. [groupby/hashgroup2.C:966]
So I changed the method to 'sort' instead of 'hash' and the warning went away. However I'm worried that when running on a 4 node config file and method=sort, the data will not be partitioned correctly..thought that was method=hash would do? Do I need to do anything extra in teh aggregator stage to make sure the data is sorted and partitioned (spread accross the nodes) correctly??
Here is how my env variable are set
APT_NO_SORT_INSERTION = FALSE
APT_NO_PART_INSERTION = FALSE
agg_UWMargin,0: Hash table has grown to 16384 entries. [groupby/hashgroup2.C:966]
So I changed the method to 'sort' instead of 'hash' and the warning went away. However I'm worried that when running on a 4 node config file and method=sort, the data will not be partitioned correctly..thought that was method=hash would do? Do I need to do anything extra in teh aggregator stage to make sure the data is sorted and partitioned (spread accross the nodes) correctly??
Here is how my env variable are set
APT_NO_SORT_INSERTION = FALSE
APT_NO_PART_INSERTION = FALSE