The proposed methods would be shuffle_by_keys, sort_by_keys, and group_by_keys. Right now, we only have shuffle_by_keys.
@rjzamora says:
exposing a clear space for documentation is probably the best reason to add it. That documentation should also clarify that these global operations (requiring inter-partition data movement) should be avoided unless absolutely necessary 🙂
The proposed methods would be
shuffle_by_keys,sort_by_keys, andgroup_by_keys. Right now, we only haveshuffle_by_keys.@rjzamora says: