Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
None
-
None
-
ghx-label-7
Description
The documentation in "The Underlying Functions for a UDA" doesn't do a good job of explaining the state transitions that a UDA can go through. E.g. when is Serialize() called. It's complicated because data needs to be serialized to go over the network, but sometimes is serialized to spill to disk, which changes the sequence of function calls.
See https://community.cloudera.com/t5/Interactive-Short-cycle-SQL/Function-execution-flow-in-UDAs-and-memory-implications-for/m-p/88892#M5532?eid=1&aid=1 for a user who is trying to understand this.