Details
-
Improvement
-
Status: Open
-
Major
-
Resolution: Unresolved
-
0.9.0, 0.10.0
-
None
-
None
-
None
Description
As described in PIG-1188, if users define a schema (w or w/o types), we need to check the number of fields after loading data, so if there are less fields we need to pad null fields, and if there are more fields we need to throw them away.
For schema with types, Pig used to insert a Foreach after the loader for type casting which also checks #fields. For schema without types there was no such Foreach, thus PIG-1188 inserted one just for checking #fields. Unfortunately, Foreach is too expensive for such checking, and ideally we can push it into the loader.
Attachments
Attachments
Issue Links
- is related to
-
PIG-2661 Pig uses an extra job for loading data in Pigmix L9
- Open