[CRUNCH-268] Crunch's internal Avro tuple schemas should have stable names - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: 0.7.0
Fix Version/s: 0.8.0
Component/s: Core, IO
Labels:
None

Description

A long time ago, I made a change that used random names for the custom Avro schemas that Crunch generates for processing tuple types (pairs, trips, etc.). I recently hit a use case where that randomization burned me when I was re-running some pipelines over checkpointed data that I serialized using Crunch's Avro schemas (Pair, in particular), so I think that we should change the tuple schemas to have stable names based on their constituent field schemas via an MD5 hash.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

CRUNCH-268.patch
20/Sep/13 22:12
3 kB
Josh Wills
CRUNCH-268v2.patch
21/Sep/13 20:33
4 kB
Josh Wills

Activity

People

Assignee:: Josh Wills

Reporter:: Josh Wills

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Dates

Created:: 20/Sep/13 22:10

Updated:: 08/Nov/13 21:24

Resolved:: 21/Sep/13 20:37