[SPARK-5682] Add encrypted shuffle in spark - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.1.0
Component/s: Shuffle, Spark Core
Labels:
None

Description

Encrypted shuffle is enabled in hadoop 2.6 which make the process of shuffle data safer. This feature is necessary in spark. AES is a specification for the encryption of electronic data. There are 5 common modes in AES. CTR is one of the modes. We use two codec JceAesCtrCryptoCodec and OpensslAesCtrCryptoCodec to enable spark encrypted shuffle which is also used in hadoop encrypted shuffle. JceAesCtrypoCodec uses encrypted algorithms jdk provides while OpensslAesCtrCryptoCodec uses encrypted algorithms openssl provides.
Because ugi credential info is used in the process of encrypted shuffle, we first enable encrypted shuffle on spark-on-yarn framework.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

Design Document of Encrypted Spark Shuffle_20150209.docx
09/Feb/15 07:59
81 kB
liyunzhang
Design Document of Encrypted Spark Shuffle_20150318.docx
19/Mar/15 07:54
91 kB
liyunzhang
Design Document of Encrypted Spark Shuffle_20150402.docx
02/Apr/15 01:32
94 kB
liyunzhang
Design Document of Encrypted Spark Shuffle_20150506.docx
06/May/15 04:02
95 kB
liyunzhang

Issue Links

is duplicated by

SPARK-6460 Implement OpensslAesCtrCryptoCodec to enable encrypted shuffle algorithms which openssl provides

Resolved

SPARK-10771 Implement the shuffle encryption with AES-CTR crypto using JCE key provider.

Resolved

SPARK-12333 Support shuffle spill encryption in Spark

Resolved

is related to

SPARK-12278 Move the shuffle related test case from Yarn module to Core module

Resolved

relates to

SPARK-12333 Support shuffle spill encryption in Spark

Resolved

links to

[Github] Pull Request #4491 (kellyzly)

[Github] Pull Request #5307 (kellyzly)

[Github] Pull Request #8880 (winningsix)

(3 links to)

Sub-Tasks

1.	Implement OpensslAesCtrCryptoCodec to enable encrypted shuffle algorithms which openssl provides		Resolved	Unassigned
2.	Implement the shuffle encryption with AES-CTR crypto using JCE key provider.		Resolved	Unassigned

Activity

People

Assignee:: Ferdinand Xu

Reporter:: liyunzhang

Votes:: 0 Vote for this issue

Watchers:: 26 Start watching this issue

Dates

Created:: 09/Feb/15 07:49

Updated:: 17/May/20 18:30

Resolved:: 30/Aug/16 16:17