Description
Improve S3a error handling and reporting
this includes
- looking at error codes and translating to more specific exceptions
- better retry logic where present
- adding retry logic where not present
- more diagnostics in exceptions
- docs
Overall goals
- things that can be retried and will go away are retried for a bit
- things that don't go away when retried failfast (302, no auth, unknown host, connection refused)
- meaningful exceptions are built in translate exception
- diagnostics are included, where possible
- our troubleshooting docs are expanded with new failures we encounter
AWS S3 error codes: http://docs.aws.amazon.com/AmazonS3/latest/API/ErrorResponses.html
Attachments
Issue Links
- depends upon
-
HADOOP-13811 s3a: getFileStatus fails with com.amazonaws.AmazonClientException: Failed to sanitize XML document destined for handler class
- Resolved
-
HADOOP-14012 Handled dynamo exceptions in translateException
- Resolved
- is depended upon by
-
HADOOP-13271 Intermittent failure of TestS3AContractRootDir.testListEmptyRootDirectory
- Resolved
-
HADOOP-14831 Über-jira: S3a phase IV: Hadoop 3.1 features
- Resolved
-
HADOOP-14825 Über-JIRA: S3Guard Phase II: Hadoop 3.1 features
- Resolved
- is duplicated by
-
HADOOP-14590 Extend and enhance S3A exception handling
- Resolved
- is related to
-
HADOOP-15076 Enhance S3A troubleshooting documents and add a performance document
- Resolved