[COMPRESS-542] Corrupt 7z allocates huge amount of SevenZEntries - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Bug
Status: Resolved
Priority: Major
Resolution: Fixed
Affects Version/s: 1.20
Fix Version/s: 1.21
Component/s: None
Labels:
None

Description

We ran into a problem where a 1.43GB corrupt 7z file tried to allocate about 138 million SevenZArchiveEntries which will use about 12GB of memory. Sadly I'm unable to share the file. If you have enough Memory available the following exception is thrown.

java.io.IOException: Start header corrupt and unable to guess end Header
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.tryToLocateEndHeader(SevenZFile.java:511)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.readHeaders(SevenZFile.java:470)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.<init>(SevenZFile.java:336)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.<init>(SevenZFile.java:128)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.<init>(SevenZFile.java:369)

7z itself aborts really quick when I'm trying to list the content of the file.

7z l "corrupt.7z"

7-Zip 18.01 (x64) : Copyright (c) 1999-2018 Igor Pavlov : 2018-01-28

Scanning the drive for archives:
1 file, 1537752212 bytes (1467 MiB)

Listing archive: corrupt.7z

ERROR: corrupt.7z : corrupt.7z
Open ERROR: Can not open the file as [7z] archive

ERRORS:
Is not archive

Errors: 1

I hacked together the attached patch which will reduce the memory allocation to about 1GB. So lazy instantiation of the entries could be a good solution to the problem. Optimal would be to only create the entries if the headers could be parsed correctly.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

Reduced_memory_allocation_for_corrupted_7z_archives.patch
08/Jul/20 13:40
4 kB
Robin Schimpf
endheadercorrupted2.7z
06/Aug/20 08:17
0.2 kB
A Kelday
endheadercorrupted.7z
06/Aug/20 08:17
0.2 kB
A Kelday

Issue Links

links to

#120

Activity

People

Assignee:: Unassigned

Reporter:: Robin Schimpf

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Dates

Created:: 08/Jul/20 13:40

Updated:: 13/Jul/21 04:27

Resolved:: 27/Jun/21 20:10

Time Tracking

Estimated:

Not Specified

Remaining:

Logged:

3h 10m