Loading...

XML

Word

Printable

Type: Enhancement
Resolution: Fixed
Priority: P4
Fix Version/s: 15
Affects Version/s: None
Component/s: core-libs
Labels:
- performance
- startup

Subcomponent:
java.util.jar
Resolved In Build:
b21

Current implementation of ZipFile.getEntryPos takes the encoded byte[] of the String we're looking up. Which means when looking up entries across multiple jar files, we allocate the byte[] over and over for each jar file searched.

By refactoring the ZipFile hash table to save a String-normalized hash value rather than a hash of the encoded value, we can avoid this allocation most of the time when the entry is not found in the jar/zip, while also getting the benefit of the caching of String.hashCode.

For jar files or any zip using UTF-8 encoding, calculating such hashes during open (initCEN) is easy to get as fast for ASCII entries. We could make the cost negligible for non-ASCII.

Experimental patch: http://cr.openjdk.java.net/~redestad/scratch/zipfile_string_hash.01

Instrumenting cost of ZipFile.getEntry shows a ~120ms improvement on Spring PetClinic, roughly 2.5-3% of total.

causes

JDK-8347712 IllegalStateException on multithreaded ZipFile access with non-UTF8 charset

Resolved

relates to

JDK-7009069 ZipFile.getEntry(String name) returns null because it does NOT respect the "language encoding flag"

Open

JDK-8260010 UTF8ZipCoder not thread-safe since JDK-8243469

Closed

Assignee:: Claes Redestad

Reporter:: Claes Redestad

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2020-04-23 03:10

Updated:: 2025-03-11 07:02

Resolved:: 2020-04-27 08:31

Details

Description

Attachments

Issue Links

Activity

People

Dates