Loading...

XML

Word

Printable

Type: Bug
Resolution: Duplicate
Priority: P3
Fix Version/s: None
Affects Version/s: 1.2.2_08
Component/s: core-libs
Labels:
None

Subcomponent:
java.net
CPU:

sparc
OS:

solaris_2.6

javadocs say 'All other characters are converted into the 3-character string
%xy, where xy is the two-digit hexadecimal representation of the lower
8-bits of the character.'

Problem was seen trying to round trip Arabic text on a Sol8 system with a
default LANG of 'ar' (8859-6) through UTF-8 to URLEncoded and then all the
way back. The URLEncode decode was mangling the encoded text due to the
interaction with the non-english default charset.

But the code is:

OutputStreamWriter writer = new OutputStreamWriter(buf);

which uses the default encoding of the system property file.encoding to do
the encoding. If the char needs encoding, a sequence of bytes is written to
the outputstream rather than just the low 8 bits. The use of the writer is also unnecessary. See suggested fix for alternate code without the writer.

Will be filing the related bug in URLDecoder.decode() for related problem.

Seen in 1.2.2 and in 1.3.

duplicates

JDK-4257115 URLEncoder and URLDecoder should support target character sets

Resolved

relates to

JDK-4488606 URLDecoder.decode() should not do charset conversion

Closed

Assignee:: Michael McMahon

Reporter:: J. Duke

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Created:: 2001-08-06 11:19

Updated:: 2001-08-07 05:54

Resolved:: 2001-08-07 05:54

Imported:: 16/Sep/12 12:27 AM

Indexed:: 17/Jul/12 8:35 PM

Details

Description

Attachments

Issue Links

Activity

People

Dates