An InputStreamReader using the UTF8 encoding produces an erroneous character
stream, as demonstrated by the following program.
--------
import java.io.*;
public class UTF8 {
public static void main(String[] args) {
try {
Reader in = new InputStreamReader(System.in, "UTF8");
char buf[] = new char[10];
int n;
while ((n = in.read(buf)) >= 0) {
System.out.println(new String(buf, 0, n));
}
} catch (Exception x) {
x.printStackTrace();
}
}
}
--------
% java UTF8 <UTF8.java
import ja
tic void m
new InputS
[] = new c
(buf)) >=
n));
race();
}
%
--------
The above command should cause the entire program to be printed.
stream, as demonstrated by the following program.
--------
import java.io.*;
public class UTF8 {
public static void main(String[] args) {
try {
Reader in = new InputStreamReader(System.in, "UTF8");
char buf[] = new char[10];
int n;
while ((n = in.read(buf)) >= 0) {
System.out.println(new String(buf, 0, n));
}
} catch (Exception x) {
x.printStackTrace();
}
}
}
--------
% java UTF8 <UTF8.java
import ja
tic void m
new InputS
[] = new c
(buf)) >=
n));
race();
}
%
--------
The above command should cause the entire program to be printed.
- relates to
-
JDK-4038630 UTF8 converter crashes with large data
-
- Closed
-