'testArrayStoreCheckRT1' and 'testArrayStoreCheckRT2' in JMH benchmarks have performance regression in SPARC when they work with enabled OptoScheduling option. In SPARC, running those benchmarks with disabled OptoScheduling give around 4x throughput performance compared to running with enabled OptoScheduling option. Generated code with enabled OptoScheduling shows that 8 store word instructions and 8 store byte instructions are interleaved in code sequence while
disabled OptoScheduling gives non-interleaved store instructions code sequence which groups store word instructions store byte instructions, separately.
disabled OptoScheduling gives non-interleaved store instructions code sequence which groups store word instructions store byte instructions, separately.