Improve the various RegMask::is_bound method performance by avoiding Size() and adding a quick drain loop after finding the bit/pair/set we're looking for.
This optimize RegMask::is_bound by ~2x, and the other components are inlined in various places so the improvement adds up to about an 1% improvement on C2Compiler::initialize()
This optimize RegMask::is_bound by ~2x, and the other components are inlined in various places so the improvement adds up to about an 1% improvement on C2Compiler::initialize()