Fixes incorrect buffer sizes in the test. Adds clobbers making the test more robust. Adds no vzeroupper flag, necessary on some platforms that are adding the instruction.
Even with above, the test is not bullet proof and could be broken by an existing or future toolchain. Ideally, some parts should get rewritten in assembly.