[drmemtrace] avoid thread switch gaps after branches
For a tool to infer branch target information we want the final trace to have the subsequent instruction for that thread immediately after the branch. However, for offline traces, we merge thread buffers at buffer dump points, which are often right after branches. We could just tweak the dump point to avoid this problem.