reuse_distance and reuse_time trace analyzers need to compute per thread
The drcachesim trace analyzers for reuse_distance and reuse_time need to perform their reuse calculations on a per-thread basis, and then aggregate across threads, rather than considering the merged multi-thread memory access stream when counting reuses.