track exact number of bytes moved via MPI
Right now we only track the total number of bytes read and written at the posix level. This may be different than the number of bytes read and written at the MPI level due to various MPI-IO optimizations.
For files opened via MPI, we could get more accurate performance estimates by using the bytes transferred at the MPI level rather than at the POSIX level.