Also do you happen to have a perf report to see where the additional time is spent? My bet would be check_mm called on the mm drop path. If those machines have a lot of cpus then there is much more work to be done. We are saving a lot of atomic operations on the accounting side but the cpu iteration might turn out to be quite visible.