Comment # 2 on bug 1209102 from Michal Hocko

Also do you happen to have a perf report to see where the additional time is
spent? My bet would be check_mm called on the mm drop path. If those machines
have a lot of cpus then there is much more work to be done.

We are saving a lot of atomic operations on the accounting side but the cpu
iteration might turn out to be quite visible.