I was under the impression that upstream implemented run-time detection
correctly. Do you know where binary differences are?
The following are supposed to be selected at run-time:
lib/matcher.cpp
lib/matcher_avx2.cpp
lib/matcher_avx512bw.cpp
Also is the following constraint not forcing the correct CPU so that the binary
result is always the same?
> #!BuildConstraint: hardware:cpu:flag x86-64-v3
Does the binary differ even when taking that into account?