Message ID | 20231031200925.3297456-1-adhemerval.zanella@linaro.org |
---|---|
Headers | show |
Series | x86: Improve ERMS usage on Zen3+ | expand |
Adhemerval, We added this to our todo list, and will get back shortly after verifying the patches. -Sajan
On 15/11/23 16:05, sajan.karumanchi@gmail.com wrote: > Adhemerval, > > We added this to our todo list, and will get back shortly after verifying the patches. > > -Sajan Thanks Sajan, let me know if you need anything else. I only have access to a Zen3 core machine, so if you could also check the BZ30995 [1] it would be helpful (it is related to Zen4 performance for memcpy). [1] https://sourceware.org/bugzilla/show_bug.cgi?id=30995
Adhemerval, In our extensive testing, we observed mixed results for rep-movs/stos performance with the ERMS feature enabled. Henceforth, we approve this patch to avoid the ERMS code path on AMD processors for better performance. -Sajan
On 05/02/24 16:01, Sajan Karumanchi wrote: > > Adhemerval, > > In our extensive testing, we observed mixed results for rep-movs/stos performance with the ERMS feature enabled. > Henceforth, we approve this patch to avoid the ERMS code path on AMD processors for better performance. > > -Sajan > > Thanks for checking this out Sajan, I will rebase with some wording fixes in comments and double check if everything is ok. If you can, please send a Ack-by or Reviewed-by in the next comment. I will also check with H.J and Noah (x86 maintaners) to see if everything is ok.