Replies: 1 comment 1 reply
-
Maybe something like this could be used: https://crates.io/crates/multiversion |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
@julianbraha I saw you made the original PR for using
mul_add
to speed up float operations in chunk gen. Those we ultimately reverted because that function was extremely slow on non-FMA cpus.I was wondering if you had access to a FMA cpu and could run benchmarks to see if it would be worth adding a compile time method to use
mul_add
orx * a + b
based on whether the CPU supports.Beta Was this translation helpful? Give feedback.
All reactions