-
Notifications
You must be signed in to change notification settings - Fork 91
Open
Labels
C-feature-requestCategory: a feature request, i.e. not implemented / a PRCategory: a feature request, i.e. not implemented / a PR
Description
.round()
function is very slow compared to platform-native intrinsic on AVX (https://godbolt.org/z/3sdd9jrvW) because it provides a platform-agnostic behavior. Although there are many use cases when the exact behavior on half-way values or INFs and NaNs doesn't matter.
I think adding somewhat like round_fast
function is reasonable.
Metadata
Metadata
Assignees
Labels
C-feature-requestCategory: a feature request, i.e. not implemented / a PRCategory: a feature request, i.e. not implemented / a PR