[v4,10/18] x86-64: Add vector atan2/atan2f implementation to libmvec

Implement vectorized atan2/atan2f containing SSE, AVX, AVX2 and
AVX512 versions for libmvec as per vector ABI.  It also contains
accuracy and ABI tests for vector atan2/atan2f with regenerated ulps.
---
 bits/libm-simd-decl-stubs.h                   |  11 +
 math/bits/mathcalls.h                         |   2 +-
 .../unix/sysv/linux/x86_64/libmvec.abilist    |   8 +
 sysdeps/x86/fpu/bits/math-vector.h            |   4 +
 .../x86/fpu/finclude/math-vector-fortran.h    |   4 +
 sysdeps/x86_64/fpu/Makeconfig                 |   1 +
 sysdeps/x86_64/fpu/Versions                   |   2 +
 sysdeps/x86_64/fpu/libm-test-ulps             |  20 +
 .../fpu/multiarch/svml_d_atan22_core-sse2.S   |  20 +
 .../x86_64/fpu/multiarch/svml_d_atan22_core.c |  28 ++
 .../fpu/multiarch/svml_d_atan22_core_sse4.S   | 471 +++++++++++++++++
 .../fpu/multiarch/svml_d_atan24_core-sse.S    |  20 +
 .../x86_64/fpu/multiarch/svml_d_atan24_core.c |  28 ++
 .../fpu/multiarch/svml_d_atan24_core_avx2.S   | 451 +++++++++++++++++
 .../fpu/multiarch/svml_d_atan28_core-avx2.S   |  20 +
 .../x86_64/fpu/multiarch/svml_d_atan28_core.c |  28 ++
 .../fpu/multiarch/svml_d_atan28_core_avx512.S | 475 ++++++++++++++++++
 .../fpu/multiarch/svml_s_atan2f16_core-avx2.S |  20 +
 .../fpu/multiarch/svml_s_atan2f16_core.c      |  28 ++
 .../multiarch/svml_s_atan2f16_core_avx512.S   | 399 +++++++++++++++
 .../fpu/multiarch/svml_s_atan2f4_core-sse2.S  |  20 +
 .../fpu/multiarch/svml_s_atan2f4_core.c       |  28 ++
 .../fpu/multiarch/svml_s_atan2f4_core_sse4.S  | 384 ++++++++++++++
 .../fpu/multiarch/svml_s_atan2f8_core-sse.S   |  20 +
 .../fpu/multiarch/svml_s_atan2f8_core.c       |  28 ++
 .../fpu/multiarch/svml_s_atan2f8_core_avx2.S  | 362 +++++++++++++
 sysdeps/x86_64/fpu/svml_d_atan22_core.S       |  29 ++
 sysdeps/x86_64/fpu/svml_d_atan24_core.S       |  29 ++
 sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S   |  25 +
 sysdeps/x86_64/fpu/svml_d_atan28_core.S       |  25 +
 sysdeps/x86_64/fpu/svml_s_atan2f16_core.S     |  25 +
 sysdeps/x86_64/fpu/svml_s_atan2f4_core.S      |  29 ++
 sysdeps/x86_64/fpu/svml_s_atan2f8_core.S      |  29 ++
 sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S  |  25 +
 .../fpu/test-double-libmvec-atan2-avx.c       |   1 +
 .../fpu/test-double-libmvec-atan2-avx2.c      |   1 +
 .../fpu/test-double-libmvec-atan2-avx512f.c   |   1 +
 .../x86_64/fpu/test-double-libmvec-atan2.c    |   3 +
 .../x86_64/fpu/test-double-vlen2-wrappers.c   |   1 +
 .../fpu/test-double-vlen4-avx2-wrappers.c     |   1 +
 .../x86_64/fpu/test-double-vlen4-wrappers.c   |   1 +
 .../x86_64/fpu/test-double-vlen8-wrappers.c   |   1 +
 .../fpu/test-float-libmvec-atan2f-avx.c       |   1 +
 .../fpu/test-float-libmvec-atan2f-avx2.c      |   1 +
 .../fpu/test-float-libmvec-atan2f-avx512f.c   |   1 +
 .../x86_64/fpu/test-float-libmvec-atan2f.c    |   3 +
 .../x86_64/fpu/test-float-vlen16-wrappers.c   |   1 +
 .../x86_64/fpu/test-float-vlen4-wrappers.c    |   1 +
 .../fpu/test-float-vlen8-avx2-wrappers.c      |   1 +
 .../x86_64/fpu/test-float-vlen8-wrappers.c    |   1 +
 50 files changed, 3117 insertions(+), 1 deletion(-)
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan22_core-sse2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan22_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan22_core_sse4.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan24_core-sse.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan24_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan24_core_avx2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan28_core-avx2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan28_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan28_core_avx512.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atan2f16_core-avx2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atan2f16_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atan2f16_core_avx512.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atan2f4_core-sse2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atan2f4_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atan2f4_core_sse4.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atan2f8_core-sse.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atan2f8_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atan2f8_core_avx2.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_atan22_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_atan24_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_atan24_core_avx.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_atan28_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_atan2f16_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_atan2f4_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_atan2f8_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_atan2f8_core_avx.S
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-atan2-avx.c
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-atan2-avx2.c
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-atan2-avx512f.c
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-atan2.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-atan2f-avx.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-atan2f-avx2.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-atan2f-avx512f.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-atan2f.c

Message ID	20211228201130.737370-11-skpgkp2@gmail.com
State	New
Headers	show Return-Path: <libc-alpha-bounces+incoming=patchwork.ozlabs.org@sourceware.org> DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org CF6A0385842A To: libc-alpha@sourceware.org Subject: [PATCH v4 10/18] x86-64: Add vector atan2/atan2f implementation to libmvec Date: Tue, 28 Dec 2021 12:11:22 -0800 Message-Id: <20211228201130.737370-11-skpgkp2@gmail.com> In-Reply-To: <20211228201130.737370-1-skpgkp2@gmail.com> References: <CAMe9rOrLdPcoaPxUA5oXqJPsfriPbbo=q4z7v5GQrKBq1LXARw@mail.gmail.com> <20211228201130.737370-1-skpgkp2@gmail.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: list From: Sunil K Pandey via Libc-alpha <libc-alpha@sourceware.org> Reply-To: Sunil K Pandey <skpgkp2@gmail.com> Cc: andrey.kolesov@intel.com, marius.cornea@intel.com Errors-To: libc-alpha-bounces+incoming=patchwork.ozlabs.org@sourceware.org Sender: "Libc-alpha" <libc-alpha-bounces+incoming=patchwork.ozlabs.org@sourceware.org>
Series	x86-64: Add vector math functions to libmvec \| expand [v4,00/18] x86-64: Add vector math functions to libmvec [v4,01/18] x86-64: Add vector atan/atanf implementation to libmvec [v4,02/18] x86-64: Add vector asin/asinf implementation to libmvec [v4,03/18] x86-64: Add vector hypot/hypotf implementation to libmvec [v4,04/18] x86-64: Add vector exp2/exp2f implementation to libmvec [v4,05/18] x86-64: Add vector exp10/exp10f implementation to libmvec [v4,06/18] x86-64: Add vector cosh/coshf implementation to libmvec [v4,07/18] x86-64: Add vector expm1/expm1f implementation to libmvec [v4,08/18] x86-64: Add vector sinh/sinhf implementation to libmvec [v4,09/18] x86-64: Add vector cbrt/cbrtf implementation to libmvec [v4,10/18] x86-64: Add vector atan2/atan2f implementation to libmvec [v4,11/18] x86-64: Add vector log10/log10f implementation to libmvec [v4,12/18] x86-64: Add vector log2/log2f implementation to libmvec [v4,13/18] x86-64: Add vector log1p/log1pf implementation to libmvec [v4,14/18] x86-64: Add vector atanh/atanhf implementation to libmvec [v4,15/18] x86-64: Add vector acosh/acoshf implementation to libmvec [v4,16/18] x86-64: Add vector erf/erff implementation to libmvec [v4,17/18] x86-64: Add vector tanh/tanhf implementation to libmvec [v4,18/18] x86-64: Add vector asinh/asinhf implementation to libmvec

[v4,10/18] x86-64: Add vector atan2/atan2f implementation to libmvec

Commit Message

Patch