[ARM] : don't use NEON floating-point without -ffast-math (PR43703)

This patch fixes PR43703.  To sum up the problem, the autovectorizer currently 
generates NEON instructions for floating-point operations, but this is incorrect 
because the NEON floating-point unit doesn't fully support IEEE arithmetic.  In 
particular, denormalized numbers are rounded to zero, leading to precision loss 
in some cases.  So, the solution is, essentially, "don't do that".  ;-)  This 
patch adds checks for flag_unsafe_math_operations in the places where NEON 
instructions would currently otherwise be generated from canonical RTL.

To give credit where credit was due, this is really Julian's patch -- my 
contribution is only in refactoring it a bit to pull out the part that affected 
the patch I was already preparing to provide canonical RTL for some of the 
affected NEON instructions.  I posted it earlier this evening here:

http://gcc.gnu.org/ml/gcc-patches/2010-06/msg02101.html

I think the patch I'm posting here has some dependencies on the one I've linked 
to above, though.  I tested the two patches together on an arm-none-eabi build 
using a simulator target, with both NEON and non-NEON compilation options. 
Assuming the other one is approved, is this one OK to commit too?

-Sandra

2010-06-21  Julian Brown  <julian@codesourcery.com>
	    Sandra Loosemore <sandra@codesourcery.com>

	PR target/43703

	gcc/
	* config/arm/vec-common.md (add<mode>3, sub<mode>3, smin<mode>3)
	(smax<mode>3): Disable for NEON float modes when
	flag_unsafe_math_optimizations is false.
	* config/arm/neon.md (*add<mode>3_neon, *sub<mode>3_neon)
	(*mul<mode>3_neon)
	(mul<mode>3add<mode>_neon, mul<mode>3neg<mode>add<mode>_neon)
	(reduc_splus_<mode>, reduc_smin_<mode>, reduc_smax_<mode>): Disable
	for NEON float modes when flag_unsafe_math_optimizations is false.
	(quad_halves_<code>v4sf): Only enable if flag_unsafe_math_optimizations
	is true.
	* doc/invoke.texi (ARM Options): Add note about floating point
	vectorization requiring -funsafe-math-optimizations.

	gcc/testsuite/
	* gcc.dg/vect/vect.exp: Add -ffast-math for NEON.
	* gcc.dg/vect/vect-reduc-6.c: Add XFAIL for NEON.

[ARM] : don't use NEON floating-point without -ffast-math (PR43703)

Commit Message

Comments

Patch