Skip to content

Commit

Permalink
Release notes for 3.3.1-beta1
Browse files Browse the repository at this point in the history
  • Loading branch information
matteo-frigo committed Aug 21, 2011
1 parent 7079b52 commit e1b527d
Show file tree
Hide file tree
Showing 4 changed files with 26 additions and 11 deletions.
4 changes: 4 additions & 0 deletions NEWS
Original file line number Diff line number Diff line change
@@ -1,5 +1,9 @@
FFTW 3.3.1

* Added support for the NEON extensions to the ARM ISA. (Note to beta
users: an ARM cycle counter is not yet implemented; please contact
[email protected] if you know how to do it right.)

* MPI code now compiles even if mpicc is a C++ compiler; thanks to
Kyle Spyksma for the bug report.

Expand Down
2 changes: 1 addition & 1 deletion configure.ac
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
dnl Process this file with autoconf to produce a configure script.
AC_INIT(fftw, 3.3.1, [email protected])
AC_INIT(fftw, 3.3.1-beta1, [email protected])
AC_CONFIG_SRCDIR(kernel/ifftw.h)
# fftw-3.1.x was 4:X:1
# fftw-3.2.x was 5:X:2
Expand Down
26 changes: 18 additions & 8 deletions doc/install.texi
Original file line number Diff line number Diff line change
Expand Up @@ -190,14 +190,14 @@ of the time). @xref{Cycle Counters}.

@item
@code{--enable-sse}, @code{--enable-sse2}, @code{--enable-avx},
@code{--enable-altivec}: Enable the compilation of SIMD code for SSE
(Pentium III+), SSE2 (Pentium IV+), AVX (Sandy Bridge, Interlagos),
AltiVec (PowerPC G4+). SSE and AltiVec only work with
@code{--enable-float} (above). SSE2 works in both single and double
precision (and is simply SSE in single precision). The resulting code
will @emph{still work} on earlier CPUs lacking the SIMD extensions
(SIMD is automatically disabled, although the FFTW library is still
larger).
@code{--enable-altivec}, @code{--enable-neon}: Enable the compilation of
SIMD code for SSE (Pentium III+), SSE2 (Pentium IV+), AVX (Sandy Bridge,
Interlagos), AltiVec (PowerPC G4+), NEON (some ARM processors). SSE,
AltiVec, and NEON only work with @code{--enable-float} (above). SSE2
works in both single and double precision (and is simply SSE in single
precision). The resulting code will @emph{still work} on earlier CPUs
lacking the SIMD extensions (SIMD is automatically disabled, although
the FFTW library is still larger).
@itemize @minus
@item
These options require a compiler supporting SIMD extensions, and
Expand All @@ -214,6 +214,16 @@ properly aligns the stack when compiling any code that links to FFTW.
By default, @code{gcc} 2.95 and later versions align the stack as
needed, but you should not compile FFTW with the @code{-Os} option or the
@code{-mpreferred-stack-boundary} option with an argument less than 4.
@item
Because of the large variety of ARM processors and ABIs, FFTW
does not attempt to guess the correct @code{gcc} flags for generating
NEON code. In general, you will have to provide them on the command line.
This command line is known to have worked at least once:
@example
./configure --with-slow-timer --host=arm-linux-gnueabi \
--enable-single --enable-neon \
"CC=arm-linux-gnueabi-gcc -march=armv7-a -mfloat-abi=softfp"
@end example
@end itemize

@end itemize
Expand Down
5 changes: 3 additions & 2 deletions doc/other.texi
Original file line number Diff line number Diff line change
Expand Up @@ -15,8 +15,9 @@ SIMD, which stands for ``Single Instruction Multiple Data,'' is a set of
special operations supported by some processors to perform a single
operation on several numbers (usually 2 or 4) simultaneously. SIMD
floating-point instructions are available on several popular CPUs:
SSE/SSE2/AVX on recent x86/x86-64 processors, AltiVec (single precision) on some PowerPCs (Apple G4 and
higher), and MIPS Paired Single (currently only in FFTW 3.2.x). FFTW can be compiled to support the
SSE/SSE2/AVX on recent x86/x86-64 processors, AltiVec (single precision)
on some PowerPCs (Apple G4 and higher), NEON on some ARM models, and MIPS Paired Single
(currently only in FFTW 3.2.x). FFTW can be compiled to support the
SIMD instructions on any of these systems.
@cindex SIMD
@cindex SSE
Expand Down

0 comments on commit e1b527d

Please sign in to comment.