Trustie - 确实激发创新

开源项目社区	当前位置 :
oss.trustie.net/open_source_projects	主页 > 开源项目社区 > math-neon

math-neon

0	0	28
贡献者	讨论	代码提交

概述

This project aims to implement high performance approximations of higher order math functions (trig, exponential, powers, etc) in C and hand optimized assembly. Floating Point performance on ARM Cortex A8 equipped devices (OpenPandora, Beagle Board, iPhone 3GS, Palm Pre, etc) is heavily dependent on utilizing the pipelined NEON VFPU. By default GCC targets the fully IEEE754 compliant VFP-lite, which is an order of magnitude slower. Support for the NEON unit in GCC is limited, so hand written assembly will be required.

To determine if this project was worthwhile i investigated a cmath implementation as compiled by Code Sourcery 2009q1. I noticed the following problems:
1. Depends on many floating point branches. Even simple things like "if (x < 0) y = y;" are generating branches. -> Requires a 20 cycle stall.
2. All integer work on floating point data is being done on the ARM. -> Requires a 20 cycle stall.
3. Only VFP instructions are being produced / No vectorization. -> Slower than NEON.
4. Overly robust, ie floorf() uses complex ARM based routine instead of VFP/NEON float>integer->float conversion -> Very slow.

创建时间：2014-05-11 09:54

项目来源：http://www.ohloh.net/p/math-neon

标签:

beagleboard

cmath

openpandora

mathh

asm

neon

fpu

cortex

arm

共有 0 个贴子

没有任何数据可供显示

新建帖子