OT : Approximate / fast math libraries ?

Fri Aug 31 19:48:49 UTC 2007

Btw, I usually use -O3 rather than -O2. I agree with the other poster: I am not sure getting rid of precision is a great idea.

Trotter

----- Original Message ----
From: Chris Jones <jonesc at hep.phy.cam.ac.uk>
To: For users of Fedora <fedora-list at redhat.com>
Sent: Friday, August 31, 2007 2:41:56 PM
Subject: Re: OT : Approximate / fast math libraries ?

Hi,

Thanks for your feedback.

> What exactly is your need? Contact me off-list and maybe I can
> help. Have you profiled your code? I have found that people
> often do not actually know where their code is spending its
> time. I once sped up an app which was universally acknowledged
> to be slow "because it uses floating point." I sped it up 3x.

Yes, I have profiled the code, quite extensively, using the valgrind/calltree 
application. From this I know this that I'm know I've tidying up this to the 
point where its hard to find big improvements, the cpu time is fairly well 
spread around, not isolated in a few places. So am now looking a a few places 
where math calls are taking more time than I would hope. I'm not going to get 
factors in speed in the overall application, but I hope in a few places 
things can be improvemed a lot locally.

Also, the project is not small, massive in fact, and I'm only writting one 
small part. If you are interested you can find it here

http://lhcb-release-area.web.cern.ch/LHCb-release-area/DOC/brunel/releases/latest/doxygen/index.html

It also has to be supported on a *lot* of hardware. Basically gcc 3.2.3 based 
Scientific Linux 3 machines, gcc 3.4.6 SL4 machines (32 and 64 bit) and (not 
my decision), windows VC 7.1. I cannot rely on for instance SSE math calls 
etc.

Taking an example from another thread, one place I'm trying to understand is 
where I use atan2 see 

http://www.hep.phy.cam.ac.uk/~jonesc/atan2.png

for the profiler output. atan2 is taking 50% of the time of this method. Not 
here I don't need that much precision on the result - say +- O(2*pi/100). 
Anything you can suggest here - The code is here

http://www.hep.phy.cam.ac.uk/~jonesc/RichPhotonRecoUsingCKEstiFromRadius.cpp

( note though its full of internal classes etc...)

cheers Chris

>
> I modified the parsing routines it used, not the floating point.
>
> Mike
> --
> p="p=%c%s%c;main(){printf(p,34,p,34);}";main(){printf(p,34,p,34);}
> Oppose globalization and One World Governments like the UN.
> This message made from 100% recycled bits.
> You have found the bank of Larn.
> I can explain it for you, but I can't understand it for you.
> I speak only for myself, and I am unanimous in that!

-- 
fedora-list mailing list
fedora-list at redhat.com
To unsubscribe: https://www.redhat.com/mailman/listinfo/fedora-list

____________________________________________________________________________________
Pinpoint customers who are looking for what you sell. 
http://searchmarketing.yahoo.com/