Best Square Root Method (Precision VS Speed) - CodeProject
-
Download SquareRoot_src.zip - 5.17 KB
Introduction
I enjoy Game Programmingwith Directx and I noticed that the most called method through out mostof my games is the standard sqrt method in the Math.hand this made me search for faster functions than the standard sqrt.And after some searching I found lots of functions that were much muchfaster but it's always a compromise between speed and precision. Themain purpose of this article is to help people choose the bestsquare-root method that suits their program.
Background
Inthis article I compare 12 different methods for computing the squareroot with the standard sqrt function as a reference,and for each method I show it's precision and speed compared to the sqrtmethod.
What this article is not about
- Explaining how each method works.
- New ways to compute the square root.
Using the code
The code is simple, it basicallycontains:
1. main.cpp
Calls all themethods and for each one of them it computes the speedand precision relative to the sqrt function.
2.SquareRootmethods.h
This Header contains theimplementation of the functions, and the reference of where I got themfrom.
First I calculate the Speed and Precision of the sqrtmethod which will be my reference.
For computing the Speed Imeasure the time it takes to call the sqrt function (M-1)times and I assign this value to RefSpeed
which will be myrefrence.
And for computing the Precision I add the currentresult to the previous result in RefTotalPrecision
everytime I call the method.
willbe my refrence. RefTotalPrecisi
on
For measuring runtime duration(Speed) of themethods I use the CDuration
class found on this Link, and iuse dur
as an instance of that class.
for(int j=0;jAndfor the other Methods I do the same calculations, but in the endi reference them to the sqrt.
Collapse CopyCodefor(int j=0;jNOTES:
- I Assume that the error in Precision whether larger or smaller than the reference is equal that's why i use "abs".
- The Speed is refrenced as the actual percentage, while the Precision is referenced a decrease percentage.
You can Modify the value of M
as you like,i initially assign it with 10000.
You can Modify AVG
as well, the higher it is the more accurate the results.
Collapse CopyCode#define M 10000#define AVG 10
Pointsof Interest
Precision wise the sqrt standard method is the best,But the other functions can be much faster even 5 times faster, i wouldpersonally choose Method N# 2 as it has high precision and high speedbut I'll leave it for you to choose :)
I took 5 samples andaveraged them and here is the output:
NOTE:The performance of these Methods depends highly on your processor andmay change from a computer to another.
The METHODS
Sqrt1
Reference:http://ilab.usc.edu/wiki/index.php/Fast_Square_Root
Algorithm:BabylonianMethod + some manipulations on IEEE 32 bit floating pointrepresentation
Collapse CopyCode
float sqrt1(const float x){union{int i;float x;} u;u.x = x;u.i = (1<<29) + (u.i >> 1) - (1<<22);// Two Babylonian Steps (simplified from:) // u.x = 0.5f * (u.x + x/u.x); // u.x = 0.5f * (u.x + x/u.x); u.x = u.x + x/u.x;u.x = 0.25f*u.x + x/u.x;return u.x;}
Sqrt2
Reference: http://ilab.usc.edu/wiki/index.php/Fast_Square_Root
Algorithm: The MagicNumber (Quake 3)
Collapse CopyCode#define SQRT_MAGIC_F 0x5f3759dffloat sqrt2(const float x){const float xhalf = 0.5f*x;union // get bits for floating value {float x;int i;} u;u.x = x;u.i = SQRT_MAGIC_F - (u.i >> 1); // gives initial guess y0 return x*u.x*(1.5f - xhalf*u.x*u.x);// Newton step, repeating increases accuracy}
Sqrt3
Reference: http://ilab.usc.edu/wiki/index.php/Fast_Square_Root
Algorithm: Logbase 2 approximation and Newton's Method
Collapse CopyCodefloat sqrt3(const float x){union{int i;float x;} u;u.x = x;u.i = (1<<29) + (u.i >> 1) - (1<<22);return u.x;}
Sqrt4
Reference: I Got it a long time a go from aforum and i forgot, please contact me if you know it's reference
Algorithm: BakhsaliApproximation
Collapse CopyCode
float sqrt4(const float m){int i=0;while( (i*i) <= m )i++;i--;float d = m - i*i;float p=d/(2*i);float a=i+p;return a-(p*p)/(2*a);}
Sqrt5
Reference: http://www.dreamincode.net/code/snippet244.htm
Algorithm: BabylonianMethod
Collapse CopyCode
float sqrt5(const float m){float i=0;float x1,x2;while( (i*i) <= m )i+=0.1f;x1=i;for(int j=0;j<10;j++){x2=m;x2/=x1;x2+=x1;x2/=2;x1=x2;}return x2;}
Sqrt6
Reference: http://www.azillionmonkeys.com/qed/sqroot.html#calcmeth
Algorithm: Dependanton IEEE representation and only works for 32 bits
Collapse CopyCodedouble sqrt6 (double y){double x, z, tempf;unsigned long *tfptr = ((unsigned long *)&tempf) + 1;tempf = y;*tfptr = (0xbfcdd90a - *tfptr)>>1;x = tempf;z = y*0.5;x = (1.5*x) - (x*x)*(x*z); //The more you make replicates of this statment the higher the //accuracy, here only 2 replicates are used x = (1.5*x) - (x*x)*(x*z);return x*y;}
Sqrt7
Reference: http://bits.stephan-brumme.com/squareRoot.html
Algorithm: Dependanton IEEE representation and only works for 32 bits
Collapse CopyCodefloat sqrt7(float x){unsigned int i = *(unsigned int*) &x;// adjust bias i += 127 << 23;// approximation of square root i >>= 1;return *(float*) &i;}
Sqrt8
Reference: http://forums.techarena.in/software-development/1290144.htm
Algorithm: Babylonian Method
Collapse CopyCode
double sqrt9( const double fg){double n = fg / 2.0;double lstX = 0.0;while(n != lstX){lstX = n;n = (n + fg/n) / 2.0;}return n;}
Sqrt9
Reference: http://www.functionx.com/cpp/examples/squareroot.htm
Algorithm: BabylonianMethod
Collapse CopyCode double Abs(double Nbr){if( Nbr >= 0 )return Nbr;elsereturn -Nbr;}double sqrt10(double Nbr){double Number = Nbr / 2;const double Tolerance = 1.0e-7;do{Number = (Number + Nbr / Number) / 2;}while( Abs(Number * Number - Nbr) > Tolerance);return Number;}
Sqrt10
Reference: http://www.cs.uni.edu/~jacobson/C++/newton.html
Algorithm: Newton's Approximation Method
Collapse CopyCodedouble sqrt11(const double number)e{const double ACCURACY=0.001;double lower, upper, guess;if (number < 1){lower = number;upper = 1;}else{lower = 1;upper = number;}while ((upper-lower) > ACCURACY){guess = (lower + upper)/2;if(guess*guess > number)upper =guess;elselower = guess;}return (lower + upper)/2;}
Sqrt11
Reference: http://www.drdobbs.com/184409869;jsessionid=AIDFL0EBECDYLQE1GHOSKH4ATMY32JVN
Algorithm: Newton'sApproximation Method
Collapse CopyCode double sqrt12( unsigned long N ){double n, p, low, high;if( 2 > N )return( N );low = 0;high = N;while( high > low + 1 ){n = (high + low) / 2;p = n * n;if( N < p )high = n;else if( N > p )low = n;elsebreak;}return( N == p ? n : low );}
Sqrt12
Reference: http://cjjscript.q8ieng.com/?p=32
Algorithm: Babylonian Method
Collapse CopyCode
double sqrt13( int n ){// double a = (eventually the main method will plug values into a)double a = (double) n;double x = 1;// For loop to get the square root value of the entered number.for( int i = 0; i < n; i++){x = 0.5 * ( x+a / x );}return x;}