Converting float to double

Question

How expensive is the conversion of a float to a double? Is it as trivial as an int to long conversion?

EDIT: I'm assuming a platform where where float is 4 bytes and double is 8 bytes

Did your profiler indicate that your program is getting bogged down converting floats to doubles? — Michael Kristofik
– Michael Kristofik, Commented Sep 14, 2009 at 14:09
Please, specify the platform. Is this Windows on x86 (Win32) or x64 (Win64)? Or PPC, or perhaps some embedded plarform? The question is not answerable without knowing the platform. — Suma
– Suma, Commented Sep 14, 2009 at 14:40

Suma · Accepted Answer · 2009-09-14 14:38:55Z

Platform considerations

This depends on platform used for float computation. With x87 FPU the conversion is free, as the register content is the same - the only price you may sometimes pay is the memory traffic, but in many cases there is even no traffic, as you can simply use the value without any conversion. x87 is actually a strange beast in this respect - it is hard to properly distinguish between floats and doubles on it, as the instructions and registers used are the same, what is different are load/store instructions and computation precision itself is controlled using status bits. Using mixed float/double computations may result in unexpected results (and there are compiler command line options to control exact behaviour and optimization strategies because of this).

When you use SSE (and sometimes Visual Studio uses SSE by default), it may be different, as you may need to transfer the value in the FPU registers or do something explicit to perform the conversion.

Memory savings performance

As a summary, and answering to your comment elsewhere: if you want to store results of floating computations into 32b storage, the result will be same speed or faster, because:

If you do this on x87, the conversion is free - the only difference will be fstp dword[] will be used instead of fstp qword[].
If you do this with SSE enabled, you may even see some performance gain, as some float computations can be done with SSE once the precision of the computation is only float insteead of default double.
In all cases the memory traffic is lower

Stephen Canon · Accepted Answer · 2009-09-14 14:48:55Z

Float to double conversions happen for free on some platforms (PPC, x86 if your compiler/runtime uses the "to hell with what type you told me to use, i'm going to evaluate everything in long double anyway, nyah nyah" evaluation mode).

On an x86 environment where floating-point evaluation is actually done in the specified type using SSE registers, conversions between float and double are about as expensive as a floating-point add or multiply (i.e., unlikely to be a performance consideration unless you're doing a lot of them).

In an embedded environment that lacks hardware floating-point, they can be somewhat costly.

Dharmateja · Accepted Answer · 2012-04-17 13:26:39Z

I can't imagine it'd be too much more complex. The big difference between converting int to long and converting float to double is that the int types have two components (sign and value) while floating point numbers have three components (sign, mantissa, and exponent).

IEEE 754 single precision is encoded in 32 bits using 1 bit for the sign, 8 bits for the exponent, and 23 bits for the significand. However, it uses a hidden bit, so the significand is 24 bits (p = 24), even though it is encoded using only 23 bits.

-- David Goldberg, What Every Computer Scientist Should Know About Floating-Point Arithmetic

So, converting between float and double is going to keep the same sign bit, set the last 23/24 bits of the float's mantissa to the double's mantissa, and set the last 8 bits of the float's exponent to the double's exponent.

This behavior may even be guaranteed by IEEE 754... I haven't checked it, so I'm not sure.

Vijay Mathew · Accepted Answer · 2009-09-14 14:15:44Z

0

This is specific to the C++ implementation you are using. In C++, the default floating-point type is double. A compiler should issue a warning for the following code:

float a = 3.45;

because the double value 3.45 is being assigned to a float. If you need to use float specifically, suffix the value with f:

float a = 3.45f;

The point is, all floating-point numbers are by default double. It's safe to stick to this default if you are not sure of the implementation details of your compiler and don't have significant understanding of floating point computation. Avoid the cast.

Also see section 4.5 of The C++ Programming Language.

answered Sep 14, 2009 at 14:15

Vijay Mathew

27.3k4 gold badges66 silver badges95 bronze badges

4 Comments

Tony the Pony Over a year ago

I need to store lots of floating point values and don't need double precision and want to cut the required memory in half.

Vijay Mathew Over a year ago

@Jen: As I said, this is not a language issue, but a compiler + floating-point arithmetic implementation issue. You need to look into your compiler + hardware manuals.

Justicle Over a year ago

This does not answer the question at all.

Vijay Mathew Over a year ago

@Justicle There is no ONE answer either! This is a platform specific issue.

vehomzzz · Accepted Answer · 2009-09-14 14:07:00Z

-1

probably a bit slower than converting int to long, as memory required is larger and manipulation is more complex. A good reference about memory alignment issues

answered Sep 14, 2009 at 14:07

vehomzzz

45.1k77 gold badges193 silver badges217 bronze badges

Comments

β.εηοιτ.βε · Accepted Answer · 2016-05-07 17:16:39Z

-1

Maybe this help:

#include <stdlib.h>
#include <stdio.h>
#include <conio.h>

double _ftod(float fValue)
{
  char czDummy[30];
  printf(czDummy,"%9.5f",fValue);
  double dValue = strtod(czDummy,NULL);
  return dValue;
}


int main(int argc, char* argv[])
{
  float fValue(250.84f);
  double dValue = _ftod(fValue);//good conversion
  double dValue2 = fValue;//wrong conversion
  printf("%f\n",dValue);//250.840000
  printf("%f\n",dValue2);//250.839996
  getch();
  return 0;
}

edited May 7, 2016 at 17:16

β.εηοιτ.βε

40.2k14 gold badges81 silver badges104 bronze badges

answered May 7, 2016 at 16:57

Mara Black

1,74918 silver badges24 bronze badges

1 Comment

β.εηοιτ.βε Over a year ago

At least explaining to the person asking the question why your snippet of code actually answer is question would make your answer somehow acceptable

Collectives™ on Stack Overflow

Converting float to double

6 Answers 6

Platform considerations

Memory savings performance

Comments

Comments

Comments

4 Comments

Comments

1 Comment

Hot Network Questions

Collectives™ on Stack Overflow

6 Answers 6

Platform considerations

Memory savings performance

Comments

Comments

Comments

4 Comments

Comments

1 Comment

Related