CUDA Math API
CUDA Math API
‣ Integer Intrinsics
‣ Type Casting Intrinsics
‣ SIMD Intrinsics
Parameters
a
- half. Is only being read.
Returns
half
Description
Calculates the absolute value of input half number and returns the result.
Description
Performs half addition of inputs a and b, in round-to-nearest-even mode.
Description
Performs half addition of inputs a and b, in round-to-nearest-even mode. Prevents floating-
point contractions of mul+add into fma.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
half
Description
Performs half add of inputs a and b, in round-to-nearest-even mode, and clamps the result
to range [0.0, 1.0]. NaN results are flushed to +0.0.
Description
Divides half input a by input b in round-to-nearest mode.
Description
Performs half multiply on inputs a and b, then performs a half add of the result with c,
rounding the result once in round-to-nearest-even mode.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
c
- half. Is only being read.
Returns
half
Description
Performs half multiply on inputs a and b, then performs a half add of the result with c,
rounding the result once in round-to-nearest-even mode. Then negative result is clamped to
0. NaN result is converted to canonical NaN.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
c
- half. Is only being read.
Returns
half
Description
Performs half multiply on inputs a and b, then performs a half add of the result with c,
rounding the result once in round-to-nearest-even mode, and clamps the result to range [0.0,
1.0]. NaN results are flushed to +0.0.
Description
Performs half multiplication of inputs a and b, in round-to-nearest mode.
Description
Performs half multiplication of inputs a and b, in round-to-nearest mode. Prevents floating-
point contractions of mul+add or sub into fma.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
half
Description
Performs half multiplication of inputs a and b, in round-to-nearest mode, and clamps the
result to range [0.0, 1.0]. NaN results are flushed to +0.0.
Description
Negates input half number and returns the result.
Description
Subtracts half input b from input a in round-to-nearest mode.
Description
Subtracts half input b from input a in round-to-nearest mode. Prevents floating-point
contractions of mul+sub into fma.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
half
Description
Subtracts half input b from input a in round-to-nearest mode, and clamps the result to
range [0.0, 1.0]. NaN results are flushed to +0.0.
Parameters
address
- half*. An address in global or shared memory.
val
- half. The value to be added.
Returns
half
Description
The location of address must be in global or shared memory. This operation has undefined
behavior otherwise. This operation is only supported by devices of compute capability 7.x and
higher.
Note:
For more details for this function see the Atomic Functions section in the CUDA C++
Programming Guide.
Description
Divides half2 input vector a by input vector b in round-to-nearest mode.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates the absolute value of both halves of the input half2 number and returns the
result.
Description
Performs half2 vector add of inputs a and b, in round-to-nearest mode.
Description
Performs half2 vector add of inputs a and b, in round-to-nearest mode. Prevents floating-
point contractions of mul+add into fma.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector add of inputs a and b, in round-to-nearest mode, and clamps the
results to range [0.0, 1.0]. NaN results are flushed to +0.0.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
c
- half2. Is only being read.
Returns
half2
Description
Interprets vector half2 input pairs a, b, and c as complex numbers in half precision and
performs complex multiply-accumulate operation: a*b + c
Description
Performs half2 vector multiply on inputs a and b, then performs a half2 vector add of the
result with c, rounding the result once in round-to-nearest-even mode.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
c
- half2. Is only being read.
Returns
half2
‣ The result of elementwise fused multiply-add operation on vectors a, b, and c with relu
saturation.
Description
Performs half2 vector multiply on inputs a and b, then performs a half2 vector add of the
result with c, rounding the result once in round-to-nearest-even mode. Then negative result is
clamped to 0. NaN result is converted to canonical NaN.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
c
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector multiply on inputs a and b, then performs a half2 vector add of
the result with c, rounding the result once in round-to-nearest-even mode, and clamps the
results to range [0.0, 1.0]. NaN results are flushed to +0.0.
Description
Performs half2 vector multiplication of inputs a and b, in round-to-nearest-even mode.
Description
Performs half2 vector multiplication of inputs a and b, in round-to-nearest-even mode.
Prevents floating-point contractions of mul+add or sub into fma.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector multiplication of inputs a and b, in round-to-nearest-even mode, and
clamps the results to range [0.0, 1.0]. NaN results are flushed to +0.0.
Description
Negates both halves of the input half2 number a and returns the result.
Description
Subtracts half2 input vector b from input vector a in round-to-nearest-even mode.
Description
Subtracts half2 input vector b from input vector a in round-to-nearest-even mode. Prevents
floating-point contractions of mul+sub into fma.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Subtracts half2 input vector b from input vector a in round-to-nearest-even mode, and
clamps the results to range [0.0, 1.0]. NaN results are flushed to +0.0.
Parameters
address
- half2*. An address in global or shared memory.
val
- half2. The value to be added.
Returns
half2
Description
The location of address must be in global or shared memory. This operation has undefined
behavior otherwise. This operation is only supported by devices of compute capability 6.x and
higher.
Note:
For more details for this function see the Atomic Functions section in the CUDA C++
Programming Guide.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half if-equal comparison of inputs a and b. NaN inputs generate false results.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half if-equal comparison of inputs a and b. NaN inputs generate true results.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half greater-equal comparison of inputs a and b. NaN inputs generate false
results.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half greater-equal comparison of inputs a and b. NaN inputs generate true
results.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half greater-than comparison of inputs a and b. NaN inputs generate false results.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half greater-than comparison of inputs a and b. NaN inputs generate true results.
Parameters
a
- half. Is only being read.
Returns
int
Description
Checks if the input half number a is infinite.
Parameters
a
- half. Is only being read.
Returns
bool
Description
Determine whether half value a is a NaN.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half less-equal comparison of inputs a and b. NaN inputs generate false results.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half less-equal comparison of inputs a and b. NaN inputs generate true results.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half less-than comparison of inputs a and b. NaN inputs generate false results.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half less-than comparison of inputs a and b. NaN inputs generate true results.
Description
Calculates half max(a, b) defined as (a > b) ? a : b.
Description
Calculates half max(a, b) defined as (a > b) ? a : b.
Description
Calculates half min(a, b) defined as (a < b) ? a : b.
Description
Calculates half min(a, b) defined as (a < b) ? a : b.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half not-equal comparison of inputs a and b. NaN inputs generate false results.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
bool
Description
Performs half not-equal comparison of inputs a and b. NaN inputs generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of if-equal comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs half2 vector if-equal comparison of inputs a and b. The bool result is set to true
only if both half if-equal comparisons evaluate to true, or false otherwise. NaN inputs
generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of unordered if-equal comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs half2 vector if-equal comparison of inputs a and b. The bool result is set to true
only if both half if-equal comparisons evaluate to true, or false otherwise. NaN inputs
generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of greater-equal comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs half2 vector greater-equal comparison of inputs a and b. The bool result is set to
true only if both half greater-equal comparisons evaluate to true, or false otherwise. NaN
inputs generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of unordered greater-equal comparison of vectors a and b are
true;
‣ false otherwise.
Description
Performs half2 vector greater-equal comparison of inputs a and b. The bool result is set to
true only if both half greater-equal comparisons evaluate to true, or false otherwise. NaN
inputs generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of greater-than comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs half2 vector greater-than comparison of inputs a and b. The bool result is set to
true only if both half greater-than comparisons evaluate to true, or false otherwise. NaN
inputs generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of unordered greater-than comparison of vectors a and b are
true;
‣ false otherwise.
Description
Performs half2 vector greater-than comparison of inputs a and b. The bool result is set to
true only if both half greater-than comparisons evaluate to true, or false otherwise. NaN
inputs generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of less-equal comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs half2 vector less-equal comparison of inputs a and b. The bool result is set to true
only if both half less-equal comparisons evaluate to true, or false otherwise. NaN inputs
generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of unordered less-equal comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs half2 vector less-equal comparison of inputs a and b. The bool result is set to true
only if both half less-equal comparisons evaluate to true, or false otherwise. NaN inputs
generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of less-than comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs half2 vector less-than comparison of inputs a and b. The bool result is set to true
only if both half less-than comparisons evaluate to true, or false otherwise. NaN inputs
generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of unordered less-than comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs half2 vector less-than comparison of inputs a and b. The bool result is set to true
only if both half less-than comparisons evaluate to true, or false otherwise. NaN inputs
generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of not-equal comparison of vectors a and b are true,
‣ false otherwise.
Description
Performs half2 vector not-equal comparison of inputs a and b. The bool result is set to true
only if both half not-equal comparisons evaluate to true, or false otherwise. NaN inputs
generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
bool
‣ true if both half results of unordered not-equal comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs half2 vector not-equal comparison of inputs a and b. The bool result is set to true
only if both half not-equal comparisons evaluate to true, or false otherwise. NaN inputs
generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector if-equal comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector if-equal comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector greater-equal comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector greater-equal comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector greater-than comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector greater-than comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true results.
Parameters
a
- half2. Is only being read.
Returns
half2
‣ The half2 with the corresponding half results set to 1.0 for NaN, 0.0 otherwise.
Description
Determine whether each half of input half2 number a is a NaN.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector less-equal comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector less-equal comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector less-than comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector less-than comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true results.
Description
Calculates half2 vector max(a, b). Elementwise half operation is defined as (a > b) ? a : b.
Description
Calculates half2 vector max(a, b). Elementwise half operation is defined as (a > b) ? a : b.
Description
Calculates half2 vector min(a, b). Elementwise half operation is defined as (a < b) ? a : b.
Description
Calculates half2 vector min(a, b). Elementwise half operation is defined as (a < b) ? a : b.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector not-equal comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false results.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Performs half2 vector not-equal comparison of inputs a and b. The corresponding half
results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true results.
Parameters
a
- double. Is only being read.
Returns
half
‣ a converted to half.
Description
Converts double number a to half precision in round-to-nearest-even mode.
Parameters
a
- float2. Is only being read.
Returns
half2
‣ The half2 which has corresponding halves equal to the converted float2 components.
Description
Converts both components of float2 to half precision in round-to-nearest mode and combines
the results into one half2 number. Low 16 bits of the return value correspond to a.x and
high 16 bits of the return value correspond to a.y.
Parameters
a
- float. Is only being read.
Returns
half
‣ a converted to half.
Description
Converts float number a to half precision in round-to-nearest-even mode.
Parameters
a
- float. Is only being read.
Returns
half2
‣ The half2 value with both halves equal to the converted half precision number.
Description
Converts input a to half precision in round-to-nearest-even mode and populates both halves of
half2 with converted value.
Parameters
a
- float. Is only being read.
Returns
half
‣ a converted to half.
Description
Converts float number a to half precision in round-down mode.
Parameters
a
- float. Is only being read.
Returns
half
‣ a converted to half.
Description
Converts float number a to half precision in round-to-nearest-even mode.
Parameters
a
- float. Is only being read.
Returns
half
‣ a converted to half.
Description
Converts float number a to half precision in round-up mode.
Parameters
a
- float. Is only being read.
Returns
half
‣ a converted to half.
Description
Converts float number a to half precision in round-towards-zero mode.
Parameters
a
- float. Is only being read.
b
- float. Is only being read.
Returns
half2
‣ The half2 value with corresponding halves equal to the converted input floats.
Description
Converts both input floats to half precision in round-to-nearest-even mode and combines the
results into one half2 number. Low 16 bits of the return value correspond to the input a, high
16 bits correspond to the input b.
Parameters
a
- half2. Is only being read.
Returns
float2
‣ a converted to float2.
Description
Converts both halves of half2 input a to float2 and returns the result.
Parameters
a
- float. Is only being read.
Returns
float
‣ a converted to float.
Description
Converts half number a to float.
Parameters
a
- half. Is only being read.
Returns
half2
‣ The vector which has both its halves equal to the input a.
Description
Returns half2 number with both halves equal to the input a half number.
Parameters
h
- half. Is only being read.
Returns
int
Description
Convert the half-precision floating-point value h to a signed integer in round-down mode. NaN
inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
int
Description
Convert the half-precision floating-point value h to a signed integer in round-to-nearest-even
mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
int
Description
Convert the half-precision floating-point value h to a signed integer in round-up mode. NaN
inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
int
Description
Convert the half-precision floating-point value h to a signed integer in round-towards-zero
mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
long long int
Description
Convert the half-precision floating-point value h to a signed 64-bit integer in round-down
mode. NaN inputs return a long long int with hex value of 0x8000000000000000.
Parameters
h
- half. Is only being read.
Returns
long long int
Description
Convert the half-precision floating-point value h to a signed 64-bit integer in round-to-
nearest-even mode. NaN inputs return a long long int with hex value of 0x8000000000000000.
Parameters
h
- half. Is only being read.
Returns
long long int
Description
Convert the half-precision floating-point value h to a signed 64-bit integer in round-up mode.
NaN inputs return a long long int with hex value of 0x8000000000000000.
Parameters
h
- half. Is only being read.
Returns
long long int
Description
Convert the half-precision floating-point value h to a signed 64-bit integer in round-towards-
zero mode. NaN inputs return a long long int with hex value of 0x8000000000000000.
Parameters
h
- half. Is only being read.
Returns
short int
Description
Convert the half-precision floating-point value h to a signed short integer in round-down
mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
short int
Description
Convert the half-precision floating-point value h to a signed short integer in round-to-nearest-
even mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
short int
Description
Convert the half-precision floating-point value h to a signed short integer in round-up mode.
NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
short int
Description
Convert the half-precision floating-point value h to a signed short integer in round-towards-
zero mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
unsigned int
Description
Convert the half-precision floating-point value h to an unsigned integer in round-down mode.
NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
unsigned int
Description
Convert the half-precision floating-point value h to an unsigned integer in round-to-nearest-
even mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
unsigned int
Description
Convert the half-precision floating-point value h to an unsigned integer in round-up mode.
NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
unsigned int
Description
Convert the half-precision floating-point value h to an unsigned integer in round-towards-zero
mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
unsigned long long int
Description
Convert the half-precision floating-point value h to an unsigned 64-bit integer in round-down
mode. NaN inputs return 0x8000000000000000.
Parameters
h
- half. Is only being read.
Returns
unsigned long long int
Description
Convert the half-precision floating-point value h to an unsigned 64-bit integer in round-to-
nearest-even mode. NaN inputs return 0x8000000000000000.
Parameters
h
- half. Is only being read.
Returns
unsigned long long int
Description
Convert the half-precision floating-point value h to an unsigned 64-bit integer in round-up
mode. NaN inputs return 0x8000000000000000.
Parameters
h
- half. Is only being read.
Returns
unsigned long long int
Description
Convert the half-precision floating-point value h to an unsigned 64-bit integer in round-
towards-zero mode. NaN inputs return 0x8000000000000000.
Parameters
h
- half. Is only being read.
Returns
unsigned short int
Description
Convert the half-precision floating-point value h to an unsigned short integer in round-down
mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
unsigned short int
Description
Convert the half-precision floating-point value h to an unsigned short integer in round-to-
nearest-even mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
unsigned short int
Description
Convert the half-precision floating-point value h to an unsigned short integer in round-up
mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
unsigned short int
Description
Convert the half-precision floating-point value h to an unsigned short integer in round-
towards-zero mode. NaN inputs are converted to 0.
Parameters
h
- half. Is only being read.
Returns
short int
Description
Reinterprets the bits in the half-precision floating-point number h as a signed short integer.
Parameters
h
- half. Is only being read.
Returns
unsigned short int
Description
Reinterprets the bits in the half-precision floating-point h as an unsigned short number.
Parameters
a
- half. Is only being read.
b
- half. Is only being read.
Returns
half2
Description
Combines two input half number a and b into one half2 number. Input a is stored in low 16
bits of the return value, input b is stored in high 16 bits of the return value.
Parameters
a
- half2. Is only being read.
Returns
float
Description
Converts high 16 bits of half2 input a to 32-bit floating-point number and returns the result.
Parameters
a
- half2. Is only being read.
Returns
half
Description
Returns high 16 bits of half2 input a.
Parameters
a
- half2. Is only being read.
Returns
half2
‣ The half2 with both halves equal to the high 16 bits of the input.
Description
Extracts high 16 bits from half2 input a and returns a new half2 number which has both
halves equal to the extracted bits.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Extracts high 16 bits from each of the two half2 inputs and combines into one half2
number. High 16 bits from input a is stored in low 16 bits of the return value, high 16 bits from
input b is stored in high 16 bits of the return value.
Parameters
i
- int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed integer value i to a half-precision floating-point value in round-down
mode.
Parameters
i
- int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed integer value i to a half-precision floating-point value in round-to-nearest-
even mode.
Parameters
i
- int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed integer value i to a half-precision floating-point value in round-up mode.
Parameters
i
- int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed integer value i to a half-precision floating-point value in round-towards-
zero mode.
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Description
defined(__CUDA_ARCH__) || (__CUDA_ARCH__ >= 300)
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
i
- long long int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed 64-bit integer value i to a half-precision floating-point value in round-
down mode.
Parameters
i
- long long int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed 64-bit integer value i to a half-precision floating-point value in round-to-
nearest-even mode.
Parameters
i
- long long int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed 64-bit integer value i to a half-precision floating-point value in round-up
mode.
Parameters
i
- long long int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed 64-bit integer value i to a half-precision floating-point value in round-
towards-zero mode.
Parameters
a
- half2. Is only being read.
Returns
float
Description
Converts low 16 bits of half2 input a to 32-bit floating-point number and returns the result.
Parameters
a
- half2. Is only being read.
Returns
half
Description
Returns low 16 bits of half2 input a.
Parameters
a
- half2. Is only being read.
Returns
half2
‣ The half2 with both halves equal to the low 16 bits of the input.
Description
Extracts low 16 bits from half2 input a and returns a new half2 number which has both
halves equal to the extracted bits.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Swaps both halves of the half2 input and returns a new half2 number with swapped halves.
Parameters
a
- half2. Is only being read.
b
- half2. Is only being read.
Returns
half2
Description
Extracts low 16 bits from each of the two half2 inputs and combines into one half2 number.
Low 16 bits from input a is stored in low 16 bits of the return value, low 16 bits from input b is
stored in high 16 bits of the return value.
Parameters
mask
- unsigned int. Is only being read.
var
- half. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 2-byte word referenced by var from the source thread ID as half. If the source
thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by adding delta to the caller's thread ID. The value of var held
by the resulting thread ID is returned: this has the effect of shifting var down the warp by
delta threads. If width is less than warpSize then each subsection of the warp behaves as a
separate entity with a starting logical thread ID of 0. As for __shfl_up_sync(), the ID number of
the source thread will not wrap around the value of width and so the upper delta threads will
remain unchanged.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- half2. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 4-byte word referenced by var from the source thread ID as half2. If the source
thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by adding delta to the caller's thread ID. The value of var held
by the resulting thread ID is returned: this has the effect of shifting var down the warp by
delta threads. If width is less than warpSize then each subsection of the warp behaves as a
separate entity with a starting logical thread ID of 0. As for __shfl_up_sync(), the ID number of
the source thread will not wrap around the value of width and so the upper delta threads will
remain unchanged.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- half. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 2-byte word referenced by var from the source thread ID as half. If the source
thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Returns the value of var held by the thread whose ID is given by delta. If width is less than
warpSize then each subsection of the warp behaves as a separate entity with a starting logical
thread ID of 0. If delta is outside the range [0:width-1], the value returned corresponds to the
value of var held by the delta modulo width (i.e. within the same subsection). width must have
a value which is a power of 2; results are undefined if width is not a power of 2, or is a number
greater than warpSize.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- half2. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 4-byte word referenced by var from the source thread ID as half2. If the source
thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Returns the value of var held by the thread whose ID is given by delta. If width is less than
warpSize then each subsection of the warp behaves as a separate entity with a starting logical
thread ID of 0. If delta is outside the range [0:width-1], the value returned corresponds to the
value of var held by the delta modulo width (i.e. within the same subsection). width must have
a value which is a power of 2; results are undefined if width is not a power of 2, or is a number
greater than warpSize.
Parameters
mask
- unsigned int. Is only being read.
var
- half. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 2-byte word referenced by var from the source thread ID as half. If the source
thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by subtracting delta from the caller's lane ID. The value of var
held by the resulting lane ID is returned: in effect, var is shifted up the warp by delta threads.
If width is less than warpSize then each subsection of the warp behaves as a separate entity
with a starting logical thread ID of 0. The source thread index will not wrap around the value of
width, so effectively the lower delta threads will be unchanged. width must have a value which
is a power of 2; results are undefined if width is not a power of 2, or is a number greater than
warpSize.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- half2. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 4-byte word referenced by var from the source thread ID as half2. If the source
thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by subtracting delta from the caller's lane ID. The value of var
held by the resulting lane ID is returned: in effect, var is shifted up the warp by delta threads.
If width is less than warpSize then each subsection of the warp behaves as a separate entity
with a starting logical thread ID of 0. The source thread index will not wrap around the value of
width, so effectively the lower delta threads will be unchanged. width must have a value which
is a power of 2; results are undefined if width is not a power of 2, or is a number greater than
warpSize.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- half. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 2-byte word referenced by var from the source thread ID as half. If the source
thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by performing a bitwise XOR of the caller's thread ID with mask:
the value of var held by the resulting thread ID is returned. If width is less than warpSize then
each group of width consecutive threads are able to access elements from earlier groups of
threads, however if they attempt to access elements from later groups of threads their own
value of var will be returned. This mode implements a butterfly addressing pattern such as is
used in tree reduction and broadcast.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- half2. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 4-byte word referenced by var from the source thread ID as half2. If the source
thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by performing a bitwise XOR of the caller's thread ID with mask:
the value of var held by the resulting thread ID is returned. If width is less than warpSize then
each group of width consecutive threads are able to access elements from earlier groups of
threads, however if they attempt to access elements from later groups of threads their own
value of var will be returned. This mode implements a butterfly addressing pattern such as is
used in tree reduction and broadcast.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
i
- short int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed short integer value i to a half-precision floating-point value in round-down
mode.
Parameters
i
- short int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed short integer value i to a half-precision floating-point value in round-to-
nearest-even mode.
Parameters
i
- short int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed short integer value i to a half-precision floating-point value in round-up
mode.
Parameters
i
- short int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the signed short integer value i to a half-precision floating-point value in round-
towards-zero mode.
Parameters
i
- short int. Is only being read.
Returns
half
Description
Reinterprets the bits in the signed short integer i as a half-precision floating-point number.
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
i
- unsigned int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned integer value i to a half-precision floating-point value in round-down
mode.
Parameters
i
- unsigned int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned integer value i to a half-precision floating-point value in round-to-
nearest-even mode.
Parameters
i
- unsigned int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned integer value i to a half-precision floating-point value in round-up
mode.
Parameters
i
- unsigned int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned integer value i to a half-precision floating-point value in round-towards-
zero mode.
Parameters
i
- unsigned long long int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned 64-bit integer value i to a half-precision floating-point value in round-
down mode.
Parameters
i
- unsigned long long int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned 64-bit integer value i to a half-precision floating-point value in round-
to-nearest-even mode.
Parameters
i
- unsigned long long int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned 64-bit integer value i to a half-precision floating-point value in round-up
mode.
Parameters
i
- unsigned long long int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned 64-bit integer value i to a half-precision floating-point value in round-
towards-zero mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned short integer value i to a half-precision floating-point value in round-
down mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned short integer value i to a half-precision floating-point value in round-to-
nearest-even mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned short integer value i to a half-precision floating-point value in round-up
mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
half
‣ i converted to half.
Description
Convert the unsigned short integer value i to a half-precision floating-point value in round-
towards-zero mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
half
Description
Reinterprets the bits in the unsigned short integer i as a half-precision floating-point number.
Parameters
h
- half. Is only being read.
Returns
half
Description
Compute the smallest integer value not less than h.
Parameters
a
- half. Is only being read.
Returns
half
‣ The cosine of a.
Description
Calculates half cosine of input a in round-to-nearest-even mode.
Parameters
a
- half. Is only being read.
Returns
half
Description
Calculates half natural exponential function of input a in round-to-nearest-even mode.
Parameters
a
- half. Is only being read.
Returns
half
Description
Calculates half decimal exponential function of input a in round-to-nearest-even mode.
Parameters
a
- half. Is only being read.
Returns
half
Description
Calculates half binary exponential function of input a in round-to-nearest-even mode.
Parameters
h
- half. Is only being read.
Returns
half
Description
Calculate the largest integer value which is less than or equal to h.
Parameters
a
- half. Is only being read.
Returns
half
Description
Calculates half natural logarithm of input a in round-to-nearest-even mode.
Parameters
a
- half. Is only being read.
Returns
half
Description
Calculates half decimal logarithm of input a in round-to-nearest-even mode.
Parameters
a
- half. Is only being read.
Returns
half
Description
Calculates half binary logarithm of input a in round-to-nearest-even mode.
Parameters
a
- half. Is only being read.
Returns
half
‣ The reciprocal of a.
Description
Calculates half reciprocal of input a in round-to-nearest-even mode.
Parameters
h
- half. Is only being read.
Returns
half
Description
Round h to the nearest integer value in half-precision floating-point format, with halfway
cases rounded to the nearest even integer value.
Parameters
a
- half. Is only being read.
Returns
half
Description
Calculates half reciprocal square root of input a in round-to-nearest mode.
Parameters
a
- half. Is only being read.
Returns
half
‣ The sine of a.
Description
Calculates half sine of input a in round-to-nearest-even mode.
Parameters
a
- half. Is only being read.
Returns
half
Description
Calculates half square root of input a in round-to-nearest-even mode.
Parameters
h
- half. Is only being read.
Returns
half
Description
Round h to the nearest integer value that does not exceed h in magnitude.
Parameters
h
- half2. Is only being read.
Returns
half2
Description
For each component of vector h compute the smallest integer value not less than h.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 cosine of input vector a in round-to-nearest-even mode.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 exponential function of input vector a in round-to-nearest-even mode.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 decimal exponential function of input vector a in round-to-nearest-even
mode.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 binary exponential function of input vector a in round-to-nearest-even
mode.
Parameters
h
- half2. Is only being read.
Returns
half2
Description
For each component of vector h calculate the largest integer value which is less than or equal
to h.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 natural logarithm of input vector a in round-to-nearest-even mode.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 decimal logarithm of input vector a in round-to-nearest-even mode.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 binary logarithm of input vector a in round-to-nearest mode.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 reciprocal of input vector a in round-to-nearest-even mode.
Parameters
h
- half2. Is only being read.
Returns
half2
Description
Round each component of half2 vector h to the nearest integer value in half-precision
floating-point format, with halfway cases rounded to the nearest even integer value.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 reciprocal square root of input vector a in round-to-nearest-even mode.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 sine of input vector a in round-to-nearest-even mode.
Parameters
a
- half2. Is only being read.
Returns
half2
Description
Calculates half2 square root of input vector a in round-to-nearest mode.
Parameters
h
- half2. Is only being read.
Returns
half2
‣ The truncated h.
Description
Round each component of vector h to the nearest integer value that does not exceed h in
magnitude.
Description
Divides nv_bfloat162 input vector a by input vector b in round-to-nearest mode.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculates the absolute value of input nv_bfloat16 number and returns the result.
Description
Performs nv_bfloat16 addition of inputs a and b, in round-to-nearest-even mode.
Description
Performs nv_bfloat16 addition of inputs a and b, in round-to-nearest-even mode. Prevents
floating-point contractions of mul+add into fma.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Performs nv_bfloat16 add of inputs a and b, in round-to-nearest-even mode, and clamps
the result to range [0.0, 1.0]. NaN results are flushed to +0.0.
Description
Divides nv_bfloat16 input a by input b in round-to-nearest mode.
Description
Performs nv_bfloat16 multiply on inputs a and b, then performs a nv_bfloat16 add of
the result with c, rounding the result once in round-to-nearest-even mode.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
c
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Performs nv_bfloat16 multiply on inputs a and b, then performs a nv_bfloat16 add of
the result with c, rounding the result once in round-to-nearest-even mode. Then negative
result is clamped to 0. NaN result is converted to canonical NaN.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
c
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Performs nv_bfloat16 multiply on inputs a and b, then performs a nv_bfloat16 add of
the result with c, rounding the result once in round-to-nearest-even mode, and clamps the
result to range [0.0, 1.0]. NaN results are flushed to +0.0.
Description
Performs nv_bfloat16 multiplication of inputs a and b, in round-to-nearest mode.
Description
Performs nv_bfloat16 multiplication of inputs a and b, in round-to-nearest mode. Prevents
floating-point contractions of mul+add or sub into fma.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Performs nv_bfloat16 multiplication of inputs a and b, in round-to-nearest mode, and
clamps the result to range [0.0, 1.0]. NaN results are flushed to +0.0.
Description
Negates input nv_bfloat16 number and returns the result.
Description
Subtracts nv_bfloat16 input b from input a in round-to-nearest mode.
Description
Subtracts nv_bfloat16 input b from input a in round-to-nearest mode. Prevents floating-
point contractions of mul+sub into fma.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Subtracts nv_bfloat16 input b from input a in round-to-nearest mode, and clamps the
result to range [0.0, 1.0]. NaN results are flushed to +0.0.
Parameters
address
- __nv_bfloat16*. An address in global or shared memory.
val
- __nv_bfloat16. The value to be added.
Returns
__nv_bfloat16
Description
The location of address must be in global or shared memory. This operation has undefined
behavior otherwise. This operation is only supported by devices of compute capability 8.x and
higher.
Note:
For more details for this function see the Atomic Functions section in the CUDA C++
Programming Guide.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
bfloat2
Description
Calculates the absolute value of both halves of the input nv_bfloat162 number and returns
the result.
Description
Performs nv_bfloat162 vector add of inputs a and b, in round-to-nearest mode.
Description
Performs nv_bfloat162 vector add of inputs a and b, in round-to-nearest mode. Prevents
floating-point contractions of mul+add into fma.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector add of inputs a and b, in round-to-nearest mode, and
clamps the results to range [0.0, 1.0]. NaN results are flushed to +0.0.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
c
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Interprets vector nv_bfloat162 input pairs a, b, and c as complex numbers in
nv_bfloat16 precision and performs complex multiply-accumulate operation: a*b + c
Description
Performs nv_bfloat162 vector multiply on inputs a and b, then performs a nv_bfloat162
vector add of the result with c, rounding the result once in round-to-nearest-even mode.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
c
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
‣ The result of elementwise fused multiply-add operation on vectors a, b, and c with relu
saturation.
Description
Performs nv_bfloat162 vector multiply on inputs a and b, then performs a nv_bfloat162
vector add of the result with c, rounding the result once in round-to-nearest-even mode. Then
negative result is clamped to 0. NaN result is converted to canonical NaN.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
c
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector multiply on inputs a and b, then performs a nv_bfloat162
vector add of the result with c, rounding the result once in round-to-nearest-even mode, and
clamps the results to range [0.0, 1.0]. NaN results are flushed to +0.0.
Description
Performs nv_bfloat162 vector multiplication of inputs a and b, in round-to-nearest-even
mode.
Description
Performs nv_bfloat162 vector multiplication of inputs a and b, in round-to-nearest-even
mode. Prevents floating-point contractions of mul+add or sub into fma.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector multiplication of inputs a and b, in round-to-nearest-even
mode, and clamps the results to range [0.0, 1.0]. NaN results are flushed to +0.0.
Description
Negates both halves of the input nv_bfloat162 number a and returns the result.
Description
Subtracts nv_bfloat162 input vector b from input vector a in round-to-nearest-even mode.
Description
Subtracts nv_bfloat162 input vector b from input vector a in round-to-nearest-even mode.
Prevents floating-point contractions of mul+sub into fma.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Subtracts nv_bfloat162 input vector b from input vector a in round-to-nearest-even mode,
and clamps the results to range [0.0, 1.0]. NaN results are flushed to +0.0.
Parameters
address
- __nv_bfloat162*. An address in global or shared memory.
val
- __nv_bfloat162. The value to be added.
Returns
__nv_bfloat162
Description
The location of address must be in global or shared memory. This operation has undefined
behavior otherwise. This operation is only supported by devices of compute capability 8.x and
higher.
Note:
For more details for this function see the Atomic Functions section in the CUDA C++
Programming Guide.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 if-equal comparison of inputs a and b. NaN inputs generate false
results.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 if-equal comparison of inputs a and b. NaN inputs generate true
results.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 greater-equal comparison of inputs a and b. NaN inputs generate
false results.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 greater-equal comparison of inputs a and b. NaN inputs generate
true results.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 greater-than comparison of inputs a and b. NaN inputs generate
false results.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 greater-than comparison of inputs a and b. NaN inputs generate
true results.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
int
Description
Checks if the input nv_bfloat16 number a is infinite.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
bool
Description
Determine whether nv_bfloat16 value a is a NaN.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 less-equal comparison of inputs a and b. NaN inputs generate false
results.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 less-equal comparison of inputs a and b. NaN inputs generate true
results.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 less-than comparison of inputs a and b. NaN inputs generate false
results.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 less-than comparison of inputs a and b. NaN inputs generate true
results.
Description
Calculates nv_bfloat16 max(a, b) defined as (a > b) ? a : b.
Description
Calculates nv_bfloat16 max(a, b) defined as (a > b) ? a : b.
Description
Calculates nv_bfloat16 min(a, b) defined as (a < b) ? a : b.
Description
Calculates nv_bfloat16 min(a, b) defined as (a < b) ? a : b.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 not-equal comparison of inputs a and b. NaN inputs generate false
results.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
bool
Description
Performs nv_bfloat16 not-equal comparison of inputs a and b. NaN inputs generate true
results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ true if both nv_bfloat16 results of if-equal comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs nv_bfloat162 vector if-equal comparison of inputs a and b. The bool result is set
to true only if both nv_bfloat16 if-equal comparisons evaluate to true, or false otherwise.
NaN inputs generate false results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ true if both nv_bfloat16 results of unordered if-equal comparison of vectors a and b are
true;
‣ false otherwise.
Description
Performs nv_bfloat162 vector if-equal comparison of inputs a and b. The bool result is set
to true only if both nv_bfloat16 if-equal comparisons evaluate to true, or false otherwise.
NaN inputs generate true results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ false otherwise.
Description
Performs nv_bfloat162 vector greater-equal comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 greater-equal comparisons evaluate to true, or false
otherwise. NaN inputs generate false results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ false otherwise.
Description
Performs nv_bfloat162 vector greater-equal comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 greater-equal comparisons evaluate to true, or false
otherwise. NaN inputs generate true results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ true if both nv_bfloat16 results of greater-than comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs nv_bfloat162 vector greater-than comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 greater-than comparisons evaluate to true, or false
otherwise. NaN inputs generate false results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ false otherwise.
Description
Performs nv_bfloat162 vector greater-than comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 greater-than comparisons evaluate to true, or false
otherwise. NaN inputs generate true results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ true if both nv_bfloat16 results of less-equal comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs nv_bfloat162 vector less-equal comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 less-equal comparisons evaluate to true, or false
otherwise. NaN inputs generate false results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ false otherwise.
Description
Performs nv_bfloat162 vector less-equal comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 less-equal comparisons evaluate to true, or false
otherwise. NaN inputs generate true results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ true if both nv_bfloat16 results of less-than comparison of vectors a and b are true;
‣ false otherwise.
Description
Performs nv_bfloat162 vector less-than comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 less-than comparisons evaluate to true, or false
otherwise. NaN inputs generate false results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ false otherwise.
Description
Performs nv_bfloat162 vector less-than comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 less-than comparisons evaluate to true, or false
otherwise. NaN inputs generate true results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ true if both nv_bfloat16 results of not-equal comparison of vectors a and b are true,
‣ false otherwise.
Description
Performs nv_bfloat162 vector not-equal comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 not-equal comparisons evaluate to true, or false
otherwise. NaN inputs generate false results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
bool
‣ false otherwise.
Description
Performs nv_bfloat162 vector not-equal comparison of inputs a and b. The bool result
is set to true only if both nv_bfloat16 not-equal comparisons evaluate to true, or false
otherwise. NaN inputs generate true results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector if-equal comparison of inputs a and b. The corresponding
nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false
results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector if-equal comparison of inputs a and b. The corresponding
nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true
results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector greater-equal comparison of inputs a and b. The
corresponding nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs
generate false results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector greater-equal comparison of inputs a and b. The
corresponding nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs
generate true results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector greater-than comparison of inputs a and b. The
corresponding nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs
generate false results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector greater-than comparison of inputs a and b. The
corresponding nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs
generate true results.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
‣ The nv_bfloat162 with the corresponding nv_bfloat16 results set to 1.0 for NaN, 0.0
otherwise.
Description
Determine whether each nv_bfloat16 of input nv_bfloat162 number a is a NaN.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector less-equal comparison of inputs a and b. The corresponding
nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false
results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector less-equal comparison of inputs a and b. The corresponding
nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true
results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector less-than comparison of inputs a and b. The corresponding
nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false
results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector less-than comparison of inputs a and b. The corresponding
nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true
results.
Description
Calculates nv_bfloat162 vector max(a, b). Elementwise nv_bfloat16 operation is defined
as (a > b) ? a : b.
Description
Calculates nv_bfloat162 vector max(a, b). Elementwise nv_bfloat16 operation is defined
as (a > b) ? a : b.
Description
Calculates nv_bfloat162 vector min(a, b). Elementwise nv_bfloat16 operation is defined
as (a < b) ? a : b.
Description
Calculates nv_bfloat162 vector min(a, b). Elementwise nv_bfloat16 operation is defined
as (a < b) ? a : b.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector not-equal comparison of inputs a and b. The corresponding
nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs generate false
results.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Performs nv_bfloat162 vector not-equal comparison of inputs a and b. The corresponding
nv_bfloat16 results are set to 1.0 for true, or 0.0 for false. NaN inputs generate true
results.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
float2
‣ a converted to float2.
Description
Converts both halves of nv_bfloat162 input a to float2 and returns the result.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat162
‣ The vector which has both its halves equal to the input a.
Description
Returns nv_bfloat162 number with both halves equal to the input a nv_bfloat16
number.
Parameters
a
- float. Is only being read.
Returns
float
‣ a converted to float.
Description
Converts nv_bfloat16 number a to float.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
int
Description
Convert the nv_bfloat16 floating-point value h to a signed integer in round-down mode. NaN
inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
int
Description
Convert the nv_bfloat16 floating-point value h to a signed integer in round-to-nearest-even
mode. NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
int
Description
Convert the nv_bfloat16 floating-point value h to a signed integer in round-up mode. NaN
inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
int
Description
Convert the nv_bfloat16 floating-point value h to a signed integer in round-towards-zero
mode. NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
long long int
Description
Convert the nv_bfloat16 floating-point value h to a signed 64-bit integer in round-down mode.
NaN inputs return a long long int with hex value of 0x8000000000000000.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
long long int
Description
Convert the nv_bfloat16 floating-point value h to a signed 64-bit integer in round-to-nearest-
even mode. NaN inputs return a long long int with hex value of 0x8000000000000000.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
long long int
Description
Convert the nv_bfloat16 floating-point value h to a signed 64-bit integer in round-up mode.
NaN inputs return a long long int with hex value of 0x8000000000000000.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
long long int
Description
Convert the nv_bfloat16 floating-point value h to a signed 64-bit integer in round-towards-zero
mode. NaN inputs return a long long int with hex value of 0x8000000000000000.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
short int
Description
Convert the nv_bfloat16 floating-point value h to a signed short integer in round-down mode.
NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
short int
Description
Convert the nv_bfloat16 floating-point value h to a signed short integer in round-to-nearest-
even mode. NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
short int
Description
Convert the nv_bfloat16 floating-point value h to a signed short integer in round-up mode.
NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
short int
Description
Convert the nv_bfloat16 floating-point value h to a signed short integer in round-towards-zero
mode. NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned integer in round-down mode.
NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned integer in round-to-nearest-even
mode. NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned integer in round-up mode. NaN
inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned integer in round-towards-zero
mode. NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned long long int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned 64-bit integer in round-down
mode. NaN inputs return 0x8000000000000000.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned long long int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned 64-bit integer in round-to-
nearest-even mode. NaN inputs return 0x8000000000000000.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned long long int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned 64-bit integer in round-up mode.
NaN inputs return 0x8000000000000000.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned long long int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned 64-bit integer in round-towards-
zero mode. NaN inputs return 0x8000000000000000.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned short int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned short integer in round-down
mode. NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned short int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned short integer in round-to-
nearest-even mode. NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned short int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned short integer in round-up mode.
NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned short int
Description
Convert the nv_bfloat16 floating-point value h to an unsigned short integer in round-towards-
zero mode. NaN inputs are converted to 0.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
short int
Description
Reinterprets the bits in the nv_bfloat16 floating-point number h as a signed short integer.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
unsigned short int
Description
Reinterprets the bits in the nv_bfloat16 floating-point h as an unsigned short number.
Parameters
a
- double. Is only being read.
Returns
nv_bfloat16
‣ a converted to nv_bfloat16.
Description
Converts double number a to nv_bfloat16 precision in round-to-nearest-even mode.
Parameters
a
- float2. Is only being read.
Returns
nv_bfloat162
‣ The nv_bfloat162 which has corresponding halves equal to the converted float2
components.
Description
Converts both components of float2 to nv_bfloat16 precision in round-to-nearest mode and
combines the results into one nv_bfloat162 number. Low 16 bits of the return value
correspond to a.x and high 16 bits of the return value correspond to a.y.
Parameters
a
- float. Is only being read.
Returns
nv_bfloat16
‣ a converted to nv_bfloat16.
Description
Converts float number a to nv_bfloat16 precision in round-to-nearest-even mode.
Parameters
a
- float. Is only being read.
Returns
nv_bfloat162
‣ The nv_bfloat162 value with both halves equal to the converted nv_bfloat16 precision
number.
Description
Converts input a to nv_bfloat16 precision in round-to-nearest-even mode and populates both
halves of nv_bfloat162 with converted value.
Parameters
a
- float. Is only being read.
Returns
nv_bfloat16
‣ a converted to nv_bfloat16.
Description
Converts float number a to nv_bfloat16 precision in round-down mode.
Parameters
a
- float. Is only being read.
Returns
nv_bfloat16
‣ a converted to nv_bfloat16.
Description
Converts float number a to nv_bfloat16 precision in round-to-nearest-even mode.
Parameters
a
- float. Is only being read.
Returns
nv_bfloat16
‣ a converted to nv_bfloat16.
Description
Converts float number a to nv_bfloat16 precision in round-up mode.
Parameters
a
- float. Is only being read.
Returns
nv_bfloat16
‣ a converted to nv_bfloat16.
Description
Converts float number a to nv_bfloat16 precision in round-towards-zero mode.
Parameters
a
- float. Is only being read.
b
- float. Is only being read.
Returns
nv_bfloat162
‣ The nv_bfloat162 value with corresponding halves equal to the converted input floats.
Description
Converts both input floats to nv_bfloat16 precision in round-to-nearest-even mode and
combines the results into one nv_bfloat162 number. Low 16 bits of the return value
correspond to the input a, high 16 bits correspond to the input b.
Parameters
a
- nv_bfloat16. Is only being read.
b
- nv_bfloat16. Is only being read.
Returns
nv_bfloat162
Description
Combines two input nv_bfloat16 number a and b into one nv_bfloat162 number. Input a
is stored in low 16 bits of the return value, input b is stored in high 16 bits of the return value.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat16
Description
Returns high 16 bits of nv_bfloat162 input a.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
‣ The nv_bfloat162 with both halves equal to the high 16 bits of the input.
Description
Extracts high 16 bits from nv_bfloat162 input a and returns a new nv_bfloat162 number
which has both halves equal to the extracted bits.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
float
Description
Converts high 16 bits of nv_bfloat162 input a to 32-bit floating-point number and returns
the result.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Extracts high 16 bits from each of the two nv_bfloat162 inputs and combines into one
nv_bfloat162 number. High 16 bits from input a is stored in low 16 bits of the return value,
high 16 bits from input b is stored in high 16 bits of the return value.
Parameters
i
- int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed integer value i to a nv_bfloat16 floating-point value in round-down mode.
Parameters
i
- int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed integer value i to a nv_bfloat16 floating-point value in round-to-nearest-
even mode.
Parameters
i
- int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed integer value i to a nv_bfloat16 floating-point value in round-up mode.
Parameters
i
- int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed integer value i to a nv_bfloat16 floating-point value in round-towards-zero
mode.
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
ptr
- memory location
Returns
The value pointed by `ptr`
Parameters
i
- long long int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed 64-bit integer value i to a nv_bfloat16 floating-point value in round-down
mode.
Parameters
i
- long long int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed 64-bit integer value i to a nv_bfloat16 floating-point value in round-to-
nearest-even mode.
Parameters
i
- long long int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed 64-bit integer value i to a nv_bfloat16 floating-point value in round-up
mode.
Parameters
i
- long long int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed 64-bit integer value i to a nv_bfloat16 floating-point value in round-
towards-zero mode.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat16
Description
Returns low 16 bits of nv_bfloat162 input a.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
‣ The nv_bfloat162 with both halves equal to the low 16 bits of the input.
Description
Extracts low 16 bits from nv_bfloat162 input a and returns a new nv_bfloat162 number
which has both halves equal to the extracted bits.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
float
Description
Converts low 16 bits of nv_bfloat162 input a to 32-bit floating-point number and returns
the result.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Swaps both halves of the nv_bfloat162 input and returns a new nv_bfloat162 number
with swapped halves.
Parameters
a
- nv_bfloat162. Is only being read.
b
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Extracts low 16 bits from each of the two nv_bfloat162 inputs and combines into one
nv_bfloat162 number. Low 16 bits from input a is stored in low 16 bits of the return value,
low 16 bits from input b is stored in high 16 bits of the return value.
Parameters
mask
- unsigned int. Is only being read.
var
- nv_bfloat16. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 2-byte word referenced by var from the source thread ID as nv_bfloat16. If the
source thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by adding delta to the caller's thread ID. The value of var held
by the resulting thread ID is returned: this has the effect of shifting var down the warp by
delta threads. If width is less than warpSize then each subsection of the warp behaves as a
separate entity with a starting logical thread ID of 0. As for __shfl_up_sync(), the ID number of
the source thread will not wrap around the value of width and so the upper delta threads will
remain unchanged.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- nv_bfloat162. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 4-byte word referenced by var from the source thread ID as nv_bfloat162. If the
source thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by adding delta to the caller's thread ID. The value of var held
by the resulting thread ID is returned: this has the effect of shifting var down the warp by
delta threads. If width is less than warpSize then each subsection of the warp behaves as a
separate entity with a starting logical thread ID of 0. As for __shfl_up_sync(), the ID number of
the source thread will not wrap around the value of width and so the upper delta threads will
remain unchanged.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- nv_bfloat16. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 2-byte word referenced by var from the source thread ID as nv_bfloat16. If the
source thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Returns the value of var held by the thread whose ID is given by delta. If width is less than
warpSize then each subsection of the warp behaves as a separate entity with a starting logical
thread ID of 0. If delta is outside the range [0:width-1], the value returned corresponds to the
value of var held by the delta modulo width (i.e. within the same subsection). width must have
a value which is a power of 2; results are undefined if width is not a power of 2, or is a number
greater than warpSize.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- nv_bfloat162. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 4-byte word referenced by var from the source thread ID as nv_bfloat162. If the
source thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Returns the value of var held by the thread whose ID is given by delta. If width is less than
warpSize then each subsection of the warp behaves as a separate entity with a starting logical
thread ID of 0. If delta is outside the range [0:width-1], the value returned corresponds to the
value of var held by the delta modulo width (i.e. within the same subsection). width must have
a value which is a power of 2; results are undefined if width is not a power of 2, or is a number
greater than warpSize.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- nv_bfloat16. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 2-byte word referenced by var from the source thread ID as nv_bfloat16. If the
source thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by subtracting delta from the caller's lane ID. The value of var
held by the resulting lane ID is returned: in effect, var is shifted up the warp by delta threads.
If width is less than warpSize then each subsection of the warp behaves as a separate entity
with a starting logical thread ID of 0. The source thread index will not wrap around the value of
width, so effectively the lower delta threads will be unchanged. width must have a value which
is a power of 2; results are undefined if width is not a power of 2, or is a number greater than
warpSize.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- nv_bfloat162. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 4-byte word referenced by var from the source thread ID as nv_bfloat162. If the
source thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by subtracting delta from the caller's lane ID. The value of var
held by the resulting lane ID is returned: in effect, var is shifted up the warp by delta threads.
If width is less than warpSize then each subsection of the warp behaves as a separate entity
with a starting logical thread ID of 0. The source thread index will not wrap around the value of
width, so effectively the lower delta threads will be unchanged. width must have a value which
is a power of 2; results are undefined if width is not a power of 2, or is a number greater than
warpSize.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- nv_bfloat16. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 2-byte word referenced by var from the source thread ID as nv_bfloat16. If the
source thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by performing a bitwise XOR of the caller's thread ID with mask:
the value of var held by the resulting thread ID is returned. If width is less than warpSize then
each group of width consecutive threads are able to access elements from earlier groups of
threads, however if they attempt to access elements from later groups of threads their own
value of var will be returned. This mode implements a butterfly addressing pattern such as is
used in tree reduction and broadcast.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
mask
- unsigned int. Is only being read.
var
- nv_bfloat162. Is only being read.
delta
- int. Is only being read.
width
- int. Is only being read.
Returns
Returns the 4-byte word referenced by var from the source thread ID as nv_bfloat162. If the
source thread ID is out of range or the source thread has exited, the calling thread's own var is
returned.
Description
Calculates a source thread ID by performing a bitwise XOR of the caller's thread ID with mask:
the value of var held by the resulting thread ID is returned. If width is less than warpSize then
each group of width consecutive threads are able to access elements from earlier groups of
threads, however if they attempt to access elements from later groups of threads their own
value of var will be returned. This mode implements a butterfly addressing pattern such as is
used in tree reduction and broadcast.
Note:
For more details for this function see the Warp Shuffle Functions section in the CUDA C++
Programming Guide.
Parameters
i
- short int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed short integer value i to a nv_bfloat16 floating-point value in round-down
mode.
Parameters
i
- short int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed short integer value i to a nv_bfloat16 floating-point value in round-to-
nearest-even mode.
Parameters
i
- short int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed short integer value i to a nv_bfloat16 floating-point value in round-up
mode.
Parameters
i
- short int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the signed short integer value i to a nv_bfloat16 floating-point value in round-
towards-zero mode.
Parameters
i
- short int. Is only being read.
Returns
nv_bfloat16
Description
Reinterprets the bits in the signed short integer i as a nv_bfloat16 floating-point number.
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
ptr
- memory location
value
- the value to be stored
Parameters
i
- unsigned int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned integer value i to a nv_bfloat16 floating-point value in round-down
mode.
Parameters
i
- unsigned int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned integer value i to a nv_bfloat16 floating-point value in round-to-nearest-
even mode.
Parameters
i
- unsigned int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned integer value i to a nv_bfloat16 floating-point value in round-up mode.
Parameters
i
- unsigned int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned integer value i to a nv_bfloat16 floating-point value in round-towards-
zero mode.
Parameters
i
- unsigned long long int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned 64-bit integer value i to a nv_bfloat16 floating-point value in round-
down mode.
Parameters
i
- unsigned long long int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned 64-bit integer value i to a nv_bfloat16 floating-point value in round-to-
nearest-even mode.
Parameters
i
- unsigned long long int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned 64-bit integer value i to a nv_bfloat16 floating-point value in round-up
mode.
Parameters
i
- unsigned long long int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned 64-bit integer value i to a nv_bfloat16 floating-point value in round-
towards-zero mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned short integer value i to a nv_bfloat16 floating-point value in round-down
mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned short integer value i to a nv_bfloat16 floating-point value in round-to-
nearest-even mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned short integer value i to a nv_bfloat16 floating-point value in round-up
mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
nv_bfloat16
‣ i converted to nv_bfloat16.
Description
Convert the unsigned short integer value i to a nv_bfloat16 floating-point value in round-
towards-zero mode.
Parameters
i
- unsigned short int. Is only being read.
Returns
nv_bfloat16
Description
Reinterprets the bits in the unsigned short integer i as a nv_bfloat16 floating-point number.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Compute the smallest integer value not less than h.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
‣ The cosine of a.
Description
Calculates nv_bfloat16 cosine of input a in round-to-nearest-even mode.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculates nv_bfloat16 natural exponential function of input a in round-to-nearest-even
mode.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculates nv_bfloat16 decimal exponential function of input a in round-to-nearest-even
mode.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculates nv_bfloat16 binary exponential function of input a in round-to-nearest-even
mode.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculate the largest integer value which is less than or equal to h.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculates nv_bfloat16 natural logarithm of input a in round-to-nearest-even mode.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculates nv_bfloat16 decimal logarithm of input a in round-to-nearest-even mode.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculates nv_bfloat16 binary logarithm of input a in round-to-nearest-even mode.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
‣ The reciprocal of a.
Description
Calculates nv_bfloat16 reciprocal of input a in round-to-nearest-even mode.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Round h to the nearest integer value in nv_bfloat16 floating-point format, with bfloat16way
cases rounded to the nearest even integer value.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculates nv_bfloat16 reciprocal square root of input a in round-to-nearest mode.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
‣ The sine of a.
Description
Calculates nv_bfloat16 sine of input a in round-to-nearest-even mode.
Parameters
a
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Calculates nv_bfloat16 square root of input a in round-to-nearest-even mode.
Parameters
h
- nv_bfloat16. Is only being read.
Returns
nv_bfloat16
Description
Round h to the nearest integer value that does not exceed h in magnitude.
Parameters
h
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
For each component of vector h compute the smallest integer value not less than h.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 cosine of input vector a in round-to-nearest-even mode.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 exponential function of input vector a in round-to-nearest-even
mode.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 decimal exponential function of input vector a in round-to-
nearest-even mode.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 binary exponential function of input vector a in round-to-nearest-
even mode.
Parameters
h
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
For each component of vector h calculate the largest integer value which is less than or equal
to h.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 natural logarithm of input vector a in round-to-nearest-even
mode.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 decimal logarithm of input vector a in round-to-nearest-even
mode.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 binary logarithm of input vector a in round-to-nearest mode.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 reciprocal of input vector a in round-to-nearest-even mode.
Parameters
h
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Round each component of nv_bfloat162 vector h to the nearest integer value in nv_bfloat16
floating-point format, with bfloat16way cases rounded to the nearest even integer value.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 reciprocal square root of input vector a in round-to-nearest-even
mode.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 sine of input vector a in round-to-nearest-even mode.
Parameters
a
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
Description
Calculates nv_bfloat162 square root of input vector a in round-to-nearest mode.
Parameters
h
- nv_bfloat162. Is only being read.
Returns
nv_bfloat162
‣ The truncated h.
Description
Round each component of vector h to the nearest integer value that does not exceed h in
magnitude.
Returns
Result will be in radians, in the interval [0, ] for x inside [-1, +1].
Description
Calculate the principal value of the arc cosine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Result will be in the interval [0, ].
‣ acoshf(1) returns 0.
‣ acoshf(x) returns NaN for x in the interval [ , 1).
‣ acoshf( ) returns .
Description
Calculate the nonnegative inverse hyperbolic cosine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Result will be in radians, in the interval [- ,+ ] for x inside [-1, +1].
‣ asinf( ) returns .
Description
Calculate the principal value of the arc sine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ asinhf( ) returns .
‣ asinhf( ) returns .
Description
Calculate the inverse hyperbolic sine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Result will be in radians, in the interval [- ,+ ].
Description
Calculate the principal value of the arc tangent of the ratio of first and second input arguments
y / x. The quadrant of the result is determined by the signs of inputs y and x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Result will be in radians, in the interval [- ,+ ].
‣ atanf( ) returns .
Description
Calculate the principal value of the arc tangent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ atanhf( ) returns .
‣ atanhf( ) returns .
Description
Calculate the inverse hyperbolic tangent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
‣ cbrtf( ) returns .
‣ cbrtf( ) returns .
Description
Calculate the cube root of x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns expressed as a floating-point number.
‣ ceilf( ) returns .
‣ ceilf( ) returns .
Description
Compute the smallest integer value not less than x.
Returns
Returns a value with the magnitude of x and the sign of y.
Description
Create a floating-point value with the magnitude x and the sign of y.
Returns
‣ cosf( ) returns 1.
Description
Calculate the cosine of the input argument x (measured in radians).
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
‣ coshf( ) returns 1.
‣ coshf( ) returns .
Description
Calculate the hyperbolic cosine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ cospif( ) returns 1.
Description
Calculate the cosine of x (measured in radians), where x is the input argument.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the value of the regular modified cylindrical Bessel function of order 0.
Description
Calculate the value of the regular modified cylindrical Bessel function of order 0 for the input
argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the value of the regular modified cylindrical Bessel function of order 1.
Description
Calculate the value of the regular modified cylindrical Bessel function of order 1 for the input
argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ erfcf( ) returns 2.
Description
Calculate the complementary error function of the input argument x, 1 - erf(x).
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ erfcinvf( ) returns .
‣ erfcinvf(2) returns .
Description
Calculate the inverse complementary error function (x), of the input argument x in the
interval [0, 2].
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ erfcxf( ) returns .
Description
Calculate the scaled complementary error function of the input argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ erff( ) returns .
‣ erff( ) returns .
Description
Calculate the value of the error function for the input argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ erfinvf( ) returns .
‣ erfinvf(1) returns .
‣ erfinvf(-1) returns .
Description
Calculate the inverse error function (x), of the input argument x in the interval [-1, 1].
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ exp10f( ) returns 1.
‣ exp10f( ) returns .
Description
Calculate , the base 10 exponential of the input argument x.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
‣ exp2f( ) returns 1.
‣ exp2f( ) returns .
Description
Calculate , the base 2 exponential of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ expf( ) returns 1.
‣ expf( ) returns .
Description
Calculate , the base exponential of the input argument x.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
‣ expm1f( ) returns .
‣ expm1f( ) returns .
Description
Calculate -1, the base exponential of the input argument x, minus 1.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the absolute value of its argument.
‣ fabsf( ) returns .
Description
Calculate the absolute value of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the positive difference between x and y.
Description
Compute the positive difference between x and y. The positive difference is x - y when x > y
and +0 otherwise.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Compute x divided by y. If --use_fast_math is specified, use __fdividef() for higher
performance, otherwise use normal division.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
Returns expressed as a floating-point number.
‣ floorf( ) returns .
‣ floorf( ) returns .
Description
Calculate the largest integer value which is less than or equal to x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the rounded value of as a single operation.
Description
Compute the value of as a single ternary operation. After computing the value to
infinite precision, the value is rounded once.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the maximum numeric values of the arguments x and y.
Description
Determines the maximum numeric value of the arguments x and y. Treats NaN arguments as
missing data. If one argument is a NaN and the other is legitimate numeric value, the numeric
value is chosen.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the minimum numeric value of the arguments x and y.
Description
Determines the minimum numeric value of the arguments x and y. Treats NaN arguments as
missing data. If one argument is a NaN and the other is legitimate numeric value, the numeric
value is chosen.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ Returns the floating-point remainder of x / y.
‣ fmodf( , y) returns if y is not zero.
Description
Calculate the floating-point remainder of x / y. The floating-point remainder of the division
operation x / y calculated by this function is exactly the value x - n*y, where n is x / y
with its fractional part truncated. The computed value will have the same sign as x, and its
magnitude will be less than the magnitude of y.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the fractional component m.
‣ frexpf( , nptr) returns and stores zero in the location pointed to by nptr.
‣ frexpf( , nptr) returns and stores an unspecified value in the location to which
nptr points.
‣ frexpf(NaN, y) returns a NaN and stores an unspecified value in the location to which
nptr points.
Description
Decomposes the floating-point value x into a component m for the normalized fraction element
and another term n for the exponent. The absolute value of m will be greater than or equal to
0.5 and less than 1.0 or it will be equal to 0; . The integer exponent n will be stored in
the location to which nptr points.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the length of the hypotenuse .
Description
Calculates the length of the hypotenuse of a right triangle whose two sides have lengths x and
y without undue overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ If successful, returns the unbiased exponent of the argument.
‣ ilogbf( ) returns INT_MIN.
‣ Note: above behavior does not take into account FP_ILOGB0 nor FP_ILOGBNAN.
Description
Calculates the unbiased integer exponent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ With Visual Studio 2013 host compiler: __RETURN_TYPE is 'bool'. Returns true if and only
if a is a finite value.
‣ With other host compilers: __RETURN_TYPE is 'int'. Returns a nonzero value if and only if
a is a finite value.
Description
Determine whether the floating-point value a is a finite value (zero, subnormal, or normal and
not infinity or NaN).
Returns
‣ With Visual Studio 2013 host compiler: __RETURN_TYPE is 'bool'. Returns true if and only
if a is an infinite value.
‣ With other host compilers: __RETURN_TYPE is 'int'. Returns a nonzero value if and only if
a is an infinite value.
Description
Determine whether the floating-point value a is an infinite value (positive or negative).
Returns
‣ With Visual Studio 2013 host compiler: __RETURN_TYPE is 'bool'. Returns true if and only
if a is a NaN value.
‣ With other host compilers: __RETURN_TYPE is 'int'. Returns a nonzero value if and only if
a is a NaN value.
Description
Determine whether the floating-point value a is a NaN.
Returns
Returns the value of the Bessel function of the first kind of order 0.
Description
Calculate the value of the Bessel function of the first kind of order 0 for the input argument x,
.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the value of the Bessel function of the first kind of order 1.
‣ j1f( ) returns .
‣ j1f( ) returns .
Description
Calculate the value of the Bessel function of the first kind of order 1 for the input argument x,
.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the value of the Bessel function of the first kind of order n.
Description
Calculate the value of the Bessel function of the first kind of order n for the input argument x,
.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ ldexpf(x, exp) is equivalent to scalbnf(x, exp).
Description
Calculate the value of of the input arguments x and exp.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ lgammaf(1) returns +0.
‣ lgammaf(2) returns +0.
‣ lgammaf(x) returns if x 0 and x is an integer.
‣ lgammaf( ) returns .
‣ lgammaf( ) returns .
Description
Calculate the natural logarithm of the absolute value of the gamma function of the input
argument x, namely the value of
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns rounded integer value.
Description
Round x to the nearest integer value, with halfway cases rounded to the nearest even integer
value. If the result is outside the range of the return type, the behavior is undefined.
Returns
Returns rounded integer value.
Description
Round x to the nearest integer value, with halfway cases rounded away from zero. If the result
is outside the range of the return type, the behavior is undefined.
Note:
This function may be slower than alternate rounding methods. See llrintf().
Returns
‣ log10f( ) returns .
Description
Calculate the base 10 logarithm of the input argument x.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
‣ log1pf( ) returns .
‣ log1pf(-1) returns .
Description
Calculate the value of of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ log2f( ) returns .
Description
Calculate the base 2 logarithm of the input argument x.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
‣ logbf( ) returns .
‣ logbf( ) returns .
Description
Calculate the floating-point representation of the exponent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ logf( ) returns .
Description
Calculate the natural logarithm of the input argument x.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
Returns rounded integer value.
Description
Round x to the nearest integer value, with halfway cases rounded to the nearest even integer
value. If the result is outside the range of the return type, the behavior is undefined.
Returns
Returns rounded integer value.
Description
Round x to the nearest integer value, with halfway cases rounded away from zero. If the result
is outside the range of the return type, the behavior is undefined.
Note:
This function may be slower than alternate rounding methods. See lrintf().
Description
Calculate the maximum value of the arguments a and b. Behavior is equivalent to fmaxf()
function.
Note, this is different from std:: specification
Description
Calculate the minimum value of the arguments a and b. Behavior is equivalent to fminf()
function.
Note, this is different from std:: specification
Returns
‣ modff( , iptr) returns a result with the same sign as x.
‣ modff(NaN, iptr) stores a NaN in the object pointed to by iptr and returns a NaN.
Description
Break down the argument x into fractional and integral parts. The integral part is stored in the
argument iptr. Fractional and integral parts are given the same sign as the argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ nanf(tagp) returns NaN.
Description
Return a representation of a quiet NaN. Argument tagp selects one of the possible
representations.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ nearbyintf( ) returns .
‣ nearbyintf( ) returns .
Description
Round argument x to an integer value in single precision floating-point format. Uses round to
nearest rounding, with ties rounding to even.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ nextafterf(x, y) = y if x equals y.
‣ nextafterf(x, y) = NaN if either x or y are NaN.
Description
Calculate the next representable single-precision floating-point value following x in
the direction of y. For example, if y is greater than x, nextafterf() returns the smallest
representable number greater than x
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the length of the 3D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculates the length of three dimensional vector in Euclidean space without undue overflow
or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the length of the 4D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculates the length of four dimensional vector in Euclidean space without undue overflow or
underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ normcdff( ) returns 1.
‣ normcdff( ) returns +0
Description
Calculate the cumulative distribution function of the standard normal distribution for input
argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ normcdfinvf( ) returns .
‣ normcdfinvf(1) returns .
Description
Calculate the inverse of the standard normal cumulative distribution function for input
argument x, . The function is defined for input values in the interval .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the length of the dim-D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculates the length of a vector p, dimension of which is passed as an argument without
undue overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ powf( , y) returns for y an odd integer less than 0.
‣ powf(-1, ) returns 1.
Description
Calculate the value of x to the power of y.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
‣ rcbrt( ) returns .
‣ rcbrt( ) returns .
Description
Calculate reciprocal cube root function of x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ remainderf(x, ) returns NaN.
Description
Compute single-precision floating-point remainder r of dividing x by y for nonzero y. Thus
. The value n is the integer value nearest . In the case when , the even n
value is chosen.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the remainder.
‣ remquof(x, , quo) returns NaN and stores an unspecified value in the location to which
quo points.
‣ remquof( , y, quo) returns NaN and stores an unspecified value in the location to
which quo points.
‣ remquof(x, y, quo) returns NaN and stores an unspecified value in the location to which
quo points if either of x or y is NaN.
‣ remquof(x, , quo) returns x and stores zero in the location to which quo points for
finite x.
Description
Compute a single-precision floating-point remainder in the same way as the remainderf()
function. Argument quo returns part of quotient upon division of x by y. Value quo has the
same sign as and may not be the exact quotient but agrees with the exact quotient in the
low order 3 bits.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns one over the length of the hypotenuse .
Description
Calculates one over the length of the hypotenuse of a right triangle whose two sides have
lengths x and y without undue overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns rounded integer value.
‣ rintf( ) returns .
‣ rintf( ) returns .
Description
Round x to the nearest integer value in floating-point format, with halfway cases rounded to
the nearest even integer value.
Returns
Returns one over the length of the 3D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculates one over the length of three dimension vector in Euclidean space without undue
overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns one over the length of the 3D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculates one over the length of four dimension vector in Euclidean space without undue
overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns one over the length of the vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculates one over the length of vector p, dimension of which is passed as an argument, in
Euclidean space without undue overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns rounded integer value.
‣ roundf( ) returns .
‣ roundf( ) returns .
Description
Round x to the nearest integer value in floating-point format, with halfway cases rounded
away from zero.
Note:
This function may be slower than alternate rounding methods. See rintf().
Returns
Returns .
‣ rsqrtf( ) returns .
Description
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x * .
‣ scalblnf( , n) returns .
‣ scalblnf(x, 0) returns x.
‣ scalblnf( , n) returns .
Description
Scale x by by efficient manipulation of the floating-point exponent.
Returns
Returns x * .
‣ scalbnf( , n) returns .
‣ scalbnf(x, 0) returns x.
‣ scalbnf( , n) returns .
Description
Scale x by by efficient manipulation of the floating-point exponent.
Returns
Reports the sign bit of all values including infinities, zeros, and NaNs.
‣ With Visual Studio 2013 host compiler: __RETURN_TYPE is 'bool'. Returns true if and only
if a is negative.
‣ With other host compilers: __RETURN_TYPE is 'int'. Returns a nonzero value if and only if
a is negative.
Description
Determine whether the floating-point value a is negative.
Returns
‣ none
Description
Calculate the sine and cosine of the first input argument x (measured in radians). The results
for sine and cosine are written into the second argument, sptr, and, respectively, third
argument, cptr.
See also:
sinf() and cosf().
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
‣ none
Description
Calculate the sine and cosine of the first input argument, x (measured in radians), . The
results for sine and cosine are written into the second argument, sptr, and, respectively,
third argument, cptr.
See also:
sinpif() and cospif().
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ sinf( ) returns .
Description
Calculate the sine of the input argument x (measured in radians).
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
‣ sinhf( ) returns .
‣ sinhf( ) returns .
Description
Calculate the hyperbolic sine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ sinpif( ) returns .
Description
Calculate the sine of x (measured in radians), where x is the input argument.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
‣ sqrtf( ) returns .
‣ sqrtf( ) returns .
Description
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ tanf( ) returns .
Description
Calculate the tangent of the input argument x (measured in radians).
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
‣ This function is affected by the --use_fast_math compiler flag. See the CUDA C++
Programming Guide, Mathematical Functions Appendix, Intrinsic Functions section for a
complete list of functions affected.
Returns
‣ tanhf( ) returns .
‣ tanhf( ) returns .
Description
Calculate the hyperbolic tangent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
‣ tgammaf( ) returns .
‣ tgammaf( ) returns .
Description
Calculate the gamma function of the input argument x, namely the value of .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns truncated integer value.
‣ truncf( ) returns .
‣ truncf( ) returns .
Description
Round x to the nearest integer value that does not exceed x in magnitude.
Returns
Returns the value of the Bessel function of the second kind of order 0.
‣ y0f( ) returns .
Description
Calculate the value of the Bessel function of the second kind of order 0 for the input argument
x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the value of the Bessel function of the second kind of order 1.
‣ y1f( ) returns .
Description
Calculate the value of the Bessel function of the second kind of order 1 for the input argument
x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the value of the Bessel function of the second kind of order n.
Description
Calculate the value of the Bessel function of the second kind of order n for the input argument
x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Result will be in radians, in the interval [0, ] for x inside [-1, +1].
Description
Calculate the principal value of the arc cosine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Result will be in the interval [0, ].
‣ acosh(1) returns 0.
‣ acosh(x) returns NaN for x in the interval [ , 1).
‣ acosh( ) returns .
Description
Calculate the nonnegative inverse hyperbolic cosine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Result will be in radians, in the interval [- /2, + /2] for x inside [-1, +1].
‣ asin( ) returns .
Description
Calculate the principal value of the arc sine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ asinh( ) returns .
‣ asinh( ) returns .
Description
Calculate the inverse hyperbolic sine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Result will be in radians, in the interval [- /2, + /2].
‣ atan( ) returns .
Description
Calculate the principal value of the arc tangent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Result will be in radians, in the interval [- ,+ ].
Description
Calculate the principal value of the arc tangent of the ratio of first and second input arguments
y / x. The quadrant of the result is determined by the signs of inputs y and x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ atanh( ) returns .
‣ atanh( ) returns .
Description
Calculate the inverse hyperbolic tangent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
‣ cbrt( ) returns .
‣ cbrt( ) returns .
Description
Calculate the cube root of x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns expressed as a floating-point number.
‣ ceil( ) returns .
‣ ceil( ) returns .
Description
Compute the smallest integer value not less than x.
Returns
Returns a value with the magnitude of x and the sign of y.
Description
Create a floating-point value with the magnitude x and the sign of y.
Returns
‣ cos( ) returns 1.
Description
Calculate the cosine of the input argument x (measured in radians).
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ cosh( ) returns 1.
‣ cosh( ) returns .
Description
Calculate the hyperbolic cosine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ cospi( ) returns 1.
Description
Calculate the cosine of x (measured in radians), where x is the input argument.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the value of the regular modified cylindrical Bessel function of order 0.
Description
Calculate the value of the regular modified cylindrical Bessel function of order 0 for the input
argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the value of the regular modified cylindrical Bessel function of order 1.
Description
Calculate the value of the regular modified cylindrical Bessel function of order 1 for the input
argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ erf( ) returns .
‣ erf( ) returns .
Description
Calculate the value of the error function for the input argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ erfc( ) returns 2.
Description
Calculate the complementary error function of the input argument x, 1 - erf(x).
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ erfcinv( ) returns .
‣ erfcinv(2) returns .
Description
Calculate the inverse complementary error function (x), of the input argument x in the
interval [0, 2].
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ erfcx( ) returns .
Description
Calculate the scaled complementary error function of the input argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ erfinv( ) returns .
‣ erfinv(1) returns .
‣ erfinv(-1) returns .
Description
Calculate the inverse error function (x), of the input argument x in the interval [-1, 1].
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ exp( ) returns 1.
‣ exp( ) returns .
Description
Calculate , the base exponential of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ exp10( ) returns 1.
‣ exp10( ) returns .
Description
Calculate , the base 10 exponential of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ exp2( ) returns 1.
‣ exp2( ) returns .
Description
Calculate , the base 2 exponential of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ expm1( ) returns .
‣ expm1( ) returns .
Description
Calculate -1, the base exponential of the input argument x, minus 1.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the absolute value of the input argument.
‣ fabs( ) returns .
Description
Calculate the absolute value of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the positive difference between x and y.
Description
Compute the positive difference between x and y. The positive difference is x - y when x > y
and +0 otherwise.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns expressed as a floating-point number.
‣ floor( ) returns .
‣ floor( ) returns .
Description
Calculates the largest integer value which is less than or equal to x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the rounded value of as a single operation.
Description
Compute the value of as a single ternary operation. After computing the value to
infinite precision, the value is rounded once.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the maximum numeric values of the arguments x and y.
Description
Determines the maximum numeric value of the arguments x and y. Treats NaN arguments as
missing data. If one argument is a NaN and the other is legitimate numeric value, the numeric
value is chosen.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the minimum numeric value of the arguments x and y.
Description
Determines the minimum numeric value of the arguments x and y. Treats NaN arguments as
missing data. If one argument is a NaN and the other is legitimate numeric value, the numeric
value is chosen.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ Returns the floating-point remainder of x / y.
‣ fmod( , y) returns if y is not zero.
Description
Calculate the double-precision floating-point remainder of x / y. The floating-point remainder
of the division operation x / y calculated by this function is exactly the value x - n*y, where
n is x / y with its fractional part truncated. The computed value will have the same sign as x,
and its magnitude will be less than the magnitude of y.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the fractional component m.
‣ frexp( , nptr) returns and stores zero in the location pointed to by nptr.
‣ frexp( , nptr) returns and stores an unspecified value in the location to which
nptr points.
‣ frexp(NaN, y) returns a NaN and stores an unspecified value in the location to which nptr
points.
Description
Decompose the floating-point value x into a component m for the normalized fraction element
and another term n for the exponent. The absolute value of m will be greater than or equal to
0.5 and less than 1.0 or it will be equal to 0; . The integer exponent n will be stored in
the location to which nptr points.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the length of the hypotenuse .
Description
Calculate the length of the hypotenuse of a right triangle whose two sides have lengths x and
y without undue overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ If successful, returns the unbiased exponent of the argument.
‣ ilogb( ) returns INT_MIN.
‣ Note: above behavior does not take into account FP_ILOGB0 nor FP_ILOGBNAN.
Description
Calculates the unbiased integer exponent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ With Visual Studio 2013 host compiler: __RETURN_TYPE is 'bool'. Returns true if and only
if a is a finite value.
‣ With other host compilers: __RETURN_TYPE is 'int'. Returns a nonzero value if and only if
a is a finite value.
Description
Determine whether the floating-point value a is a finite value (zero, subnormal, or normal and
not infinity or NaN).
Returns
‣ With Visual Studio 2013 host compiler: Returns true if and only if a is an infinite value.
‣ With other host compilers: Returns a nonzero value if and only if a is an infinite value.
Description
Determine whether the floating-point value a is an infinite value (positive or negative).
Returns
‣ With Visual Studio 2013 host compiler: __RETURN_TYPE is 'bool'. Returns true if and only
if a is a NaN value.
‣ With other host compilers: __RETURN_TYPE is 'int'. Returns a nonzero value if and only if
a is a NaN value.
Description
Determine whether the floating-point value a is a NaN.
Returns
Returns the value of the Bessel function of the first kind of order 0.
Description
Calculate the value of the Bessel function of the first kind of order 0 for the input argument x,
.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the value of the Bessel function of the first kind of order 1.
‣ j1( ) returns .
‣ j1( ) returns .
Description
Calculate the value of the Bessel function of the first kind of order 1 for the input argument x,
.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the value of the Bessel function of the first kind of order n.
Description
Calculate the value of the Bessel function of the first kind of order n for the input argument x,
.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ ldexp(x, exp) is equivalent to scalbn(x, exp).
Description
Calculate the value of of the input arguments x and exp.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ lgamma(1) returns +0.
‣ lgamma(2) returns +0.
‣ lgamma(x) returns if x 0 and x is an integer.
‣ lgamma( ) returns .
‣ lgamma( ) returns .
Description
Calculate the natural logarithm of the absolute value of the gamma function of the input
argument x, namely the value of
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns rounded integer value.
Description
Round x to the nearest integer value, with halfway cases rounded to the nearest even integer
value. If the result is outside the range of the return type, the behavior is undefined.
Returns
Returns rounded integer value.
Description
Round x to the nearest integer value, with halfway cases rounded away from zero. If the result
is outside the range of the return type, the behavior is undefined.
Note:
This function may be slower than alternate rounding methods. See llrint().
Returns
‣ log( ) returns .
Description
Calculate the base logarithm of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ log10( ) returns .
Description
Calculate the base 10 logarithm of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ log1p( ) returns .
‣ log1p(-1) returns .
Description
Calculate the value of of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ log2( ) returns .
Description
Calculate the base 2 logarithm of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ logb( ) returns .
‣ logb( ) returns .
Description
Calculate the floating-point representation of the exponent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns rounded integer value.
Description
Round x to the nearest integer value, with halfway cases rounded to the nearest even integer
value. If the result is outside the range of the return type, the behavior is undefined.
Returns
Returns rounded integer value.
Description
Round x to the nearest integer value, with halfway cases rounded away from zero. If the result
is outside the range of the return type, the behavior is undefined.
Note:
This function may be slower than alternate rounding methods. See lrint().
Description
Convert float argument b to double, followed by fmax().
Note, this is different from std:: specification
Description
Convert float argument a to double, followed by fmax().
Note, this is different from std:: specification
Description
Calculate the maximum value of the arguments a and b. Behavior is equivalent to fmax()
function.
Note, this is different from std:: specification
Description
Convert float argument b to double, followed by fmin().
Note, this is different from std:: specification
Description
Convert float argument a to double, followed by fmin().
Note, this is different from std:: specification
Description
Calculate the minimum value of the arguments a and b. Behavior is equivalent to fmin()
function.
Note, this is different from std:: specification
Returns
‣ modf( , iptr) returns a result with the same sign as x.
‣ modf(NaN, iptr) stores a NaN in the object pointed to by iptr and returns a NaN.
Description
Break down the argument x into fractional and integral parts. The integral part is stored in the
argument iptr. Fractional and integral parts are given the same sign as the argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ nan(tagp) returns NaN.
Description
Return a representation of a quiet NaN. Argument tagp selects one of the possible
representations.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ nearbyint( ) returns .
‣ nearbyint( ) returns .
Description
Round argument x to an integer value in double precision floating-point format. Uses round to
nearest rounding, with ties rounding to even.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ nextafter(x, y) = y if x equals y.
‣ nextafter(x, y) = NaN if either x or y are NaN.
Description
Calculate the next representable double-precision floating-point value following x in
the direction of y. For example, if y is greater than x, nextafter() returns the smallest
representable number greater than x
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the length of the dim-D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculate the length of a vector p, dimension of which is passed as an argument without
undue overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the length of 3D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculate the length of three dimensional vector in Euclidean space without undue overflow or
underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the length of 4D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculate the length of four dimensional vector in Euclidean space without undue overflow or
underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ normcdf( ) returns 1.
Description
Calculate the cumulative distribution function of the standard normal distribution for input
argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ normcdfinv( ) returns .
‣ normcdfinv(1) returns .
Description
Calculate the inverse of the standard normal cumulative distribution function for input
argument x, . The function is defined for input values in the interval .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ pow( , y) returns for y an odd integer less than 0.
‣ pow(-1, ) returns 1.
Description
Calculate the value of x to the power of y.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ rcbrt( ) returns .
‣ rcbrt( ) returns .
Description
Calculate reciprocal cube root function of x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ remainder(x, ) returns NaN.
Description
Compute double-precision floating-point remainder r of dividing x by y for nonzero y. Thus
. The value n is the integer value nearest . In the case when , the even n
value is chosen.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the remainder.
‣ remquo(x, , quo) returns NaN and stores an unspecified value in the location to which
quo points.
‣ remquo( , y, quo) returns NaN and stores an unspecified value in the location to
which quo points.
‣ remquo(x, y, quo) returns NaN and stores an unspecified value in the location to which
quo points if either of x or y is NaN.
‣ remquo(x, , quo) returns x and stores zero in the location to which quo points for
finite x.
Description
Compute a double-precision floating-point remainder in the same way as the remainder()
function. Argument quo returns part of quotient upon division of x by y. Value quo has the
same sign as and may not be the exact quotient but agrees with the exact quotient in the
low order 3 bits.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns one over the length of the hypotenuse .
Description
Calculate one over the length of the hypotenuse of a right triangle whose two sides have
lengths x and y without undue overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns rounded integer value.
‣ rint( ) returns .
‣ rint( ) returns .
Description
Round x to the nearest integer value in floating-point format, with halfway cases rounded to
the nearest even integer value.
Returns
Returns one over the length of the vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculates one over the length of vector p, dimension of which is passed as an argument, in
Euclidean space without undue overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns one over the length of the 3D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculate one over the length of three dimensional vector in Euclidean space without undue
overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns one over the length of the 3D vector .
‣ In the presence of an exactly infinite coordinate is returned, even if there are NaNs.
Description
Calculate one over the length of four dimensional vector in Euclidean space without undue
overflow or underflow.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns rounded integer value.
‣ round( ) returns .
‣ round( ) returns .
Description
Round x to the nearest integer value in floating-point format, with halfway cases rounded
away from zero.
Note:
This function may be slower than alternate rounding methods. See rint().
Returns
Returns .
‣ rsqrt( ) returns .
Description
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x * .
‣ scalbln( , n) returns .
‣ scalbln(x, 0) returns x.
‣ scalbln( , n) returns .
Description
Scale x by by efficient manipulation of the floating-point exponent.
Returns
Returns x * .
‣ scalbn( , n) returns .
‣ scalbn(x, 0) returns x.
‣ scalbn( , n) returns .
Description
Scale x by by efficient manipulation of the floating-point exponent.
Returns
Reports the sign bit of all values including infinities, zeros, and NaNs.
‣ With Visual Studio 2013 host compiler: __RETURN_TYPE is 'bool'. Returns true if and only
if a is negative.
‣ With other host compilers: __RETURN_TYPE is 'int'. Returns a nonzero value if and only if
a is negative.
Description
Determine whether the floating-point value a is negative.
Returns
‣ sin( ) returns .
Description
Calculate the sine of the input argument x (measured in radians).
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ none
Description
Calculate the sine and cosine of the first input argument x (measured in radians). The results
for sine and cosine are written into the second argument, sptr, and, respectively, third
argument, cptr.
See also:
sin() and cos().
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ none
Description
Calculate the sine and cosine of the first input argument, x (measured in radians), . The
results for sine and cosine are written into the second argument, sptr, and, respectively,
third argument, cptr.
See also:
sinpi() and cospi().
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ sinh( ) returns .
‣ sinh( ) returns .
Description
Calculate the hyperbolic sine of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ sinpi( ) returns .
Description
Calculate the sine of x (measured in radians), where x is the input argument.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
‣ sqrt( ) returns .
‣ sqrt( ) returns .
Description
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ tan( ) returns .
Description
Calculate the tangent of the input argument x (measured in radians).
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ tanh( ) returns .
‣ tanh( ) returns .
Description
Calculate the hyperbolic tangent of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
‣ tgamma( ) returns .
‣ tgamma( ) returns .
Description
Calculate the gamma function of the input argument x, namely the value of .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns truncated integer value.
‣ trunc( ) returns .
‣ trunc( ) returns .
Description
Round x to the nearest integer value that does not exceed x in magnitude.
Returns
Returns the value of the Bessel function of the second kind of order 0.
‣ y0( ) returns .
Description
Calculate the value of the Bessel function of the second kind of order 0 for the input argument
x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the value of the Bessel function of the second kind of order 1.
‣ y1( ) returns .
Description
Calculate the value of the Bessel function of the second kind of order 1 for the input argument
x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the value of the Bessel function of the second kind of order n.
Description
Calculate the value of the Bessel function of the second kind of order n for the input argument
x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Description
Calculate the absolute value of the input argument a.
Description
Calculate the absolute value of the input argument a.
Description
Calculate the absolute value of the input argument a.
Description
Calculate the maximum value of the arguments a and b.
Description
Calculate the minimum value of the arguments a and b.
Description
Calculate the maximum value of the arguments a and b, perform integer promotion first.
Description
Calculate the maximum value of the arguments a and b, perform integer promotion first.
Description
Calculate the maximum value of the arguments a and b.
Description
Calculate the maximum value of the arguments a and b.
Description
Calculate the maximum value of the arguments a and b, perform integer promotion first.
Description
Calculate the maximum value of the arguments a and b, perform integer promotion first.
Description
Calculate the maximum value of the arguments a and b.
Description
Calculate the maximum value of the arguments a and b.
Description
Calculate the maximum value of the arguments a and b, perform integer promotion first.
Description
Calculate the maximum value of the arguments a and b, perform integer promotion first.
Description
Calculate the maximum value of the arguments a and b.
Description
Calculate the maximum value of the arguments a and b.
Description
Calculate the minimum value of the arguments a and b, perform integer promotion first.
Description
Calculate the minimum value of the arguments a and b, perform integer promotion first.
Description
Calculate the minimum value of the arguments a and b.
Description
Calculate the minimum value of the arguments a and b.
Description
Calculate the minimum value of the arguments a and b, perform integer promotion first.
Description
Calculate the minimum value of the arguments a and b, perform integer promotion first.
Description
Calculate the minimum value of the arguments a and b.
Description
Calculate the minimum value of the arguments a and b.
Description
Calculate the minimum value of the arguments a and b, perform integer promotion first.
Description
Calculate the minimum value of the arguments a and b, perform integer promotion first.
Description
Calculate the minimum value of the arguments a and b.
Description
Calculate the minimum value of the arguments a and b.
Description
Calculate the maximum value of the arguments a and b.
Description
Calculate the minimum value of the arguments a and b.
Description
Calculate the maximum value of the arguments a and b.
Description
Calculate the minimum value of the arguments a and b.
Returns
Returns the approximate cosine of x.
Description
Calculate the fast approximate cosine of the input argument x, measured in radians.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Returns
Returns an approximation to .
Description
Calculate the fast approximate base 10 exponential of the input argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Returns
Returns an approximation to .
Description
Calculate the fast approximate base exponential of the input argument x, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Returns
Returns x + y.
Description
Compute the sum of x and y in round-down (to negative infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x + y.
Description
Compute the sum of x and y in round-to-nearest-even rounding mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x + y.
Description
Compute the sum of x and y in round-up (to positive infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x + y.
Description
Compute the sum of x and y in round-towards-zero mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Divide two floating-point values x by y in round-down (to negative infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Divide two floating-point values x by y in round-to-nearest-even mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Divide two floating-point values x by y in round-up (to positive infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Divide two floating-point values x by y in round-towards-zero mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Calculate the fast approximate division of x by y.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Description
Behavior is the same as __fmaf_rd(x, y, z), the difference is in handling denormalized inputs
and outputs: -ftz compiler flag has no effect.
Description
Behavior is the same as __fmaf_rn(x, y, z), the difference is in handling denormalized inputs
and outputs: -ftz compiler flag has no effect.
Description
Behavior is the same as __fmaf_ru(x, y, z), the difference is in handling denormalized inputs
and outputs: -ftz compiler flag has no effect.
Description
Behavior is the same as __fmaf_rz(x, y, z), the difference is in handling denormalized inputs
and outputs: -ftz compiler flag has no effect.
Returns
Returns the rounded value of as a single operation.
Description
Computes the value of as a single ternary operation, rounding the result once in
round-down (to negative infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the rounded value of as a single operation.
Description
Computes the value of as a single ternary operation, rounding the result once in
round-to-nearest-even mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the rounded value of as a single operation.
Description
Computes the value of as a single ternary operation, rounding the result once in
round-up (to positive infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns the rounded value of as a single operation.
Description
Computes the value of as a single ternary operation, rounding the result once in
round-towards-zero mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x * y.
Description
Compute the product of x and y in round-down (to negative infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x * y.
Description
Compute the product of x and y in round-to-nearest-even mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x * y.
Description
Compute the product of x and y in round-up (to positive infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x * y.
Description
Compute the product of x and y in round-towards-zero mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the reciprocal of x in round-down (to negative infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the reciprocal of x in round-to-nearest-even mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the reciprocal of x in round-up (to positive infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the reciprocal of x in round-towards-zero mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the reciprocal square root of x in round-to-nearest-even mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the square root of x in round-down (to negative infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the square root of x in round-to-nearest-even mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the square root of x in round-up (to positive infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the square root of x in round-towards-zero mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x - y.
Description
Compute the difference of x and y in round-down (to negative infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x - y.
Description
Compute the difference of x and y in round-to-nearest-even rounding mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x - y.
Description
Compute the difference of x and y in round-up (to positive infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns x - y.
Description
Compute the difference of x and y in round-towards-zero mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Single-Precision Floating-Point Functions section.
Returns
Returns an approximation to .
Description
Calculate the fast approximate base 10 logarithm of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Returns
Returns an approximation to .
Description
Calculate the fast approximate base 2 logarithm of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Returns
Returns an approximation to .
Description
Calculate the fast approximate base logarithm of the input argument x.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Returns
Returns an approximation to .
Description
Calculate the fast approximate of x, the first input argument, raised to the power of y, the
second input argument, .
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Returns
‣ __saturatef(x) returns 0 if x < 0.
‣ __saturatef(x) returns 1 if x > 1.
‣ __saturatef(x) returns x if .
‣ __saturatef(NaN) returns 0.
Description
Clamp the input argument x to be within the interval [+0.0, 1.0].
Returns
‣ none
Description
Calculate the fast approximate of sine and cosine of the first input argument x (measured in
radians). The results for sine and cosine are written into the second argument, sptr, and,
respectively, third argument, cptr.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Returns
Returns the approximate sine of x.
Description
Calculate the fast approximate sine of the input argument x, measured in radians.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
Returns
Returns the approximate tangent of x.
Description
Calculate the fast approximate tangent of the input argument x, measured in radians.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Intrinsic Functions section.
‣ The result is computed as the fast divide of __sinf() by __cosf(). Denormal output is flushed
to sign-preserving 0.0.
Returns
Returns x + y.
Description
Adds two floating-point values x and y in round-down (to negative infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x + y.
Description
Adds two floating-point values x and y in round-to-nearest-even mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x + y.
Description
Adds two floating-point values x and y in round-up (to positive infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x + y.
Description
Adds two floating-point values x and y in round-towards-zero mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Divides two floating-point values x by y in round-down (to negative infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Divides two floating-point values x by y in round-to-nearest-even mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Divides two floating-point values x by y in round-up (to positive infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x / y.
Description
Divides two floating-point values x by y in round-towards-zero mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x * y.
Description
Multiplies two floating-point values x and y in round-down (to negative infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x * y.
Description
Multiplies two floating-point values x and y in round-to-nearest-even mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x * y.
Description
Multiplies two floating-point values x and y in round-up (to positive infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x * y.
Description
Multiplies two floating-point values x and y in round-towards-zero mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the reciprocal of x in round-down (to negative infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the reciprocal of x in round-to-nearest-even mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the reciprocal of x in round-up (to positive infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the reciprocal of x in round-towards-zero mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the square root of x in round-down (to negative infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the square root of x in round-to-nearest-even mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the square root of x in round-up (to positive infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns .
Description
Compute the square root of x in round-towards-zero mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x - y.
Description
Subtracts two floating-point values x and y in round-down (to negative infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x - y.
Description
Subtracts two floating-point values x and y in round-to-nearest-even mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x - y.
Description
Subtracts two floating-point values x and y in round-up (to positive infinity) mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns x - y.
Description
Subtracts two floating-point values x and y in round-towards-zero mode.
Note:
‣ For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the rounded value of as a single operation.
Description
Computes the value of as a single ternary operation, rounding the result once in
round-down (to negative infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the rounded value of as a single operation.
Description
Computes the value of as a single ternary operation, rounding the result once in
round-to-nearest-even mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the rounded value of as a single operation.
Description
Computes the value of as a single ternary operation, rounding the result once in
round-up (to positive infinity) mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the rounded value of as a single operation.
Description
Computes the value of as a single ternary operation, rounding the result once in
round-towards-zero mode.
Note:
For accuracy information see the CUDA C++ Programming Guide, Mathematical Functions
Appendix, Double-Precision Floating-Point Functions section.
Returns
Returns the bit-reversed value of x. i.e. bit N of the return value corresponds to bit 31-N of x.
Description
Reverses the bit order of the 32-bit unsigned integer x.
Returns
Returns the bit-reversed value of x. i.e. bit N of the return value corresponds to bit 63-N of x.
Description
Reverses the bit order of the 64-bit unsigned integer x.
Returns
The returned value r is computed to be: result[n] := input[selector[n]] where
result[n] is the nth byte of r.
Description
byte_perm(x,y,s) returns a 32-bit integer consisting of four bytes from eight input bytes
provided in the two input integers x and y, as specified by a selector, s.
The input bytes are indexed as follows: input[0] = x<7:0> input[1] = x<15:8> input[2] = x<23:16>
input[3] = x<31:24> input[4] = y<7:0> input[5] = y<15:8> input[6] = y<23:16> input[7] = y<31:24>
The selector indices are as follows (the upper 16-bits of the selector are not used): selector[0]
= s<2:0> selector[1] = s<6:4> selector[2] = s<10:8> selector[3] = s<14:12>
Returns
Returns a value between 0 and 32 inclusive representing the number of zero bits.
Description
Count the number of consecutive leading zero bits, starting at the most significant bit (bit 31)
of x.
Returns
Returns a value between 0 and 64 inclusive representing the number of zero bits.
Description
Count the number of consecutive leading zero bits, starting at the most significant bit (bit 63)
of x.
Returns
Returns a value between 0 and 32 inclusive representing the position of the first bit set.
‣ __ffs(0) returns 0.
Description
Find the position of the first (least significant) bit set to 1 in x, where the least significant bit
position is 1.
Returns
Returns a value between 0 and 64 inclusive representing the position of the first bit set.
‣ __ffsll(0) returns 0.
Description
Find the position of the first (least significant) bit set to 1 in x, where the least significant bit
position is 1.
Returns
Returns the most significant 32 bits of the shifted 64-bit value.
Description
Shift the 64-bit value formed by concatenating argument lo and hi left by the amount
specified by the argument shift. Argument lo holds bits 31:0 and argument hi holds bits
63:32 of the 64-bit source value. The source is shifted left by the wrapped value of shift
(shift & 31). The most significant 32-bits of the result are returned.
Returns
Returns the most significant 32 bits of the shifted 64-bit value.
Description
Shift the 64-bit value formed by concatenating argument lo and hi left by the amount
specified by the argument shift. Argument lo holds bits 31:0 and argument hi holds bits
63:32 of the 64-bit source value. The source is shifted left by the clamped value of shift
(min(shift, 32)). The most significant 32-bits of the result are returned.
Returns
Returns the least significant 32 bits of the shifted 64-bit value.
Description
Shift the 64-bit value formed by concatenating argument lo and hi right by the amount
specified by the argument shift. Argument lo holds bits 31:0 and argument hi holds bits
63:32 of the 64-bit source value. The source is shifted right by the wrapped value of shift
(shift & 31). The least significant 32-bits of the result are returned.
Returns
Returns the least significant 32 bits of the shifted 64-bit value.
Description
Shift the 64-bit value formed by concatenating argument lo and hi right by the amount
specified by the argument shift. Argument lo holds bits 31:0 and argument hi holds bits
63:32 of the 64-bit source value. The source is shifted right by the clamped value of shift
(min(shift, 32)). The least significant 32-bits of the result are returned.
Returns
Returns a signed integer value representing the signed average value of the two inputs.
Description
Compute average of signed input arguments x and y as ( x + y ) >> 1, avoiding overflow in the
intermediate sum.
Returns
Returns the least significant 32 bits of the product x * y.
Description
Calculate the least significant 32 bits of the product of the least significant 24 bits of x and y.
The high order 8 bits of x and y are ignored.
Returns
Returns the most significant 64 bits of the product x * y.
Description
Calculate the most significant 64 bits of the 128-bit product x * y, where x and y are 64-bit
integers.
Returns
Returns the most significant 32 bits of the product x * y.
Description
Calculate the most significant 32 bits of the 64-bit product x * y, where x and y are 32-bit
integers.
Returns
Returns a value between 0 and 32 inclusive representing the number of set bits.
Description
Count the number of bits that are set to 1 in x.
Returns
Returns a value between 0 and 64 inclusive representing the number of set bits.
Description
Count the number of bits that are set to 1 in x.
Returns
Returns a signed integer value representing the signed rounded average value of the two
inputs.
Description
Compute average of signed input arguments x and y as ( x + y + 1 ) >> 1, avoiding overflow in
the intermediate sum.
Returns
Returns .
Description
Calculate , the 32-bit sum of the third argument z plus and the absolute value of the
difference between the first argument, x, and second argument, y.
Inputs x and y are signed 32-bit integers, input z is a 32-bit unsigned integer.
Returns
Returns an unsigned integer value representing the unsigned average value of the two inputs.
Description
Compute average of unsigned input arguments x and y as ( x + y ) >> 1, avoiding overflow in
the intermediate sum.
Returns
Returns the least significant 32 bits of the product x * y.
Description
Calculate the least significant 32 bits of the product of the least significant 24 bits of x and y.
The high order 8 bits of x and y are ignored.
Returns
Returns the most significant 64 bits of the product x * y.
Description
Calculate the most significant 64 bits of the 128-bit product x * y, where x and y are 64-bit
unsigned integers.
Returns
Returns the most significant 32 bits of the product x * y.
Description
Calculate the most significant 32 bits of the 64-bit product x * y, where x and y are 32-bit
unsigned integers.
Returns
Returns an unsigned integer value representing the unsigned rounded average value of the
two inputs.
Description
Compute average of unsigned input arguments x and y as ( x + y + 1 ) >> 1, avoiding overflow
in the intermediate sum.
Returns
Returns .
Description
Calculate , the 32-bit sum of the third argument z plus and the absolute value of the
difference between the first argument, x, and second argument, y.
Inputs x, y, and z are unsigned 32-bit integers.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a single-precision floating-point value in
round-down (to negative infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a single-precision floating-point value in
round-to-nearest-even mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a single-precision floating-point value in
round-up (to positive infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a single-precision floating-point value in
round-towards-zero mode.
Returns
Returns reinterpreted value.
Description
Reinterpret the high 32 bits in the double-precision floating-point value x as a signed integer.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a signed integer value in round-down (to
negative infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a signed integer value in round-to-
nearest-even mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a signed integer value in round-up (to
positive infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a signed integer value in round-
towards-zero mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a signed 64-bit integer value in round-
down (to negative infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a signed 64-bit integer value in round-
to-nearest-even mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a signed 64-bit integer value in round-
up (to positive infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to a signed 64-bit integer value in round-
towards-zero mode.
Returns
Returns reinterpreted value.
Description
Reinterpret the low 32 bits in the double-precision floating-point value x as a signed integer.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to an unsigned integer value in round-
down (to negative infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to an unsigned integer value in round-to-
nearest-even mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to an unsigned integer value in round-up
(to positive infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to an unsigned integer value in round-
towards-zero mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to an unsigned 64-bit integer value in
round-down (to negative infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to an unsigned 64-bit integer value in
round-to-nearest-even mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to an unsigned 64-bit integer value in
round-up (to positive infinity) mode.
Returns
Returns converted value.
Description
Convert the double-precision floating-point value x to an unsigned 64-bit integer value in
round-towards-zero mode.
Returns
Returns reinterpreted value.
Description
Reinterpret the bits in the double-precision floating-point value x as a signed 64-bit integer.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to a signed integer in round-down (to
negative infinity) mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to a signed integer in round-to-nearest-
even mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to a signed integer in round-up (to positive
infinity) mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to a signed integer in round-towards-zero
mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to a signed 64-bit integer in round-down (to
negative infinity) mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to a signed 64-bit integer in round-to-
nearest-even mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to a signed 64-bit integer in round-up (to
positive infinity) mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to a signed 64-bit integer in round-
towards-zero mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to an unsigned integer in round-down (to
negative infinity) mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to an unsigned integer in round-to-nearest-
even mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to an unsigned integer in round-up (to
positive infinity) mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to an unsigned integer in round-towards-
zero mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to an unsigned 64-bit integer in round-
down (to negative infinity) mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to an unsigned 64-bit integer in round-to-
nearest-even mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to an unsigned 64-bit integer in round-up
(to positive infinity) mode.
Returns
Returns converted value.
Description
Convert the single-precision floating-point value x to an unsigned 64-bit integer in round-
towards-zero mode.
Returns
Returns reinterpreted value.
Description
Reinterpret the bits in the single-precision floating-point value x as a signed integer.
Returns
Returns reinterpreted value.
Description
Reinterpret the bits in the single-precision floating-point value x as a unsigned integer.
Returns
Returns reinterpreted value.
Description
Reinterpret the integer value of hi as the high 32 bits of a double-precision floating-point
value and the integer value of lo as the low 32 bits of the same double-precision floating-point
value.
Returns
Returns converted value.
Description
Convert the signed integer value x to a double-precision floating-point value.
Returns
Returns converted value.
Description
Convert the signed integer value x to a single-precision floating-point value in round-down (to
negative infinity) mode.
Returns
Returns converted value.
Description
Convert the signed integer value x to a single-precision floating-point value in round-to-
nearest-even mode.
Returns
Returns converted value.
Description
Convert the signed integer value x to a single-precision floating-point value in round-up (to
positive infinity) mode.
Returns
Returns converted value.
Description
Convert the signed integer value x to a single-precision floating-point value in round-towards-
zero mode.
Returns
Returns reinterpreted value.
Description
Reinterpret the bits in the signed integer value x as a single-precision floating-point value.
Returns
Returns converted value.
Description
Convert the signed 64-bit integer value x to a double-precision floating-point value in round-
down (to negative infinity) mode.
Returns
Returns converted value.
Description
Convert the signed 64-bit integer value x to a double-precision floating-point value in round-
to-nearest-even mode.
Returns
Returns converted value.
Description
Convert the signed 64-bit integer value x to a double-precision floating-point value in round-
up (to positive infinity) mode.
Returns
Returns converted value.
Description
Convert the signed 64-bit integer value x to a double-precision floating-point value in round-
towards-zero mode.
Returns
Returns converted value.
Description
Convert the signed integer value x to a single-precision floating-point value in round-down (to
negative infinity) mode.
Returns
Returns converted value.
Description
Convert the signed 64-bit integer value x to a single-precision floating-point value in round-to-
nearest-even mode.
Returns
Returns converted value.
Description
Convert the signed integer value x to a single-precision floating-point value in round-up (to
positive infinity) mode.
Returns
Returns converted value.
Description
Convert the signed integer value x to a single-precision floating-point value in round-towards-
zero mode.
Returns
Returns reinterpreted value.
Description
Reinterpret the bits in the 64-bit signed integer value x as a double-precision floating-point
value.
Returns
Returns converted value.
Description
Convert the unsigned integer value x to a double-precision floating-point value.
Returns
Returns converted value.
Description
Convert the unsigned integer value x to a single-precision floating-point value in round-down
(to negative infinity) mode.
Returns
Returns converted value.
Description
Convert the unsigned integer value x to a single-precision floating-point value in round-to-
nearest-even mode.
Returns
Returns converted value.
Description
Convert the unsigned integer value x to a single-precision floating-point value in round-up (to
positive infinity) mode.
Returns
Returns converted value.
Description
Convert the unsigned integer value x to a single-precision floating-point value in round-
towards-zero mode.
Returns
Returns reinterpreted value.
Description
Reinterpret the bits in the unsigned integer value x as a single-precision floating-point value.
Returns
Returns converted value.
Description
Convert the unsigned 64-bit integer value x to a double-precision floating-point value in
round-down (to negative infinity) mode.
Returns
Returns converted value.
Description
Convert the unsigned 64-bit integer value x to a double-precision floating-point value in
round-to-nearest-even mode.
Returns
Returns converted value.
Description
Convert the unsigned 64-bit integer value x to a double-precision floating-point value in
round-up (to positive infinity) mode.
Returns
Returns converted value.
Description
Convert the unsigned 64-bit integer value x to a double-precision floating-point value in
round-towards-zero mode.
Returns
Returns converted value.
Description
Convert the unsigned integer value x to a single-precision floating-point value in round-down
(to negative infinity) mode.
Returns
Returns converted value.
Description
Convert the unsigned integer value x to a single-precision floating-point value in round-to-
nearest-even mode.
Returns
Returns converted value.
Description
Convert the unsigned integer value x to a single-precision floating-point value in round-up (to
positive infinity) mode.
Returns
Returns converted value.
Description
Convert the unsigned integer value x to a single-precision floating-point value in round-
towards-zero mode.
Returns
Returns computed value.
Description
Splits 4 bytes of argument into 2 parts, each consisting of 2 bytes, then computes absolute
value for each of parts. Partial results are recombined and returned as unsigned int.
Returns
Returns computed value.
Description
Splits argument by bytes. Computes absolute value of each byte. Partial results are
recombined and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each into 2 parts, each consisting of 2 bytes. For corresponding parts function
computes absolute difference. Partial results are recombined and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each into 4 parts, each consisting of 1 byte. For corresponding parts function
computes absolute difference. Partial results are recombined and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function computes absolute difference. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts function computes absolute difference. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of argument into 2 parts, each consisting of 2 bytes, then computes absolute
value with signed saturation for each of parts. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of argument into 4 parts, each consisting of 1 byte, then computes absolute
value with signed saturation for each of parts. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes, then performs
unsigned addition on corresponding parts. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 'a' into 4 bytes, then performs unsigned addition on each of these bytes with the
corresponding byte from 'b', ignoring overflow. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes, then performs addition
with signed saturation on corresponding parts. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte, then performs addition
with signed saturation on corresponding parts. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes, then performs addition
with unsigned saturation on corresponding parts.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte, then performs addition
with unsigned saturation on corresponding parts.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes, then computes signed
rounded average of corresponding parts. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. then computes signed
rounded average of corresponding parts. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes, then computes
unsigned rounded average of corresponding parts. Partial results are recombined and
returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. then computes
unsigned rounded average of corresponding parts. Partial results are recombined and
returned as unsigned int.
Returns
Returns 0xffff computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts result is ffff if they are equal, and 0000 otherwise. For example __vcmpeq2(0x1234aba5,
0x1234aba6) returns 0xffff0000.
Returns
Returns 0xff if a = b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if they are equal, and 00 otherwise. For example __vcmpeq4(0x1234aba5,
0x1234aba6) returns 0xffffff00.
Returns
Returns 0xffff if a >= b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For
corresponding parts result is ffff if 'a' part >= 'b' part, and 0000 otherwise. For example
__vcmpges2(0x1234aba5, 0x1234aba6) returns 0xffff0000.
Returns
Returns 0xff if a >= b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if 'a' part >= 'b' part, and 00 otherwise. For example __vcmpges4(0x1234aba5,
0x1234aba6) returns 0xffffff00.
Returns
Returns 0xffff if a >= b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For
corresponding parts result is ffff if 'a' part >= 'b' part, and 0000 otherwise. For example
__vcmpgeu2(0x1234aba5, 0x1234aba6) returns 0xffff0000.
Returns
Returns 0xff if a = b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if 'a' part >= 'b' part, and 00 otherwise. For example __vcmpgeu4(0x1234aba5,
0x1234aba6) returns 0xffffff00.
Returns
Returns 0xffff if a > b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For
corresponding parts result is ffff if 'a' part > 'b' part, and 0000 otherwise. For example
__vcmpgts2(0x1234aba5, 0x1234aba6) returns 0x00000000.
Returns
Returns 0xff if a > b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if 'a' part > 'b' part, and 00 otherwise. For example __vcmpgts4(0x1234aba5,
0x1234aba6) returns 0x00000000.
Returns
Returns 0xffff if a > b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For
corresponding parts result is ffff if 'a' part > 'b' part, and 0000 otherwise. For example
__vcmpgtu2(0x1234aba5, 0x1234aba6) returns 0x00000000.
Returns
Returns 0xff if a > b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if 'a' part > 'b' part, and 00 otherwise. For example __vcmpgtu4(0x1234aba5,
0x1234aba6) returns 0x00000000.
Returns
Returns 0xffff if a <= b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For
corresponding parts result is ffff if 'a' part <= 'b' part, and 0000 otherwise. For example
__vcmples2(0x1234aba5, 0x1234aba6) returns 0xffffffff.
Returns
Returns 0xff if a <= b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if 'a' part <= 'b' part, and 00 otherwise. For example __vcmples4(0x1234aba5,
0x1234aba6) returns 0xffffffff.
Returns
Returns 0xffff if a <= b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For
corresponding parts result is ffff if 'a' part <= 'b' part, and 0000 otherwise. For example
__vcmpleu2(0x1234aba5, 0x1234aba6) returns 0xffffffff.
Returns
Returns 0xff if a <= b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if 'a' part <= 'b' part, and 00 otherwise. For example __vcmpleu4(0x1234aba5,
0x1234aba6) returns 0xffffffff.
Returns
Returns 0xffff if a < b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For
corresponding parts result is ffff if 'a' part < 'b' part, and 0000 otherwise. For example
__vcmplts2(0x1234aba5, 0x1234aba6) returns 0x0000ffff.
Returns
Returns 0xff if a < b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if 'a' part < 'b' part, and 00 otherwise. For example __vcmplts4(0x1234aba5,
0x1234aba6) returns 0x000000ff.
Returns
Returns 0xffff if a < b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For
corresponding parts result is ffff if 'a' part < 'b' part, and 0000 otherwise. For example
__vcmpltu2(0x1234aba5, 0x1234aba6) returns 0x0000ffff.
Returns
Returns 0xff if a < b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if 'a' part < 'b' part, and 00 otherwise. For example __vcmpltu4(0x1234aba5,
0x1234aba6) returns 0x000000ff.
Returns
Returns 0xffff if a != b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For
corresponding parts result is ffff if 'a' part != 'b' part, and 0000 otherwise. For example
__vcmplts2(0x1234aba5, 0x1234aba6) returns 0x0000ffff.
Returns
Returns 0xff if a != b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts result is ff if 'a' part != 'b' part, and 00 otherwise. For example __vcmplts4(0x1234aba5,
0x1234aba6) returns 0x000000ff.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes, then computes
unsigned average of corresponding parts. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. then computes
unsigned average of corresponding parts. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function computes signed maximum. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function computes signed maximum. Partial results are recombined and returned as unsigned
int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function computes unsigned maximum. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts function computes unsigned maximum. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function computes signed minimum. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function computes signed minimum. Partial results are recombined and returned as unsigned
int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function computes unsigned minimum. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts function computes unsigned minimum. Partial results are recombined and returned as
unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of argument into 2 parts, each consisting of 2 bytes. For each part function
computes negation. Partial results are recombined and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of argument into 4 parts, each consisting of 1 byte. For each part function
computes negation. Partial results are recombined and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of argument into 2 parts, each consisting of 2 bytes. For each part function
computes negation. Partial results are recombined and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of argument into 4 parts, each consisting of 1 byte. For each part function
computes negation. Partial results are recombined and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function computes absolute difference and sum it up. Partial results are recombined and
returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function computes absolute difference and sum it up. Partial results are recombined and
returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function computes absolute differences and returns sum of those differences.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function computes absolute differences and returns sum of those differences.
Returns
Returns 1 if a = b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part == 'b' part. If both equalities are satisfied, function
returns 1.
Returns
Returns 1 if a = b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts function performs comparison 'a' part == 'b' part. If both equalities are satisfied, function
returns 1.
Returns
Returns 1 if a >= b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part >= 'b' part. If both inequalities are satisfied,
function returns 1.
Returns
Returns 1 if a >= b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs comparison 'a' part >= 'b' part. If both inequalities are satisfied, function
returns 1.
Returns
Returns 1 if a >= b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part >= 'b' part. If both inequalities are satisfied,
function returns 1.
Returns
Returns 1 if a >= b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs comparison 'a' part >= 'b' part. If both inequalities are satisfied, function
returns 1.
Returns
Returns 1 if a > b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part > 'b' part. If both inequalities are satisfied,
function returns 1.
Returns
Returns 1 if a > b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs comparison 'a' part > 'b' part. If both inequalities are satisfied, function
returns 1.
Returns
Returns 1 if a > b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part > 'b' part. If both inequalities are satisfied,
function returns 1.
Returns
Returns 1 if a > b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs comparison 'a' part > 'b' part. If both inequalities are satisfied, function
returns 1.
Returns
Returns 1 if a <= b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part <= 'b' part. If both inequalities are satisfied,
function returns 1.
Returns
Returns 1 if a <= b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs comparison 'a' part <= 'b' part. If both inequalities are satisfied, function
returns 1.
Returns
Returns 1 if a <= b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part <= 'b' part. If both inequalities are satisfied,
function returns 1.
Returns
Returns 1 if a <= b, else returns 0.
Description
Splits 4 bytes of each argument into 4 part, each consisting of 1 byte. For corresponding parts
function performs comparison 'a' part <= 'b' part. If both inequalities are satisfied, function
returns 1.
Returns
Returns 1 if a < b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part <= 'b' part. If both inequalities are satisfied,
function returns 1.
Returns
Returns 1 if a < b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs comparison 'a' part <= 'b' part. If both inequalities are satisfied, function
returns 1.
Returns
Returns 1 if a < b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part <= 'b' part. If both inequalities are satisfied,
function returns 1.
Returns
Returns 1 if a < b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs comparison 'a' part <= 'b' part. If both inequalities are satisfied, function
returns 1.
Returns
Returns 1 if a != b, else returns 0.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs comparison 'a' part != 'b' part. If both conditions are satisfied, function
returns 1.
Returns
Returns 1 if a != b, else returns 0.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding
parts function performs comparison 'a' part != 'b' part. If both conditions are satisfied, function
returns 1.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs subtraction. Partial results are recombined and returned as unsigned
int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs subtraction. Partial results are recombined and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs subtraction with signed saturation. Partial results are recombined
and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs subtraction with signed saturation. Partial results are recombined and
returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 2 parts, each consisting of 2 bytes. For corresponding
parts function performs subtraction with unsigned saturation. Partial results are recombined
and returned as unsigned int.
Returns
Returns computed value.
Description
Splits 4 bytes of each argument into 4 parts, each consisting of 1 byte. For corresponding parts
function performs subtraction with unsigned saturation. Partial results are recombined and
returned as unsigned int.
NVIDIA reserves the right to make corrections, modifications, enhancements, improvements, and any other changes to this document, at any time without notice.
Customer should obtain the latest relevant information before placing orders and should verify that such information is current and complete.
NVIDIA products are sold subject to the NVIDIA standard terms and conditions of sale supplied at the time of order acknowledgement, unless otherwise agreed
in an individual sales agreement signed by authorized representatives of NVIDIA and customer (“Terms of Sale”). NVIDIA hereby expressly objects to applying any
customer general terms and conditions with regards to the purchase of the NVIDIA product referenced in this document. No contractual obligations are formed
either directly or indirectly by this document.
OpenCL
OpenCL is a trademark of Apple Inc. used under license to the Khronos Group Inc.
Trademarks
NVIDIA and the NVIDIA logo are trademarks or registered trademarks of NVIDIA Corporation in the U.S. and other countries. Other company and product names may
be trademarks of the respective companies with which they are associated.
Copyright
© 2007-2022 NVIDIA Corporation & affiliates. All rights reserved.