BLD: Modify cpu detection and printing to get working aarch64 build #11568

ksunden · 2018-07-14T16:32:35Z

Closes #11564

jjhelmus · 2018-07-14T16:43:15Z

Once the build is working again on aarch64 we could use the new ARMv8-A on Shippable for an altarch CI.

charris · 2018-07-14T18:20:18Z

LGTM failure looks unrelated.

charris · 2018-07-14T18:21:32Z

Has this been tested to see if it fixes the build?

charris · 2018-07-14T18:25:54Z

numpy/core/src/multiarray/dragon4.c

@@ -2714,7 +2714,7 @@ Dragon4_PrintFloat_Intel_extended128(
 * becomes more common.
 */
 static npy_uint32
-Dragon4_PrintFloat_IEEE_binary128(
+Dragon4_PrintFloat_IEEE_binary128_le(


Why this change?

This function is for quad precision floats, I doubt that ARM implements that.

Without this change, I got warnings for implicit definition of Dragon4_PrintFloat_IEEE_binary128_le and unused symbol for Dragon4_PrintFloat_IEEE_binary128

Additionally, it failed on import looking for Dragon4_PrintFloat_IEEE_binary128_le

It is very possible that the root cause is actually the identification of what representation it should be using. I've been looking at

numpy/numpy/core/setup_common.py

Line 343 in 20a80e0

def long_double_representation(lines):

to see if there's something incorrect there.

I've created the object file outside of running setup_common, and confirmed that the sequence that identifies IEEE_QUAD_LE is indeed present in the od -b dump

Given the error that was thrown, I tried the naive approach of just editing the function name, not fully expecting it to work, but it did (at least as far as building was concerned, still have some test failures)

Hmm, LLVM does seem to have software support for quad precision, so I think we need to track this down. What is your compiler tool chain? I'm concerned about the endian part. @ahaldane Comment?

Maybe wrong flag, but it is there somewhere, https://llvm.org/docs/LangRef.html#floating-point-types. We should be detecting on the byte representation, so either there is an error in that or the compiler is actually producing quad floats.

OK we need to have two functions, one with the _be extension and one with the _le extension. This function, despite the comment, looks to be big endian with the little endian version in #11570. The difference is only four lines, so it would be nice to have wrappers for the endian specific versions that called into the common code, but the easy fix would be copy the function from # 11570 here with the appropriate name.

See similar wrappers for the various extended precision alignments.

I think all the wrappers need to do is swap the a and b members of buf128, so maybe only one wrapper to do that for the _le version and name this function with the _be extension.

I've pushed the easy fix, cherry-picking the commit from my other PR, I'll work on the wrapper func now

charris · 2018-07-14T20:04:25Z

The availability of half precision is also interesting.

ksunden · 2018-07-14T22:17:48Z

I have indeed built this on my machine, and @jjhelmus has built it as well on a remote machine

charris · 2018-07-15T02:17:38Z

Out of curiosity, what does np.finfo(np.float128) show?

ksunden · 2018-07-15T04:41:31Z

>>> np.finfo(np.float128)
finfo(resolution=1e-33, min=-1.189731495357231765085759326628007e+4932, max=1.189731495357231765085759326628007e+4932, dtype=float128)

charris · 2018-07-15T13:42:04Z

OK, looks like genuine IEEE quad precision long doubles, the finfo function should be making the identification based on the byte representation. It would be interesting to know why they are there, as that seems not to be common news, but the signs and portents do seem to indicate that the long delayed move to quad precision is about to get underway.

[ci skip]

charris · 2018-07-16T01:22:26Z

Thanks @ksunden .

…umpy#11568)

QuLogic · 2018-08-11T05:23:14Z

I think this broke some big-endian systems. Dragon4_PrintFloat_IEEE_binary128 exists if defined(HAVE_LDOUBLE_IEEE_QUAD_LE) || defined(HAVE_LDOUBLE_IEEE_QUAD_BE) and calls LogBase2_128, but that only exists if defined(HAVE_LDOUBLE_IEEE_QUAD_LE), which means it's not available on big-endian systems.

QuLogic · 2018-08-11T06:30:25Z

BigInt_Set_2x_uint64 is also not defined for HAVE_LDOUBLE_IEEE_QUAD_BE and called by this function.

Both these functions are used by `Dragon4_PrintFloat_IEEE_binary128`, which was recently made available on big-endian systems without these in numpy#11568.

Build working on aarch64

2b6a063

charris added 00 - Bug 06 - Regression 09 - Backport-Candidate PRs tagged should be backported component: numpy._core labels Jul 14, 2018

charris added this to the 1.15.0 release milestone Jul 14, 2018

charris reviewed Jul 14, 2018

View reviewed changes

ksunden mentioned this pull request Jul 14, 2018

BUG: Fix printing of float128 on ARM #11570

Closed

ksunden and others added 4 commits July 15, 2018 16:05

Swap a and b in float128 parsing

b7a2f11

Wrappers for IEEE_QUAD be and le

490cbb0

Update comments

4cc8980

MAINT: Fix spelling and remove obsolete comment.

106ce3a

[ci skip]

charris changed the title ~~BLD: Modify cpu detection and function to get build working on aarch64~~ BLD: Modify cpu detection and printing to get working aarch64 build Jul 16, 2018

charris merged commit f07359b into numpy:master Jul 16, 2018

charris pushed a commit to charris/numpy that referenced this pull request Jul 16, 2018

BLD: Modify cpu detection and function to get working aarch64 build (n…

44084fc

…umpy#11568)

charris mentioned this pull request Jul 16, 2018

BLD: Modify cpu detection and printing to get working aarch64 build #11577

Merged

charris added component: build and removed 09 - Backport-Candidate PRs tagged should be backported labels Jul 16, 2018

charris removed this from the 1.15.0 release milestone Jul 16, 2018

QuLogic mentioned this pull request Aug 11, 2018

BUG: Fix undefined functions on big-endian systems. #11711

Merged

charris mentioned this pull request Aug 12, 2018

BUG: Fix undefined functions on big-endian systems. #11719

Merged

Uh oh!

BLD: Modify cpu detection and printing to get working aarch64 build #11568

BLD: Modify cpu detection and printing to get working aarch64 build #11568

Uh oh!

Conversation

ksunden commented Jul 14, 2018

Uh oh!

jjhelmus commented Jul 14, 2018

Uh oh!

charris commented Jul 14, 2018

Uh oh!

charris commented Jul 14, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

charris Jul 14, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

charris commented Jul 14, 2018

Uh oh!

ksunden commented Jul 14, 2018

Uh oh!

charris commented Jul 15, 2018

Uh oh!

ksunden commented Jul 15, 2018

Uh oh!

charris commented Jul 15, 2018

Uh oh!

charris commented Jul 16, 2018

Uh oh!

QuLogic commented Aug 11, 2018

Uh oh!

QuLogic commented Aug 11, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

charris Jul 14, 2018 •

edited

Loading

QuLogic commented Aug 11, 2018 •

edited

Loading