Make stoip6 return whether the conversion succeed #72

Taiki-San · 2018-06-29T12:44:15Z

Implement the change suggested by @kjbracey-arm in #71.
stoip6 now returns false if the string is too long, if characters besides hexadecimal characters or ':' are detected, or if it's missing fields.

kjbracey · 2018-07-10T10:24:28Z

source/libip6string/stoip6.c

-        /* Should really report an error if we didn't get 8 fields */
-        memset(addr, 0, 16 - field_no * 2);
+        //Report an error if we didn't get 8 fields
+        return false;


Just in case, for backwards compatibility, keep the memset in. We've already written something to the buffer. May as well fully fill it.

I'd like to minimise the chances of breaking a large body of code using this.

Although we now are being stricter on the is_hex check. Maybe that should carry on but set an "error" flag to return false at the end? Guarantee we don't break anyone?

Ok with memset.

As for soft failing, I'm not totally confortable with such a lax parser, especially for something as important as parsing IP addresses. If you want to push to avoid breaking third-party code, I'll make that change but I'd rather see buggy/unsafe code break before being exploited by hackers. Your call.

Tend to agree, but the current code is currently returning void, and there are lots of users, so even if you make it return false, no current code is going to be checking the return.

By returning early, you make current users process uninitialised data rather a definite value they're getting at the moment.

Once people actually start looking at the return value, then it becomes a strict parser.

Would memset-ing the entire output buffer to 0 every time we return false be an appropriate alternative?

Seems plausible. I'm just nervous about any change to the output values here, as I suspect there's not enough CI on this repo - we wouldn't see any problems until incorporated into upstream projects.

Hmm. Okay, make it clear when returning false. (Stylistically I'm fine with a goto error for that).

kjbracey · 2018-07-10T10:24:38Z

source/libip6string/stoip6.c

    }

    // First go forward the string, until end, noting :: position if any
    for (field_no = 0, p = ip6addr; (len > (size_t)(p - ip6addr)) && *p && field_no < 8; p = q + 1) {
        q = p;
        // Seek for ':' or end
        while (*q && (*q != ':')) {
-            q++;
+            //There must only be hex characters besides ':'


Space after //, also below.

kjbracey · 2018-07-10T10:27:54Z

source/libip6string/stoip6.c

@@ -20,15 +20,17 @@
 #include "ip6string.h"

 static uint16_t hex(const char *p);
+static bool is_hex(const char c);


Not a fan of meaningless const in declaration prototypes - you're passing by value. You're free to retain it in the definition below, I guess, but there's no const size_t or void * const dest going on, so I think this is just randomly picked up from the line above where the const means something different. This is actually analogous to const char * const p.

(Argument variables can be declared const or not inside a function, indicating the constness of the copy. For the parameters, const is meaningless, and doesn't affect type-checking of definition versus declaration.).

kjbracey

If you want to be stricter, should probably reject any >4-digit segments. Could also reject coloncolon being set if already set.

kjbracey · 2018-07-10T13:48:20Z

source/libip6string/stoip6.c

    }
    return true;
+
+error:
+    // Fill the output buffer with 0 so we stick to the old failure mechanism


I don't think this exactly match the old failure in all cases (eg non-hex digits), so probably best to not claim that.

Is the new comment less ambiguous or should I rather drop the mention of the old behavior ?

kjbracey

Looks fine now apart from formatting. Make sure you run the unit test, and you should probably add a couple of tests for the new failure modes.

kjbracey · 2018-07-11T08:25:50Z

source/libip6string/stoip6.c

        q = p;
        while (*q && (*q != ':')) { // Seek for ':' or end
            if (!is_hex(*q++)) { // There must only be hex characters besides ':'
                goto error;
            }
        }
+
+        if((q - p) > 4) { // We can't have more than 4 hex digits per segments


Formatting - space after if. "Per segment"

kjbracey · 2018-07-11T08:26:05Z

source/libip6string/stoip6.c

        // Check if we reached "::"
        if ((len > (size_t)(q - ip6addr)) && *q && (q[0] == ':') && (q[1] == ':')) {
+            if(coloncolon != -1) { // We are not supposed to see "::" more than once per address


Formatting - space after if

Taiki-San · 2018-07-11T21:14:09Z

Rebased the patches after fixing the formatting you mentioned.
Also tweaked the unit tests: it's now checking that the call succeed/failed and a new tests validate the failure in a few scenarios.

Taiki-San · 2018-07-12T07:11:16Z

I don't have the toolchain to run the tests locally, but are they ran by the CI? 504b8b2 should have failed.

kjbracey · 2018-07-12T08:19:18Z

I fear the CI is not running those unit tests on PRs. This repo is low-traffic enough we've not spent much time on automation. I can run them locally.

kjbracey · 2018-07-12T08:25:03Z