Starlark: optimize StarlarkInt.Big comparison to StarlarkInt.Int{32,64} #12638

stepancheg · 2020-12-06T00:57:00Z

Perform comparison without conversion of smaller integers to
BigInteger. StarlarkInt.compareTo does not allocate now.

For this benchmark:

def test():
    x = 17 << 77
    for i in range(10):
        print(i)
        for j in range(10000000):
            x > 1

test()

A: n=27 mean=4.262 std=0.203 se=0.039 min=4.036 med=4.193
B: n=27 mean=4.113 std=0.172 se=0.033 min=3.859 med=4.049
B/A: 0.965 0.941..0.990 (95% conf)

Speed up is about 7% when comparing to an integer outside of
BigInteger cached range (-16..16).

Finally, StarlarkInt.Big to StarlarkInt.Big comparison performance
seems to stay the same (within 95% confidence interval after 100
test iterations).

alandonovan · 2020-12-06T15:00:54Z

src/main/java/net/starlark/java/eval/StarlarkInt.java

@@ -446,7 +464,13 @@ public static int compare(StarlarkInt x, StarlarkInt y) {
      /* fall through */
    }

-    return x.toBigInteger().compareTo(y.toBigInteger());
+    int xo = x.orderOfMagnitude();


Delete LL457-459? (GitHub won't let me comment on that line.)

The invariant at this point (L467) is that at least one of the operands is Big. Can we do the comparison efficiently without adding the new virtual method, something like this:

public static int compare(StarlarkInt x, StarlarkInt y) { long xl, yl; boolean xbig, ybig; try { xl = x.toLongFast(); } catch (Overflow) { xbig = true; } try { yl = y.toLongFast(); } catch (Overflow) { ybig = true; } // If only one operand is big, its magnitude is greater than the other operand, // which can be ignored (treated as zero). if (xbig) { return ybig ? ((Big) x).compareTo(y) : x.signum(); } else { return ybig ? -y.signum(), Long.compare(x, y); } }

stepancheg · 2020-12-06T19:26:58Z

Updated PR without orderOfMagnitude virtual method.

Removed special case x instanceof Int32 && y instanceof Int32, possible slowdown is very little (within half percent) for test 1 < 2.

alandonovan · 2020-12-06T20:53:06Z

src/main/java/net/starlark/java/eval/StarlarkInt.java

+    // If neither argument is big integer, we compare longs.
+    // If only one argument is big integer, it is bigger than other if positive
+    // and smaller otherwise.
+    if (xbig && ybig) {


FWIW, this version requires 4 branches to reach the common case (Long.compare), whereas the version I proposed used a control tree of depth two. Probably not a big deal, especially given the mystery and unpredictability of measurements using the JVM, but something to bear in mind when optimizing (especially in a language like C++ or Go or assembly).

Perform comparison without conversion of smaller integers to `BigInteger`. `StarlarkInt.compareTo` does not allocate now. For this benchmark: ``` def test(): x = 17 << 77 for i in range(10): print(i) for j in range(10000000): x > 1 test() ``` ``` A: n=27 mean=4.262 std=0.203 se=0.039 min=4.036 med=4.193 B: n=27 mean=4.113 std=0.172 se=0.033 min=3.859 med=4.049 B/A: 0.965 0.941..0.990 (95% conf) ``` Speed up is about 7% when comparing to an integer outside of `BigInteger` cached range (-16..16). Finally, `StarlarkInt.Big` to `StarlarkInt.Big` comparison performance seems to stay the same (within 95% confidence interval after 100 test iterations).

stepancheg · 2020-12-06T21:30:36Z

Go full branchless on the typical case of long-long comparison.

1 < 2 test seems to perform slightly better than previous version (and even master), but so little, that it is impossible to say for sure without running test for several hours in different host configurations.

Let's say, performance is the same.

stepancheg requested review from alandonovan and brandjon as code owners December 6, 2020 00:57

google-cla bot added the cla: yes label Dec 6, 2020

alandonovan reviewed Dec 6, 2020

View reviewed changes

stepancheg force-pushed the int-compare branch from 2cedae9 to 030d81b Compare December 6, 2020 19:16

stepancheg force-pushed the int-compare branch from 030d81b to 5dd4d34 Compare December 6, 2020 19:39

alandonovan approved these changes Dec 6, 2020

View reviewed changes

stepancheg force-pushed the int-compare branch from 5dd4d34 to 1f75e4d Compare December 6, 2020 21:27

stepancheg force-pushed the int-compare branch from 1f75e4d to dfde464 Compare December 6, 2020 21:34

jin added the team-Starlark label Dec 7, 2020

jin assigned alandonovan Dec 7, 2020

bazel-io closed this in 923519f Dec 7, 2020

stepancheg deleted the int-compare branch December 7, 2020 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Starlark: optimize StarlarkInt.Big comparison to StarlarkInt.Int{32,64} #12638

Starlark: optimize StarlarkInt.Big comparison to StarlarkInt.Int{32,64} #12638

stepancheg commented Dec 6, 2020

alandonovan Dec 6, 2020

stepancheg commented Dec 6, 2020

alandonovan Dec 6, 2020

stepancheg commented Dec 6, 2020 •

edited

Loading

Starlark: optimize StarlarkInt.Big comparison to StarlarkInt.Int{32,64} #12638

Starlark: optimize StarlarkInt.Big comparison to StarlarkInt.Int{32,64} #12638

Conversation

stepancheg commented Dec 6, 2020

alandonovan Dec 6, 2020

Choose a reason for hiding this comment

stepancheg commented Dec 6, 2020

alandonovan Dec 6, 2020

Choose a reason for hiding this comment

stepancheg commented Dec 6, 2020 • edited Loading

stepancheg commented Dec 6, 2020 •

edited

Loading