perf: replace BW6-761 final exp by a class equivalence check #1155

yelhousni · 2024-06-03T19:44:16Z

Description

Similarly to #1143 we adapt https://eprint.iacr.org/2024/640.pdf to the BW6-761 case with the following parameters:

$h = \frac{p^6-1}{r}=3^2\cdot l$ with $\text{gcd}(l,3)=1$
$\lambda=x^3-x^2+1-(x+1)q$

// magma code ran online: http://magma.maths.usyd.edu.au/calc
QQ := Rationals();
QQx<x> := PolynomialRing(QQ);
rx := (x^6 - 2*x^5 + 2*x^3 + x + 1)/3;
qx := (103*x^12 - 379*x^11 + 250*x^10 + 691*x^9 - 911*x^8 - 79*x^7 + 623*x^6 - 640*x^5 + 274*x^4 + 763*x^3 + 73*x^2 + 254*x + 229)/9;
M := Matrix(QQx, 2, 2, [rx, 0, -qx mod rx, 1]);
R := LLL(M);
print R

[        x + 1 x^3 - x^2 - x]
[x^3 - x^2 + 1        -x - 1]

assert ((R[2][1] + qx*R[2][2]) mod rx) eq 0; // better Hamming weight

$m=\lambda/r$
$d=\text{gcd}(m,h)=1$
$m'=m/d = m$

First, we find the residue witness in a hint:

Compute r-th root: Raising the miller function to $1/r \pmod h$
Compute m′-th root: Raising the result to $1/m' \pmod h$
(no need for the modified Tonelli-Shanks for cube roots here as $d=1$ and no need for scaling.)

Then, we check in-circuit that:

MillerLoop == Witness ^ (u^3-u^2+1-(u+1)q)

with two optimized addition chains, a Frobenius power and a hinted division in Fp6.

Type of change

New feature (non-breaking change which adds functionality)

How has this been tested?

TestPairingCheckTestSolve test passes.

How has this been benchmarked?

This PR saves 2,679,259 scs in the emulated PLONK verifier of BW6-761 in a BN254-PLONK.

Checklist:

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
I did not modify files generated from templates
golangci-lint does not output errors locally
New and existing unit tests pass locally with my changes
Any dependent changes have been merged and published in downstream modules

yelhousni · 2024-06-05T11:10:58Z

It is though difficult to include the computation of c^{u^3-u^2+1-(u+1)q} in the Miller loop computation (see #1143 (comment)) for two reasons:
1- we use a different Miller loop for BW6 with loop size 3*l2+l1 where l2=u³-u²-u and l1=u+1 (see Alg.2 in https://eprint.iacr.org/2021/1359.pdf). We can have two separate ate Miller loops of sizes u^3-u^2+1 and u+1 but the current ML is way more efficient constraint-wise.
2- The Miller loop in-circuit is implemented using Fp6 as a direct extension of Fp while out-ciruit it is a quadratic over cubic extension. This makes the witness residue different. We can implement the direct extension in gnark-crypto and Toom-6 arithmetic but this is more of a pain in vain because of 1-.

Adapting the algorithms in/out-circuit to match each other would affect performances and make the trick not worth it.

yelhousni · 2024-06-05T14:21:06Z

It is though difficult to include the computation of c^{u^3-u^2+1-(u+1)q} in the Miller loop computation (see #1143 (comment)) for two reasons: 1- we use a different Miller loop for BW6 with loop size 3*l2+l1 where l2=u³-u²-u and l1=u+1 (see Alg.2 in https://eprint.iacr.org/2021/1359.pdf). 2- The Miller loop in-circuit is implemented using Fp6 as a direct extension of Fp while out-ciruit it is a quadratic over cubic extension. This makes the witness residue different.

Adapting the algorithms in/out-circuit to match each other would affect performances and make the trick not worth it.

Actually, now that this additional trick might not be worth it, it becomes more relevant to push the Miller function to the cyclotomic subgroup by performing the easy part of the final exp only before doing the class equivalence check. This saves an additional 1,390,037 scs making the total cut at 2,679,259 scs.

ivokub

LGTM

perf(bw6-761): eliminate finalexp

53c88f4

yelhousni added perf zk-evm labels Jun 3, 2024

yelhousni requested review from ivokub and ThomasPiellard June 3, 2024 19:44

yelhousni self-assigned this Jun 3, 2024

refactor: clean code

78e5468

perf(bw6-761): push ML to cyclo-group before FE elimination

2d162dd

Merge branch 'master' into perf/eliminate-finalExp-bw6761

be2e5ac

yelhousni mentioned this pull request Jun 12, 2024

perf: replace BN254 final exp by a class equivalence check #1143

Merged

9 tasks

Merge branch 'master' into perf/eliminate-finalExp-bw6761

3eb9a1a

yelhousni mentioned this pull request Jun 14, 2024

perf(bls12-381): eliminate finalexp ~naively #1173

Merged

12 tasks

perf(bw6-761): use Karabina even for 1 square

48cf0a0

ivokub approved these changes Jul 3, 2024

View reviewed changes

yelhousni merged commit 55c05b6 into master Jul 3, 2024
7 checks passed

yelhousni deleted the perf/eliminate-finalExp-bw6761 branch July 3, 2024 12:25

This was referenced Aug 4, 2024

perf: Groth16 verifier #1235

Closed

perf: Groth16 verifier #1238

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: replace BW6-761 final exp by a class equivalence check #1155

perf: replace BW6-761 final exp by a class equivalence check #1155

yelhousni commented Jun 3, 2024 •

edited

Loading

yelhousni commented Jun 5, 2024 •

edited

Loading

yelhousni commented Jun 5, 2024 •

edited

Loading

ivokub left a comment

perf: replace BW6-761 final exp by a class equivalence check #1155

perf: replace BW6-761 final exp by a class equivalence check #1155

Conversation

yelhousni commented Jun 3, 2024 • edited Loading

Description

Type of change

How has this been tested?

How has this been benchmarked?

Checklist:

yelhousni commented Jun 5, 2024 • edited Loading

yelhousni commented Jun 5, 2024 • edited Loading

ivokub left a comment

Choose a reason for hiding this comment

yelhousni commented Jun 3, 2024 •

edited

Loading

yelhousni commented Jun 5, 2024 •

edited

Loading

yelhousni commented Jun 5, 2024 •

edited

Loading