How quantum computers affect zk and blockchains + how to quantum proof Ethereum

yush · November 20, 2022, 2:49am

Thinking about Quantum x Blockchains

Note: This hasn’t been reviewed by an expert, and is just from me skimming through papers. There is likely some error here in my very simplified mental models. However, I think this model is a good middle ground between the oversimplified layman-oriented quantum news articles, and the hyper academic quantum computing papers that are hard for a cryptographer to parse. Leave thoughts/comments/corrections on the hackmd draft of this post! This is a mirror of this blog post as well, however the blog post has a more modern version with the latest corrections and notes.

What are the powers of a quantum adversary?

There are a couple key algorithms here, including Shors and Grovers. The main thing that they can do is prime factorize and take discrete log. They cannot help undo hashes (as far as we know).
Specifically, given a public key, they can derive the private key. This is what leads breaking back-secrecy of any detereministic function of a secret key, such as any zk ecdsa nullifier scheme.

What happens to blockchains?

For Bitcoin (and Ethereum), addresses need to have at least one public signature for people to know the public key that corresponds to their address (usually address = keccak_hash(pk)[0:40]). Bitcoin is secure, because UTXOs can simply act as one-time-use accounts, and spend all the money – even if someone can derive the secret key, they will not be able to spend past UTXOs.
Ethereum can easily transition to a secure keypair set because you can merely have all accounts sign the public key of their new account and submit it to, say, a migration smart contract, which will then hardfork to move everyones Eth to the more secure keypair set. Smart contracts do not have public keys, only addresses (recall that even a quantum computer cannot undo that hash), so funds are safu.

What parts of zero knowledge exactly are broken?

tl;dr almost nothing.
There is a key distinction between statistical and computational zero knowledge (and perfect zk, but that’s impractical) – statistical zero knowledge means that no infinite compute verifier can distinguish between distributions, computational means that no polynomial verifier can distinguish between distributions.
groth16 (and most proof systems we know in production right now) are perfect zk: paper, a subset of statistically zk proof systems. This means that even a quantum adversary with access to several past proofs, cannot break past zero knowledge or uncover your secret information.
However, because they can take discrete log, they can derive the toxic waste from just the public signals of any trusted setup ceremony. Thus, they can fake any ZK-SNARK – we expect that any current verifier deployed on-chain would have time to migrate to a quantum-resistant proof system prior to this scheme being live. - Similarly, they can derive the discrete logs of the signals used to make IPA commitments hiding, and thus break hiding on IPA commitments. STARKs are still secure though, since they rely on hashing.
In fact, this can be generalized – the reason quantum breaks soundness but not secrecy is that there is a fundamental tradeoff here with zk vs soundness of proofs: this fairly short paper proves you can either have statistical zero knowledge or statistical soundness, but not both. In practice, almost all of our proof systems opt for perfect zk and computational soundness, so quantum computers can fake proofs but past secrets are still secret.

What is going on with annealing vs qubit computers, the different quantum computing paradigms?

[NOTE: this point is not completely correct and needs to be rewritten] There are two major quantum computing paradigms: quantum annealing (analog superposition across all of the qubits, which slowly ‘anneals’ to an approximate solution), and pure quantum computers (have superposition across only quickly-changing discrete gates, but can thus calculate across all the qubits and have intermediate error correction). It’s a lot easier to get impressive-seeming qubit counts like 5000 on quantum annealing computers (DWAVE for instance), but they require far more bits for the same task, are usually less efficient, and cannot be error corrected as easily for hard tasks (no strong theoretical results even exist yet as of 2022).
Pure quantum computers are the ones where you’ve heard excitement over recently factored numbers like 15 and 35, and these have huge problems with noise (and some think an existential upper bound on the number of qubits due to the noise).

What do different algorithms like factorization, discrete log, or un-hashing look like on quantum computers?

Annealing bounds:

Quantum annealing can minimize funnctions. For instance, to solve prime factorization, they minimize (n - pq) over the bits of n, p, and q: this ends up taking about \frac14 \log^2(n) qubits to prime factorize n: 2018 paper.
Discrete log to factorize n (with log(n) bits), from a 2021 paper shows about 2\log^2(n) qubits needed on annealing based systems, although they ran into practical connectivity issues past n = 6 bits.
In fact, it’s likely that bigger discrete log is impossible: this 2013 paper shows that the Hamiltonian makes it very hard to convert physical qubits to logical qubits.

Quantum computer bounds:

On actual quantum computers, the bound for simple prime field discrete log is around 3n + 0.002n \log n where n is the number of bits (n=256 for us): 2021 paper – without considering noise overhead. With noise, they calculate that n = 2048 bit discete log will take 20 million physical qubits.
Newer algorithms have shown that elliptic curve discrete log on a curve like secp256k1 is a bit harder, closer to 9n: 2017 paper. Past bounds closer to 6n don’t explicitly describe how to do arithmetic on elliptic curves and merely provided a lower bound 2008 paper.
Again, these are numbers for signal qubits without noise, and noise qubits add several orders of magnitude more qubits than this, so perhaps these initial estimations are not even relevant – perhaps one should even omit the constant factors with asymptotic notation here to better communicate that.

Intuitively, why is a hash function hard for any quantum computer?

If you write a hash function as a polynomial in the bits of the input, the resulting function has a degree that is far too high for a quantum adversary to reverse. Specifically, root finding on standard quantum computers takes O(n \log(n)) time on \log (n) qubits, where n is the degree of the polynomial, 2015 paper. While the qubit count may be within imagination, this time is absolutely infeasible (degrees of hash functions expressed as polynomials look like 2^{16000}). Of course, future specific quantum algorithms might provide some improvement, but this seems like a reasonable first guess.

What is a reasonable timeline to expect ECDSA on secp256k1 to be broken?

It seems that expert consensus varies from 2050-never (if the theoretical noise problem is never overcome). Some professors I’ve spoken to seem to think 2100 is the fastest possible point, and it may take longer to get there because of the valley of death of applications between a few dozen qubits and a few hundred thousand. There is utility on the small end for theoreticians, and utility on the high end for cryptography, but very little intermediate use for qubit counts in the middle, and thus makes ROI for funding much worse.
IBM has been surprisingly accurate on it’s timeline for qubit computers – again, these are signal + noise qubits, so the actual signal qubit count is substantially less than the number you see, though the extent to which this is the case depends on the specific algorithm.

This is a very rapidly changing field, so these results will likely update year after year.

Account with stack overflow questions/comments.

yush · November 24, 2022, 4:55pm

So far, I’ve only seen solutions on ethresearch to quantum proof Ethereum via new keypair types. However, I think there’s a more robust solution to migrate Ethereum than hardforking to a quantum resistant keypair – this would break every single wallet and piece of key-related infra. I think there’s a way to quantum-proof Ethereum on the existing ECDSA on secp256k1. The reason it’s not currently quantum proof is that after sending a tx, your public key is revealed (i.e. the hash preimage of your address), so you can take the discrete log efficiently with a quantum computer and get someone’s secret key. If there was a way to send txs that didn’t reveal the public key, this may allow existing keypairs to remain quantum secure.

A post-quantum keypairs could keep their public key hidden, and only make their addresses public. Then, they just send all of their tx’s via a zk proof of knowing a valid signature that corresponds to their address, and that would authorize the transfer, so no one would ever even know their public key! With account abstraction-type solutions, this type of thing could even be possible as soon as that is available on any L2 or L1. It wouldn’t work on accounts that have already sent any tx’s today (since those reveal public keys), but they could easily send all their assets to a new keypair, and vow to not reveal their public key in those cases. It would quantum proof Ethereum in the long term as well (similarly to how unused utxos in btc are safe right now).

You’d have to make this ECDSA proof inside ZK-STARKs super fast to generate and verify, which hasn’t been done yet as far as I can tell.

One issue is that smart contracts need to be special-cased, since we know the pre-image of the address via create2. One easy solution is to hard-code that once a contract has been made by create/create2, transactions that utilize their secret key are disallowed (i.e. no signatures or eoa-style txs will be validated).

Perhaps, for future smart contracts, if we don’t want to special case them, we could standardize around a new opcode (say create3, or create2 with an optional arg), that, say, just swaps the last bit in the create2 output. This keeps the address determination deterministic, but does not reveal the pre-image of the hash.

mpzajac · December 21, 2022, 4:02pm

Hi, thanks for your post.
Two minor comments:

Groth16, Plonk, STARKs, etc, etc are perfect zk (which is a “special case” of statistical zk)
Also, note that the generalization you mention in the last zk point is not that simple – proof systems that rely on hash functions are also computationally sound, so you can have (plausible) quantum resiliency and perfect zk.

yush · January 3, 2023, 9:19am

Thanks for the comments! From what I know, statistical ZK means a PPT algorithm cannot distinguish between the zero knowledge proof distribution and a random draw from the distribution. Perfect ZK means no unbounded adversary can distinguish them. Because Fiat-Shamir computation has a hash computation which would confirm the private data, it seems any non-interactive ZK protocol derived from an interactive one (including FRI-STARKS and most others) can’t be perfect ZK. (source: page 74 here) I think you’re probably right about groth16 being perfect ZK though, I don’t remember there being a Fiat-Shamir transformation. Regardless, since this hash function is so hard to crack, the computational ZK title is more for academic rigor than practicality.

On your second point, still unsure if STARKs are perfect ZK! How does a statement like plausible quantum resiliency translate into statements about statistical vs perfect guarantees? As far as we are concerned practically though, you’re right that I should specially mention STARKs and their benefits a bit more in the post.

mpzajac · January 3, 2023, 8:47pm

Hey!
Statistical ZK means that the distributions of real and simulated proofs are close w.r.t statistical distance. Perfect ZK means that the distributions are the same (i.e., the statistical distance between them is 0). Note that when we talk about the statistical distance between some distributions, we don’t need to assume that the adversary – an algorithm tasked to distinguish the distributions – is computationally bounded. Hence, we don’t need to limit the algorithm to be PPT.

Regarding hashing and Fiat–Shamir, this doesn’t hurt the ZK property in generality. The result you cited seems to work only for transparent SNARKs, as e.g. Marlin and Plonk utilizy FS transformation and achieve perfect zk. Also, I am not sure whether there is an impossibility result that a NI transparent proof system cannot be perfectly zk, or just the compilers from IOPs we have cause this privacy loss.

Regarding “plausibly quantum secure”. If a proof system is perfect or statistical ZK, then quantum computers cannot break the ZK property. In particular, even though Marlin/Plonk/Groth16 are not sound if there are quantum computers, they still provide privacy, i.e. the zk property still holds.

yush · January 4, 2023, 9:45pm

Got it, thanks for the details! These are great points – I’ll update the original post. Is it the case then that it’s unclear if STARKs are perfect ZK?

mpzajac · January 5, 2023, 8:27am

Thanks,
TBH I don’t know whether it’s proven that (non-interactive) STARKs can be only computationally ZK. I suppose that may be the case.

How quantum computers affect zk and blockchains + how to quantum proof Ethereum