Dankrad Feist

SUMCHECK Quickie

2023-08-08T00:00:00+00:00

The SUMCHECK protocol

I won’t delve deeply into its introduction here, but the SUMCHECK protocol (Carsten Lund, Lance Fortnow, Howard J. Karloff, and Noam Nisan. “Algebraic Methods for Interactive Proof Systems”, 1992) serves as a foundation for many “verifiable computation” primitives. If you’re reading this, you likely already know its significance.

Let $\mathbb{F}$ be a finite field and $f(X_1, X_2, \ldots, X_n)$ be a multivariate polynomial of degree $d$ over $\mathbb{F}$ . Consider the sum $S$ defined as:

$S = \sum_{x_1\in\{0, 1\}} \sum_{x_2\in\{0, 1\}} \cdots \sum_{x_n\in\{0, 1\}} f(x_1, x_2, \ldots, x_n) = \sum_{(x_1, x_2, \ldots, x_n) \in \{0,1\}^n} f(x_1, x_2, \ldots, x_n)$

The SUMCHECK protocol provides an $n$ round interactive protocol to verify the correctness of $S$ , using only a single evaluation of $f$ (i.e. the verifier only has to evaluate $f(X)$ once, rather than $2^n$ times).

The protocol advances through $n$ rounds. In each round, the prover sends a univariate polynomial to the verifier. In response, the verifier conducts a check and responds by sending a single field element challenge.

Round 1

The prover computes and shares the univariate polynomial:

$f_1(X) = \sum_{(x_2, \ldots, x_n) \in \{0,1\}^{n-1}} f(X, x_2, x_3, \ldots, x_n)$

The verifier ensures that the degree of $f_1$ is less than $d$ , checks if $f_1(0) + f_1(1) = S$ , and issues the challenge $r_1 \in \mathbb{F}$ .

Round 2

The prover computes and shares the univariate polynomial:

$f_2(X) = \sum_{(x_3, \ldots, x_n) \in \{0,1\}^{n-2}} f(r_1, X, x_3, x_4, \ldots, x_n)$

The verifier ensures that the degree of $f_2$ is less than $d$ , checks if $f_2(0) + f_2(1) = f_1(r_1)$ , and issues the challenge $r_2 \in \mathbb{F}$ .

Round $i$

The prover computes and shares the univariate polynomial:

$f_i(X) = \sum_{(x_{i+1}, \ldots, x_n) \in \{0,1\}^{n-i}} f(r_1, \ldots, r_{i-1}, X, x_{i+1}, x_{i+2}, \ldots, x_n)$

The verifier ensures that the degree of $f_i$ is less than $d$ , checks if $f_i(0) + f_i(1) = f_{i-1}(r_{i-1})$ , and issues the challenge $r_i \in \mathbb{F}$ .

Round $n$

The prover computes and shares the univariate polynomial:

$f_n(X) = f(r_1, \ldots, r_{n-1}, X)$

The verifier ensures that the degree of $f_n$ is less than $d$ , checks if $f_n(0) + f_n(1) = f_{n-1}(r_{n-1})$ , and issues the challenge $r_n \in \mathbb{F}$ .

Final check

The verifier evaluates $f(r_1, \ldots, r_n)$ and confirms that it equals $f_n(r_n)$ .

Why does it work

Below, I present an intuitive proof for the efficacy of the SUMCHECK protocol, assuming a large field $\mathbb{F}$ . For this explanation, the Schwartz-Zippel lemma is essential.

The Schwartz-Zippel Lemma

The Schwartz-Zippel lemma states: For a non-zero multivariate polynomial $P(X_1, \ldots, X_n)$ with a total degree of at most $d$ over the field $\mathbb{F}$ , when evaluating $P$ at random points $r_1, \ldots, r_n$ from $\mathbb{F}$ , the likelihood of $P(r_1, \ldots, r_n)$ being zero is less than $\frac{d}{\lvert\mathbb{F}\rvert}$ .

In large fields (like the scalar fields of elliptic curves often, which are cryptographically secure with field sizes around $2^{256}$ ), this probability is minuscule. This is because a polynomial can only have a limited number of zeros. For a univariate polynomial, this is immediately evident from the “factor theorem”: Every zero of the polynomial corresponds to a linear factor, so a degree $d$ polynomial can have up to $d$ zeros.

A common application of the Schwartz-Zippel lemma is comparing two univariate polynomials, $f(X)$ and $g(X)$ , to determine their identity. By generating a random number $r \in \mathbb{F}$ and checking if $f(r) = g(r)$ , we can determine if they are the same. This method is correct, but is it also sound? What’s the likelihood of erroneously claiming $f(X)$ and $g(X)$ are identical when they aren’t?

Claim: If $f(X) \not= g(X)$ , then the probability of the check being successful is at most $\frac{d}{\lvert\mathbb{F}\rvert}$ .

Proof: Let’s set $P(X) = f(X) - g(X)$ . Given that $f(X) \not= g(X)$ , $P(X)$ isn’t the zero polynomial. As per Schwartz-Zippel, the chance of $P(r) = f(r) - g(r) = 0$ is at most $\frac{d}{\lvert\mathbb{F}\rvert}$ .

A succinct takeaway from Schwartz-Zippel is: Over large fields, two “low-degree” polynomials are either identical or different almost everywhere.

SUMCHECK protocol proof

The SUMCHECK protocol harnesses this property to convert a sum with a potentially large number of terms into a singular evaluation. Let’s see why it works, by trying to create a prover that attempts to cheat the protocol, attempting to prove a sum $\tilde S \not= S$ .

Round 1

The prover needs to send a polynomial $\tilde f_1(X)$ with the property $\tilde f_1(0) + \tilde f_1(1) = \tilde S$ . Any other choice would make the first check fail immediately, so there would be no point.

Since $\tilde S \not= S$ , the prover cannot send the honest polynomial $f_1(X)$ . Recall that this means that the polynomial they will be sending is different almost everywhere from $f_1(X)$ ; more precisely, it is the same as $f_1$ on at most $d$ points.

The verifier then sends the challenge $r_1$ .

Round 2

In round 2, the prover needs to send a polynomial $\tilde f_2(X)$ with the property that $\tilde f_2(0) + \tilde f_2(1) = \tilde f_1(r_1)$ .

Now there are two possibilities:

$\tilde f_1(r_1)= f_1(r_1)$ . In this case the prover has hit the jackpot: He can simply send $f_2(X)$ (the “honest” answer) and continue the protocol honestly to the end. He will succeed at cheating. However, the probability for this happening is only $\frac{d}{\mathbb{F}}$ .
$\tilde f_1(r_1) \not= f_1(r_1)$ (which is overwhelmingly likely). In this case, the prover is in the same situation as before: He cannot send the honest $f_2(X)$ , but only a malicious $\tilde f_2(X)$ with $\tilde f_2(0) + \tilde f_2(1) = \tilde f_1(r_1)$ , which can coincide with the honest answer on at most $d$ points.

This repeats every round:

Round $i$

In round $i$ , the prover needs to send a polynomial $\tilde f_i(X)$ with the property that $\tilde f_i(0) + \tilde f_i(1) = \tilde f_{i-1}(r_{i-1})$ .

Again, two possibilities:

$\tilde f_{i-1}(r_{i-1}) = f_{i-1}(r_{i-1})$ . Prover wins (probability of this happening $\frac{d}{\mathbb{F}}$ ).
$\tilde f_{i-1}(r_{i-1}) \not= f_{i-1}(r_{i-1})$ . Prover needs to send $\tilde f_i(X)$ with $\tilde f_i(0) + \tilde f_i(1) = \tilde f_{i-1}(r_{i-1})$

Final check

Finally, in the last round, the verifier checks if $f(r_1, \ldots, r_n)=f_n(r_n)$ . Assuming the prover didn’t “win” in one of the rounds before, he send a malicious $\tilde f_n(X)$ which coincides with the honest $f_n(r_n)$ on at most $d$ points, so the probability that the final check will pass is $\frac{d}{\mathbb{F}}$ .

As you can see from the above analysis, the prover has a small probability of “winning” and cheating the verifier in each round: If his polynomial happens to have the same evaluation on the challenge field element as the honest version, the prover can continue the protocol like an honest prover and the verification will succeed.

However, assuming that the field $\mathbb{F}$ is large, the probability for this is small. Since there are $n$ rounds, at best, the prover can succeed with probability $\frac{nd}{\lvert\mathbb{F}\rvert}$ .

RAI – one of the coolest experiments in crypto

2023-01-31T10:30:00+00:00

Special thanks to Vitalik Buterin and Ameen Soleimani for feedback and review.

RAI – one of the coolest experiments in crypto

I think RAI is one of the coolest experiments in crypto right now. So I thought I’d write my version of an explainer for it, from the perspective that I have introduced in my previous article on Supply and demand for stablecoins. Back when I wrote it, my understanding of RAI was poor. The version of DAI it describes (single-collateral DAI, before the introduction of custodial stablecoins as collateral) is actually very close to RAI, however there is an interesting difference: Instead of applying an interest rate to balances, like DAI does, RAI directly manipulates the redemption price (which is always 1 USD for DAI). I think it’s nice to directly describe this mechanism. If you want to understand more on how Collateralized Debt Positions (CDPs) work to maintain stability, then I still recommend reading my previous article!

Why is RAI floating, and not tracking one currency?

In the past, the goal of creating stablecoins was seen as creating an asset that is always worth 1 USD (or some other currency). But as Vitalik remarked in his thought experiments on automated stablecoins, if you can create a coin that is always worth 1 USD, why can’t you use the same mechanism to create one that is worth 1 USD plus 20% interest per year (i.e. 1.00 USD in year 1, 1.20 USD in year 2, 1.44 USD in year 3, and so on)? After all, the only way the blockchain knows about prices is through oracles, and it’s an easy change to the oracle to make it return the value of the coin priced in this new unit (USD appreciating by 20% per year) instead of USD.

Clearly there is something missing in the picture. As we will see below, in order to balance supply and demand, a fully decentralized stablecoin needs to be able to give incentives to those going long (using the stablecoin) and going short (supplying the stablecoin) in some form. This is true whether it tracks USD, USD+20% interest or USD-5% interest.

One way of doing this is to add a mechanism rate that charges interest on debt (the suppliers of the stablecoin) and credits it to the holders (the users of the stablecoin). The interest rate can however be negative when there is more demand for holding than there is for stablecoin debt.

In March 2020, DAI first depegged upward (the market price increased to more than 1 USD) and only repegged after USDC (a custodial, centralized stablecoin pegged to 1 USD) was added as one of the forms of collateral to mint DAI, otherwise it would have required a negative interest rate. Since its inception, RAI has mostly had negative interest rates. For now, it seems like decentralized stablecoin require negative interest rates most of the time.

When interest rates are negative, instead of having your balance change from 1 to 0.99 to 0.98, RAI keeps the balance the same and changes the actual price target of the stablecoin instead. This means that RAI looks like a floating currency, but with the property that it is much less volatile than cryptos like Ether and Bitcoin.

The stablecoin problem

Cryptocurrencies are volatile. Apart from scaling, this is probably still the largest barrier to adoption. This is why there have been many attempts to create a coin that is less volatile.

Like any commodity, the price of a stablecoin is determined by supply and demand. At any instant, some people want to buy and sell the coin, and these inflows and outflows must be matched, so the price will adjust until they do (the price where they match is the market price). Market makers will try to cover short term spikes in supply or demand, but will adjust their quote prices if they see consistent pressure in one direction.

So if you want to keep a coin stable, you have to be able to somehow manipulate supply and demand such that they cross at a desired price. If the current price is too high, it’s easy to create more supply and push the price down. The trouble comes when the coin falls below the desired price (more outflows than inflows): We need to either decrease supply or increase demand, but how do we do that if the supply comes from independent holders wanting to sell?

There is only one decentralized and sustainable option that I know of. It requires saving during the good times in order to be able to create demand in the bad times: In order for new stablecoins to be created, enough collateral for has to be added to the protocol, so that when demand decreases, this collateral can be used to generate new demand.

Collateralized debt positions

Creating stablecoins by means of Collateralized Debt Positions (CDPs) is a way in which this can be implemented. A CDP is a position where a holder of a volatile currency, such as Ether, takes out a loan in the stablecoin. The CDP represents this position. It can also be seen as a leverage position in the collateral. For example, this graph represents a CDP that has 200 RAI borrowed against 1 ETH, where the value of 1 ETH is currently 1400 USD and 1 RAI is 3 USD; the holder of the CDP gets the “equity” value of this position (currently 1400-600=800 USD, but it can fluctuate with the price), and the RAI holder the debt (which is independent of the current price of Ether).

How can CDPs create demand? Some protocols do this directly by allowing stablecoin holders to redeem against the collateral, this is for example how Liquity works. However, RAI follows MakerDAO’s original DAI in not integrating such a mechanism. But CDPs can still generate demand:

While the CDP is well collateralized, charging an interest rate to the debt holder can incentivize them to take action in relation to their CDP. For example, if the interest rate on debt increases, a CDP holder may decide that holding on to the position is no longer worth it, and that it’s better to repay the debt. When they do this, they have to buy the stablecoin on the market, which creates demand.
Once the CDP gets close to the liquidation ratio, the holder is incentivized to close the position to avoid the liquidation penalty, unless they can add more collateral. If the position gets liquidated, the liquidator will also have to buy stablecoins in order to bid for the collateral.

By only ever issuing new stablecoins in the form of debt when a CDP is created, the protocol has all the collateral in the CDPs to prop up the coin when other demand for stablecoin collapses.

This construction comes with a counterintuitive downside: New coins can only be created when someone is willing to take out a CDP. This requires someone who wants to take a leveraged position in the collateral.

This demand is currently the limiting factor for stablecoins based on this construction. In order to stop the stablecoin from increasing in value due to limited supply of willing CDP holders (in other words, demand for leverage), we will have to do one of two things:

Make the leveraged position more attractive to the CDP holders
Make holding the stablecoin less attractive

What we can do is that we charge the stablecoin holders a negative interest rate, which is paid out to the CDP holders. This actually does both: It increases the attractiveness of leveraged positions, and makes holding the stablecoin less attractive.

Margin exchanges have done this for a while: They too have to find this balance, as every long position has to be matched to a short position, so that the net exposure is equal to the deposited assets. They use the same mechanism to balance the books: The funding rate is paid by the type of position for which there is more demand to the side that for which there is less.

How RAI balances supply and demand

We have just learned that one mechanism to achieve the balance between CDPs (stablecoin shorts) and holders (stablecoin longs) is an interest rate transfer between the two. DAI implements this mechanism using the DAI savings rate: you can put your DAI into the savings contract and you get paid an interest.

Things become more awkward when the interest rate is negative, i.e. DAI holders are paying to CDP holders. In this regime, DAI balances would have to be slowly decreasing. Implementing it in this way has the advantage that your balance always represents the value in USD, and 1 DAI remains worth 1 USD. It’s less good for smart contract developers who now have to deal with the fact that balances in an account can decrease.

Instead, RAI goes a different way: Adjust the “redemption price” to represent the interest rate. What’s the redemption price? It’s the target value of 1 RAI. In particular it is used

To borrow RAI in CDPs and to repay debt, as well as determine whether a position is underwater and should be liquidated
As the value at which all debts and deposits are settled when global settlement is triggered.

Since the interest rate is applied to the redemption price, it is called redemption rate. As an example, if the redemption rate is -3% and the redemption price is currently 1.00 USD, then in 1 year the redemption price will be 0.97 USD (RAI actually started with a redemption price of 3.14 USD).

Now when such a negative redemption rate is applied, two things will happen:

RAI holders will expect to have 3% less value when compared to holding USD after one year
RAI borrowers (who buy RAI back on the market) will expect their debt to decrease in value by 3% after one year

How does RAI determine the redemption rate

Another cool component of RAI is that the redemption rate is actually automatically computed by the protocol. The protocol detects the supply and demand imbalance by tracking the deviation of the market price from the redemption price. If the market price is higher than the redemption price, it means there is more demand for RAI than there is for CDPs – and so a negative redemption rate has to be applied. Conversely, if the market price is lower than the redemption price, the redemption rate needs to be be positive.

So a very simple design could look like this: Find the current difference from between the redemption price and the market price, multiply it by some number – for simplicitly say 1, and make that the redemption rate. Say the current redemption price is 4% under market, then the redemption rate will be -4%. If it is 10% above, the redemption rate will be +10%.

If we did this, it would constitute a P controller (P for proportional), which is actually what RAI did initially. RAI’s adjustment mechanism was later updated to use a PI controller that takes the difference between market and redemption prices (the error) as an input. A PI controller, in addition to the current value, also uses the integral (I), so takes into consideration how much the value has deviated in the past. This makes the system more stable and means interest rates fluctuate a bit less with short term price changes.

The RAI website shows the history of RAI redemption price and market price, as well as the interest rates, which can be a nice demonstration of how this mechanism works

On top, you can see the market (red) and redemption prices (grey). The market price is typically above the redemption price, representing excess demand for RAI over CDPs, which the protocol compensates by applying a negative redemption rate – this is why the redemption price is slowly decreasing.

The lower graph shows how the redemption rate is computed. the blue curve (p_rate) is the P part of the PI controller. It is proportional to the error and indeed, the graph looks like the inverted difference between the red and grey curves in the upper graph. The orange curve (i_rate) is much smoother and represents the I part (integral) of the controller, which reacts to past deviations. The sum of the p_rate and the i_rate is the redemption rate and is how fast the redemption price is going down at any given time.

The higher the market price is above the redemption price, the faster the redemption price thus decreases – rebalancing supply and demand as the expected value of holding RAI decreases (and RAI debt becomes more attractive).

But what pulls RAI back to the redemption price

There’s one thing we skipped over. The redemption price represents the target value of RAI in the protocol, and we’ve just been taking it for granted that lowering the price will make long RAI positions less attractive and short positions more attractive. But this assumes that market participants have some expectation of being able to use RAI at or near the redemption price – which requires some force that pulls the market at least in the direction of the redemption price, so that lowering the redemption price has a meaning.

Of course, we can expect “global settlement” will solve this: There is a mechanism in the protocol, which can be triggered by governance, that settles all deposits and debts according to the current redemption price. It is expected that this mechanism will be triggered when the deviations become too extreme. So maybe that’s the reason why the redemption price matters?

Actually, the global settlement is a cool emergency feature, but it is not necessary to explain why the market price will track the redemption price, assuming (some) rational market participants (with enough capital).

Let’s assume that market participants just ignore the redemption price entirely. What would happen?

The current minimal collateralization for CDPs is 135%. What that means is that if the market price is more than 35% above the redemption price, anyone can just mint RAI for Ether and “forget” about their CDP – just sell the RAI, buy more Ether with it and take the arbitrage profit. RAI can’t trade significantly more than 35% above redemption price for this reason.
There is no strict bound like this from below – but we can do a thought experiment: Let’s say that RAI trades consistently 10% below redemption price. Note that this would lead to an enormous redemption rate of something like 240% per year (in the long term, when the integral term has had enough time to accumulate). CDP holders have to take this redemption rate into account – eventually they will get liquidated, when their collateralization ratio (which is computed using redemption price) reaches 135%. They thus have a strong incentive to buy RAI before this happens.
Similarly, we can find that if RAI trades 10% above the redemption price, the negative interest rate will reach something crazy like -70% (again in the long term, when the integral term has had enough time to accumulate this), which means there is a very strong incentive for RAI holders to get out before this happens. If they don’t, lots of newly minted RAI from new CDPs will eventually be available at the much lower redemption price.

Combined, these forces mean that while the market price can deviate, it cannot deviate too far and too long from the redemption price.

How does tracking another currency change RAI?

An interesting question is: How would RAI be different if instead of tracking USD, it had been set up to track the Euro, the Chinese Yuan, or maybe the 6 months moving average of the Ether price?

To start with, we will do a thought experiment (one proposed by Vitalik): What if RAI was set up to track USD + 20% (a version of the USD that comes with a 20% interest rate)? Let’s call this asset RAI-PONZI.

Obviously holding this asset seems really attractive and having debt in that asset much less so. The price of RAI-PONZI will keep rising as buyers want the high interest rates and there are few people available wanting to take out CDPs in RAI-PONZI.

As RAI-PONZI rises above the redemption price, the redemption rate will get more and more negative. It will reach -20%, which makes RAI-PONZI equivalent to USD. From there, it will likely go even further: Currently RAI’s redemption rate is about -10%, so I would expect RAI-PONZI to settle at -30% in current market conditions. At that point, it becomes equivalent to current RAI, so it makes sense that market participants would behave in this way, assuming the same risk tolerance of market participants.

This is actually nothing else than creating an “offset” in the redemption rate of +20%, and an equivalent price offset.

What can we learn from this? The long-term expected gains or losses of a currency do not impact how RAI behaves. If RAI were pegged to Turkish Lira, which seems to lose about 25% of its value compared to the USD every year, it would probably not behave too differently on long timescales. Let’s call this asset RAI-TRY.

Where RAI-TRY is different is on short timescales and unexpected shocks. If the Lira suddenly drops 20%, due to a black swan, then RAI-TRY will do so, too. The same goes for a sudden increase.

What exact currency is used as an input to the RAI oracles therefore probably does not matter that much. It is likely that most major currencies like EUR or GBP will result in a very similar asset, except that it will react slightly differently under market shocks. This is because any expectation in different performance will just be corrected by market particpiants (so if they expect that GBP will lose 1% per year vs USD, they will just correct for it by picking a different redemption rate).

Why do I think RAI is such a cool experiment?

There have been many attempts to solve the decentralized stablecoin problem. MakerDAO with DAI was probably the first that solved a major part of the puzzle – how to stop it from crashing to zero in a confidence crisis. However, it turned out that they had still missed one part, which is how to stop it from going up.

Finally, RAI came and added this missing piece in a slightly unexpected way – whilst many had been expecting a DAI with negative interest rate, doing it via the redemption rate adjustment is much more elegant. And at the same time, it allows us to learn a lot:

First and foremost, the point of a stablecoin is not to be pegged to USD. It is to provide an asset with low volatility. RAI does indeed solve this task, and has much lower volatility than the underlying collateral, Ether.

RAI therefore is something like a new currency, which is underlined by the fact that it doesn’t really matter that much which fiat currency is used for the oracles, as long as it is reasonably stable. In fact you can change the reference asset while the system is working without much of a problem.

Secondly, the current market structure dictates that if users want a decentralized stablecoin, they have to pay a “price for stability”, in the form of a slowly decaying price of RAI (vs the USD). This is because there is a lot of demand for stablecoins, and limited demand for leverage on decentralized assets like Ether. While it currently feels like this might be an eternal truth, it does not have to be. It could be that this balance tips in the favour of stablecoins again in a bull market, as demand for leverage increases. However, before that happens, it is likely that we would have to see MakerDAO shrug off all of its custodial stablecoin exposure (in order to get their funding rates off zero), which currently seems a long way off.

What I love about RAI is that it is a completely fair way to determine the “price of stability”, and also much cleaner than others. I have posited in my past article on stablecoins that the price of stability isn’t currently fairly determined – and that if it were, it may well be a negative interest rate. Many see inflation as a scourge, but the reality is that having a “guaranteed stable asset” as we implicitly expect currencies to be has to be paid for by someone. If that price were determined using a market like it is in RAI, what would it be?

At least in the decentralized world, we now have an answer, that the price is typically higher than the 2% inflation allowed for by many central banks. Obviously, the number of assets that can be used as collateral in decentralized stablecoins is small, and the result in the real world may well be different. After all, there are 100s of trillions of collateral available, compared to currently less than one trillion in crypto assets.

RAI is, in many ways, central banking in a pure form. We will probably learn many things from this experiment, and that already makes it a worthwhile project.

Ethereum Merge: Run the majority client at your own peril!

2022-03-24T11:00:00+00:00

Special thanks to Vitalik Buterin, Hsiao-Wei Wang and Caspar Schwarz-Schilling for feedback and review.

TL;DR: For reasons of both safety and liveness, Ethereum has chosen a multi-client architecture. In order to encourage stakers to diversify their setups, penalties are higher for correlated failures. A staker running a minority client will thus typically only lose moderate amounts should their client have a bug, but running a majority client can incur a total loss. Responsible stakers should therefore look at the client landscape and choose a less popular client.

Why do we need multiple clients?

There are arguments why a single client architecture would be preferable. Developing multiple clients incurs a substantial overhead, which is the reason why we haven’t seen any other blockchain network seriously pursue the multi-client option.

So why does Ethereum aim to be multi-client? Clients are very complex pieces of code and likely contain bugs. The worst of these are so called “consensus bugs”, bugs in the core state transition logic of the blockchain. One often quoted example of this is the so-called “infinite money supply” bug, in which a buggy client accepts a transaction printing arbitrary amounts of Ether. If someone finds such a bug and isn’t stopped before they get to the exit doors (i.e. making use of the funds by sending them through a mixer or to an exchange), it would massively crash the value of Ether.

If everyone runs the same client, stopping this requires manual intervention, because the chain, all smart contracts and exchanges will keep running as usual. Even a few minutes could be enough to execute a successful attack and sufficiently disperse the funds to make it impossible to roll back only the attacker’s transactions. Depending on the amount of ETH printed, the community would likely coordinate on rolling back the chain to before the exploit (after having identified and fixed the bug).

Now let’s have a look at what happens when we have multiple clients. There are two possible cases:

The client with the bug represents less than 50% of the stake. The client will produce a block with the transaction exploiting the bug, printing ETH. Let’s call this chain A.

However, the majority of stake running a non-faulty client will ignore this block, because it is invalid (to them the printing ETH operation is simply invalid). They will build an alternative chain B that does not contain the invalid block.

Since the correct clients are in the majority, chain B will accumulate more attestations. Hence, even the buggy client will vote for chain B; as a result chain B will accumulate 100% of the votes and chain A will die. The chain will continue as if the bug never happened.
The majority of stake uses the buggy client. In this case, chain A will accumulate the majority of votes. But since B has less than 50% of all attestations, the offending client will never see a reason to switch from chain A to chain B. We will thus see a chain split.

Case 1 is the ideal case. It would most likely lead to a single orphaned block which most users wouldn’t even notice. Devs can debug the client, fix the bug, and everything is great. Case 2 is clearly less than ideal, but still a better outcome than if there’s only a single client – most people would very quickly detect that there is a chain split (you can do this automatically by running several clients), exchanges would quickly suspend deposits, Defi users could tread carefully while the split is resolved. Basically, compared to the single client architecture, this still gives us a big flashing red warning light that allows to protect against the worst outcomes.

Case 2 will be much worse if the buggy client is run by more than 2/3 of the stake, in which case it would be finalizing the invalid chain. More on that later.

Some people think a chain split is so catastrophic that in itself it is an argument for a single-client architecture. But note that the chain split only happened because of a bug in the client. With a single client, if you wanted to fix this and return the chain back to status quo ante, you would have to roll back to the block before the bug happened – that’s just as bad as the chain split! So as bad as a chain split sounds, in the case where there is a critical bug in a client, it’s actually a feature, not a bug. At least you can see that something is seriously wrong.

Incentivising client diversity: anti-correlation penalties

It is clearly good for the network if the stake is split across multiple clients, with the best case being each client owning less than 1/3 of the total stake. This will make it resilient against a bug in any individual client. But why would stakers care? If there aren’t any incentives by the network, it’s unlikely that they will take on the cost of switching to a minority client.

Unfortunately we can’t make rewards directly dependent on what client a validator runs. There is no objective way to measure this that can’t be spoofed.

However, you can’t hide when your client has a bug. And this is where anti-correlation penalties come in: The idea is that if your validator does something bad, then the penalty is higher if more validators make a mistake around the same time. In other words, you get punished for correlated failures.

In Ethereum, you can currently get slashed for two behaviours:

Signing two blocks at the same height
Creating a pair of slashable attestations (surround or double votes)

When you get slashed, you don’t usually lose all your funds. At the time of this writing (Altair fork), the default penalty is actually quite small: You would only lose 0.5 ETH, or about 1.5% of your staked Ether (ultimately this will be increased to 1 ETH or 3%).

However, there is a catch: There is an additional penalty that is dependent on all other slashings that occur during the 4096 epochs (18 days) before and after your validator was slashed. You are further penalized by an amount that is proportional to the total amount slashed during this period.

This can be a much larger penalty than the initial penalty. Currently (Altair fork) it is set so that if more than half of the full staking balance got slashed during this period, then you will lose all your funds. Ultimately this will be set so that you will lose all of your stake if 1/3 of other validators got slashed. 1/3 was chosen because this is the minimum amount of the stake that has to equivocate in order to create a consensus failure.

The other anti-correlation penalty: The quadratic inactivity leak

Another way a validator can fail is by being offline. Again there is a penalty for it, but its mechanism is very different. We do not call it slashing, and it’s usually small: Under normal operation, a validator that is offline is penalized by the same amount that they would be gaining if they were validating perfectly. At the time of this writing, this is 4.8% per year. It is probably not worth breaking a sweat if your validator is offline for a few hours or days, for example due to a temporary internet outage.

It becomes very different when more than 1/3 of all validators are offline. Then the beacon chain cannot finalize, which threatens a fundamental propoerty of the consensus protocol, namely liveness.

To restore liveness in a scenario like this the so-called “quadratic inactivity leak” kicks in. The total penalty amount raises quadratically with time if a validator continues being offline while the chain is not finalizing. Initially it is very low; after ~4.5 days, the offline validators will lose 1% of their stake. However, it increases to 5% after ~10 days and to 20% after ~21 days (these are Altair values, they will be doubled in the future).

This mechanism is designed so that in the case of a catastrophic event that annihilates a large number of validator operations, the chain will eventually be able to finalize again. As the offline validators lose larger and larger parts of their stake, they will make up a smaller and smaller share of the total, and as their stake drops below 1/3, the remaining online validators gain the required 2/3-majority, allowing them to finalize the chain.

However, there is another case where this becomes relevant: In certain cases, validators cannot vote for the valid chain anymore because they accidentally locked themselves into an invalid chain. More on this below.

How bad is it to run the majority client?

In order to understand what the dangers are, let’s take a look at three failure types:

Mass slashing event: Due to a bug, majority-client validators sign slashable attestations
Mass offline event: Due to a bug, all majority-client validators go offline
Invalid block event: Due to a bug, majority-client validators all attest to an invalid block

There are other kinds of mass failures and slashings that can happen, but I’m restricting myself to those related to client bugs (the ones you should consider when choosing which client to run).

Scenario 1: Double signing

This is probably the most feared scenario by most validator operators: A bug leading the validator client to sign slashable attestations. One example would be two attestations voting for the same target epoch, but with different payloads. Because it is a client bug, it’s not just one staker that is concerned, but all stakers that run this particular client. When the equivocations are detected, the slashings will be a bloodbath: All concerned stakers will lose 100% of their staked funds. This is because we are considering a majority client: If the stake of the concerned client were only 10%, then “only” about 20% of their stake would be slashed (in Altair; 30% with the final penalty parameters in place).

The damage in this case is clearly extreme, but I also think it is extremely unlikely. The conditions for slashable attestations are simple, and that’s why validator clients (VCs) were built to enforce them. The validator client is a small, well audited piece of software. A bug of this magnitude is unlikely.

We have seen some slashings so far, but as far as I know all of them where due to operator failures – almost all of them resulting from an operator running the same validator in several locations. Since these aren’t correlated, the slashing amounts are small.

Scenario 2: Mass offline event

For this scenario, we assume that the majority client has a bug, which when triggered, leads to a crash of the client. An offending block has been integrated into the chain, and whenever the client encounters that block, it goes offline, leaving it unable to participate any further in consensus. The majority client is now offline, so the inactivity leak kicks in.

Client developers will scramble to get things back together. Realistically within hours, at most in a few days, they will release a bug fix that will remove the crash.

In the meantime, stakers also have the option to simply switch to another client. As long as enough do this to get more than 2/3 of all validators online, the quadratic inactivity leak will stop. It is not unlikely that this will happen before there is a fix for the buggy client.

This scenario is not unlikely (bugs that lead to crashes are one of the most common types), but the total penalty would probably be less than 1% of the stake affected.

Scenario 3: Invalid block

For this scenario, we consider the case where the majority client has a bug that produces an invalid block, and also accepts it as valid – i.e. when other validators using the same client see the invalid block, they will consider it as valid, and hence attest to it.

Let’s call the chain that includes the invalid block chain A. As soon as the invalid block is produced, two things will happen:

All correctly functioning clients will ignore the invalid block and instead build on the latest valid head producing a separate chain B. All correctly working clients will vote and build on chain B.
The faulty client considers both chain A and B valid. It will thus vote for whichever of the two it currently sees as the heaviest chain.

We need to distinguish three cases:

The buggy client has less than 1/2 of total stake. In this case, all correct clients vote and build on chain B eventually making it the heaviest chain. At this point even the buggy client will switch to chain B. Other than one or a few orphaned blocks, nothing bad will happen. This is the happy case, and why it is great to only have sub-majority clients.
The buggy client has more than 1/2 and less than 2/3 of the stake. In this case, we will see two chains being built – A by the buggy client, and B by all other clients. Neither chain has a 2/3-majority and therefore they cannot finalize. As this happens, developers will scramble to understand why there are two chains. As they figure out that there is an invalid block in chain A, they can proceed to fix the buggy client. Once it is fixed, it will recognize chain A as invalid. It will thus start building on chain B, which will allow it to finalize. This is very disruptive for users. While hopefully the confusion between which chain is valid will be short and less than an hour, the chain probably won’t finalize for many hours, potentially even a day. But for stakers, even the ones running the buggy client, the penalties would still be relatively light. They will receive the “inactivity leak” penalty for not participating in chain B while they were building the invalid chain A. However, since this is likely less than a day, we are talking of a penalty that’s less than 1% of the stake.
The buggy client has more than 2/3 of the stake. In this case, the buggy client will not just build chain A – it will actually have enough stake to “finalize” it. Note that it will be the only client that will think that chain A is finalized. One of the conditions of finalization is that the chain is valid, and to all other correctly operating clients, chain A will be invalid. However, due to how the Casper FFG protocol works, when a validator has finalized chain A, they can never take part in another chain that is in conflict with A without getting slashed, unless that chain is finalized (for anyone interested in the details, see Appendix 2). So once chain A has been finalized, the validators running the buggy client are in a terrible bind: They have committed to chain A, but chain A is invalid. They cannot contribute to B because it hasn’t finalized yet. Even the bugfix to their validator software won’t help them – they have already sent the offending votes. What will happen now is very painful: Chain B, which is not finalizing, will go into the quadratic inactivity leak. Over several weeks, the offending validators will leak their stake until enough has been lost so that B will finalize again. Let’s say they started off with 70% of the stake – then they would lose 79% of their stake, because this is how much they would need to lose in order to represent less than 1/3 of the total stake. At this point, chain B will finalize again and all stakers can switch to it. The chain will be healthy again, but the disruption will have lasted weeks, and millions of ETH were destroyed in the process.

Clearly, case 3 is nothing short of a catastrophe. This is why we are extremely keen not to have any client with more than 2/3 of the stake. Then no invalid block can ever be finalized, and this can never happen.

Risk analysis

So how do we evaluate these scenarios? A typical risk analysis strategy is to evaluate the likelihood of an event happening (1 – extremely unlikely, 5 – quite likely) as well as the impact (1 – very low, 5 – catastrophic). The most important risks to focus on are those that score high on both metrics, represented by the product of impact and likelihood.

Scenario	Likelihood	Impact	Product (Impact * Likelihood)
Scenario 1	1	5	5
Scenario 2	4	2	8
Scenario 3	3	5	15

Looking at this, by far the highest priority is scenario 3. The impact when one client is in a 2/3 supermajority is quite catastrophic, and it is also a relatively likely scenario. To highlight how easily such a bug can happen, a bug of this sort happened recently on the Kiln testnet (see Kiln testnet block proposal failure). In this case, Prysm did detect that the block was faulty after proposing it, and did not attest to it. Had Prysm considered that block as valid, and this had happened on mainnet, then we would be in the catasrophic case described in case 3 of scenario 3 – because Prysm currently has a 2/3 majority in mainnet. So if you are currently running Prysm, there is a very real risk that you could lose all your funds and you should consider switching clients.

Scenario 1, which people are probably most worried about, received a relatively low rating. The reason for this is that I consider the likelihood of it happening to be quite low, because I think that the Validator Client software is very well implemented in all clients and it is unlikely to produce slashable attestations or blocks.

What are my options, if I currently run the majority client and I’m worried about switching?

Switching clients can be a major undertaking. It also comes with some risks. What if the slashing database is not properly migrated to the new setup? There might be a risk of getting slashed, which completely defeats the purpose.

There is another option that I would suggest to anyone who is worried about this. It is also possible to leave your validator setup exactly as it is (no need to take those keys out etc.) and only switch the beacon node. This is extremely low risk because as long as the validator client is working as intended, it will never double sign and thus cannot be slashed. Especially if you have large operations, where changing the validator client (or remote signer) infrastructure would be very expensive and might require audits, this may be a good option. Should the setup perform less well than expected, it can also be easily switched back to the original client or another minority client can be tried.

The nice thing is that you have very little to worry about when switching your beacon node: The worst thing it can do to you is to be temporarily offline. That’s because the beacon node itself can never produce a slashable message on its own. And you can’t end up in scenario 3 if you’re running a minority client, because even if you would vote for an invalid block, that block would not get enough votes to be finalized.

How about the execution clients?

What I have written above applies to the Consensus clients – Prysm, Lighthouse, Nimbus, Lodestar and Teku, of which at the time of writing, Prysm likely has a 2/3 majority on the network.

All of this applies in the same way to the execution client. Should Go-ethereum, likely to be the majority execution client after the merge, produce an invalid block, it could get finalized and thus cause the catastrophic failure described in scenario 3.

Luckily, we now have three other execution clients ready for production – Nethermind, Besu and Erigon. If you are a staker, I highly recommend running one of these. If you are running a minority client, the risks are very low! But if you run the majority client, you are at serious risk of losing all your funds.

Appendix

A1: Why is there no slashing for invalid blocks?

In Scenario 3, we have to rely on the quadratic inactivity leak to punish validators for proposing and voting for an invalid block. That’s strange – why don’t we just punish them directly? It would be faster and less painful to watch.

There are actually two reasons why we don’t do this – one is that we currently can’t, but even if we could, we may well not do it:

Currently, it is practically impossible to introduce a penalty (“slashing”) for invalid blocks. The reason for this is that neither the beacon chain nor the execution chain are currently “stateless” – i.e. in order to check whether a block is valid, you need a context (the “state”) that is 100s of MB (beacon chain) or GB (execution chain) large. This means there is no “concise proof” that a block is invalid. We need such a proof to slash a validator: The block that “slashes” a validator needs to include a proof that the validator has made an offence. There are ways around this without having a stateless consensus, however it would involve much more complex constructions such as multi-round fraud proofs, such as Arbitrum is currently using for their rollup.
The second reason why we might not be that eager to introduce this type of slashing even if we could, is because producing invalid blocks is a much harder thing to protect against than the current slashing conditions. The current conditions are extremely simple and can be validated easily in a few lines of code by validator clients. This is why I consider scenario 1 above so unlikely – slashable messages have so far only been produced by operator failures, and I think that’s likely to remain the case. Adding slashing for producing invalid blocks (or attesting to them) raises the risks for stakers. Now even those running minority clients could risk serious penalties.

In summary, we are unlikely to see direct penalties for invalid blocks and/or attestations to them for the next few years.

A2: Why can’t the buggy client switch to chain B once it has finalized chain A?

This section is for anyone who wants to understand in more detail why the buggy client can’t just switch back and has to suffer the horrendous inactivity leak. For this we have to look how Casper FFG finalization works.

Each attestation contains a source and a target checkpoint. A checkpoint is the first block of an epoch. If there is a link from one epoch to another which has a total of >2/3 of all stake voting for it (i.e., there are this many attestations with the first checkpoint as the “source” and the second checkpoint as the “target”), then we call this a “supermajority link”.

An epoch can be “justified” and “finalized”. These are defined as follows:

Epoch 0 is justified
An epoch is justified if there is a supermajority link from a justified epoch.
An epoch X is finalized if (1) the epoch X is justified and (2) the next epoch is also justified, with the source of the supermajority link being epoch X

Rule 3 is slightly simplified (there are more conditions under which an epoch can be finalized, but they aren’t important for this discussion). Now let’s come to the slashing conditions. There are two rules for slashing attestations. Both compare a pair of attestations V and W:

They are slashable if the target of V and W is the same epoch (i.e. the same height), but they don’t vote for the same checkpoint (double vote)
They are slashable if V “jumps over” W. What this means as that (1) the source of V is earlier than the source of W and (2) the target of V is later than the target of W (surround vote)

The first condition is obvious: It prevents simply voting for two different chains at the same height. But what does the second condition do?

Its function is to slash all validators that take part in finalizing two conflicting chains (which should never happen). To see why, let’s look at our scenario 3 again, in the worst case where the buggy client is in a supermajority (>2/3 of the stake). As it continues voting for the faulty chain, it will finalize the epoch with the invalid block, like this:

The rounded boxes in this picture represent epochs, not blocks. The green arrow is the last supermajority link created by all validators. The red arrows are supermajority links that were only supported by the buggy client. Correctly working clients ignore the epoch with the invalid block (red). The first red arrow will justify the invalid epoch, and the second one finalizes it.

Now let’s assume that the bug has been fixed and the validators that finalized the invalid epoch would like to rejoin the correct chain B. In order to be able to finalize the chain, a first step is to justify epoch X:

However, in order to participate in the justification of epoch X (which needs a supermajority link as indicated by the dashed green arrow), they would have to “jump over” the second red arrow – the one that finalized the invalid epoch. Voting for both of these links is a slashable offense.

This continues to be true for any later epoch. The only way it will get fixed is through the quadratic inactivity leak: As chain B grows, the locked out validators will leak their funds until chain B can be justified and finalized by the correctly working clients.

Exponential EIP-1559

2022-03-16T11:00:00+00:00

Exponential EIP-1559 explainer

In this blog post I will try to help understand how the exponential version of EIP-1559 works – the one that was suggested for the Shard Blob EIP.

I’m not going to try to explain how the EIP-1559 mechanism works – good explainers already exist, for example by Barnabé Monnot.

Linear EIP-1559 mechanics (“original version”)

I will call the current implementation of EIP-1559 the “linear version” of EIP-1559.

In the linear version, we define the constants

$T = 15{,}000{,}000 \text{ (Gas target)}\\ A = 8 \text{ (Max base fee change denominator)}$

Each block $B_i$ has a base fee of $b_i$ and total gas consumed in the block of $g_i$ . There is an update rule for the Basefee:

$b_{i+1} = b_i \cdot \left(1+ \frac{1}{A}\frac{g_i - T}{T}\right)$

There is also a constraint that the maximum amoung of gas per block can’t be more than $2 T$ . However, this limit is not important in the scope of this post so I will ignore it.

Exponential EIP-1559

One way to understand this “linear EIP-1559” better is to compute what happens after many updates. In particular, how does $b_n$ depend on $b_0$ ? By substituting the equation into itself many times, we get

$b_{n} = b_0 \prod_{i=0}^{n-1} \left(1+ \frac{1}{A}\frac{g_i - T}{T}\right)$

Now let’s say that $A$ is a large number, so that all the terms of the form $\frac{1}{A}\frac{g_i - T}{T}$ are small? Let’s call $x_i = \frac{g_i - T}{T}$ .

If we assume this, we can use an approximation: The exponential function $e^x \approx 1+x$ for small $x$ . We use this approximation in the inverse to replace $1+\frac{x_i}{A} = e^{\frac{x_i}{A}}$ to get

$b_{n} = b_0 \prod_{i=0}^{n-1} \left(1+ \frac{x_i}{A}\right) \approx b_0 \prod_{i=0}^{n-1} e^\frac{x_i}{A} = b_0 \exp\left(\frac{1}{A}\sum_{i=0}^{n-1}x_i\right)$

For the last step, we have used the property that $e^x \cdot e^y = e^{x+y}$ . Note that

$\frac{1}{A}\sum_{i=0}^{n-1}x_i = \frac{1}{TA}\sum_{i=0}^{n-1}(g_i - T) = \frac{1}{TA}\left(\sum_{i=0}^{n-1}g_i - nT\right)$

so we get this new formula for $b_n$

$b_{n} = b_0 \exp\left(\frac{1}{TA}\left(\sum_{i=0}^{n-1}g_i - nT\right)\right)= b_0 \exp\left(\frac{1}{TA}\left(G_n - T_n\right)\right)$

where we used $G_n = \sum_{i=0}^{n-1}g_i$ for the total gas used since block 0 and $T_n = nT$ the cumulative gas target.

This is the exponential form of EIP-1559. The only thing we need to keep track of to compute the basefee current $b_n$ is (1) the total gas used, which is $G_n = \sum_{i=0}^{n-1}g_i$ , and the total gas target, which is represented by $T_n = nT$ (so it’s enough to just count the number of blocks).

Analyzing the exponential form

Here is a question I have heard many times about exponential EIP-1559: Isn’t the goal of EIP-1559 that in the long term, the total gas used $G_n$ equals to the gas target $T_n$ ?

And if so, if you evaluate the equation for $G_n = T_n$ , the exponential becomes one and thus $b_n = b_0$ . So if the target is achieved, then the basefee would always be $b_0$ (which is typically very low)?

Here I will explain why this is not the case. Let’s see how this happens in linear EIP-1559 first. Here is a very simple economic model: Let’s assume a there is a “fair price” for gas at price $p$ . For simplicity assume that below $p$ , demand is infinite: All blocks will be filled. Above $p$ , blocks will be empty. And when the base fee is equal to $p$ , then there will be exactly enough demand to fill the blocks to half.

So what will happen in this model in linear EIP-1559?

We assume that the base fee starts of low with $b_0 < p$ .
Then there will be a number of $n_0$ blocks where the basefee is lower than $p$ , which will be completely full by our assumption.
When the basefee reaches $p$ at $n_0$ , all futher blocks will be filled to target.

To keep things simple, say the max block size is exactly $2T$ . Then how much gas has been used at any block $n>n_0$ ? It would be $G_n = 2 n_0 T + (n - n_0) T = n_0 T + n T = T_n + n_0 T$ . So it is actually not exactly the target, despite the EIP-1559 mechanism now being in equilibrium.

What happened? Note that it is still correct to say that EIP-1559 will ensure $G_n \approx T_n$ ; for example, if you take the fraction $\frac{G_n}{T_n}$ , it will tend to $1$ so in asymptotic notation we would write $G_n \sim T_n$ .

But there is a constant difference between $G_n$ and $T_n$ , which is the amount of gas that was needed to shift the basefee from $b_0$ to $p$ .

More generally, the current difference between $G_n$ and $T_n$ is a constant determined by the current basefee. In the linear version, this is approximately true; in the exponential version, it is exactly true (it follows directly from the definition $b_{n} = b_0 \exp\left(\frac{1}{TA}\left(G_n - T_n\right)\right)$ )

Graphical illustration

First let’s look at our simplified example. The following graph illustrates what is happening: Above you see the relation between total gas used and the basefee (an exponential function). Once in equilibrium, the basefee is at $p$ which means that $G_n-T_n = n_0 T$ . Below we see how this adds up. The green shaded area of gas used is the blocks filled up to the target, and the orange area is the gas used above the target that sums up to $n_0 T$ .

Next we look at a generic example, where some blocks are filled with more than target gas and some blocks are below. In this case, we need to sum up all the gas consumed by each block above the target (marked in green) and substract the gas that was below the target in underfull blocks (marked with red and white diagonal stripes). We can then read the sum on the $x$ axis of the basefee relation to determine the basefee $b_n$ .

Appendix: Using differential equations to derive the exponential form

Here is another way of deriving the exponential form, using differential equations:

Let’s start from the update rule of linear EIP-1559

$b_{i+1} = b_i \cdot \left(1+ \frac{1}{A}\frac{g_i - T}{T}\right)$

We can rewrite this as

$b_{i+1} - b_i = b_i \frac{1}{A}\frac{g_i - T}{T}$

Tis is mathematically a difference equation. This is similar to a differential equation, but for finite differences. However, we can “approximate” it as a differential equation, by changing $i$ into a continuous variable and writing

$b'(i) = b(i) \frac{1}{A}\frac{g(i) - T}{T}$

where we use that $b(i)-b(i+1) \approx b'(i)$ . You could say that this form comes about naturally if you assume that we are making the blocks smaller and smaller, scaling the target as well. As the size of the blocks goes towards zero, we get the differential equation.

This is a linear ordinary differential equation of first order that can be solved by moving the terms depending on $b$ to one side:

$\frac{b'(i)}{b(i)} = \frac{1}{A}\frac{g(i) - T}{T}$

Integrating on both sides yields:

$\int\frac{b'(i)}{b(i)} = \ln b(i) = \int_{i=0}^n \frac{1}{A}\frac{g(i) - T}{T} + C$

This is because

$\frac{\mathrm{d}}{\mathrm{d}i} \ln b(i) = \frac{b'(i)}{b(i)}$

So we can exponentiate and get

$b(i) = \exp\left(\int_{i=0}^n \frac{1}{A}\frac{g(i) - T}{T} + C\right)$

Since we want $b(0)=b_0$ we get $C=\ln b_0$ and so

$b(i) = b_0\exp\left(\int_{i=0}^n \frac{1}{A}\frac{g(i) - T}{T}\right)$

This looks almost the same that we derived above – except we now have an integral instead of a sum, because we are working with the continuous form.

The exponential version thus arises naturaly when you consider very small blocks and update the basefee each time. From this perspective, it is the most natural way of implementing EIP-1559.

内积证明

2021-11-18T17:00:00+00:00

原文链接： Inner Product Arguments

翻译：Star.LI @ Trapdoor Tech

介绍

你也许听说过“BulletProofs”：它是一种零知识证明算法，不要求可信设置。比如，门罗币（Monero）就用了这个算法。这种证明系统的核心是内积证明¹，一个能让证明者向验证者证明“内积“正确性的小技巧。内积，即是计算两个向量中每个分量的乘积和：

$\begin{align*} \vec a \cdot \vec b = a_0 b_0 + a_1 b_1 + a_2 b_2 + \cdots + a_{n-1} b_{n-1} \end{align*}$

其中 $\vec a = (a_0, a_1, \ldots, a_{n-1})$ , $\vec b = (b_0, b_1, \ldots, b_{n-1})$ 。

一个比较有趣的例子就是当我们设置向量 $\vec b$ 为某个 $z$ 的幂，即 $\vec b = (1, z, z^2, \ldots, z^{n-1})$ ，那么它的内积就变成了多项式

$\begin{align*} f(X) = \sum_{i=1}^{n-1} a_i X^i \end{align*}$

在 $z$ 点的取值。

内积证明采用Pedersen承诺。我之前写过一篇文章介绍KZG承诺，Pedersen承诺与其类似，承诺值也是在椭圆曲线上的，不同的是它不需要可信设置。下面比较这两个多项式承诺方案(PCS)，KZG承诺方案和 Pedersen承诺与内积证明相结合的方案:

	Pedersen+IPA	KZG
安全假设	离散对数	双线性群
可信设置	否	是
承诺大小	1个群元素	1个群元素
证明大小	2 log n个群元素	1个群元素
验证	O(n) 群运算	1个配对

说到底，和KZG承诺相比，我们这个承诺方案的效率要低一些。证明大小更大（ $O(\log n)$ ），不过对数本身还是挺小的，所以不至于太糟。但可惜的是验证者需要做的计算是线性的, 这失去了简洁性。这些局限使得 Pedersen 承诺对于某些应用来说不太现实，但在一些情况下这些缺点可以被规避。

其中一个例子我在之前的文章多重打开中曾经提到过。这里的诀窍是你可以将多个打开聚合成一个。
Halo2 ², 其中多个打开的线性成本可以被聚合。

在以上这两个例子中，特点就是多个打开的成本被分摊了。如果你只想打开一个多项式，那就比较困难了，你需要承担整个打开的运算成本。

但是，Pedersen 承诺与内积证明结合的方案，很大的好处是较少的安全假设。也就是，不需要配对，并且不需要可信设置。

Pedersen 承诺

在我们讨论内积证明之前，我们要先看一下依赖的结构：Pedersen承诺。为了使用Pedersen承诺，我们需要一个椭圆曲线 $G$ 。让我们首先回顾一下我们使用椭圆曲线可以做到哪些事情（在这里我会使用加法符号表示，看起来更自然一些）：

你可以将两个椭圆曲线点 $g_0 \in G$ 和 $g_1 \in G$ 相加: $h = g_0 + g_1$
你可以将元素 $g \in G$ 与一个标量 $a \in \mathbb F_p$ 相乘，其中 $p$ 是 $G$ 椭圆曲线的阶 (即元素数量): $h=ag$

无法计算两个曲线元素的“乘积”：“ $h * h$ ”运算是未定义的，所以你没办法计算“ $h * h = a g * a g = a^2 g$ ”；与此相反，与标量相乘是很容易计算的，比如 $2 h = 2 a g$ 。

另一个重要的性质就是不存在有效的计算“离散对数”的算法，这意味着对于满足 $h = a g$ 的给定的 $h$ 和 $g$ ，如果你不知道 $a$ ， $a$ 是不可计算的，我们称 $a$ 为 $h$ 对于 $g$ 的离散对数。

Pedersen承诺则利用该不可计算性来构造承诺方案。假设有两个点 $g_0$ 和 $g_1$ ，它们的离散对数(比如存在 $x \in \mathbb F_p$ 使得 $g_1 = x g_0$ )并不可知，那么我们可以向两个数 $a_0, a_1 \in \mathbb F_p$ 提交承诺:

$\begin{align*} C = a_0 g_0 + a_1 g_1 \end{align*}$

$C$ 为椭圆曲线 $G$ 的一个元素。

为打开一个承诺，证明者给验证者 $a_0$ 和 $a_1$ ，然后验证者计算 $C$ ，如果相等的话就被接受。

承诺方案的中心性质在于它是不是绑定（binding）。给定 $C=a_0 g_0 + a_1 g_1$ ，一个试图作弊的证明者是否能够生成 $b_0, b_1 \in \mathbb F_p$ 并使验证者接受它们，即同时满足 $C = b_0 g_0 + b_1 g_1$ 且 $b_0, b_1 \not= a_0, a_1$ 。

如果有人能做到上述行为的话，那么它们也可以找出离散对数。为什么呢？我们知道 $a_0 g_0 + a_1 g_1 = b_0 g_0 + b_1 g_1$ ，整理后可得

$\begin{align*} (a_0 - b_0) g_0 = (b_1 - a_1) g_1 \end{align*}$

所以 $a_0 − b_0$ 和 $b_1 − a_1$ 不可同时为0。假说 $a_0 − b_0$ 不为零，我们得到：

$\begin{align*} g_0 = \frac{b_1 - a_1}{a_0 - b_0} g_1 = x g_1 \end{align*}$

对于 $x = \frac{b_1 - a_1}{a_0 - b_0}$ 。这样我们就找到了 $x$ 的值。我们知道该问题是难题，所以在现实中没有攻击者能够做到。

这意味着对于攻击者来说找到另外的 $b_0 , b_1$ 来打开承诺 $C$ 在计算性上是不可能的。（它们的确存在，只是无法通过计算获得 – 就像哈希碰撞）。

我们可以扩展一个向量的承诺，比如说一个标量列表 $a_0, a_1, \ldots, a_{n-1} \in \mathbb F_p$ 。我们只是需要一个“基”，即一个相等数量的互相未知离散对数的群元素，然后我们就可以计算承诺了：

$\begin{align*} C = a_0 g_0 + a_1 g_1 + a_2 g_2 + \ldots + a_{n-1} g_{n-1} \end{align*}$

这给了我们一个向量承诺，尽管这个承诺相对复杂：为了打开任意元素，必须打开所有的元素。但这里有一个重要的性质：这个承诺方案是加同态的，这意味着如果我们有另外一个承诺 $D = b_0 g_0 + b_1 g_1 + b_2 g_2 + \ldots + b_{n-1} g_{n-1}$ ，那么我们可能可以通过添加两个承诺得到两个向量 $\vec a$ 和 $\vec b$ 的和:

$\begin{align*} C + D = (a_0 + b_0) g_0 + (a_1 + b_1) g_1 + (a_1 + b_1) g_2 + \ldots + (a_{n-1} + b_{n-1}) g_{n-1} \end{align*}$

因为有了加同态的性质，这个向量承诺就变得有用了。

内积证明

内积证明的基本策略是“分治法”：将一个问题规约成多个同类型的子问题，而不是试图一步完全解决它。当子问题规约到一定程度的时候，就可以简单解决。

在这当中每一步，问题的大小都会减半。这保证了 $\log n$ 步后，问题大小会减少到1，可以通过简单证明解决。

假设我们需要证明的承诺 $C$ 具有以下形式：

$\begin{align*} C = \vec a \cdot \vec g + \vec b \cdot \vec h + (\vec a \cdot \vec b) q \end{align*}$

其中 $\vec g = (g_0, g_1, \ldots, g_{n-1}), \vec h = (h_0, h_1, \ldots, h_{n-1})$ ，且 $q$ 是我们的“基”，即：它们是群 $G$ 中的元素，并且它们之间对于任意一方的离散对数都未知。同时介绍一种新的表示方法： $\vec a \cdot \vec g$ ，一个由标量组成的向量（ $\vec a$ ）和另一个由群元素组成的向量（ $\vec g$ ）的乘积，我们将其定义为

$\begin{align*} \vec a \cdot \vec g = a_0 g_0 + a_1 g_1 + \cdots + a_{n-1} g_{n-1} \end{align*}$

也就是说，我们要证明 $C$ 是以下元素的承诺

基为 $\vec g$ 的向量 $\vec a$
基为 $\vec h$ 的向量 $\vec b$
基为 $q$ 的内积 $\vec a \cdot \vec b$ 。

单看本身似乎并不是很有用 – 在大多数应用中我们想让验证者知道 $\vec a \cdot \vec b$ ，而不是仅仅将这个结果藏在一个承诺里。这可以通过我后面要讲到的一个小技巧来解决。

证明

我们想让证明者向验证者证明 $C$ 的确具有形式 $C = \vec a \cdot \vec g + \vec b \cdot \vec h + (\vec a \cdot \vec b) q$ 。就像之前提到的，不是直接证明，而是要把这个问题规约成，如果这一性质对于另一个承诺 $C′$ 成立，则 $C$ 也满足该性质。

接下来证明者就要和验证者玩一个小游戏了。证明者提交一些信息，然后验证者发起一个挑战，从而引出下一个承诺 $C′$ 。将它称作一个游戏不代表这个证明必须是交互的：Fiat-Shamir算法允许我们通过将挑战换成一个承诺的抗碰撞哈希值，从而将交互式的证明转化成非交互式的。

证明描述

承诺 $C$ 符合 $C = \vec a \cdot \vec g + \vec b \cdot \vec h + (\vec a \cdot \vec b) q$ 的形式，并且以 $\vec g, \vec h, q$ 为基。我们将符合这样形式的 $C$ 称为拥有“内积性质”。

规约步骤

假设 $m = \frac{n}{2}$ , 证明者计算

$\begin{align*} z_L = a_m b_0 + a_{m+1} b_1 + \cdots + a_{n-1} b_{m-1} = \vec a_R \cdot \vec b_L \\ z_R = a_0 b_m + a_{1} b_{m+1} + \cdots + a_{m-1} b_{n-1} = \vec a_L \cdot \vec b_R \end{align*}$

这里我们定义 $\vec a_L$ 为向量 $\vec a$ 的“左半部”， $\vec a_R$ 为“右半部”，向量 $\vec b$ 类似。

然后证明者计算如下的承诺：

$\begin{align*} C_L = \vec a_R \cdot \vec g_L + \vec b_L \cdot \vec h_R + z_L q \\ C_R = \vec a_L \cdot \vec g_R + \vec b_R \cdot \vec h_L + z_R q \end{align*}$

并发送给验证者。然后验证者发送挑战 $x \in \mathbb F_p$ (通过使用Fiat-Shamir将它变成非交互式的，这意味着 $x$ 是 $C_L$ 和 $C_R$ 的哈希)。证明者以此来计算更新的向量：

$\begin{align*} \vec a' = \vec a_L + x \vec a_R \\ \vec b' = \vec b_L + x^{-1} \vec b_R \end{align*}$

长度为原向量的一半。

现在，验证者计算新的承诺：

$\begin{align*} C' = x C_L + C + x^{-1} C_R \end{align*}$

还有更新的基：

$\begin{align*} \vec g' = \vec g_L + x^{-1} \vec g_R \\ \vec h' = \vec h_L + x \vec h_R \end{align*}$

现在，如果新的承诺 $C′$ 符合了 $C' = \vec a' \cdot \vec g' +\vec b' \cdot \vec h' + \vec a' \cdot \vec b' q$ 的形式 - 那么承诺 $C$ 就遵从最初的假设，拥有“内积性质”。

所有的向量大小都减半 – 这让我们离成功又近一步。在这里我们替换 $C:=C'$ , $\vec g := \vec g'$ ， $\vec h := \vec h'$ 并重复以上步骤。

接下来我解释这个方法可行的数学原理，同时推荐你们去看看 Vitalik 所做的一个漂亮的可视化展示以得到一些直观的感受。

最终步骤

当我们一直重复以上步骤，n每次降低一半。最终我们会到达 $n=1$ ，然后就可以停下了。这时证明者发送 $\vec a$ 和 $\vec b$ 两个向量，事实上就是两个标量，然后验证者就可以非常直观地计算：

$\begin{align*} D = a g + b h + a b q \end{align*}$

如果等于 $C$ 就接受，反之则拒绝。

正确性(correctness) 及合理性(soundness)

在之前我假设 $C′$ 为我们所需要的形式，然后以此证明 $C$ 也成立。现在我要证明为什么该逻辑成立。我们需要验证以下两点：

正确性 – 即当证明者遵从相应操作的时候，它们可以达到说服验证者该结论为正确的目的；
合理性 – 即试图作弊的证明者不能使用错误的证明通过验证者验证，或者成功率低至可忽略不计。

让我们从正确性证起，假设证明者依照操作进行每一个步骤。既然如此，我们知道给定 $\vec g, \vec h, q$ 为基， $C = \vec a \cdot \vec g + \vec b \cdot \vec h + (\vec a \cdot \vec b) q$ ，同时 $C'= \vec a' \cdot \vec g' +\vec b' \cdot \vec h' + \vec a' \cdot \vec b' q$ 。

验证者计算 $C' = x C_L + C + x^{-1} C_R$ 。

$\begin{eqnarray} C' & = & x C_L + C + x^{-1} C_R \\ & = & x ( \vec a_R \cdot \vec g_L + \vec b_L \cdot \vec h_R + z_L q) \\ & & + \vec a_L \cdot \vec g_L + \vec a_R \cdot \vec g_R + \vec b_L \cdot \vec h_L + \vec b_R \cdot \vec h_R + \vec a \cdot \vec b q \\ & & + x^{-1} (\vec a_L \cdot \vec g_R + \vec b_R \cdot \vec h_L + z_R q) \\ & = & (x \vec a_R + \vec a_L)\cdot(\vec g_L + x^{-1} \vec g_R) \\ & & + (\vec b_L + x^{-1} \vec b_R)\cdot(\vec h_L + x \vec h_R) \\ & & + (x z_L + \vec a \cdot \vec b + x^{-1} z_R) q \\ &=& (x \vec a_R + \vec a_L)\cdot \vec g' + (\vec b_L + x^{-1} \vec b_R)\cdot \vec h' + (x z_L + \vec a \cdot \vec b + x^{-1} z_R) q \end{eqnarray}$

为了使承诺具有内积属性，我们需要验证 $(x \vec a_R + \vec a_L) \cdot (\vec b_L + x^{-1} \vec b_R) = x z_L + \vec a \cdot \vec b + x^{-1} z_R$ 。这个等式成立，因为

$\begin{eqnarray} (x \vec a_R + \vec a_L) \cdot (\vec b_L + x^{-1} \vec b_R) & = & x \vec a_R \cdot \vec b_L + \vec a_L \cdot \vec b_L + \vec a_R \cdot \vec b_R + x^{-1} \vec a_L \cdot \vec b_R \\ & = & x z_L + \vec a \cdot \vec b + x^{-1} z_R \end{eqnarray}$

这样正确性就证明了。为证明合理性，我们需要证明，证明者的初始承诺 $C$ 不具有内积属性，那么通过规约步骤，是无法产生一个具有内积属性的承诺 $C'$ 。

假设证明者提交了 $C=\vec a \cdot \vec g + \vec b \cdot \vec h + r q$ ，其中 $r \neq \vec a \cdot \vec b$ 。如果我们走一遍如上的规约步骤，我们会得到

$\begin{align*} C' = (x \vec a_R + \vec a_L)\cdot \vec g' + (\vec b_L + x^{-1} \vec b_R)\cdot \vec h' + (x z_L + r + x^{-1} z_R) q \end{align*}$

所以现在我们假设证明者成功作弊，那么 $C′$ 就满足了内积属性，则有：

$\begin{align*} (x \vec a_R + \vec a_L) \cdot (\vec b_L + x^{-1} \vec b_R) = x z_L + r + x^{-1} z_R \end{align*}$

展开左侧可得

$\begin{align*} x \vec a_R \cdot \vec b_L + \vec a \cdot \vec b + x^{-1} \vec a_L \cdot \vec b_R = x z_L + r + x^{-1} z_R \end{align*}$

注意证明者可以自由选择 $z_L$ 和 $z_R$ ，所以我们不能直接假设它们会遵从以上定义。

同时乘以 $x$ 并移到同一边，我们得到 $x$ 的二次方程式：

$\begin{align*} x^2 ( \vec a_R \cdot \vec b_L - z_L) + x (\vec a \cdot \vec b - r) + (\vec a_L \cdot \vec b_R - z_R ) \end{align*}$

除非所有项都为零，该等式至多会有两个解 $x \in \mathbb F_p$ ，但是验证者是在证明者已经承诺了他们的 $r$ , $z_L$ 和 $z_R$ 值后选择 $x$ 值，证明者能够成功作弊的概率非常小；我们通常选择域 $\mathbb F_p$ 大小约为 $2^{256}$ 。因此，当证明者不按照协议选择正确值时，验证者选择到一个让等式能够成立的 $x$ 值的概率微乎其微。

这就完成了我们的合理性证明。

仅在最后计算基变化

验证者在每一轮都需要做两件事：计算挑战 $x$ ，并计算更新的基 $\vec g'$ 和 $\vec h'$ 。但是在每一轮都更新 $g$ 效率很低，验证者可以简单地保存他们在 $k$ 轮中遇到的挑战值 $x_1 , x_2 \dots x_k$ 。

假设 $k$ 轮后，这些基为 $\vec g_k, \vec h_k$ 。元素 $g_\ell$ 和 $h_\ell$ 是标量（或者长度为1的向量），因为长度到达1的时候我们会终止协议。通过 $\vec g_0$ 计算 $\vec g_\ell$ ，是一个长度为n的椭圆曲线上的多标量点乘（MSM）。 $\vec g_0$ 的标量因子是下面多项式的系数

$\begin{align*} f_g(X) = \prod_{j=0}^{k-1} \left(1+x^{-1}_{k-j} X^{2^{j}}\right) \end{align*}$

且 $\vec h_0$ 的标量因子由以下多项式给定

$\begin{align*} f_h(X) = \prod_{j=0}^{k-1} \left(1+x_{k-j} X^{2^{j}}\right) \end{align*}$

使用内积证明来验证多项式值

针对我们的主要应用 – 验证 $f(x) = \sum_{i=1}^{n-1} a_i x^i$ 在 $z$ 处的取值 – 我们需要对协议做一些小的扩充。

最重要的一点是，我们想要验证 $f(z) = \vec a \cdot \vec b$ 的结果，而不仅是承诺 $C$ 拥有“内积属性”。
$\vec b = (1, z, z^2, ..., z^{n-1})$ 对于验证者来说是已知的。因此我们可以把这部分从承诺中移除来简化协议。

如何构造承诺

如果我们想要验证多项式 $f(x) = \sum_{i=1}^{n-1} a_i x^i$ ，我们通常需要从承诺 $F = \vec a \cdot \vec g$ 开始进行构造。证明者可以将 $y=f(z)$ 的计算发送给验证者。

那么貌似验证者可以计算最初的承诺 $C=\vec a \cdot \vec g + \vec b \cdot \vec h + \vec a \cdot \vec b q = F + \vec b \cdot \vec h + f(z) q$ ，因为他们已知 $\vec b = (1, z, z^2, ..., z^{n-1})$ ，然后开始证明流程。

但稍等一下。大多数情况下， $F$ 是证明者生成的承诺，一个恶意的证明者可以在这里作弊，比如说提交一个 $F = \vec a \cdot \vec g + tq$ 。在这种情况下，因为证明者生成的承诺有一个偏移，他们能够证明 $f(z) = y - t$ 。

为了避免这种情况，我们需要在证明流程中进行一点改变。收到承诺 $F$ 和计算结果 $y$ 后，验证者生成一个向量 $w$ 并且重新选择基 $q:=wq$ ，之后证明继续。因为证明者不能预判 $w$ 的取值，它们就无法成功操控除了 $f(z)$ 之外的结果（或者说概率极小）。

注意如果想要得到一个通用的内积，我们还要防止证明者操控向量 $\vec b$ – 但在多项式取值的应用中， $\vec b$ 的部分可以完全去掉不用考虑，因此这里略过细节。

如何去掉第二个向量

注意，如果我们想要进行多项式计算，验证者已知向量 $\vec b = (1, z, z^2, ..., z^{n-1})$ 。给定挑战 $x_0, x_1, \ldots, x_\ell$ ，他们可以通过在“仅在最终计算基变化”一节中提到的技巧简单地得到 $b_\ell$ 。

因此，我们可以从所有承诺中移除第二个向量并且只计算 $b_\ell$ 。这意味着验证者必须要能够从初始向量 $\vec b_0 = (1, z, z^2, ..., z^{n-1})$ 中计算最终的 $b_\ell$ 。因为 $\vec b$ 的规约过程与基向量 $\vec g$ 相同，线性组合也由之前定义的多项式 $f_g$ 的系数定义，也就是说 $b_\ell=f_g(z)$ 。

针对点值形式多项式的IPA

目前为止，我们用一个内积证明计算了使用它的系数提交的多项式，即一个由 $f(X) = \sum_{i=0}^{n-1} f_i X^i$ 定义的多项式中的 $f_i$ 。然而，多数情况下我们想要一个在给定定义域 $x_0, x_1, \ldots, x_{n-1}$ 计算值定义的多项式。因为任何阶低于 $n−1$ 的多项式都是由 $f(x_0), f(x_1), \ldots, f(x_{n-1})$ 的计算结果定义的独一无二的多项式，所以这两者是完全相等的。但是这两者之间的转换在计算上非常费时，如果定义域适用快速傅立叶转换的话需要花费 $O(n \log n)$ 次计算，否则就是 $O(n^2)$ 次。

为了避免这项开销，我们尝试避免使用多项式系数形式。这可以通过提交多项式值 $f$ 的承诺的而不是系数的承诺来实现：

$\begin{align*} C = f(x_0) g_0 + f(x_1) g_1 + \cdots + f(x_{n-1}) g_{n-1} \end{align*}$

这表示我们IPA的向量 $\vec a$ 形式为 $\vec a = (f(x_0), f(x_1), \ldots, f(x_{n-1}))$ ：

重心公式使我们现在可以计算这个新的承诺多项式的取值，记作：

$\begin{align*} f(z) = A(z)\sum_{i=0}^{n-1} \frac{f(x_i)}{A'(x_i)} \frac{1}{z-x_i} \end{align*}$

如果我们选择向量 $\vec b$

$\begin{align*} b_i = \frac{A(z)}{A'(x_i)} \frac{1}{z-x_i} \end{align*}$

我们可以得到 $\vec a \cdot \vec b = f(z)$ ，因此采用这种向量的IPA可以被用作证明点值多项式的取值。除了这一点差异之外，其他的证明过程是完全相同的。

Bowe, Grigg, Hopwood: Recursive Proof Composition without a Trusted setup ↩ ↩
Bootle, Cerulli, Chaidos, Groth, Petit: Efficient Zero-Knowledge Arguments forArithmetic Circuits in the Discrete Log Setting ↩ ↩

KZG多项式承诺

2021-10-13T00:00:00+00:00

原文链接： KZG Polynomial Commitments

翻译：Star.LI @ Trapdoor Tech

简介

今天我想向你们介绍一下Kate，Zaverucha和Goldberg发表的多项式承诺方案 ¹。这篇文章并不涉及复杂的数学及密码学理论知识，仅作为一篇简介。

该方案通常被称作卡特（Kate，读作kah-tay）多项式承诺方案。在一个多项式承诺方案中，证明者计算一个多项式的承诺（commitment）, 并可以在多项式的任意一个点进行打开（opening）：该承诺方案能证明多项式在特定位置的值与指定的数值一致。

之所以被称为承诺，是因为当一个承诺值（椭圆曲线上的一个点）发送给某对象（ 验证者），证明者不可以改变当前计算的多项式。它们只能够对一个多项式提供有效的证明；当试图作弊时，它们要不无法提供证明，要不证明被验证者拒绝。

预备知识

如果你对有限域，椭圆曲线和配对这几个话题不是很熟悉的话，非常推荐去读一读Vitalik Buterin的博客：椭圆曲线配对这篇文章。

默克尔树对比

如果你已经熟知默克尔树，我想在此之上和卡特承诺进行对比。默克尔树即是密码学家所说的矢量承诺：运用一个深度为 $d$ 的默克尔树，你可以计算一个矢量的承诺（矢量为一个固定长度的列表 $a_0, \ldots, a_{2^d-1}$ ）。运用熟知的默克尔证明，你可以用 $d$ 个哈希来提供证明元素 $a_i$ 存在于这个矢量的位置 $i$ 。

事实上，我们可以用默克尔树来构造多项式承诺：回忆一下，一个 $n$ 次的多项式 $p(X)$ ，无非是一个函数 $p(X) = \sum_{i=0}^{n} p_i X^i$ ，其中 $p_i$ 是该多项式的系数。

通过设置 $a_i=p_i$ ，我们可以计算这一系列系数的默克尔树根，从而比较容易地对一个 $n=2^{d}-1$ 次的多项式进行承诺。证明一个取值，意味着证明者想要向验证者展示对于某个值z， $p(z)=y$ 。为达到这个目的，证明者可以向验证者发送所有的 $p_i$ ，然后验证者计算p(z)是否等于y。

当然，这是一个极度简单化的多项式承诺，但它能帮助我们理解真实的多项式承诺的益处。让我们一起回顾多项式承诺的性质：

承诺的大小是一个单一哈希（默克尔树根）。一个足够安全的加密散列一般需要256位，即32字节。
为了证明一个取值，证明者需要发送所有的 $p_i$ ，所以证明的大小和多项式次数是线性相关的。同时，验证者需要做同等的线性量级的计算（他们需要计算多项式在 $z$ 点的取值，即计算 $p(z)=\sum_{i=0}^{n} p_i z^i$ ）。
该方案不隐藏多项式的任何部分 - 证明者一个系数接一个系数地发送完整的多项式。

现在让我们一起来看看卡特方案是如何达成以上要求的：

承诺大小是一个支持配对的椭圆曲线群元素。比如说对于BLS12_381曲线，大小应是48字节。
证明大小独立于多项式大小，永远是一个群元素。验证，同样独立于多项式大小，无论多项式次数为多少都只要两次群乘法和两次配对。
大多数时候该方案隐藏多项式 - 事实上，无限多的多项式将会拥有完全一样的卡特承诺。但是这并不是完美隐藏：如果你能猜多项式（比如说该多项式过于简单，或者它存在于一个很小的多项式集合中），你就可以找到这个被承诺的多项式。

还有一点，在一个承诺中合并任意数量的取值证明是可行的。这些性质使得卡特方案对于零知识证明系统来说非常具有吸引力，例如PLONK和SONIC。同时对于一些更日常的目的，或者简单的作为一个矢量承诺来使用也是非常有趣的场景，接下来的文章中我们就会看到。

椭圆曲线以及配对

正如之前所提到的预备知识所说，我强烈推荐Vitalik Buterin的博客：椭圆曲线配对。本文包含了本文所需的背景知识：特别是有限域，椭圆曲线和配对相关知识。

假设 $\mathbb G_1$ 和 $\mathbb G_2$ 是两条满足 $e: \mathbb G_1 \times \mathbb G_2 \rightarrow \mathbb G_T$ 的配对，假设p是 $\mathbb G_1$ 和 $\mathbb G_2$ 的阶，同时G和H是 $\mathbb G_1$ 和 $\mathbb G_2$ 的生成元。接下来，我们定义一个非常有效的速记符号：对于任意 $x \in \mathbb F_p$ $\displaystyle [x]_1 = x G \in \mathbb G_1 \text{ and } [x]_2 = x H \in \mathbb G_2$

可信设置

假设我们已有一个可信设置，使得对于一个秘密s，其子元素 $[s^i]_1$ 和 $[s^i]_2$ 都对于任意 $i=0, \ldots, n-1$ 的证明者和验证者有效。

有一种方法能够达到这种可信设置：我们用离线计算机生成一个随机数 $s$ ，计算所有的群元素 $[s^i]_x$ ，并通过电线传输出去（不包括 $s$ ）,然后烧掉这部计算机。当然这并不是一个好的解决方案，你必须相信计算机的操纵者没有通过其他渠道泄露这个秘密 $s$ 。

在实际应用中，这种设置通常采用安全多方计算（MPC），使用一组计算机来创建这个群元素，而没有任何单一计算机知道秘密s，这样只有挟持了整组计算机才能知道s。

注意这里有一件事是不可能的：你不能仅仅选择一个随机群元素 $[s]_1$ （其中 $s$ 是未知的）然后通过它计算其他的群元素。不知道 $s$ 是无法计算 $[s^2]_1$ 的。

好了，椭圆曲线密码学基础告诉我们通过可信设置的群元素是无法破解 $s$ 的，它是有限域 $\mathbb F_p$ 中的一个数字，但证明者无法找出它的具体数值。他们只能在给定的元素上做一些特定的计算。举个例子，他们可以用椭圆曲线乘法轻易地计算 $c [s^i]_1 = c s^i G = [cs^i]_1$ ，或者说将椭圆曲线点值相加算出 $c [s^i]_1 + d [s^j]_1 = (c s^i + d s^j) G = [cs^i + d s^j]_1$ 。实际上如果 $p(X) = \sum_{i=0}^{n} p_i X^i$ 是一个多项式，证明者可以计算 $\displaystyle [p(s)]_1 = [\sum_{i=0}^{n} p_i s^i]_1 = \sum_{i=0}^{n} p_i [s^i]_1$

这就显得非常有趣 – 通过使用这套可信设置，任何人都可以计算出一个多项式在一个谁也不知道的秘密点s上的值。只是他们得到的输出值不是一个自然数，而是一个椭圆曲线点 $[p(s)]_1 = p(s) G$ ，这已经足够有用。

卡特承诺

在卡特承诺方案中，元素 $C = [p(s)]_1$ 是多项式 $p(X)$ 的承诺。

这样你可能会问了，证明者是不是在不知道 $s$ 的情况下找到另一个有相同承诺的多项式 $q(X) \neq p(X)$ ，使得 $[p(s)]_1 = [q(s)]_1$ ？我们假设这个推理成立，那么就是说 $[p(s) - q(s)]_1=[0]_1$ ，即 $p(s)-q(s)=0$ 。

$r(X) = p(X)-q(X)$ 本身就是一个多项式。我们知道它不是常数，因为 $p(X) \neq q(X)$ 。有一个非常著名的定理，即是任意非常数的 $n$ 次多项式至多可以有 $n$ 个零点，这是因为如果 $r(z)=0$ ， $r(X)$ 就可以被线性因子 $X−z$ 整除；因为每一个零点都意味着可以被一个线性因子整除，同时每经过一次除法会降低一阶，所以推理可知至多存在 $n$ 个零点²。

因为证明者不知道 $s$ ，他们只能通过在尽可能多的地方让 $p(X)−q(X)=0$ 来使得 $p(s)−q(s)=0$ 。如上所证，他们只能在至多 $n$ 个点上使 $p(s)−q(s)=0$ ，那么成功的可能性就很小，因为 $n$ 比起曲线的次数 $p$ 要小很多， $s$ 被选中成为 $p(X)=q(X)$ 成立点的概率是微乎其微的。来感受一下这个概率的大小，假设我们采用现有最大的可信设置，当 $n = 2^{28}$ ，把它来和曲线顺序 $p \approx 2^{256}$ 对比：攻击者设立的多项式 $q(X)$ 来与 $p(X)$ 尽可能多的重合， $n=2^{28}$ 个点，得到相同承诺（p(s)=q(s)）的概率是 $2^{28}/2^{256} = 2^{28-256} \approx 2 \cdot 10^{-69}$ 。这是一个非常低的概率，在现实中意味着攻击者没有办法施行该攻击。

多项式相乘

目前为止我们学习了在一个秘密 $s$ 的多项式取值是可计算的，这就使得我们可以对一个独一无二的多项式进行承诺 - 对于同一个承诺 $C=[p(s)]_1$ 存在多个多项式，但是在实践中它们其实是无法计算的（这就是密码学家所说的绑定（computationally binding））。

但是，我们仍缺少在不发送给验证者完整多项式的情况下“打开”这个承诺的能力。为了达到这个目的，我们需要用到配对。如上所述，我们可以对这个秘密进行线性操作；举个例子，我们可以计算 $p(X)$ 的承诺 $[p(s)]_1$ ，还可以通过两个承诺 $p(X)$ 和 $q(X)$ 来计算 $p(X)+q(X)$ 的联合承诺： $[p(s)]_1+[q(s)]_1=[p(s)+q(s)]_1$ 。

现在我们所缺少的就是两个多项式的乘法。如果我们做到乘法，就能利用多项式的性质打开更多酷炫玩法的大门。尽管椭圆曲线本身不允许作乘法，幸运的事我们可以通过配对解决这个问题：我们有

$\displaystyle e([a]_1, [b]_2) = e(G, H)^{(ab)} = [ab]_T$ 在这里介绍一个新的标识方法： $[x]_T = e(G, H)^x$ 。这样，尽管我们不能直接在椭圆曲线上直接将两个元素相乘得到它们的乘积，一个椭圆曲线元素（这就是所谓全同态加密/FHE的一个性质；椭圆曲线仅是加同态)。如果是在不同的曲线上（比如一个在 $\mathbb G1$ ，另一个在 $\mathbb G2$ 上）提交承诺，我们可以将两个字段元素相乘，这样所得到的输出就是一个 $\mathbb G_T$ 元素。

这里我们就看到了卡特证明的核心。记得我们之前提到的线性因子：如果一个多项式在 $z$ 处有零点，那么它就可以被 $X−z$ 整除。同理反向可证 - 如果多项式可以被 $X−z$ 整除，那么它必在 $z$ 处有零点。可被 $X−z$ 整除，意味着对于某个多项式 $q(X)$ 我们可得 $p(X)=(X−z)⋅q(X)$ ，并且很明显在 $X=z$ 处得到零点。

举个例子，我们想要证明 $p(z)=y$ ，使用多项式 $p(X)−y$ – 明显该多项式在 $z$ 处达到零点，这样我们就可以应用线性因子的知识。取多项式 $q(X)$ ， $p(X)−y$ 被线性因子 $X−z$ 除，即：

$\displaystyle q(X) = \frac{p(X)-y}{X-z}$

这就等同于 $q(X)(X-z) = p(X)-y$ 。

卡特证明

定义 $p(z)=y$ 的卡特证明为 $π=[q(s)]_1$ ，记得多项式 $p(X)$ 的承诺是 $C=[p(s)]_1$ .

验证者用如下等式来确认这个证明：

$\displaystyle e(\pi,[s-z]_2) = e(C-[y]_1, H)$ 注意验证者可以计算 $[s−z]_2$ ，因为这仅是可信设置的元素 $[s]_2$ 和多项式被计算的点 $z$ 的一个组合。同样的，验证者已知了 $y$ 是取值 $p(z)$ ，所以他们也可以计算 $[y]_1$ 。那么为什么上述证明能向验证者证明 $p(z)=y$ ，或者更准确地说， $C$ 所提交的多项式在 $z$ 点的取值是 $y$ ？

这里我们需要考证两个性质：正确性 和 可靠性。 正确性 指的是如果证明者遵循我们定义的步骤，他们就可以产出一个能被验证的证明。这个通常难度不大。还有就是可靠性，这个性质是指证明者不会产出一个“不正确”的证明 – 比如说，他们不会欺骗验证者对于某个 $y′≠y$ ， $p(z)=y′$ 。

接下来我们先写出配对组的对应等式： $\displaystyle [q(s) \cdot (s-z)]_T = [p(s) - y]_T$ 正确性非常一目了然 – 这就是等式 $q(X)(X−z)=p(X)−y$ 在一个没人知道的随机点 $s$ 的取值。

那么，我们怎么才能知道它的可靠性，证明者不会创建假的证明呢？让我们从多项式的角度来看待这个问题。如果证明者想依循我们的方法来构建一个证明，他们就需要用 $X−z$ 来除 $p(X)−y′$ 。但是 $p(z)−y′$ 并不为零，无论怎么除都会有一个余数，所以他们就无法进行这个多项式除法。这样一来，证明者就无法用这个方法进行伪造了。

剩下的就只能直接在椭圆群中想办法了：如果说对于某个承诺 $C$ ，他们可以计算椭圆群元素

$\displaystyle \pi_\text{Fake} = \frac{1}{s-z} (C-[y']_1)$ 一旦成立，那证明者就可以为所欲为了。感觉上这是很难做到的，你必须用和s相关的什么东西来求幂，但s又是未知的。为了严格证明，你需要针对证明和配对的一个密码学假设，即所谓的 $q$ -strong SDH假设 ³。

多重证明

为这里为止我们已经看到了如何在一个单点上证明一个多项式取值，这是已经是非常了不起的一件事：你可以仅靠发送单个的群元素（可以是48字节大小，例如BLS12_381）来证明任何次数的多项式 - 比如说 $2^{28}$ 次 – 在任意点的取值。作为对比，在一个简单的把默克尔树用作多项式承诺的例子中，我们需要发送 $2^{28}$ 个元素，即这个多项式所有的系数。

更进一步，我们来看看如何仅使用一个群元素，来计算并证明一个多项式在任意多个点 的取值。首先我们需要了解一个新概念：插值多项式。有一个包含k个点的列表 $(z_0, y_0), (z_1, y_1), \ldots, (z_{k-1}, y_{k-1})$ ，我们随时都可以找到一个次数小于 $k$ 的多项式来经过这些点。其中一个方法是利用拉格朗日插值，这样我们可以得到该多项式的公式I(X)：

$I(X) = \sum_{i=0}^{k-1} y_i \prod_{j=0 \atop j \neq i}^{k-1} \frac{X-z_j}{z_i-z_j}$ 现在我们假设已知 $p(X)$ 经过了所有的点，那么多项式 $z_0, z_1, \ldots, z_{k-1}$ 都是零点。这就意味着多项式可被所有的线性因子： $(X-z_0), (X-z_1), \ldots (X-z_{k-1})$ 整除，我们将它们组合在一起，称为零多项式：

$\displaystyle Z(X) = (X-z_0) \cdot (X-z_1) \cdots (X-z_{k-1})$ 我们可以计算商值

$\displaystyle q(X) = \frac{p(X) - I(X)}{Z(X)}$ 注意，因为 $p(X)−I(X)$ 能被 $Z(X)$ 所有的线性因子整除，所以它能被 $Z(X)$ 本身整除。

现在我们可以定义这个计算 $(z_0, y_0), (z_1, y_1), \ldots, (z_{k-1}, y_{k-1})$ 的卡特证明： $\pi=[q(s)]_1$ – 这仍然仅是一个群元素。

为了验证这个证明，验证者同样需要计算插值多项式 $I(X)$ 和零多项式 $Z(X)$ ，使用这些结果他们可以计算 $[Z(s)]_2$ 和 $[I(s)]_1$ ，然后就可以确认配对等式：

$\displaystyle e(\pi,[Z(s)]_2) = e(C-[I(s)]_1, H)$ 将该等式写成配对，我们可以像单点上的卡特证明一样简单地确认它是否能够成立：

$\displaystyle [q(s)\cdot Z(s)]_T = [p(s)-I(s)]_T$ 这就非常酷炫了：仅仅提供一个群元素，你就能证明任何数量的计算，甚至是百万个！这相当于通过48个字节来证明海量的计算。

将卡特作为矢量承诺来使用

尽管卡特承诺被设计成多项式承诺，但它作为矢量承诺来使用也大有用处。回忆一下，一个矢量承诺是针对矢量 $a_0, \ldots, a_{n-1}$ 的承诺，并且允许你证明任意位置 $i$ 对应 $a_i$ 。我们可以使用卡特承诺的方案来重现这一场景：使 $p(X)$ 为对所有的 $i$ 计算 $p(i)=a_i$ 的一个多项式，我们知道这样一个多项式存在，并且可以通过拉格朗日插值来计算它：

$\displaystyle p(X) = \sum_{i=0}^{n-1} a_i \prod_{j=0 \atop j \neq i}^{n-1} \frac{X-j}{i-j}$ 使用这个多项式，我们可以就可以利用一个单一群元素来证明这个矢量中任意数量的元素！注意到比起默克尔树（在证明大小方面）这个方案更加高效：仅证明一个元素，默克尔证明就需要花费 $\log n$ 大小的哈希！

Proofs of Custody

2021-09-30T00:00:00+00:00

Thanks to Vitalik Buterin, Chih-Cheng Liang and Alex Stokes for helpful comments

A proof of custody is a construction that helps against the “lazy validator” problem. A lazy validator is a validator that instead of doing the work they are supposed to do – for example, ensuring that some data is available (relevant for data sharding) or that some execution was performed correctly (for execution chains) – they pretend that they’ve done it and sign the result, for example an attestations that claims the data is available anyway.

The proof of custody construction is a cryptoeconomic primitive that changes the game theory so that lazy validating simply isn’t an interesting strategy anymore.

Lazy validators – the game theory

Let’s assume there is a well-running Ethereum 2.0 chain (insert your favourite alternative PoS blockchain if you prefer). We don’t usually expect that bad things – data being withheld, invalid blocks being produced happens. In fact, you are likely to not see them ever happen, because as long as the system is run by a majority of honest validators there is no point in even trying to attack it in one of these ways. Since the attack is pretty much guaranteed to fail, there is no point in even doing it.

Now assume you run a validator. This comes with different kinds of costs – obviously the staking captial, but also hardware costs, electricity and internet bandwidth, which you might pay for directly (your provider charges you per GB) or indirectly (when your validator is running, your netflix lags). The lower you can make this cost, the more net profits you make from running your validator.

One of the tasks you do as a validator in sharded Eth2, is to assure the availability of shard data. Each attestation committee is assigned one blob of data to check, which is around 512 kB to 1 MB. The task of each validator is to download it and store it for around 90 days.

But what happens if you simply sign all attestations for shard blobs, without actually downloading the data? You would still get your full rewards, but your costs have suddently decreased. We are assuming the network is in a good state, so your laziness isn’t going to do anything to the network immediately. Let’s say your profit of running a validator was $1 per attestation, and the cost of downloading all the blocks was $0.10 per year. Now your profit has increased to $1.10.

	Profit per signed attestation
Honest	$1.00
Lazy	$1.10

This problem is called the verifier’s dilemma and was introduced in Demystifying Incentives in the Consensus Computer by Luu et al.

But I would never do this! Who would cheat like that?

It often seems obvious to us that in games like this, surely you would not succumb to bribery and stay with the honest behaviour. But it’s often more subtle than that.

Let’s assume that after having run a validator for years, a new client comes out that claims to be 10% more cost effective. People run it and see that it works, and it seems to be safe. The way it actually does this is by not downloading the shard blocks.

This could even happen by accident. Someone cut some corners in the development process, everything looks normal, it’s just that it doesn’t join the right shard subnet and nobody missed this, because it does not cause any faults in normal operation.

Some people will probably run this client.

Something else that could happen is that a service could step in to do the downloading for you. For $0.01 per shard blob, they will download the data, store it for 90 days, and send you a message that the data is available and you can sign the attestation. How bad is this?

It’s also quite bad. Because as many people start using this service, it becomes a single point of failure. Or even worse, it could be part of an attack. If it can make more than 50% of validators vote for the availability of a shard blob, without ever publishing the blob, that would be a withholding attack.

As it is often the case, dishonesty can come in many disguises, so our best bet is to work on the equilibrium to make the honest strategy rational.

A proof of custody and an update to the game theory

The proof of custody works like this: Imagine we can put a “bomb” in a shard blob: If you sign this blob, you get a large penalty (you get slashed), of $3,000. You definitely don’t want to sign this blob.

Does that make you want to download it? That is certainly one way to avoid signing the bomb. But if anyone can detect the bomb, then someone can simply write a service that warns you before signing an attestation if it’s a bomb. So the bomb needs to be specific to an individual validator, and noone else can compute whether a shard blob is a bomb.

OK, now we have the essential ingredients for the proof of custody. We need

An ephemeral secret, that is recomputed every custody epoch (ca. 90 days), individual to each validator, and then revealed when it has expired (so that other validators have a chance to check the proof of custody)
A function that takes the whole shard blob data, as well as the ephemeral key, and outputs 0 (not a bomb), or, with very small probability, 1 (this blob is a bomb)

It is essential that the ephemeral secret isn’t made available to anyone else, so there are three slashing conditions:

A validator can get slashed if anyone knows its current ephemeral secret
The ephemeral secret has to be published after the custody period, and failing to do so also leads to slashing
Signing a bomb leads to slashing

How can we create this function? A simple construction works like this. Compute a Merkle tree of leaves (data0, secret, data1, secret, data2, secret, ...) as illustrated here:

graph TB A[Root] -->B[Hash] A --> B1[Hash] B --> C[Hash] B --> C1[Hash] C --> D[data0] C --> E[secret] C1 --> D1[data1] C1 --> E1[secret] B1 --> C2[Hash] B1 --> C3[Hash] C2 --> D2[data2] C2 --> E2[secret] C3 --> D3[data3] C3 --> E3[secret]

Then take the logical AND of the first 10 bits. This gives you a single bit that’s 1 in an expected 1 in 1024 times.

This function cannot be computed without knowing both the secret and the data.

(Because we do want to enable secret shared validators, a lot of work has gone into optimizing this function so that it can be efficiently computed in an MPC, which a Merkle tree cannot. For this we are suggesting a construction based on a Universal Hash Function and the Legendre symbol: https://ethresear.ch/t/using-the-legendre-symbol-as-a-prf-for-the-proof-of-custody/5169)

New game theory

All right, so with the proof of custody, any shard blob has a 1/1,024 chance of being a bomb, and you don’t know which one it is without downloading it.

The lazy validator does just fine when the blob is not a bomb. However, when it is a bomb, we see the big difference: The honest validator simply skips this attestation, which is very minor an simply sets the profit to zero. However, the lazy validator signs it and will get slashed, making a huge loss. The payoff matrix now looks like this:

	Profit for non-bomb attestation	Profit for bomb attestation	Average for 1,024 attestations
Honest	$1.00	$0.00	$1,023.00
Lazy	$1.10	$-3,000.00	$-1,873.60

In the third column, we see that the expected profit for the lazy validator is now negative. Since the whole reason for being lazy was increased profits from lower costs, this means that the lazy validator is not an interesting strategy anymore.

Proof of custody for execution

Another task of validators will be verifying the correct execution of blocks. This means verifying that the new stateroot that is part of a block is the correct one that results from applying all the transactions. The proof of custody idea can also be applied to this: The validator will have to compute the proof of custody in the same way as described above, however the data is the execution trace. The execution trace is some output generated by the step by step execution of the block. It does not have to be complete in any sense; what we want from it is just two properties:

It should be difficult to guess the execution trace without actually executing the block.
The total size of the execution trace should be large enough that simply distributing it in addition to normal blocks is unattractive.

There are some easy options of doing this; for example simply outputting every single instruction byte that the EVM executes would probably result in an execution trace of a few MB per execution block. Another option would be to use the top of the stack.

With fraud proofs, do we still need the proof of custody for execution?

When we upgrade the execution chain to statelessness, which means that blocks can be verified without having the current state, fraud proofs become easy. (Without statelessness, they are hard: Fraud proofs always have to be included on a chain different from the one where the fraud happened, and thus the actual pre-state would not be available when they have to be verified.)

This means that it will be possible to slash a validator who has produced an invalid execution block. Furthermore we can also penalize any validator that has attested to this block. Would that mean that the proof of custody is no longer necessary?

It does certainly shift the balance. But even with this penalty present, lazy validation can still be a rational strategy. It would probably be a bad idea for a validator to simply sign every block without verifying execution, as an attacker only needs to sacrifice a single validator of their own to get you slashed.

However, you can employ the following strategy: On each new block, you wait for some small percentage of other validators to sign it before you sign it yourself. Those who sign it first are unlikely to be lazy validators, as they would be employing the same strategy. This would get you quite good protection in most situations, but at a systemic level it would still leave the chain vulnerable in extreme cases.

The case with fraud proofs is thus improved, but a proof of custody remains superior for ensuring that lazy validation can’t be a rational strategy.

How is it different from data availability checks?

I wrote a primer on data availability checks here. It looks like the proof of custody for shard blobs tries to solve a very similar problem: Ensuring that data that is committed to in shard blob headers is actually available on the network.

So we may wonder: Do we need both a proof of custody and data availability checks?

There is an important difference between the two constructions, though:

Data availability checks ensure the availability of the data independent of the honest majority assumption. Even a powerful attacker controlling the entirety of the stake can’t trick full nodes into accepting data is available that is actually withheld
In contrast, a proof of custody does not help if the majority of the stake is performing an attack. The majority can compute the proof of custody without ever releasing the data to anyone else.

So in a theoretical sense, data availability checks are strictly superior to proof of custody for shard data: They hold unconditionally, whereas the latter only serve to keep rational validators honest, making an attack less likely.

Why do we still need a proof of custody for shard blobs? It might not necessarily be needed. There are however some practical problems with data availability checks that make it desirable to have a “first line of defence” against missing data:

The reason for this is that data availability checks work by excluding unavailable blocks from the fork choice rule. However, this cannot be permanent: data availability checks only ensure that eventually, everyone will see the same result, but not immediately.

The reason for this is that publishing a partially available block, might result in some nodes seeing it as available (they are seeing all their samples) and some other nodes as unavailable (missing some of the samples). Data availability checks ensure that in this situation, the data can always be reconstructed. However, this needs some node to first get enough samples to reconstruct the data, and then re-seed the samples so everyone can see them; this process can take a few slots.

In order to avoid a minority attacker (with less than 1/3 of the stake) to cause such a disruption, we only want to apply data availability checks when the chain is finalized and not immediately. In the meantime, the proof of custody can ensure that an honest majority will only ever build an available chain, where the shard data is already seeded in committees; since the committees are ready to re-seed all samples even if the original blob producer doesn’t, an attacker can’t easily force a partially available block.

In this construction, the proof of custody and data availability checks have two orthogonal functions:

The proof of custody for shard data ensures that an honest majority of validators will only ever build a chain in which all shard data is available and well seeded across committees. A minority attacker cannot easily cause disruption to this.
Data availability checks will guarantee that even if the majority of stake is attacking, they will not be able to get the remaining full nodes to consider a chain with withheld data as finalized.

Just because it has a fixed supply doesn’t make it a good store of value

2021-09-27T10:00:00+00:00

What we should really build is productive assets and stablecoins

Special thanks to David Andolfatto, Vitalik Buterin, Chih-Cheng Liang, Barnabé Monnot and Danny Ryan for comments that helped me improve this essay

I think the “store of value” narrative and the misunderstanding of what “fiat” currency really is are a huge problem undermining the whole of the cryptocurrency world. Only when we come to an honest understanding of this will we really be able to build something better.

Here are some core theses of what I believe and which I will try to illustrate in the full article:

The “store of value” narrative doesn’t hold water. There is no such thing as a guaranteed way of transmitting value into the future, and just having an asset with a fixed supply doesn’t fix that.
If you want your best bet on sending the most value possible into the future, what you really need is productive assets (for long term) and stablecoins (if you need you money in the near future).

Why “store of value” does not exist

Here is a common form of the cryptocurrency narrative: “Look at fiat currency. 1 US Dollar from 1950 had about 10 times more purchasing power than one US dollar now. It’s a scam. If you store your value in US dollars, then you are constantly losing due to inflation. This is because the central bank/government can just print more US Dollars. You should instead store value in an asset with predictable supply, such as gold or Bitcoin, which does not have this problem.”

The true part of this statement is that if you stored your money in USD, then you would have lost a large part of your purchasing power over the decades. That is not in question. The question is, is there another way, implied by the term “store of value”, that does not have this property? Store of value proponents claim that there is if you instead used an asset with a predictable supply. And of course, historical data backs this up to some extent: If you had used gold instead of storing your value in USD, then you would have fared better: You could have bought an ounce for $35 in 1950, and it would now be worth around $1765 (price as of June 20 2021 from here). Given that the Dollar is worth 10x less now due to inflation, that’s $176.50 in 1950-Dollars or a 5x increase in value.

But we could have done much better than this: If we put the $35 in an S&P 500 tracker in 1950, then we would now have a staggering $74,418.65, which is a 212x increase after correcting for the 10x loss in purchasing power of the US Dollar (so 7,441.87 1950-Dollars). So clearly, this investment is a much better “store of value” than investing in gold.

Now Bitcoin has fared much better than both gold and the S&P 500 over the last 10 years. However this is a very short timespan, in which Bitcoin went from an absolutely tiny niche to an asset that most people in the world have heard of and some significant minority has invested in. There is no reason to believe that this can be repeated (I don’t think it can). The historical data for gold says, over long periods of time, stores of value that are purely based on “limited supply” do much worse than productive assets.¹

So why do people believe gold, or Bitcoin, would make a better store of value than just investing in productive assets like companies, real estate, etc.? There are two reasons that I can see:

Stock markets clearly have a lot of volatility. So maybe they believe productive assets are a good long-term store of value, but not for the short term.
The people who believe in “limited supply” stores of value have an apocalyptic mindset. So they believe that in the case of a major social collapse, their stores of value will somehow fare better than more productive assets.

Argument number 1 does not convince me at all. That would depend on their preferred store of value having lower volatility than productive assets, which simply does not bear out in reality. Both gold and Bitcoin are much more volatile than holding an S&P 500 tracker fund. If you want low volatility, then you should still go for the productive assets.

Number 2 means that you can simply “send” value into the future even when society collapses. I think that’s a pretty crazy belief – because when society collapses both the value you can buy as well as the demand for the “limited supply asset” will do as well.

Of course, people think companies (and therefore the S&P 500) will probably go down, but other assets don’t fare any better:

Is property a good “store of value” in a catastrophe? Property is mostly valuable because of where it is in relation to valuable economic and social activity. Central Manhattan property is so valuable because it’s in a city where many want to live. A random plot of land in the middle of nowhere usually has very little value. It’s unlikely to fare that well in a major disaster (and might even work out worse than property with a garden to grow your own vegetables)
Similarly the value attributed to gold is a social convention, albeit one that has lasted for an extremely long time. Society could decide on a new asset to value highly, which is indeed what Bitcoiners argue for. But more importantly, your gold isn’t worth anything if there’s nothing of value to buy.

If we accept that value depends on a society that provides valuable goods, we have to accept that there is simply no guaranteed way to send money into the future. You might as well make real investments in productive assets.

What we need – productive assets and stablecoins

Above, I argued why I think “limited supply stores of value” (unproductive assets like gold or Bitcoin, that derive their value simply from being scarce and not utility value) are of no advantage to productive assets like stocks. They have the same or higher volatility, but at least for gold (for which we have a decent amount of history) is outperformed by productive assets in the long term. The same will probably happen to Bitcoin once it’s absorbed the initial demand and has arrived at a stable position like gold (other results, with it largely losing its current value, are certainly also possible). They also don’t necessarily fare better in catastrophes; if this is what you’re afraid of you might want to buy goods that are useful in a catastrophe instead.

This means productive assets should be the better long-term stores of values, as they are better on all dimensions.

But clearly the volatility that comes with them is undesirable for many applications that fiat currency is used for now: I don’t think many people would appreciate their salary fluctuating by 50% month on month; in fact the vast majority of people would struggle to pay for all their expenses if their salaries suddenly fell by 50%. Many people simply need or want much more stability than that.

Similarly, if you keep money around to buy a house in the near future, or run a company that keeps cash reserves to make sure they can pay their employees and suppliers, you need stability.

Even if we assumed that everyone suddenly started using Bitcoin, it would simply not fix this problem. Since its supply can’t be dynamically adjusted, its value would continue to be very volatile due to economic fluctuations.

Luckily, there are mechanisms around to create stablecoins using only volatile assets for these situations. My favourite system is the idea behind MakerDAO and DAI, which I describe in an article here.

So if the current system is so great why do we even need cryptocurrencies?

I think we need to become more nuanced thinkers in the cryptocurrency space, and start seeing the real properties of the systems we are trying to rebuild if we want to be successful. I think fiat currencies as we know them at the moment have been tremendously successful, as long as we see them for what they are: A hedge against short-term volatility rather than maximizing value long term.

I believe that crypto can vastly improve the current financial system, but hopefully not mainly by providing an asset with a limited supply (which won’t solve most of our most important problems). Instead we should make sure our assets are productive to maximize long term value, and create stablecoins for applications where volatility has to be avoided. This system improves on our current financial system because:

It is much more transparent – anyone can verify balance sheets and exposures, not just specialized audit firms. This is pretty important because currently, the detailed exposures of banks are not public, which means depositors simply don’t know enough to about banks to make an informed decision which ones they can trust
We can make it fairer – giving everyone access at the same conditions. For example, why should banks have access to central bank accounts whereas normal people and companies don’t?
Governance can be improved, bringing everyone to the table when big decisions have to be made (like Quantitative Easing after the Global Financial Crisis)
Getting rid of the baggage (for example physical currency) and thus allowing more flexibility of the system; for example there is no technical need for inflation when all balances are electronic (though in practice, it might be required for psychological reasons or “price stickiness”)
And most importantly, creating a permissionless and censorship resistant system that anyone can participate in at all levels

–

Vitalik pointed out that this will overstate the case against Bitcoin somewhat, because gold supply has increased much more (ca. 3x) since 1950 than Bitcoin will over a similar period. I do not think this will make up for the massive difference in returns between gold and the S&P 500, though. ↩

On supply and demand for stablecoins

2021-09-27T09:00:00+00:00

Special thanks to David Andolfatto, Vitalik Buterin, Chih-Cheng Liang, Barnabé Monnot and Danny Ryan for comments that helped me improve this essay

The value of a freely tradable asset is determined by supply and demand. This obviously applies to stocks and cryptocurrencies. But it also applies to any “stablecoin” we are trying to create. It even applies to traditional fiat currencies like the US Dollar or the Euro.

When I talk about stablecoins here, I am referring to decentralized, collateralized stablecoins like MakerDAO’s DAI – not to USDT or USDC, where the supply/demand problem is obvious. So how does MakerDAO balance supply and demand for stablecoins?

And how does this help us learn how central banks do this for fiat currencies?

How to create a stablecoin

Let’s understand how we can create a stablecoin if as a building block we only have assets which are subject to undesirably large volatility. Luckily we have a great example on how to do this by means of collateralized stablecoins, the prime example of which is MakerDAO, the project behind the DAI stablecoin.

The idea behind this project is to create a token, called DAI, that tracks the value of one USD as closely as possible. Note that instead of using USD, we can track any other asset as well – RAI, as an example, tracks a time-averaged version of the Ether price. I suggest that long-term, the Ethereum community should strive to create an Oracle that tracks the prices of consumer goods in Ether, so that we can create a stablecoin that has nothing to do with any currently existing fiat currency and is thus truly global and independent. But as a starting point, using USD which is a denomination that most of the world understands intuitively as relatively stable was probably a very good idea.

How did MakerDAO manage to create this stablecoin, without any cash reserves in the form of bank accounts in USD and only the on-chain assets, which are all highly volatile? The core idea is the so-called Collateralized Debt Position, or CDP. It’s a margin position where someone can lock up a volatile asset – for example Ether – and in return create, or “borrow”, a number of DAI. The CDP essentially splits the value of the locked up Ether into two tranches:

The first tranche is the “debt tranche” – this tranche is fixed in its USD value and belongs to whoever owns the actual DAI stablecoins
The second tranche is the equity tranche – it belongs to the owner of the CDP and is the value that is left once the first tranche is satisfied

Notice I called them “debt” and “equity” here, because that’s the way we call them when we talk about companies doing the same thing: When companies need capital, they can raise “debt” – in the form of bank loans and bonds, typically – which is very predictable and gets preference (as in is paid back first using the remaining assets) when the company runs out of money. That’s why bonds (which are tradable debt) are quite stable in price: As long as the company doesn’t go bust, they will always be paid back. Equity is the value that’s left over once these debt positions are satisfied, and is traded in the form of stocks – which are much more volatile, because their value depends on the profitability of the company, not just it’s solvency.

The elegance of this system is that the equity position can absorb the volatility, so that the debt holder (which is whoever holds the DAI thus created) has a predictable value. As an illustration here, see what happens when the value of the 1 ETH that has been locked up in the above CDP fluctuates: The equity holder gets a position that’s now highly volatile (and in return, if the value of ETH goes up, will get much enhanced returns). The “debt” part of the CDP stays nice and constant and is always worth 1000 USD, as long as the ETH price does not crash too rapidly.

This last part may look scary as if the red “equity” part of the line ever goes to the $1,000 line, the DAI debt could suddenly not be satisfied and thus the value of one DAI would fall below one USD. However, MakerDAO will actually liquidate CDP positions once they get too close to zero equity. This works by auctioning off the collateral to the highest bidder in DAI.

This means that in practice, MakerDAO can deal with extreme falls if they do not happen too rapidly; this has been tested repeatedly, for example in March 2020 when DAI held its peg despite a precipitous fall in crypto asset values.

(This largely describes the old version of DAI, single-collateral DAI (which only accepted ETH as collateral). The current instantiation, multi-collateral DAI, differs in that it also accepts other forms of collateral (which is great), some of which are centralized stablecoins (such as USDC) which is not so good in my opinion.)

Why we need to add interest rates to this

MakerDAO has a simple mechanism to make sure the long-term expected value of DAI should be one USD: In the case of a large deviation, the governance system can trigger global settlement, which will immediately give all DAI holders their current equivalent in ETH by tapping all the CDPs that secure it. However, this event can be far in the future and thus doesn’t guarantee that the instantaneous price is exactly one DAI.

Let us understand the goal that MakerDAO has with DAI: They want 1 DAI to always be worth 1 USD.

One might think: Oh but it would be ok if it sometimes is more than 1 USD right? As a matter of fact, this is also bad: If it costs more than 1 USD to get 1 DAI, then MakerDAO would have failed. Because if I can only get a DAI for 1.10 USD then it means it doesn’t act as a stablecoin for me – it can suddenly fall by 10% and I will lose that value when it goes back to its intended peg of 1 USD. It’s thus essential that the peg is always kept in both directions.

But like any freely traded asset, the value of DAI is determined by supply and demand.

What does it mean for a price to be determined by supply and demand? Let’s say we’re talking about a commodity like wheat with many independent buyers and sellers. The buyers of wheat follow a certain “demand curve”: The higher the price of wheat, the lower the quantity demanded; this is intuitively easy to see: if wheat becomes really expensive I will buy rice instead of flour. If wheat becomes crazy cheap then I will substitute other foods by using more wheat or even buy a few extra bags just in case I need it later. The behaviour of many consumers in aggregate makes this a smooth curve.

The supply curve looks at the other side, the producers of wheat who want to sell it into the market. The suppliers are farmers who grow wheat. They make a similar decision based on the current market price. If the price is low, they won’t grow wheat, or potentially put some of it in storage to sell later at a higher price. If the price is high, they can replace other crops with wheat or even grow it on fields that aren’t currently worthwhile because the yield is lower or it’s harder to harvest.

Conceptually the two curves can be drawn into a graph like this:

Economists traditionally put price on the $y$ -axis (vertical) in this graph, when as the independent variable it would usually be on the $x$ -axis (horizontal).

There is a price at which both curves meet. In equilibrium, this is the expected price for the commodity if there isn’t any interference with the market. This is because if the price is lower, then not all demand can be satisfied, so producers will notice they can be more profitable by increasing their prices, thus raising the overall price. On the other hand, if the current price is higher, then there will be too much supply fighting for the few consumers wanting to buy wheat, and thus the producers who lower their prices will be the ones making a profit (or a lower loss) as the consumers will be turning to them. The only stable point is where the two curves meet.

The same applies to DAI – which can be traded freely on exchanges.

Supply of DAI is given by those people who are happy to take a CDP position, which basically means leveraging their volatile assets, in order to create more DAI, as well as anyone already holding DAI and wanting to sell. Demand comes from those who want the stability of keeping their value in DAI.

These two curves don’t necessarily meet at a price of one USD per DAI.

As an example, if the market for Ether is very bullish and many people think it will go up, then it probably means that there is little demand for the stability that holding DAI provides and a high demand for leveraged Ether positions. People who are very bullish on Ether would be tempted to leverage their positions to profit even more when the price increases. In this kind of environment, so many people want to take out CDPs and create DAI that there are not enough people interested in actualling using all the DAI. The value of DAI would fall below the peg, which is undesirable.

MakerDAO can correct this by adding a positive interest rate (“savings rate”) for holding DAI, rewarding the holders and charging those who take the margin position. This makes it more attractive to hold DAI. You may think ETH is a great investment, but it’s volatile, so maybe DAI with a 5% savings interest would seem attractive. If it’s not 5%, then maybe 10% is. At some value for this interest rate, the demand for DAI will increase enough (and the supply in form of CDP decrease enough) such that the value of DAI will return to the intended peg.

But the reverse is also possible – in an environment where many people prefer the stability (maybe in a “bear market” where holding Ether isn’t as attractive), a negative interest rate makes holding DAI less attractive and thus reduces the demand. On the other hand, taking out a CDP becomes more attractive when you actually get paid for it. You may be scared of taking out a 1000$ loan against your ETH, but what if you got paid 10% or 100$ per year for it?

So we now effectively have another dimension in order to change supply and demand for DAI – the savings interest rate. A lower rate (even negative) rate will decrease demand and increase supply, leading to a lower DAI price. A higher rate does the opposite and increases the price of DAI. In order to move the price to 1 USD, we just have to adjust the interest rate until the prices agree.

Here is a graphic that illustrates how this works:

On the left, we have supply and demand curves at an interest rate of 1%. The curves meet at a price of 0.95 USD, which is the current fair market price of DAI and thus too low. In this situation MakerDAO would need to raise the interest rate. By raising the interest to 2% (on the right), the CDPs become less attractive (shifting supply) and holding DAI becomes more attractive, thus making the curves meet at the desired price 1.00 USD.

In the light of this, I am very happy that MakerDAO has after a long time decided to implement the ability to support negative interest rates. They are essential when a lot of stability is demanded. In fact, not having this in the past has required the very unfortunate decision to use centralized stablecoins such as USDC to back DAI, otherwise the demand could not have been satisfied and it would have shot above the peg. Hopefully, long term, this will be reversed.

To summarize, the interest rate is a mechanism that balances the demand and supply of the stablecoin. An ideal system should simply pick a rate that equates supply and demand – this interest rate would represent the fair market price for keeping value stable. Depending on the overall economic situation, this interest rate can be either positive or negative.

An analogy to fiat currencies

“Fiat” currency is actually a huge misnomer for our state currencies. “Fiat” implies that someone just creates a large amount of (what we in the cryptocurrency ecosystem would call) tokens and – by “fiat” (latin “let it be done”) – tells everyone that this is now money.

However this is not really how fiat currency works. Fiat currencies are actually to some extent “collateralized stablecoins” as described above, with some extra complications. As many commenters have noted in the past, we should be calling them “credit currencies” instead.

To see this, we need to understand that traditional money consists of two different components (there are more but these two will give the idea how it works):

Central bank money, which consists of reserve accounts (that banks have with the central bank) as well as all the physical money (bills, coins) in circulation; this is often denoted M0 (and can properly be called “fiat” currency)
Bank deposits, which is basically the money you have in your bank account, and similar liquid deposits. This is called M1.

But what actually is M1 money? It’s nothing else but “debt” that your bank owes you. This debt is often created by someone taking out a loan from the bank: E.g. when you take a mortgage, two accounts are created: One that says “bank owes you money” and the other one “you owe the bank money” and they cancel each other out. Your bank’s net position hasn’t changed, although it has become riskier (more leveraged) through the process. And new deposits have been created, thus enlarging the M1 quantity.

But that mortage is backed – collateralized – by both your income and the property it’s taken out for. In effect, each loan a bank gives out is very similar to our collateralized debt position above. When you take out your mortgage for 200,000 USD, your CDP is:

You are long 1 house
You are short 200,000 USD

Now it looks much more like a CDP. While central banks and states have other tools to change supply and control inflation, this debt mechanism is a powerful constraint that can dynamically adjust the quantity while keeping the value of the currency more or less unchanged.

So are negative rates and inflation not a scam?

As we have seen previously, DAI sometimes needs negative interest rates to maintain the peg. In the wake of the financial crisis of 2008, people were surprised that interest rates on bank accounts, and indeed even central bank interests, can be negative. But is this really that surprising?

Central banks have more than a single lever to adjust supply and demand for their currencies, but interest rates are still an important one. Negative interest rates send a signal to the market that rebalances the equilibrium towards lower demand for stable currency and higher supply by means of people taking debt in order to invest it in ventures.

Furthermore, inflation is basically a negative interest rate on physical cash (bills and coins), which is necessary because we don’t have any way of applying it directly. If all balances were electronic, we could equivalently also just apply the negative interest rates directly to the balances and not have any inflation. (This ignores price stickiness, which is another problem that probably also favors some form of inflation)

Conclusion

MakerDAO has demonstrated that even if you only have a volatile asset, like ETH, you can build a stable currency on top. For simplicity, a peg to the US Dollar was chosen, but it doesn’t have to be a currency. Any measure of value could be used, as long as we have a way to find a reasonably objective oracle for it.

I don’t believe that assets that are only defined by their limited supply – such as gold or Bitcoin – are very good “stores of value”. Historically speaking, the S&P 500 has vastly outperformed gold ¹ at lower volatility. I don’t think this will be different for Bitcoin and other “limited supply” assets. If what you want to do is maximize value over long timescale, productive assets (which Ethereum will be after EIP1559 and the merge) are a much better bet.

If instead you want stability over short term, you need an explicit mechanism that guarantees that; stablecoins are one, and fiat has similar mechanism. But someone will have to take the other side, and you will probably have to pay for it by having lower or even negative returns. That’s the price for stability.

You can also do something in between, like Reflexer Labs RAI. What I don’t see is how gold or Bitcoin, simply by having a fixed supply, provide something superior. They don’t. They will be strictly inferior by providing less returns at higher volatility than productive assets and the stable synthetix we can build using them. I wrote an essay about this topic: Just because it has a fixed supply doesn’t make it a good store of value

–

Starting in 1950, investing $35 in gold (one ounce) would have yielded $1765 (price as of June 20 2021 from here) vs $74,418.65 for investing in an S&P 500 tracker. Both yield positive returns even after accounting for the ca. 90% inflation of the USD, but the S&P 500 is much better at 212x real returns vs only 5x for gold. Also gold is more volatile than stocks. ↩

Inner Product Arguments

2021-07-27T23:00:00+00:00

中文版本: 内积证明

Introduction

You might have heard of Bulletproofs: It’s a type of zero knowledge proof that is used for example by Monero, and that does not require a trusted setup. The core of this proof system is the Inner Product Argument ¹, a trick that allows a prover to convince a verifier of the correctness of an “inner product”. An inner product is the component by component product of two vectors:

$\vec a \cdot \vec b = a_0 b_0 + a_1 b_1 + a_2 b_2 + \cdots + a_{n-1} b_{n-1}$

where $\vec a = (a_0, a_1, \ldots, a_{n-1})$ and $\vec b = (b_0, b_1, \ldots, b_{n-1})$ .

One interesting case is where we set the vector $\vec b$ to be the powers of some number $z$ , i.e. $\vec b = (1, z, z^2, \ldots, z^{n-1})$ . Then the inner product becomes the evaluation of the polynomial

$f(X) = \sum_{i=1}^{n-1} a_i X^i$

at $z$ .

Inner Product Arguments work on Pedersen Commitments. I have previously written about KZG commitments, and Pedersen commitments are similar in that the commitment is in an elliptic curve. However a difference is that they do not require a trusted setup. Here is a comparison of the KZG commitment scheme and using Pedersen combined with an Inner Product Argument as a Polynomial Commitment Scheme (PCS):

	Pedersen+IPA	KZG
Assumption	Discrete log	Bilinear group
Trusted setup	No	Yes
Commitment size	1 Group element	1 Group element
Proof size	2 log n Group elements	1 Group element
Verification	O(n) group operations	1 Pairing

Basically, compared to KZG commitments, our commitment scheme is less efficient. Proofs are larger ( $O(\log n)$ ), which wouldn’t be the end of the world as logarithmic is still very small. But unfortunately, the verifier has to do a linear amount of work, so they are not succinct. This makes them impractical for some applications. However in some cases this can be worked around.

One example is my writeup on multiopenings. In this case, the trick is that you can aggregate many openings into a single one.
The Halo system ², where the linear cost of many openings is aggregated

In both of these examples, the trick is to amortize many openings. If you only want to open a single polynomial, then it’s tough and you have to incur the full cost, though.

However, the big advantage is that Pedersen and Inner Product Arguments come with much fewer assumptions, in particular a pairing is not needed and they don’t require a trusted setup.

Pedersen commitments

Before we can discuss Inner Product Arguments, we need to discuss the data structure that they operate on: Pedersen commitments. In order to use Pedersen commitments, we need an elliptic curve $G$ . Let’s quickly remind ourselves what you can do in an elliptic curve (I will use additive notation because I think it is the more natural one):

You can add two elliptic curve elements $g_0 \in G$ and $g_1 \in G$ : $h = g_0 + g_1$
You can multiply an element $g \in G$ with a scalar $a \in \mathbb F_p$ , where $p$ is the curve order of $G$ (i.e. the number of elements): $h = a g$

There is no way to compute the “product” of two curve elements: the operation “ $h * h$ ” is not defined, so you cannot compute “ $h * h = a g * a g = a^2 g$ ”; as opposed to multiplying by a scalar; so $2 h = 2 a g$ , for example, is easy to compute.

Another important property is that there is no efficient algorithm to compute “discrete logarithms”. The meaning of this is that given $h$ and $g$ with the property that $h=ag$ , if you don’t know $a$ it is computationally infeasible to find $a$ . We call $a$ the discrete logarithm of $h$ with respect to $g$ .

Pedersen commitments make use of this infeasibility to construct a commitment scheme. Let’s say you have two points $g_0$ and $g_1$ and their discrete logarithm with respect to each other (i.e. the $x \in \mathbb F_p$ such that $g_1 = x g_0$ ) is unknown, then we can commit to two numbers $a_0, a_1 \in \mathbb F_p$ :

$C = a_0 g_0 + a_1 g_1$

$C$ is an element of the elliptic curve $G$ .

To reveal the commitment, the prover gives the verifier the numbers $a_0$ and $a_1$ . The verifier computes $C$ and if it matches will accept.

The central property of a commitment scheme is that it is binding. So given $C=a_0 g_0 + a_1 g_1$ , could a cheating prover come up with $b_0, b_1 \in \mathbb F_p$ such that the verifier will accept them, i.e. such that $C = b_0 g_0 + b_1 g_1$ but with $b_0, b_1 \not= a_0, a_1$ ?

If someone can do this, then they could also find the discrete logarithm. Here is why: We know that $a_0 g_0 + a_1 g_1 = b_0 g_0 + b_1 g_1$ , and by regrouping the terms on both sides of the equation we get

$(a_0 - b_0) g_0 = (b_1 - a_1) g_1$

Either $a_0 - b_0$ or $b_1 - a_1$ have to be not equal to zero. Let’s say it’s $a_0 - b_0$ , then we get:

$g_0 = \frac{b_1 - a_1}{a_0 - b_0} g_1 = x g_1$

for $x = \frac{b_1 - a_1}{a_0 - b_0}$ . Thus we’ve found $x$ . Since we know this is a hard problem, in practice no attacker can perform this.

This means it’s computationally infeasible for an attacker to find alternative $b_0, b_1$ to reveal for the commitment $C$ . (They definitely do exist, they are just computationally infeasible to find – similar to finding a collision for a hash function).

We can generalize this and commit to a vector, i.e. a list of scalars $a_0, a_1, \ldots, a_{n-1} \in \mathbb F_p$ . We just need a “basis”, i.e. an equal number of group elements that don’t have known discrete logarithms between them. Then we can compute the commitment

$C = a_0 g_0 + a_1 g_1 + a_2 g_2 + \ldots + a_{n-1} g_{n-1}$

This gives us a vector commitment, although with quite a bad complexity: In order to reveal any element, all elements of the vector have to be revealed. But there is one redeeming property: The commitment scheme is additively homomorphic. This means that if we have another commitment $D = b_0 g_0 + b_1 g_1 + b_2 g_2 + \ldots + b_{n-1} g_{n-1}$ , then it’s possible to just add the two commitments to get a new commitment to the sum of the two vectors $\vec a$ and $\vec b$ :

$C + D = (a_0 + b_0) g_0 + (a_1 + b_1) g_1 + (a_1 + b_1) g_2 + \ldots + (a_{n-1} + b_{n-1}) g_{n-1}$

Thanks to this additive homomorphic property, this vector commitment actually turns out to be useful.

Inner Product Argument

The basic strategy of the Inner Product Argument is “divide and conquer”: Take the problem and instead of completely solving it, turn it into a smaller one of the same type. At some point, it becomes so small that you can simply reveal everything and prove that the instance is correct.

At each step, the problem size halves. This ensures that after $\log n$ steps, the problem is reduced to size one, so it can be proved trivially.

The idea is that we want to prove that a commitment $C$ is of the form

$C = \vec a \cdot \vec g + \vec b \cdot \vec h + (\vec a \cdot \vec b) q$

where $\vec g = (g_0, g_1, \ldots, g_{n-1})$ and $\vec h = (h_0, h_1, \ldots, h_{n-1})$ as well as $q$ are our “basis”, i.e. they are group elements in $G$ and none of their discrete logarithms with respect to each other are known. We also introduced the new notation $\vec a \cdot \vec g$ for a product between a vector of scalars ( $\vec a$ ) and another vector of group elements ( $\vec g$ ), and it is defined as

$\vec a \cdot \vec g = a_0 g_0 + a_1 g_1 + \cdots + a_{n-1} g_{n-1}$

So essentially, we are proving that $C$ is a commitment to

a vector $\vec a$ with basis $\vec g$
a vector $\vec b$ with basis $\vec h$ and
their inner product $\vec a \cdot \vec b$ with respect to the basis $q$ .

This in itself does not seem very useful – in most applications we want the verifier to know $\vec a \cdot \vec b$ , and not just have it hidden in some commitment. But this can be remedied with a small trick which I will come to below.

The argument

We want the prover to convince the verifier that $C$ is of the form $C = \vec a \cdot \vec g + \vec b \cdot \vec h + (\vec a \cdot \vec b) q$ . As I mentioned before, instead of doing this outright, we will only reduce the problem by computing another commitment $C'$ in such a way that if the property holds for $C'$ , then it also holds for $C$ .

In order to do this, the prover and the verifier play a little game. The prover commits to certain properties, after which the verifier sends a challenge, which leads to the next commitment $C'$ . Describing it as a game does not mean the proof has to be interactive though: The Fiat-Shamir construction allows us to turn interactive proofs into non-interactive one, by replacing the challenge with a collision-resistant hash function of the commitments.

Statement to prove

The commitment $C$ has the form $C = \vec a \cdot \vec g + \vec b \cdot \vec h + (\vec a \cdot \vec b) q$ with respect to the basis given by $\vec g, \vec h, q$ . We call the fact that $C$ has this form the “Inner Product Property”.

Reduction step

Let $m = \frac{n}{2}$

The prover computes

$z_L = a_m b_0 + a_{m+1} b_1 + \cdots + a_{n-1} b_{m-1} = \vec a_R \cdot \vec b_L \\ z_R = a_0 b_m + a_{1} b_{m+1} + \cdots + a_{m-1} b_{n-1} = \vec a_L \cdot \vec b_R$

where we’ve defined $\vec a_L$ as the “left half” of the vector $\vec a$ and $\vec a_R$ the “right half” and analogously for $\vec b$ .

Then the prover computes the following commitments:

$C_L = \vec a_R \cdot \vec g_L + \vec b_L \cdot \vec h_R + z_L q \\ C_R = \vec a_L \cdot \vec g_R + \vec b_R \cdot \vec h_L + z_R q \\$

and send them to the verifier. Then the verifier sends the challenge $x \in \mathbb F_p$ (when using the Fiat-Shamir construction to make this non-interactive, this means that $x$ would be the hash of $C_L$ and $C_R$ ). The prover uses this to compute the updated vectors

$\vec a' = \vec a_L + x \vec a_R \\ \vec b' = \vec b_L + x^{-1} \vec b_R$

which have half the length.

Now the verifier computes the new commitment:

$C' = x C_L + C + x^{-1} C_R$

as well as the updated basis

$\vec g' = \vec g_L + x^{-1} \vec g_R \\ \vec h' = \vec h_L + x \vec h_R$

Now, if the new commitment $C'$ has the property that it is of the form $C' = \vec a' \cdot \vec g' +\vec b' \cdot \vec h' + \vec a' \cdot \vec b' q$ – then the commitment $C$ fulfills the originall claim.

All the vectors have halved in size – so we have achieved something. From here we replace $C:=C'$ , $\vec g := \vec g'$ and $\vec h := \vec h'$ and repeat this step.

I will below go through the maths on why this works, but Vitalik also made a nice visual representation that I recommend to get an intuition.

Final step

When we repeat the step above, we will reduce $n$ by a factor of two each time. At some point, we will encounter $n=1$ . At this point we don’t repeat the step anymore. Instead the prover will send $\vec a$ and $\vec b$ , which in fact are now only a single scalar each. Then the verifier can simply compute

$D = a g + b h + a b q$

and accept the statement if this is indeed equal to $C$ , or reject if it is not.

Correctness and soundness

Above I claimed that if $C'$ has the desired form, then it follows that $C$ also has it. I now want to show why this is the case. In order to do this, we need to look at two things:

Correctness – i.e. given a prover who follows the protocol, can they always convince the verifier that the statement is correct; and
Soundness – i.e. a dishonest prover cannot convince the verifier of an incorrect statement, except with a very small probability.

Let’s start with correctness. This assumes that the prover is doing everything according to the protocol. Since the prover is following the protocol, we know that $C = \vec a \cdot \vec g + \vec b \cdot \vec h + (\vec a \cdot \vec b) q$ with respect to the basis given by $\vec g, \vec h, q$ . We need to show that then $C'= \vec a' \cdot \vec g' +\vec b' \cdot \vec h' + \vec a' \cdot \vec b' q$ .

The verifier computes $C' = x C_L + C + x^{-1} C_R$ .

$C' = x C_L + C + x^{-1} C_R \\ = x ( \vec a_R \cdot \vec g_L + \vec b_L \cdot \vec h_R + z_L q) \\ + \vec a_L \cdot \vec g_L + \vec a_R \cdot \vec g_R + \vec b_L \cdot \vec h_L + \vec b_R \cdot \vec h_R + \vec a \cdot \vec b q \\ + x^{-1} (\vec a_L \cdot \vec g_R + \vec b_R \cdot \vec h_L + z_R q)\\ = (x \vec a_R + \vec a_L)\cdot(\vec g_L + x^{-1} \vec g_R) \\ + (\vec b_L + x^{-1} \vec b_R)\cdot(\vec h_L + x \vec h_R) \\ + (x z_L + \vec a \cdot \vec b + x^{-1} z_R) q \\ = (x \vec a_R + \vec a_L)\cdot \vec g' + (\vec b_L + x^{-1} \vec b_R)\cdot \vec h' + (x z_L + \vec a \cdot \vec b + x^{-1} z_R) q$

So in order for the commitment to have the Inner Product Property, we need to verify that $(x \vec a_R + \vec a_L) \cdot (\vec b_L + x^{-1} \vec b_R) = x z_L + \vec a \cdot \vec b + x^{-1} z_R$ . This is true because

$(x \vec a_R + \vec a_L) \cdot (\vec b_L + x^{-1} \vec b_R) \\ = x \vec a_R \cdot \vec b_L + \vec a_L \cdot \vec b_L + \vec a_R \cdot \vec b_R + x^{-1} \vec a_L \cdot \vec b_R \\ = x z_L + \vec a \cdot \vec b + x^{-1} z_R$

This concludes the proof of correctness. Now in order to prove soundness, we need the property that a prover can’t start with a commitment $C$ that does not fulfill the Inner Product Property and end up with a $C'$ that does by going through the reduction step.

So let’s assume that the prover committed to $C=\vec a \cdot \vec g + \vec b \cdot \vec h + r q$ for some $r \not= \vec a \cdot \vec b$ . If we go through the same process as before, we find

$C' = (x \vec a_R + \vec a_L)\cdot \vec g' + (\vec b_L + x^{-1} \vec b_R)\cdot \vec h' + (x z_L + r + x^{-1} z_R) q$

So now let’s assume that the prover managed to cheat, and thus $C'$ fulfills the Inner Product Property. That means that

$(x \vec a_R + \vec a_L) \cdot (\vec b_L + x^{-1} \vec b_R) = x z_L + r + x^{-1} z_R$

Expanding the left hand side, we get

$x \vec a_R \cdot \vec b_L + \vec a \cdot \vec b + x^{-1} \vec a_L \cdot \vec b_R = x z_L + r + x^{-1} z_R$

Note that the prover can choose $z_L$ and $z_R$ freely, so we cannot assume that they will be according to the above definitions.

Multiplying by $x$ and moving everything to one side we get a quadratic equation in $x$ :

$x^2 ( \vec a_R \cdot \vec b_L - z_L) + x (\vec a \cdot \vec b - r) + (\vec a_L \cdot \vec b_R - z_R )$

Unless all the terms are zero, this equation has at most two solutions $x \in \mathbb F_p$ . But the verifier chooses $x$ after the prover has already committed to their values $r$ , $z_L$ and $z_R$ . The probability that the prover can successfully cheat is thus extremely small; we typically choose the field $\mathbb F_p$ to be of size ca. $2^{256}$ , so the probability that the verifier chooses a value for $x$ such that this equation holds, when the values were not chosen according to the protocol, is vanishingly small.

This concludes the soundness proof.

Only compute basis changes at the end

The verifier has to do two things each round: Compute the challenge $x$ and compute the updates bases $\vec g'$ and $\vec h'$ . However, updating the basis $g$ at every round is inefficient. Instead the verifier can simply keep track of the challenge values $x_1$ , $x_2$ , up to $x_{\ell}$ that they will encounter during the $\ell$ rounds.

Let’s call the basis after round $k$ $\vec g_k, \vec h_k$ . The elements $g_\ell$ and $h_\ell$ are scalars (or vectors of length one) because we end the protocol once our vectors have reached length one. Computing $g_\ell$ from $\vec g_0$ is a multiscalar multiplication (MSM) of length $n$ . The scalar factors for $\vec g_0$ are the coefficients of the polynomial

$f_g(X) = \prod_{i=0}^\ell \left(1+x^{-1}_{\ell-i} X^{2^{i}}\right)$

and the scalar factors for $\vec h_0$ are given by

$f_h(X) = \prod_{i=0}^\ell \left(1+x_{\ell-i} X^{2^{i}}\right)$

Using Inner Product Arguments to evaluate polynomials

For our main application – evaluating a polynomial defined by $f(x) = \sum_{i=1}^{n-1} a_i x^i$ at a point $z$ – we want to make some small additions to this protocol.

Most importantly, we want to know the result $f(z) = \vec a \cdot \vec b$ , and not just that $C$ has the “Inner Product Property”
$\vec b = (1, z, z^2, ..., z^{n-1})$ is known to the verifier. We can thus make things a bit easier by removing it from the commitment

How to construct the commitment

If we want to verify a polynomial evaluation for the polynomial $f(x) = \sum_{i=1}^{n-1} a_i x^i$ , then we are typically working from a commitment $F = \vec a \cdot \vec g$ . The prover would send the verifier the evaluation $y=f(z)$ .

So it seems like the verifier can just compute the initial commitment $C=\vec a \cdot \vec g + \vec b \cdot \vec h + \vec a \cdot \vec b q = F + \vec b \cdot \vec h + f(z) q$ , since they know $\vec b = (1, z, z^2, ..., z^{n-1})$ , and start the protocol.

But not so fast. In most cases, $F$ will be a commitment that is generated by the prover. A malicious prover could cheat by, for example, committing to $F = \vec a \cdot \vec g + tq$ . In this case, they would be able to prove that $f(z) = y - t$ , because they have effectively shifted the result.

To prevent this, we need to do a small change to the protocol. After receiving the commitment $F$ and the evaluation $y$ , the verifier generates a scalar $w$ and rescales the basis $q:=wq$ . Afterwards the protocol can proceed as usual. Because the prover can’t predict what $w$ is going to be, they can’t succeed (except with very small probability) at manipulating the result to be something other then $f(z)$ .

Note that we also need to stop the prover from manipulating the vector $\vec b$ if what we want is a generic inner product – but for a polynomial evaluation, we can simply get rid of that part alltogether so I won’t go into the details.

How to get rid of the second vector

Note that the verifier knows the vector $\vec b = (1, z, z^2, ..., z^{n-1})$ if what we want is to compute a polynomial evaluation. Given the challenges $x_0, x_1, \ldots, x_\ell$ they can simply compute the final result $b_\ell$ using the same technique as demonstrated in “compute the basis change at the end”.

We can thus remove the second vector from all commitments and simply compute $b_\ell$ . This means the verifier has to be able to compute the final version $b_\ell$ from the initial vector $\vec b_0 = (1, z, z^2, ..., z^{n-1})$ . Since the folding process for $\vec b$ is the same as that for the basis vector $\vec g$ , the coefficients of the previously defined polynomial $f_g$ will define the linear combination, in other words $b_\ell=f_g(z)$ .

Creating an IPA for a polynomial in coefficient form

So far, we have used an Inner Product Argument to evaluate a polynomial that is committed to by its coefficients, which are the $f_i$ for a polynomial defined by $f(X) = \sum_{i=0}^{n-1} f_i X^i$ . However, often we want to work with a polynomial that is defined using its evaluations on a domain $x_0, x_1, \ldots, x_{n-1}$ . Since any polynomial of degree less than $n-1$ is uniquely defined by the evaluations $f(x_0), f(x_1), \ldots, f(x_{n-1})$ these two are completely equivalent. However, transforming between the two can be computationally expensive: it costs $O(n \log n)$ operations if the domain admits an efficient Fast Fourier Transform, and otherwise it’s $O(n^2)$ .

To avoid this cost, we try to simply never change to coefficient form. This can be done by changing the commitment to $f$ by committing to the evaluations instead of the coefficients:

$C = f(x_0) g_0 + f(x_1) g_1 + \cdots + f(x_{n-1}) g_{n-1}$

This means that our $\vec a$ vector in the IPA is now given by $\vec a = (f(x_0), f(x_1), \ldots, f(x_{n-1}))$

The barycentric formula allows us now to compute an IPA to evaluate a polynomial using this new commitment. It says that

$f(z) = A(z)\sum_{i=0}^{n-1} \frac{f(x_i)}{A'(x_i)} \frac{1}{z-x_i}$

If we choose the vector $\vec b$ to be

$b_i = \frac{A(z)}{A'(x_i)} \frac{1}{z-x_i}$

we get that $\vec a \cdot \vec b = f(z)$ , and thus an IPA with this vector can be used to prove the evaluation of a polynomial which is itself in evaluation form. Other than this, the strategy is exactly the same.

Bootle, Cerulli, Chaidos, Groth, Petit: Efficient Zero-Knowledge Arguments forArithmetic Circuits in the Discrete Log Setting ↩
Bowe, Grigg, Hopwood: Recursive Proof Composition without a Trusted setup ↩

Dankrad Feist

SUMCHECK Quickie

The SUMCHECK protocol

Why does it work

The Schwartz-Zippel Lemma

SUMCHECK protocol proof

RAI – one of the coolest experiments in crypto

RAI – one of the coolest experiments in crypto

Why is RAI floating, and not tracking one currency?

The stablecoin problem

Collateralized debt positions

How RAI balances supply and demand

How does RAI determine the redemption rate

But what pulls RAI back to the redemption price

How does tracking another currency change RAI?

Why do I think RAI is such a cool experiment?

Ethereum Merge: Run the majority client at your own peril!

Why do we need multiple clients?

Incentivising client diversity: anti-correlation penalties

The other anti-correlation penalty: The quadratic inactivity leak

How bad is it to run the majority client?

Scenario 1: Double signing

Scenario 2: Mass offline event

Scenario 3: Invalid block

Risk analysis

What are my options, if I currently run the majority client and I’m worried about switching?

How about the execution clients?

Appendix

A1: Why is there no slashing for invalid blocks?

A2: Why can’t the buggy client switch to chain B once it has finalized chain A?

Exponential EIP-1559

Exponential EIP-1559 explainer

Linear EIP-1559 mechanics (“original version”)

Exponential EIP-1559

Analyzing the exponential form

Graphical illustration

Appendix: Using differential equations to derive the exponential form

内积证明

介绍

Pedersen 承诺

内积证明

证明

证明描述

规约步骤

最终步骤

正确性(correctness) 及合理性(soundness)

仅在最后计算基变化

使用内积证明来验证多项式值

如何构造承诺

如何去掉第二个向量

针对点值形式多项式的IPA

KZG多项式承诺

简介

预备知识

默克尔树对比

椭圆曲线以及配对

可信设置

卡特承诺

多项式相乘

卡特证明

多重证明

将卡特作为矢量承诺来使用

延伸阅读

Proofs of Custody

Lazy validators – the game theory

But I would never do this! Who would cheat like that?

A proof of custody and an update to the game theory

New game theory

Proof of custody for execution

With fraud proofs, do we still need the proof of custody for execution?

How is it different from data availability checks?

Just because it has a fixed supply doesn’t make it a good store of value

What we should really build is productive assets and stablecoins

Why “store of value” does not exist

What we need – productive assets and stablecoins

So if the current system is so great why do we even need cryptocurrencies?

On supply and demand for stablecoins

How to create a stablecoin

Why we need to add interest rates to this

An analogy to fiat currencies