Implement Express Lane Timeboost #2561

rauljordan · 2024-08-08T16:22:16Z

Background

At the time of writing, the Arbitrum sequencer is centralized and offers a first-come, first-serve transaction ordering policy. Txs have a current delay of approximately 250ms, which is the time the sequencer takes to produce an ordered list of txs to emit in the form of an L2 block. The current policy does not handle MEV that occurs naturally on L2, and leads to latency races offline to get faster access to the sequencer ingress server.

A new policy has been proposed, known as Express Lane Timeboost, which allows participants to bid for the rights of priority sequencing using their funds instead of hardware. In “rounds” that start at each minute mark, participants can submits bids to participate in a sealed, second-price auction for control of the next round’s “express lane”. During a round, all non-express lane txs get their first arrival timestamp delayed by some amount of time (250ms), while the express lane controller does not. The express lane controller can also choose to transfer their rights in a round.

The sequencer itself does not need to manage auctions, but simply needs to know the current round number and the address of the express lane controller for that round. From there, it can delay non-express lane txs by a nominal amount required by the protocol and validate that a tx should go through the express lane.

This PR contains the complete implementation of the system with all its components. The smart contract changes are contained within OffchainLabs/nitro-contracts/tree/express-lane-auction-all-merged.

Basic Readings

To read more about timeboost, see the AIP, the research specification, and design doc although the design doc is not fully updated yet.

Reviewing

Recommend to look at the basic readings, then look at system_tests/timeboost_test.go to understand how it all fits together. Then, look at bid validator and auctioneer. Finally, the sequencer changes.

Features

Bidder client that allows participants to join the auction and submit bids to a bid validator
Bid validator that receives bids over the internet, validates them, and inserts validated items into Redis stream
Auctioneer server that consumes validated bids from Redis stream.
Auctioneer at the 45 second mark, submits the top two bids to a privileged sequencer endpoint
Ability to persist validated bids to a local DB (sqlite) in the auctioneer server
System tests are added that assert express lane txs have an advantage in the emitted sequencer feed

Sequencer Changes

The changes to the sequencer hot path are quite simple. In a nutshell, if a transaction is received, it checks the following:
If timeboost is enabled AND there is an express lane controller set AND it is not coming from the express lane, it delays the tx's first arrival timestamp by some amount (250ms).

To determine if a transaction is a valid express lane tx, the sequencer runs a background thread called the expressLaneService, which is scraping events from the ExpressLaneAuction.sol smart contract. Express lane transactions arrive via a different sequencer endpoint than the normal one, called timeboost_sendExpressLaneTransaction. The message looks as follows:

{
  "type": "object",
  "properties": {
    "chainId": {
      "type": "bigInt",
      "description": "chain id of the target chain"
    },
    "round": {
      "type": "uint64",
      "description": "round number (0-indexed) for the round the bidder wants to become the controller of"
    },
    "auctionContractAddress": {
      "type": "address",
      "description": "hex string of the auction contract address that the bid corresponds to"
    },
    "sequenceNumber": {
      "type": "uint64",
      "description": "the per-round nonce of express lane submissions. Each submission to the express lane during a round increases this sequence number by one, and if submissions are received out of order, the sequencer will queue them for processing in order. This is reset to 0 at each round"
    },
    "transaction": {
      "type": "bytes",
      "description": "hex string of the RLP encoded transaction payload that submitter wishes to be sequenced through the express lane"
    },
    "options": {
      "type": "ArbitrumConditionalOptions",
      "description": "conditional options for Arbitrum transactions, supported by normal sequencer endpoint https:/OffchainLabs/go-ethereum/blob/48de2030c7a6fa8689bc0a0212ebca2a0c73e3ad/arbitrum_types/txoptions.go#L71"
    },
    "signature": {
      "type": "bytes",
      "description": "Ethereum signature over the bytes encoding of (keccak256(TIMEBOOST_BID), padTo32Bytes(chainId), auctionContractAddress, uint64ToBytes(round), uint64ToBytes(sequenceNumber), transaction)"
    }
  },
}

The submission itself contains a tx payload, which MAY not be from the express lane controller. As long as the submission is signed by the controller, that is sufficient. Submissions have a specific nonce, called a sequence, to ensure that submissions are processed in order. This is different from the inner nonce of the payload tx. The sequencer keeps a queue of submissions and ensures it processes them in order. That is, if a submission N is received before N-1, it will get queued for submission once N arrives.

Bid Validator Architecture

Bids are limited to 5 bids per sender, but there are no limits to the number of bidders in a single round. To alleviate potential scaling concerns, we adopt a simple architecture of separating the bid validators from the auctioneer. The bid validators filter out invalid items and publish validated results to a Redis stream. In a simplified diagram, here's what it will look like:

Dependencies Added

github.com/golang-jwt/jwt/v4 for the authenticated endpoint from the auctioneer to the sequencer
github.com/stretchr/testify for testing utilities (will probably have to remove)
github.com/mattn/go-sqlite3 for the bids DB
github.com/jmoiron/sqlx for the bids DB
github.com/DATA-DOG/go-sqlmock for testing the bids DB

Notes

There are several parts of this implementation that are likely not ideal:

Chicken and the egg problem in sequencer
Cannot start sequencer without express lane, but cannot deploy auction for express lane without starting sequencer. To solve this in tests, we have a separate func called StartExpressLaneService in the sequencer. In prod, we don’t have this issue because we can deploy the contracts before we upgrade the sequencer to timeboost, but what to do about tests?

Janky prioritizing of auction resolution txs
The sequencer exposes an authenticated endpoint auctioneer_submitAuctionResolutionTransaction over the JWT Auth RPC for the auctioneer to use. When the auctioneer is ready to resolve an auction, it submits a tx to this endpoint, which the sequencer verifies for integrity. Then, the sequencer does the following:

log.Info("Prioritizing auction resolution transaction from auctioneer", "txHash", tx.Hash().Hex())
s.timeboostAuctionResolutionTx = tx
s.createBlock(ctx)

it immediately tries to put the item in the queue and create block. It also sets the tx as a property of the sequencer struct, and in the createBlock func, if this field is not nil, it gets put at the top of the queue. This is a bit janky in how it works and perhaps inefficient. Is there another way to prioritize a tx in the sequencer?

Sequencer opens an http connection to itself
The sequencer has a thread called expressLaneService which reads events from the auction smart contracts on L2 to determine express lane controllers. Because the sequencer does not have filtersystem API access, we instead open an RPC client against itself so we can create an ethclient to read logs and data from onchain. This doesn't seem ideal

References

Updated auctioneer with research spec

…o into express-lane-timeboost

rauljordan · 2024-08-20T21:10:18Z

Update: implemented the functionality of using a logs subscription from the blockchain struct in the sequencer instead of an ethclient to read information about the express lane auction contract here https:/OffchainLabs/nitro/compare/express-lane-timeboost...sub-logs-express-lane-timeboost?expand=1.
However, the logs are not received over the channel for some reason when running the system test under timeboost_test.go

terencechain · 2024-08-20T20:46:21Z

cmd/autonomous-auctioneer/config.go

+func (c *AutonomousAuctioneerConfig) Validate() error {
+ return nil
+}


What's the intention for this? Is it a todo or should it be removed?

terencechain · 2024-08-20T20:54:30Z

timeboost/types.go

+ Round uint64
+ AuctionContractAddress common.Address
+ Transaction *types.Transaction
+ Options *arbitrum_types.ConditionalOptions `json:"options"`


What is the use case for having options?

terencechain · 2024-08-20T21:27:20Z

execution/gethexec/express_lane_service.go

+ select {
+ case <-ctx.Done():
+ return
+ case <-time.After(time.Millisecond * 250):


I wonder if time.After(time.Millisecond * 250) is sufficient and whether it’s better to stream new blocks directly from the tx feed. Otherwise, there could be a race condition where, at 249.9ms, we check and find that the latest block number hasn’t changed, forcing us to wait another 250ms.

terencechain · 2024-08-20T21:46:09Z

execution/gethexec/express_lane_service.go

+ for {
+ // Get the next message in the sequence.
+ nextMsg, exists := es.messagesBySequenceNumber[control.sequence]
+ if !exists {
+ break
+ }
+ if err := publishTxFn(
+ ctx,
+ nextMsg.Transaction,
+ msg.Options,
+ false, /* no delay, as it should go through express lane */
+ ); err != nil {
+ // If the tx failed, clear it from the sequence map.
+ delete(es.messagesBySequenceNumber, msg.Sequence)
+ return err
+ }
+ // Increase the global round sequence number.
+ control.sequence += 1
+ }


Minor because we break, but we might want to check for context timeout here, and the caller of sequenceExpressLaneSubmission should set a deadline for when the round ends.

execution/gethexec/sequencer.go

Tristan-Wilson

Submitting comments so far

Tristan-Wilson · 2024-08-26T18:55:00Z

execution/gethexec/express_lane_service.go

+ es.LaunchThread(func(ctx context.Context) {
+ log.Info("Watching for new express lane rounds")
+ now := time.Now()
+ waitTime := es.roundDuration - time.Duration(now.Second())*time.Second - time.Duration(now.Nanosecond())


Round duration is configurable, but here and other places we're building in two assumptions:

round duration is a minute

that rounds start exactly on the minute mark.

Maybe it's fine for initial launch with the minute length rounds, but we should acknowledge all the places where we're making these assumptions and would need to change if the round length was to change.

Tristan-Wilson · 2024-08-26T20:06:11Z

execution/gethexec/express_lane_service.go

+ )
+ es.Lock()
+ // Reset the sequence numbers map for the new round.
+ es.messagesBySequenceNumber = make(map[uint64]*timeboost.ExpressLaneSubmission)


Are there any meaningful races between the CurrentRound and messagesBySequenceNumber being reset when the timer ticks, which will necessarily lag CurrentRound? The worst I'm seeing is potentially lost messages when they are submitted at the changeover time.

Ya there is a race here. I think it is safer to use two queues and rotate

execution/gethexec/express_lane_service.go

Tristan's feedback

Tristan-Wilson

I've finished reviewing most of the code except the tests and will go through the tests tomorrow.

Tristan-Wilson · 2024-08-26T23:54:20Z

execution/gethexec/sequencer.go

+ options: nil,
+ resultChan: make(chan error, 1),
+ returnedResult: &atomic.Bool{},
+ ctx: context.TODO(),


TODO context

timeboost/bid_validator.go

execution/gethexec/sequencer.go

terencechain · 2024-08-21T13:42:02Z

execution/gethexec/tx_pre_checker.go

+ if err != nil {
+ return err
+ }
+ err = PreCheckTx(c.bc, c.bc.Config(), block, statedb, arbos, msg.Transaction, msg.Options, c.config())


I think it makes more sense to validateExpressLaneTx and then PreCheckTx. It's a cheaper dos protection but I can see how it would be hard to refactor

execution/gethexec/sequencer.go

execution/gethexec/express_lane_service.go

terencechain · 2024-08-29T16:26:38Z

execution/gethexec/express_lane_service.go

+ )
+ es.Lock()
+ // Reset the sequence numbers map for the new round.
+ es.messagesBySequenceNumber = make(map[uint64]*timeboost.ExpressLaneSubmission)


Ya there is a race here. I think it is safer to use two queues and rotate

execution/gethexec/express_lane_service.go

execution/gethexec/express_lane_service_test.go

execution/gethexec/express_lane_service.go

Tristan-Wilson · 2024-08-29T19:05:40Z

execution/gethexec/express_lane_service.go

+ initialTimestamp := time.Unix(int64(roundTimingInfo.OffsetTimestamp), 0)
+ roundDuration := time.Duration(roundTimingInfo.RoundDurationSeconds) * time.Second
+ auctionClosingDuration := time.Duration(roundTimingInfo.AuctionClosingSeconds) * time.Second


Maybe we can add an assertion here that the roundTimingInfo complies with the assumptions we've got baked into the code currently (eg duration == 1 minute, offset is a time at a minute boundary, closing duration is at least 2s but probably just assert it's 15s)

roundTimingInfo is retrieved from the smart contract, which is the source of truth we should adhere to. Unless we want to add another configuration to check against this single source of truth, I generally prefer less configuration, as it's less error-prone, but I'm open to changing if that's what we prefer!

Agreed that we should rely on the contract here as the source of truth

My point is that the code is broken if it's anything other than 1 minute. The contract is the source of truth so we should check that the settings on the contract matches the assumptions we're currently making in the code, and assert if they are violated because otherwise the code will behave in unexpected ways.

Tristan-Wilson · 2024-08-29T19:16:18Z

system_tests/express_lane_timeboost_test.go

+ ctx, cancel := context.WithCancel(context.Background())
+ defer cancel()
+ redisURL := redisutil.CreateTestRedis(ctx, t)
+ _ = redisURL


This test needs to be filled out.

system_tests/timeboost_test.go

timeboost/bidder_client.go

autonomous-auctioneer on the cli was failing to start becuase we were adding the "auth" config options without having the corresponding field on the AuctioneerConfig. We can add it back in later if needed.

RPC methods can't be registered after the stack is started.

tsahee

initial review of code inside gethexec

tsahee · 2024-10-14T18:48:34Z

execution/gethexec/sequencer.go

@@ -430,6 +481,12 @@ func (s *Sequencer) PublishTransaction(parentCtx context.Context, tx *types.Tran
 return err
 }

+ if s.config().Timeboost.Enable && s.expressLaneService != nil {
+ if delay && s.expressLaneService.currentRoundHasController() {


that tells me "delay" is a misleading name - because it's actually delay only if current round has controller.
Maybe an inverted "fastLane" boolean, or something else?

tsahee · 2024-10-14T18:55:52Z

execution/gethexec/sequencer.go

+ if err := s.expressLaneService.validateExpressLaneTx(msg); err != nil {
+ return err
+ }
+ return s.expressLaneService.sequenceExpressLaneSubmission(ctx, msg, s.publishTransactionImpl)


I really don't like passing member functions, especially private ones, as function pointers if there is a reasonable alternative. Can't the expressLaneService get a pointer to sequencer instead?

tsahee · 2024-10-14T22:19:25Z

execution/gethexec/arb_interface.go

@@ -41,6 +44,18 @@ func (a *ArbInterface) PublishTransaction(ctx context.Context, tx *types.Transac
 return a.txPublisher.PublishTransaction(ctx, tx, options)
 }

+func (a *ArbInterface) PublishExpressLaneTransaction(ctx context.Context, msg *timeboost.JsonExpressLaneSubmission) error {


I really don't like having ArbInterface functions that submit transactions to the chain. Interaction with ArbInterface is via eth_call which makes this quite unexpected.

tsahee · 2024-10-14T22:42:57Z

execution/gethexec/express_lane_service.go

+ return err
+ }
+ // Increase the global round sequence number.
+ control.sequence += 1


so if publishTxnFn fails, control never advances and anything on the timeboost queue is waiting until publisher notices that and replaces the message with that sequence number?
I'm not sure this is how we want to go. Will be problematic, especially if the winner allows transactions from multiple sources.

tsahee · 2024-10-14T22:44:03Z

execution/gethexec/express_lane_service.go

+ return timeboost.ErrNoOnchainController
+ }
+ // Check if the submission nonce is too low.
+ if msg.Sequence < control.sequence {


msg.Sequence is mandatory?
Do't we want to allow sending e.g. with sequence number 0 to be processed FCFS?

tsahee · 2024-10-14T22:44:45Z

execution/gethexec/express_lane_service.go

+ }
+ // Log an informational warning if the message's sequence number is in the future.
+ if msg.Sequence > control.sequence {
+ log.Warn("Received express lane submission with future sequence number", "sequence", msg.Sequence)


Warn seems too harsh. This will happen regularly due to network reordering.
Log.Info or less

tsahee · 2024-10-14T22:47:25Z

execution/gethexec/express_lane_service.go

+ }
+ }
+ })
+ es.LaunchThread(func(ctx context.Context) {


use CCallIteratively to do something every 250mil or until context is cancelled, and move logic to a separate function

tsahee · 2024-10-14T23:09:55Z

execution/gethexec/express_lane_service.go

+ "round", it.Event.Round,
+ "controller", it.Event.FirstPriceExpressLaneController,
+ )
+ es.Lock()


I think locking deserves some overview.
Would suggest trying to split between a lock used while processing incoming transaction in sequenceExpressLaneSubmission (which writes the message sequence and only reads round info of current round) vs lock used while while processing new round info (which doesn't care about incoming transactions and writes - hopefully only next-round). If needed, incoming-Tx may hold biefly the round lock just to check current round info

tsahee · 2024-10-14T23:12:51Z

execution/gethexec/express_lane_service.go

+ return timeboost.ErrNoOnchainController
+ }
+ currentRound := timeboost.CurrentRound(es.initialTimestamp, es.roundDuration)
+ if msg.Round != currentRound {


I think we would want some buffering for next-round messages, to make sure winner can use their entire slot.

tsahee · 2024-10-14T23:34:09Z

execution/gethexec/sequencer.go

+ if !s.config().Timeboost.Enable {
+ log.Crit("Timeboost is not enabled, but StartExpressLane was called")
+ }
+ rpcClient, err := rpc.DialContext(ctx, s.config().Timeboost.SequencerHTTPEndpoint)


I really don't like this.
expresslaneservice is inside execution and has direct access to the blockchain, no reason to go through network.

rauljordan and others added 30 commits July 2, 2024 12:19

added in timeboost items

9925308

add in

4c2c74f

system test

d0ed9bf

Rename auction master to autonomous auctioneer

4ff7410

Sequence express lane transactions follow spec

83e8e00

Merge branch 'rename-auctioneer' into express-lane-timeboost

4b873fa

include master

0a7ba8c

update contracts repo and bindings

81a237b

building once more

0b7819b

builds

b829d9b

autonomous auctioneer bin

94c5653

receive bid test passing

676b89e

edits and fix sig

d7a91a8

passing test

c5fad5c

add to seq test

f54ac50

system test

4b7a910

Updated auctioneer with research spec

9681909

Merge pull request #2521 from OffchainLabs/auctioneer

fc81996

Updated auctioneer with research spec

begin adding sequencer endpoint

2fd3327

Clean up auctioneer part 1

dcb5525

Use init for domain value

32e5ffd

Tie break bids and unit tests

6fd97f4

Test resolve bids

d54f89b

begin adding in endpoints

c3d8c01

Merge branch 'express-lane-timeboost' of github.com:OffchainLabs/nitr…

08cc4f5

…o into express-lane-timeboost

Merge branch 'express-lane-timeboost' into timeboost-endpoints

ea852cd

validate express lane tx submission in sequencer

02bdf45

express lane client send transaction

d7eb164

adding in and building

4ad6fcb

auctioneer binary config

ca24d8b

rauljordan and others added 7 commits August 19, 2024 21:55

edits

f75f242

commit

ffd0766

rem redundant sig verify

a5c8777

Merge branch 'master' into express-lane-timeboost

9d53ff7

Fix tie breaker and better retry

196bdc7

edit

e61c3b2

Merge branch 'master' into express-lane-timeboost

44c8179

terencechain reviewed Aug 20, 2024

View reviewed changes

terencechain added 5 commits August 20, 2024 14:57

Fix duplicated word

981bc6a

Add copyright to express lane service

8db0bd0

Better popped the auction resolution tx log

ec185f5

Fix auction-contract-address doesnt match the field SequencerEndpoint

d15e0c5

Remove unused conversions

f28896d

leeederek reviewed Aug 23, 2024

View reviewed changes

execution/gethexec/sequencer.go Outdated Show resolved Hide resolved

terencechain added 2 commits August 23, 2024 08:36

Fix express lane advantage to 200ms

3eec686

Update reserve price

98326dc

Tristan-Wilson reviewed Aug 26, 2024

View reviewed changes

Filter transfer log

3632c66

Tristan's feedback

Tristan-Wilson reviewed Aug 29, 2024

View reviewed changes

Tristan's feedback

8122a49

terencechain reviewed Aug 29, 2024

View reviewed changes

Tristan-Wilson requested changes Aug 29, 2024

View reviewed changes

terencechain and others added 6 commits September 4, 2024 18:55

Tristan's feedback

5c03cf0

Fix autonomous-auctioner cli startup

fa457b1

autonomous-auctioneer on the cli was failing to start becuase we were adding the "auth" config options without having the corresponding field on the AuctioneerConfig. We can add it back in later if needed.

Start rpc stack after creating bid validator

0ff4f7d

RPC methods can't be registered after the stack is started.

Plumbing to be able to start timeboost in nitro

7a2eb14

Fix various linter and compilation issues

d72fd38

Fix cyclic dependency in test

2cd1ed0

tsahee requested changes Oct 14, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Express Lane Timeboost #2561

Implement Express Lane Timeboost #2561

rauljordan commented Aug 8, 2024 •

edited

Loading

rauljordan commented Aug 20, 2024

terencechain Aug 20, 2024

terencechain Aug 20, 2024

terencechain Aug 20, 2024

terencechain Aug 20, 2024

Tristan-Wilson left a comment

Tristan-Wilson Aug 26, 2024

Tristan-Wilson Aug 26, 2024

terencechain Aug 29, 2024

Tristan-Wilson left a comment

Tristan-Wilson Aug 26, 2024

terencechain Aug 21, 2024

terencechain Aug 29, 2024

Tristan-Wilson Aug 29, 2024

terencechain Sep 5, 2024

rauljordan Sep 5, 2024

Tristan-Wilson Sep 9, 2024

Tristan-Wilson Aug 29, 2024

tsahee left a comment

tsahee Oct 14, 2024

tsahee Oct 14, 2024

tsahee Oct 14, 2024

tsahee Oct 14, 2024

tsahee Oct 14, 2024

tsahee Oct 14, 2024

tsahee Oct 14, 2024

tsahee Oct 14, 2024

tsahee Oct 14, 2024

tsahee Oct 14, 2024

Implement Express Lane Timeboost #2561

Are you sure you want to change the base?

Implement Express Lane Timeboost #2561

Conversation

rauljordan commented Aug 8, 2024 • edited Loading

Background

Basic Readings

Reviewing

Features

Sequencer Changes

Bid Validator Architecture

Dependencies Added

Notes

References

rauljordan commented Aug 20, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Tristan-Wilson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Tristan-Wilson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tsahee left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rauljordan commented Aug 8, 2024 •

edited

Loading