Interfaces: More flexible abstract syntax binding instead of `define`? #267

andreasabel · 2019-11-13T10:18:21Z

This is an issue to contemplate a more flexible handling of LBNF rules in the backend. Currently there is a define pragma that allows rule names to be first-order functions over other rule names. However, this could be obsolete if we produced only an interface for abstract syntax (with a standard implementation) that could be implemented in different ways.
The interface would defined methods to construct abstract syntax and a view on abstract syntax for the pretty printer.

For example, consider the (ambiguous) LBNF grammar:

EInt.   Exp ::= Integer ;
EMinus. Exp ::= Exp "-" Exp ;
EStm.   Exp ::= "{" Stm ";" Exp "}" ;

SExp.   Stm ::= Exp ";" ;
SWhile. Stm ::= "while" "(" Exp ")" Stm ;

In Haskell, this could be represented by the following abstract syntax:

{-# LANGUAGE LambdaCase #-}
{-# LANGUAGE MultiParamTypeClasses #-}
{-# LANGUAGE FunctionalDependencies #-}

-- * Model class

class Exp s e | e -> s where  -- FunctionalDependency needed
  eInt   :: Integer -> e
  eMinus :: e -> e -> e
  eStm   :: s -> e -> e

class Stm s e where -- | s -> e where  -- FunctionalDependency not needed
  sExp   :: e -> s
  sWhile :: e -> s -> s

-- ** View

class ViewExp s e where
  viewE  :: e -> ExpView s e

class ViewStm s e where
  viewS  :: s -> StmView s e

data ExpView s e
  = EInt Integer
  | EMinus e e
  | EStm s e

data StmView s e
  = SExp e
  | SWhile e s

-- * Initial syntax

data IExp
  = IEInt Integer
  | IEMinus IExp IExp
  | IEStm IStm IExp

data IStm
  = ISExp IExp
  | ISWhile IExp IStm

-- ** Constructors

instance Exp IStm IExp where
  eInt   = IEInt
  eMinus = IEMinus
  eStm   = IEStm

instance Stm IStm IExp where
  sExp   = ISExp
  sWhile = ISWhile

-- ** Views

instance ViewExp IStm IExp where
  viewE = \case
    IEInt i      -> EInt i
    IEMinus e e' -> EMinus e e'
    IEStm s e    -> EStm s e

instance ViewStm IStm IExp where
  viewS = \case
    ISExp e     -> SExp e
    ISWhile e s -> SWhile e s

One could also make an instance that directly evaluates programs:

instance Exp () Integer where
  eInt   = id
  eMinus = (-)
  eStm _ = id

instance Stm () Integer where
  sExp _ = ()
  sWhile _ _ = ()

Basically, this is generating a standard interface to what are semantic actions of conventional parser generators.

Advantages:

subsumes define (see also Some backends do not implement the define pragma #266)
subsumes the undocumented views feature
allows to plug a custom implementation of parts of the abstract syntax, like the booleans of the host language for a type of booleans (see Bool production #231), or a different implementation of lists, etc.
allows any semantic actions in the parser (would need a monadic interface for effectful actions in Haskell)

Issue:

semantic actions are physically separated from grammar (in other file)

This can be an issue, but also an advantage, since the grammar can be studied by itself without being interleaved with the semantic actions. This is in the spirit of BNFC.

Disadvantage:

If one uses a custom implementation of AST and changes the grammar, one has to replay the changes on ones custom AST implementation. This is not an issue with define.

The text was updated successfully, but these errors were encountered:

ScottFreeCode · 2021-10-19T12:14:26Z

I love when one solution addresses many needs!

Would this be an alternative to the GADT and functor implementations? Or would those become alternative default implementations under this design?

I presume this design will allow for position information, is that correct?

In terms of ease of use (edit: and/or error handling, printing, etc.) how do you think it compares with, say, using the current design and splitting the interpreter/compiler up into two parts:

A function that converts from AST to desired data structure/types.
Another that converts from that structure to…
- (compiler) generated code, or
- (interpreter) function taking input and producing output or actions

andreasabel added the enhancement label Nov 13, 2019

andreasabel mentioned this issue Oct 29, 2020

Bool production #231

Open

andreasabel added the AST Concerning the generated abstract syntax label Jul 13, 2021

andreasabel mentioned this issue Jul 13, 2021

The nonempty macro should result in the Haskell backend using the NonEmpty type? #371

Open

This was referenced Oct 19, 2021

How to get Text String #393

Open

Parse quoted token like String #392

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interfaces: More flexible abstract syntax binding instead of `define`? #267

Interfaces: More flexible abstract syntax binding instead of `define`? #267

andreasabel commented Nov 13, 2019 •

edited

Loading

ScottFreeCode commented Oct 19, 2021 •

edited

Loading

Interfaces: More flexible abstract syntax binding instead of define? #267

Interfaces: More flexible abstract syntax binding instead of define? #267

Comments

andreasabel commented Nov 13, 2019 • edited Loading

ScottFreeCode commented Oct 19, 2021 • edited Loading

Interfaces: More flexible abstract syntax binding instead of `define`? #267

Interfaces: More flexible abstract syntax binding instead of `define`? #267

andreasabel commented Nov 13, 2019 •

edited

Loading

ScottFreeCode commented Oct 19, 2021 •

edited

Loading