Simple syntax extensions to shorten grammars #545

reverofevil · 2017-12-01T03:38:53Z

While PEG.js is already a great tool for parsing, it requires a lot of boilerplate inside of the actions. As two cases constitute most of the time we need actions, some better syntax probably should be considered.

AST node creation

start  = _ expr:expr1 { return expr; }

expr1  = left:expr2 type:[+-] _ right:expr1 { return {type, left, right}; }
       / expr2

expr2  = left:expr3 type:[*/] _ right:expr2 { return {type, left, right}; }
       / expr3

expr3  = "(" _ expr:expr1 ")" _ { return expr; }
       / intlit

intlit = value:$[0-9]+ _ { return {type: 'int', value: parseInt(value, 10)}; }

_      = [ \t\n\r]*

(Right associative operators used for simplicity.) In this example all the

{ return {type, left, right}; }

are extraneous. When there is a sequence of named expressions without an action to use those names, it would be useful to imply object creation.

The only case when it's not enough while creating AST is when we need to put type: 'smth' into the generated object. While

smth = type:{ return "smth"; } this:this that:that

is totally an option, some syntactic sugar might be useful too. Here's an example, assuming that type: is commonly used as a name of node type tag (type is a bad name though, because it's useful for typed languages. kind would be better, but it's rare), and that users won't create AST nodes in arguments of / choice operator.

smth := this:this that:that

Cherry-picking

Actions like

{ return expr; }

happen a lot, because every lexeme in a programming language should be forwarded with something like _. It's tolerable when it happens on top level, but things like

exprX = (a:[+-] _ { return a; })* exprY

are really annoying. It would be nice to have some better syntax like

exprX = (@[+-] _)* exprY

where @ sign means "leave this as the only scalar result". No more than one expression in a sequence should be modified with @ prefix operator, and it shouldn't be used in same sequence with named expressions.

Alternatively : sign can be used instead to conserve @ for some future use, but it requires making current name:expr syntax whitespace-sensitive, and that's compatibility issue.

The text was updated successfully, but these errors were encountered:

rafaelclp · 2017-12-19T22:42:23Z

+1 for the @ sign. After having written a few dozens of rules, needing to add { return a; } to simple rules just because of the _ has already become quite annoying and makes the grammar harder to read.

Also: #235

reverofevil · 2017-12-20T05:08:25Z

@rafaelclp Thanks for the link! I probably caught the idea there back in 2014 when I've made my own PEG parser generator. (Well, there is no better explanation why I've chosen the same @ sign.)

futagoza · 2018-01-22T14:16:44Z

@polkovnikov-ph Since I'm planning to use @ with annotations and import statements, I was planning to also use it for this feature, but after a while, I've started thinking, wouldn't it be confusing to use the same symbol (@) for all these? Since I plan to use % for external rule calls (see stage 2 of #523), the next best symbol to use that also fits into the current grammar nicely is # or ::

exprX = (#[+-] _)* exprY
exprY = ::expression ![+-]

What do you think?

reverofevil · 2018-01-22T14:51:26Z

@futagoza :: looks more consistent. Even : would work. Since library is in pre-release version, it's still possible to do this non backwards compatible change, so that people don't have to double tap on :. I don't think there is a lot of places where people wrote e :expression anyway.

futagoza · 2018-01-22T15:06:53Z

I chose :: over : for one reason only, to not confuse the two. It's not about backwards compatibility.

Mingun · 2018-01-22T15:11:29Z

Just a note: this problem rises up in #11 and #427. So I think that some of there issues must be closed as duplicates.

reverofevil · 2018-01-22T15:27:37Z

@futagoza I'm fine with pretty much any way to do this. Specific character doesn't matter. It will never be worse than having to write { return ; } :)

reverofevil · 2018-01-22T15:30:54Z

@Mingun I think this (#545) issue should be closed, because I read all the issues in 2013 and compiled them into another project. There's a lot of good comments under those issues.

futagoza · 2018-01-22T15:38:18Z

#235 and #427 are the same as this, but #11 is regarding another matter

Edit: Added note to OP's comment on #235 that references this issue

Resolves #235, #427, #545

reverofevil mentioned this issue Jan 22, 2018

Automatically return match actions that are a single expression #557

Closed

1 task

futagoza closed this as completed Jan 22, 2018

krisnye mentioned this issue Jan 22, 2018

Provide a concise way to indicate a single return value from a sequence #235

Closed

futagoza added a commit that referenced this issue Sep 17, 2018

Implement value plucking

460f0cc

Resolves #235, #427, #545

Mingun referenced this issue Oct 31, 2018

Updated changelog for upcoming 0.11

87dcc13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple syntax extensions to shorten grammars #545

Simple syntax extensions to shorten grammars #545

reverofevil commented Dec 1, 2017 •

edited

rafaelclp commented Dec 19, 2017 •

edited

reverofevil commented Dec 20, 2017

futagoza commented Jan 22, 2018

reverofevil commented Jan 22, 2018 •

edited

futagoza commented Jan 22, 2018

Mingun commented Jan 22, 2018

reverofevil commented Jan 22, 2018

reverofevil commented Jan 22, 2018

futagoza commented Jan 22, 2018 •

edited

Simple syntax extensions to shorten grammars #545

Simple syntax extensions to shorten grammars #545

Comments

reverofevil commented Dec 1, 2017 • edited

AST node creation

Cherry-picking

rafaelclp commented Dec 19, 2017 • edited

reverofevil commented Dec 20, 2017

futagoza commented Jan 22, 2018

reverofevil commented Jan 22, 2018 • edited

futagoza commented Jan 22, 2018

Mingun commented Jan 22, 2018

reverofevil commented Jan 22, 2018

reverofevil commented Jan 22, 2018

futagoza commented Jan 22, 2018 • edited

reverofevil commented Dec 1, 2017 •

edited

rafaelclp commented Dec 19, 2017 •

edited

reverofevil commented Jan 22, 2018 •

edited

futagoza commented Jan 22, 2018 •

edited