Maximum call stack size exceeded when using many optional groups #775

dhuebner · 2022-11-23T13:17:09Z

Langium version: current master

When using many optional groups collectInferredTypes (inferred-types.ts Line: 184) creates many different types. Currently it seems like 2ⁿ where n is number of optional groups.

With the node v14 and 17 optional groups a "RangeError" error is thrown. Below is a example grammar for testing:

grammar Test

entry Model:
    (title1=INT ';')?
    (title2=INT ';')?
    (title3=INT ';')?
    (title4=INT ';')?
    (title5=INT ';')?
    (title6=INT ';')?
    (title7=INT ';')?
    (title8=INT ';')?
    (title9=INT ';')?
    (title10=INT ';')?
    (title11=INT ';')?
    (title12=INT ';')?
    (title13=INT ';')?
    (title14=INT ';')?
    (title15=INT ';')?
    (title16=INT ';')?
    (title17=INT ';')?
    /* 
    (title18=INT ';')?
    (title19=INT ';')?
    (title20=INT ';')?
    (title21=INT ';')?
    (title22=INT ';')?
    (title23=INT ';')?
    (title24=INT ';')?
    (title25=INT ';')?
    (title26=INT ';')?
    */
;
terminal INT returns number: ('0'..'9')+;

The text was updated successfully, but these errors were encountered:

tavoda · 2024-04-25T09:05:29Z

Hello, is somebody working on this? Without this you can't use langium for any serious grammar.

msujew · 2024-04-25T15:12:57Z

@tavoda I've provided a fix. We'll probably soon release a version 3.0.1 that contains this fix.

Without this you can't use langium for any serious grammar.

Note that I would strongly disagree with this. We're working quite intensively with Langium and designed some pretty "serious grammars", and so far the only time we've encountered this issue was when we migrated a legacy grammar from Xtext to Langium, which we needed to rewrite anyway.

tavoda · 2024-04-26T19:03:31Z

@msujew thanks for fix. How I can rewrite such rules? This is of course old XText project which I would like to port but I don't see other options. Please check e.g. DslAttribute:
https://github.com/sculptor/sculptor/blob/f57ce67563f1ffa0a6a52d1ea81ad14dbe31b8e7/sculptor-eclipse/org.sculptor.dsl/src/org/sculptor/dsl/Sculptordsl.xtext#L336
You don't support reordered optional parameters, I can live without this but how to migrate this grammar?

msujew · 2024-04-27T20:30:02Z

@tavoda I'd rewrite all these <key> = <value> mappings using a new rule:

DslAttribute :
  (doc=STRING)?
  (visibility=DslVisibility)? (collectionType=DslCollectionType"<")? type=DslType (">")? name=ID
    (properties+=DslAttributeProperty)*
  (";")?;

DslAttributeProperty: NOT ref=[PropertyDefinition:ID] | ref=[PropertyDefinition:ID] ('=' value=Literal)?;
...
PropertyDefinition: name=ID ':' type=Type;

These PropertyDefinition elements are part of a standard library delivered with your language. Everything else is handled via type based validation rules and cross references. That how we design most languages nowadays which contain a lot of structural data.

tavoda · 2024-04-29T07:31:53Z

It's just moving syntactical rules to semantic layer. For me BNF rules are exactly for this purpose to express as much as possible in syntax. We can rewrite nearly whole grammar (and all C like grammars) to:
CLikeGramars: NOT ref=[PropertyDefinition:ID] | ref=[PropertyDefinition:ID] ('=' value=Literal)? | Punct | String;
Punct: /\b[!#$%&()*+,-./:;<=>?@[\]^_{\|}~]\b/
That is not purpose of parser because than you degrade it to tokenizer.
Thanks again for fix, looking forward to test it.

msujew · 2024-04-29T15:15:20Z

For me BNF rules are exactly for this purpose to express as much as possible in syntax.

IMO there's a big difference in BNF purely for parsing (i.e. something like a compiler) compared to editing (e.g. Xtext or Langium). In our experience (i.e. at TypeFox), the parser for an editor should allow a lot of "invalid" input, which is then validated against a set of known rules. This serves two purposes:

You usually get way better error recovery/messages from the parser when being a bit more flexible with your language.
Error messages can be way better compared to auto-generated syntax errors (both in regards to incorrectly typed property names as well as the value assigned to it).

With that in mind, I think my suggestion serves that grammar better than just embedding all that information into the grammar directly. No offense here, that's exactly what I did when I started out with Xtext. But in my experience, making things more flexible usually leads to a better product/end result.

That is not purpose of parser because then you degrade it to tokenizer.

It's honestly pretty difficult to tell what the purpose of a parser is. To some, it's just validating some string input. Others want a visitor like pattern, that allows them to tell which rule/part of a rule has been called. This is usually the academic perspective. Some even want a full AST. We serve the last requirement together with all the editing and linking services.

tavoda · 2024-04-29T15:23:21Z

Yes, I'm more on AST side ;-).
After you explanation I see. For editor is better to have your approach with validation on semantic layer.

snarkipus · 2024-05-05T13:42:25Z

IMO there's a big difference in BNF purely for parsing (i.e. something like a compiler) compared to editing (e.g. Xtext or Langium). In our experience (i.e. at TypeFox), the parser for an editor should allow a lot of "invalid" input, which is then validated against a set of known rules. This serves two purposes:

You usually get way better error recovery/messages from the parser when being a bit more flexible with your language.

Error messages can be way better compared to auto-generated syntax errors (both in regards to incorrectly typed property names as well as the value assigned to it).

This is such a great take. While immediately obvious to some, this realization took me a while to grasp, but it fundamentally changed how I think about tooling in general.

dhuebner added bug Something isn't working grammar Grammar language related issue labels Nov 23, 2022

pluralia added the types Types related issue label Nov 23, 2022

msujew added this to the v1.0.0 milestone Dec 7, 2022

spoenemann mentioned this issue Dec 9, 2022

Check the state of optional unordered group handling #803

Open

msujew mentioned this issue Dec 15, 2022

Respect inheritance for actions and unassigned rulecalls #837

Merged

spoenemann removed this from the v1.0.0 milestone Dec 16, 2022

msujew mentioned this issue Apr 25, 2024

Generate from language file is endless #1472

Closed

dhuebner added types Types related issue and removed types Types related issue labels Apr 25, 2024

msujew mentioned this issue Apr 25, 2024

Fix type computation for many optional properties #1473

Merged

msujew closed this as completed in #1473 May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maximum call stack size exceeded when using many optional groups #775

Maximum call stack size exceeded when using many optional groups #775

dhuebner commented Nov 23, 2022

tavoda commented Apr 25, 2024

msujew commented Apr 25, 2024

tavoda commented Apr 26, 2024

msujew commented Apr 27, 2024

tavoda commented Apr 29, 2024

msujew commented Apr 29, 2024

tavoda commented Apr 29, 2024

snarkipus commented May 5, 2024 •

edited by msujew

Maximum call stack size exceeded when using many optional groups #775

Maximum call stack size exceeded when using many optional groups #775

Comments

dhuebner commented Nov 23, 2022

tavoda commented Apr 25, 2024

msujew commented Apr 25, 2024

tavoda commented Apr 26, 2024

msujew commented Apr 27, 2024

tavoda commented Apr 29, 2024

msujew commented Apr 29, 2024

tavoda commented Apr 29, 2024

snarkipus commented May 5, 2024 • edited by msujew

snarkipus commented May 5, 2024 •

edited by msujew