Parser syntax sparse bl #76

zbraniecki · 2017-09-27T10:11:25Z

That's more or less how I think it'll have to work if we want to keep the spans and proper tokenizer flow.

I can't say I like it... :(

zbraniecki · 2017-10-02T15:41:03Z

I think this is ready to be reviewed.

I analyzed the change in span ranges and the cause is that we now start the span for message value after we skip the inline whitespace. For example:

key = Value
     ^
     |---- start before

key = Value
      ^
      |---- start after

and:

key =
     ^
     |---- start before
    Value

key =
    Value
    ^
    |---- start after

zbraniecki · 2017-10-02T15:41:55Z

I'm still testing performance but first indications are that it doesn't impact runtime perf, just the fluent-syntax parsing one, which is not significant for us. I tested on node, need to recompile spidermonkey to test against it.

zbraniecki · 2017-10-02T17:00:45Z

Ok, got some more performance tests.

This patch does actually regress on jsshell:

parseFTL: 
  mean:   21812.23 (+24%)
  stdev:  2660.13
  sample: 30
parseFTLEntries: 
  mean:   3006.2 (+8%)
  stdev:  223.84
  sample: 30
format: 
  mean:   2752.9 (+76%)
  stdev:  452.41
  sample: 30

I have no idea why the format test gets such a penalty, since the AST produced with the patch is identical to the one produced on master. @stasm - any ideas?

zbraniecki · 2017-10-02T17:04:26Z

Here's the same data for nodejs:

parseFTL: 
  mean:   8712.83 (+16%)
  stdev:  620.47
  sample: 30
parseFTLEntries: 
  mean:   1983.53 (+13%)
  stdev:  352.69
  sample: 30
format: 
  mean:   744.17 (+7.000000000000001%)
  stdev:  88.91
  sample: 30

So, still a regression but not as significant as on jsshell. I have no idea what's up with jsshell... maybe with the patch we cross some boundary due to, I don't know, number of variables? Lines of code?

stasm

I like this. I'll review this in detail tomorrow. Leaving two quick comments for now.

stasm · 2017-10-02T19:09:58Z

fluent-syntax/src/ftlstream.js

@@ -31,6 +31,21 @@ export class FTLParserStream extends ParserStream {
    }
  }

+  peekSkipBlankLines() {


This name is confusing. Does it peek or does it skip?

stasm · 2017-10-02T19:11:29Z

fluent-syntax/src/parser.js

@@ -242,6 +243,7 @@ export default class FluentParser {

    while (true) {
      ps.expectChar('\n');
+      ps.skipBlankLines();
      ps.skipInlineWS();


This looks like a common pattern. Perhaps it's worth having a function called expectIndent?

stasm

I was intrigued by the fact that neither getTags nor getAttributes didn't need skipBlankLines. It turns out that they currently use skipWS which is too tolerant about indentation. I filed https://bugzilla.mozilla.org/show_bug.cgi?id=1405645.

stasm · 2017-10-04T11:48:45Z

fluent-syntax/src/stream.js

-  resetPeek() {
-    this.peekIndex = this.index;
-    this.peekEnd = this.iterEnd;
+  resetPeek(pos = false) {


Nit: pos doesn't require a default: it will be undefined if nothing is passed.

stasm · 2017-10-04T11:56:32Z

fluent/src/parser.js

@@ -344,6 +367,16 @@ class RuntimeParser {
      // by new line and `|` character at the beginning of the next one.


Would you mind fixing this comment while you're at it, please?

stasm

Looks good to me, thanks! I'm happy about the change to the AST spans, too.

stasm · 2017-10-09T10:20:51Z

I filed bug 1406880 because it looks like the spans of multiline Patterns are slightly different from the ones visualized in your comment above. I'm not yet sure which approach is better, let's decide in the bug and fix this if needed.

- Implement Fluent Syntax 0.5. - Add support for terms. - Add support for `#`, `##` and `###` comments. - Remove support for tags. - Add support for `=` after the identifier in message and term defintions. - Forbid newlines in string expressions. - Allow trailing comma in call expression argument lists. In fluent-syntax 0.6.x the new Syntax 0.5 is supported alongside the old Syntax 0.4. This should make migrations easier. `FluentParser` will correctly parse Syntax 0.4 comments (prefixed with `//`), sections and message definitions without the `=` after the identifier. The one exception are tags which are no longer supported. Please use attributed defined on terms instead. `FluentSerializer` always serializes using the new Syntax 0.5. - Add `AST.Placeable` (#64) Added in Syntax Spec 0.4, `AST.Placeable` provides exact span data about the opening and closing brace of placeables. - Expose `FluentSerializer.serializeExpression`. (#134) - Serialize standalone comments with surrounding white-space. - Allow blank lines inside of messages. (#76) - Trim trailing newline from Comments. (#77)

Zibi Braniecki added 3 commits October 2, 2017 13:40

Handle sparse messages in fluent-syntax

873d693

Fix tests affected by the sparse message parsing change

8bf4cff

Handle sparse messages in fluent runtime

188d717

zbraniecki force-pushed the parser-syntax-sparse-bl branch from 9ce7bf8 to 188d717 Compare October 2, 2017 15:04

Fix linting errors

292361d

zbraniecki requested a review from stasm October 2, 2017 15:38

stasm reviewed Oct 2, 2017

View reviewed changes

stasm reviewed Oct 4, 2017

View reviewed changes

Apply reviewers feedback

0388600

zbraniecki requested a review from stasm October 4, 2017 12:48

stasm approved these changes Oct 4, 2017

View reviewed changes

One more indent

c0f282f

zbraniecki merged commit 7e52517 into projectfluent:master Oct 4, 2017

zbraniecki mentioned this pull request Oct 6, 2017

Bug 1397234 - Allow blank lines before attributes, tags and multiline patterns projectfluent/python-fluent#20

Merged

zbraniecki deleted the parser-syntax-sparse-bl branch December 12, 2017 04:43

stasm mentioned this pull request Jan 31, 2018

fluent-syntax: Allow new lines after { and before tags, attributes and multiline patterns #67

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parser syntax sparse bl #76

Parser syntax sparse bl #76

zbraniecki commented Sep 27, 2017

zbraniecki commented Oct 2, 2017

zbraniecki commented Oct 2, 2017

zbraniecki commented Oct 2, 2017

zbraniecki commented Oct 2, 2017

stasm left a comment

stasm Oct 2, 2017

stasm Oct 2, 2017

stasm left a comment •

edited

Loading

stasm Oct 4, 2017

stasm Oct 4, 2017

stasm left a comment

stasm commented Oct 9, 2017

		@@ -344,6 +367,16 @@ class RuntimeParser {
		// by new line and `\|` character at the beginning of the next one.

Parser syntax sparse bl #76

Parser syntax sparse bl #76

Conversation

zbraniecki commented Sep 27, 2017

zbraniecki commented Oct 2, 2017

zbraniecki commented Oct 2, 2017

zbraniecki commented Oct 2, 2017

zbraniecki commented Oct 2, 2017

stasm left a comment

Choose a reason for hiding this comment

stasm Oct 2, 2017

Choose a reason for hiding this comment

stasm Oct 2, 2017

Choose a reason for hiding this comment

stasm left a comment • edited Loading

Choose a reason for hiding this comment

stasm Oct 4, 2017

Choose a reason for hiding this comment

stasm Oct 4, 2017

Choose a reason for hiding this comment

stasm left a comment

Choose a reason for hiding this comment

stasm commented Oct 9, 2017

stasm left a comment •

edited

Loading