Refactor the escape() function to improve performance 10-20% #975

KostyaTretyak · 2017-12-19T10:18:14Z

No description provided.

Feder1co5oave · 2017-12-27T01:21:26Z

I know this sounds kinda silly, but can we stick to the present coding style? This almost looks like a different language.

KostyaTretyak · 2017-12-27T01:31:28Z

Okey, I changed the style code, it provided me with VS Code through auto formatting. Also, I replaced const to var. Hope it's OK.

Feder1co5oave · 2018-01-02T16:12:55Z

lib/marked.js

+    "'": '&#39;'
+  };
+
+var escapeTestNoEncode = /(?:[<>"']|&(?!#?\w+;))/;


There's no need to use grouping to wrap the whole thing

Thank you, I fixed it.

Y'all are awesome! Thank you.

Feder1co5oave · 2018-01-02T16:13:26Z

lib/marked.js

@@ -1084,13 +1084,33 @@ Parser.prototype.tok = function() {
 * Helpers
 */

+var escapeTest = /[&<>"']/;


These should be declared inside escape() IMO

No, marked do not have to recreate the same RegExp instance every call escape(). This reduces performance and increases memory usage.

Ok, I made them static.

Feder1co5oave · 2018-01-05T17:01:47Z

Actually, I've found there's no advantage in testing before replacing, you still have to scan the whole thing at least once, either by testing or replacing, so the first phase is pretty useless.

# with current changes:
$ node test --bench
marked completed in 8388ms.
marked (gfm) completed in 9380ms.
marked (pedantic) completed in 8315ms.
Could not bench robotskirt.
Could not bench showdown.
Could not bench markdown.js.

# without testing first:
$ node test --bench
marked completed in 8394ms.
marked (gfm) completed in 9286ms.
marked (pedantic) completed in 8045ms.
Could not bench robotskirt.
Could not bench showdown.
Could not bench markdown.js.

And you can spare some line of code:

function escape(html, encode) {
  if (encode) {
    return html.replace(escape.replace, function (ch) {
      return escape.replacements[ch];
    });
  } else {
    return html.replace(escape.replaceNoEncode, function (ch) {
      return escape.replacements[ch];
    });
  }
}

escape.replace = /[&<>"']/g;
escape.replaceNoEncode = /[<>"']|&(?!#?\w+;)/g;
escape.replacements = {
  '&': '&amp;',
  '<': '&lt;',
  '>': '&gt;',
  '"': '&quot;',
  "'": '&#39;'
};

KostyaTretyak · 2018-01-05T17:14:17Z

My first escape() without regexp.test():

function escape(html, encode) {
  if (encode) {
    return html.replace(escape.escapeReplace, function (ch) { return escape.replacements[ch] });
  }
  else {
    return html.replace(escape.escapeReplaceNoEncode, function (ch) { return escape.replacements[ch] });
  }

  return html;
}

I run this code:

node test -t

Three times:

marked completed in 4146ms
marked completed in 4133ms
marked completed in 4125ms

My second escape() with regexp.test():

function escape(html, encode) {
  if (encode) {
    if (escape.escapeTest.test(html)) {
      return html.replace(escape.escapeReplace, function (ch) { return escape.replacements[ch] });
    }
  }
  else {
    if (escape.escapeTestNoEncode.test(html)) {
      return html.replace(escape.escapeReplaceNoEncode, function (ch) { return escape.replacements[ch] });
    }
  }

  return html;
}

Run three times:

marked completed in 3885ms
marked completed in 3902ms
marked completed in 3893ms

UziTech · 2018-01-07T18:25:16Z

Looks like this would make marked faster than markdown-it in our benchmarks

marked completed in 2200ms.
marked (gfm) completed in 2225ms.
marked (pedantic) completed in 1917ms.
showdown (reuse converter) completed in 21693ms.
showdown (new converter) completed in 22802ms.
markdown-it completed in 3275ms.
markdown.js completed in 11650ms.

KostyaTretyak · 2018-01-07T18:36:03Z

In my benchmarks, remarkable is faster and more economical than remarked-it, but both of them can compete with marked when it is necessary to parse large files - more than 2 MB.

joshbruce · 2018-01-07T18:43:05Z

Yeah. @worker8's independent benchmark sample has remarkable at the top as well.

@KostyaTretyak: Just to make sure. They can compete with marked with large files >2mb - versus the can not?

I think if we do what in #746 - we will be able to see areas for optimization easier. Right now we kinda have the large class thing happening.

KostyaTretyak · 2018-01-07T18:43:40Z

In markdown.js and showdown, a very noticeable regress when files get larger than 300 KB.

joshbruce · 2018-01-07T18:46:17Z

Interesting. Of course, if they're (or we're) targeting web developers - most folks aren't going to need to go above that. Maybe marked is the "large file" parser. :)

Thinking of something like LeanPub - parse an entire book in markdown.

KostyaTretyak · 2018-01-07T18:55:35Z

Maybe marked is the "large file" parser. :)

No, it is a favorite when files are smaller than 2MB. If the files are bigger, then remarkable and markdown-it more faster.

Not for the sake of advertising, just for you to see it clearly. Do the following:

git clone https://github.com/KostyaTretyak/marked-ts.git
cd marked-ts
npm install
npm run compile

And then you can:

npm run bench -- -l 1000

Where -l 1000 - bench 1000 KB file with markdown tests. Here two dash from the front are necessary.

joshbruce · 2018-01-07T21:05:35Z

Thanks! That's an interesting trick...might interesting for us to add to the CLI...if I'm understanding correctly:

I can secify how large of a file. Kind of like lipsum https://lipsum.lipsum.com - generate Markdown of X size to run the bench against.

styfle

LGTM 👍

styfle · 2018-09-11T13:23:51Z

Is there a way to force push to invoke travis unit tests?

UziTech · 2018-09-11T13:56:49Z

@KostyaTretyak if you can rebase this PR we should be able to merge it.

git fetch upstream && git rebase upstream/master && git push -f

UziTech · 2018-09-11T13:58:19Z

I rebased and tested locally, and everything worked fine.

styfle · 2018-09-11T14:02:20Z

Nice! I ran benchmarks locally and this is the before and after:

Before

$ node test --bench
marked completed in 9146ms.
marked (gfm) completed in 11382ms.
marked (pedantic) completed in 8912ms.
commonmark completed in 10832ms.
markdown-it completed in 9832ms.
markdown.js completed in 29112ms.

After

$ node test --bench
marked completed in 8033ms.
marked (gfm) completed in 9946ms.
marked (pedantic) completed in 7905ms.
commonmark completed in 10897ms.
markdown-it completed in 9473ms.
markdown.js completed in 28330ms.

KostyaTretyak · 2018-09-11T14:05:55Z

@UziTech, done:

git fetch upstream && git rebase upstream/master && git push -f

styfle · 2018-09-11T14:10:57Z

@KostyaTretyak Thanks! Can you fix this lint error 😄

styfle

Excellent! 🎉

UziTech

Awesome work @KostyaTretyak

This was referenced Dec 21, 2017

Add an option to not espace quotes. #269

Closed

V0.3.9 #976

Merged

joshbruce added the proposal label Dec 23, 2017

joshbruce added this to the 0.5.0 - Architecture and extensibility milestone Dec 25, 2017

Feder1co5oave reviewed Jan 2, 2018

View reviewed changes

Feder1co5oave mentioned this pull request Jan 26, 2018

[Emphasis/strong] Consolidate issues #1036

Closed

8 tasks

joshbruce removed this from the 0.5.0 - Architecture and extensibility milestone Apr 4, 2018

styfle approved these changes Sep 11, 2018

View reviewed changes

styfle changed the title ~~Refactoring the old escape() function improved performance on 30-40%~~ Refactor the escape() function to improve performance 30-40% Sep 11, 2018

Refactoring the old escape() function improved performance on 30-40%

eb9f08c

styfle approved these changes Sep 11, 2018

View reviewed changes

styfle removed the proposal label Sep 11, 2018

UziTech approved these changes Sep 11, 2018

View reviewed changes

moamed-syed mentioned this pull request Oct 27, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 moamed-syed/DSOF-Patch-Chat-App#2

Open

ManPrivate mentioned this pull request Oct 27, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 ManPrivate/DSOF-Patch-Chat-App#2

Open

tdsnyk mentioned this pull request Oct 28, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 tdsnyk/nodejs-goof#6

Open

sri-sankari mentioned this pull request Oct 28, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 sri-sankari/DSOF-Patch-Chat-App#3

Open

mohankumar931 mentioned this pull request Oct 28, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 mohankumar931/DSOF-Patch-Chat-App#2

Open

sound25 mentioned this pull request Oct 29, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 sound25/DSOF-Patch-Chat-App#3

Open

Latha-lgtm mentioned this pull request Oct 29, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 Latha-lgtm/DSOF-Patch-Chat-App#3

Open

GOWTHAMAKURINJIVEANDAN mentioned this pull request Oct 29, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 GOWTHAMAKURINJIVEANDAN/DSOF-Patch-Chat-App#3

Open

Dejah2024 mentioned this pull request Oct 30, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 Dejah2024/vuln-app#5

Open

ThiruVaithiya mentioned this pull request Oct 30, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 ThiruVaithiya/DSOF-Patch-Chat-App#1

Open

Upender5119 mentioned this pull request Oct 30, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 Upender5119/DSOF-Patch-Chat-App#2

Open

Suriya0802 mentioned this pull request Oct 30, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 Suriya0802/DSOF-Patch-Chat-App#3

Open

Shriv12 mentioned this pull request Oct 30, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 Shriv12/DSOF-Patch-Chat-App#1

Open

Magali0007 mentioned this pull request Oct 31, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 Magali0007/vuln-app#5

Open

sureshrtech1 mentioned this pull request Oct 31, 2024

[Snyk] Upgrade marked from 0.3.19 to 0.8.2 sureshrtech1/DSOF-Patch-Chat-App#3

Open

Praganya03 mentioned this pull request Oct 31, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 Praganya03/DSOF-Patch-Chat-App-idp#3

Open

Ganesh13349 mentioned this pull request Oct 31, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 Ganesh13349/DSOF-Patch-Chat-App#3

Open

robertnorrie mentioned this pull request Oct 31, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 robertnorrie/DSOF-Patch-Chat-App#3

Open

sinnakkrishnan mentioned this pull request Nov 1, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 sinnakkrishnan/DSOF-Patch-Chat-App#3

Open

krithika-manohar mentioned this pull request Nov 1, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 krithika-manohar/DSOF-Patch-Chat-App#3

Open

SAGAYANELSON55 mentioned this pull request Nov 1, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 SAGAYANELSON55/DSOF-Patch-Chat-App#2

Open

swathi-ra mentioned this pull request Nov 2, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 swathi-ra/DSOF-Patch-Chat-App#1

Open

austinskou mentioned this pull request Nov 3, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 austinskou/austin-kou-nov-01-2024snyk-goof#6

Open

BD-HUGO mentioned this pull request Nov 3, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 BD-HUGO/hugo-demo-nodegoat-javascript#6

Open

danielsb74 mentioned this pull request Nov 5, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 danielsb74/vuln-app#5

Open

selvaganesh26 mentioned this pull request Nov 5, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 selvaganesh26/EH-Patch-Chat-App#3

Open

merkeldev mentioned this pull request Nov 5, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 merkeldev/vuln-app#5

Open

joedean-git mentioned this pull request Nov 9, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 joedean-org/NodeGoat#1

Open

MimiDas-Snyk mentioned this pull request Nov 10, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 MimiDas-Snyk/snyk-chat-goof#3

Open

BenDrKo mentioned this pull request Nov 11, 2024

[Snyk] Upgrade marked from 0.3.5 to 0.8.2 BenDrKo/NodeGoat#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor the escape() function to improve performance 10-20% #975

Refactor the escape() function to improve performance 10-20% #975

KostyaTretyak commented Dec 19, 2017

Feder1co5oave commented Dec 27, 2017

KostyaTretyak commented Dec 27, 2017 •

edited

Loading

Feder1co5oave Jan 2, 2018

KostyaTretyak Jan 3, 2018

joshbruce Jan 5, 2018

Feder1co5oave Jan 2, 2018

KostyaTretyak Jan 3, 2018 •

edited

Loading

KostyaTretyak Jan 3, 2018

Feder1co5oave commented Jan 5, 2018

KostyaTretyak commented Jan 5, 2018

UziTech commented Jan 7, 2018

KostyaTretyak commented Jan 7, 2018 •

edited

Loading

joshbruce commented Jan 7, 2018

KostyaTretyak commented Jan 7, 2018

joshbruce commented Jan 7, 2018

KostyaTretyak commented Jan 7, 2018

joshbruce commented Jan 7, 2018

styfle left a comment

styfle commented Sep 11, 2018

UziTech commented Sep 11, 2018

UziTech commented Sep 11, 2018

styfle commented Sep 11, 2018

KostyaTretyak commented Sep 11, 2018

styfle commented Sep 11, 2018

styfle left a comment

UziTech left a comment

Refactor the escape() function to improve performance 10-20% #975

Refactor the escape() function to improve performance 10-20% #975

Conversation

KostyaTretyak commented Dec 19, 2017

Feder1co5oave commented Dec 27, 2017

KostyaTretyak commented Dec 27, 2017 • edited Loading

Feder1co5oave Jan 2, 2018

Choose a reason for hiding this comment

KostyaTretyak Jan 3, 2018

Choose a reason for hiding this comment

joshbruce Jan 5, 2018

Choose a reason for hiding this comment

Feder1co5oave Jan 2, 2018

Choose a reason for hiding this comment

KostyaTretyak Jan 3, 2018 • edited Loading

Choose a reason for hiding this comment

KostyaTretyak Jan 3, 2018

Choose a reason for hiding this comment

Feder1co5oave commented Jan 5, 2018

KostyaTretyak commented Jan 5, 2018

UziTech commented Jan 7, 2018

KostyaTretyak commented Jan 7, 2018 • edited Loading

joshbruce commented Jan 7, 2018

KostyaTretyak commented Jan 7, 2018

joshbruce commented Jan 7, 2018

KostyaTretyak commented Jan 7, 2018

joshbruce commented Jan 7, 2018

styfle left a comment

Choose a reason for hiding this comment

styfle commented Sep 11, 2018

UziTech commented Sep 11, 2018

UziTech commented Sep 11, 2018

styfle commented Sep 11, 2018

Before

After

KostyaTretyak commented Sep 11, 2018

styfle commented Sep 11, 2018

styfle left a comment

Choose a reason for hiding this comment

UziTech left a comment

Choose a reason for hiding this comment

KostyaTretyak commented Dec 27, 2017 •

edited

Loading

KostyaTretyak Jan 3, 2018 •

edited

Loading

KostyaTretyak commented Jan 7, 2018 •

edited

Loading