Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Comply with CommonMark 0.31.2 #416

Merged
merged 15 commits into from
Apr 1, 2024

Conversation

lostenderman
Copy link
Collaborator

Closes #412

Tests that needed parser changes:

  1. raw_html/014.test
    • -- is now allowed in inline html comments
  2. emphasis_and_strong_emphasis/132.test
    • changes in unicode punctuation that prevent emphasis creation
    • punctuation specified explicitly

@Witiko Witiko added the commonmark Related to making the syntax of markdown follow the CommonMark spec label Mar 13, 2024
@Witiko Witiko added this to the 3.5.0 milestone Mar 13, 2024
markdown.dtx Outdated Show resolved Hide resolved
markdown.dtx Show resolved Hide resolved
@Witiko Witiko self-assigned this Mar 28, 2024
@Witiko Witiko force-pushed the update-commonmark branch from 93f1820 to 1522923 Compare April 1, 2024 11:41
@Witiko Witiko marked this pull request as ready for review April 1, 2024 13:25
@Witiko Witiko added the automerge This pull request will be automatically merged after continuous integration has succeeded label Apr 1, 2024
markdown.dtx Show resolved Hide resolved
Witiko and others added 2 commits April 1, 2024 15:38
This reduces the memory footprint of `markdown.lua`
This better documents the changes between different CommonMark versions
and shows that all test files from CommonMark 0.30 still pass.
@Witiko Witiko force-pushed the update-commonmark branch from b5cc60b to 9087fae Compare April 1, 2024 13:49
This reduces the memory footprint of `markdown.lua`
@github-actions github-actions bot merged commit d8a1d2f into Witiko:main Apr 1, 2024
9 of 12 checks passed
@Witiko Witiko deleted the update-commonmark branch April 1, 2024 15:06
Witiko added a commit that referenced this pull request Apr 3, 2024
Since #416, we have first read all Unicode punctuation characters to a
table `punctuation`, then defined `parsers.punctuation` using the table
`punctuation`, and then we deleted the table `punctuation`. Since #416,
we have also been experiencing steady out-of-memory issues with our
capybara runner, as discussed with @TeXhackse earlier today.

I have disabled capybara, since it's been having intermittent
out-of-memory issues ever since Markdown 3.0.0 and its speed has also
lately been an issue. Nevertheless, this indicates a potential cost of
the current approach, which may eventually impact our users as well.

This PR removes the table `punctuation` directly from the file
`UnicodeData.txt` without any intermediate data structure. This should
alleviate any memory issues caused by #416.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
automerge This pull request will be automatically merged after continuous integration has succeeded commonmark Related to making the syntax of markdown follow the CommonMark spec
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Comply with CommonMark 0.31.2
2 participants