EmbedAPIv3: updates after QA with sphinx-hoverxref #8521

humitos · 2021-09-22T15:17:39Z

Updates required by readthedocs/sphinx-hoverxref#146 found while doing QA.

Add extra domains for external intersphinx.

ericholscher

Looks good, just a few small suggestions.

ericholscher · 2021-09-27T15:52:07Z

readthedocs/embed/v3/views.py

@@ -71,12 +70,15 @@ def _download_page_content(self, url):

        response = requests.get(url, timeout=settings.RTD_EMBED_API_DEFAULT_REQUEST_TIMEOUT)
        if response.ok:
+            # NOTE: we use ``response.content`` to get its binary
+            # representation. Then ``selectolax`` is in charge to auto-detect
+            # its encoding. We trust more in selectolax for this than in requests.


Why do we trust it more? Is there a specific issue we've seen? I'm guessing that requests has a lot more eyes on it, but perhaps we're doing something different?

I think we don't have a good amount of data tested yet. However, I've already find myself doing the solution from this SO question: https://stackoverflow.com/a/52615216

r = requests.get("https://martin.slouf.name/") # override encoding by real educated guess as provided by chardet r.encoding = r.apparent_encoding # access the data r.text

Then, I read that selectolax also uses the meta-tags from the HTML (https://selectolax.readthedocs.io/en/latest/parser.html#selectolax.parser.HTMLParser) to guess the encoding and it always worked without manual interaction as with requests.

Anyways, I don't have strong opinions here. I suppose whatever we choose here, it will fail in some case that the other library wouldn't 🙃

readthedocs/embed/v3/views.py

ericholscher · 2021-09-27T15:56:09Z

readthedocs/settings/base.py

@@ -818,6 +818,8 @@ def DOCKER_LIMITS(self):
        r'docs\.python\.org',
        r'docs\.scipy\.org',
        r'docs\.sympy\.org',
+        r'www.sphinx-doc.org',
+        r'numpy\.org',


This is likely something that should be defined in the DB, or somewhere else that doesn't require a deploy to modify. It's definitely something we're going to need to iterate on pretty frequently I'm guessing, and perhaps even support customization at a project or organization level.

I agree. We do have an issue opened for Django settings in the DB at https://github.com/readthedocs/readthedocs-ops/issues/1004 but I don't think that will progress too much. If that's the case, we can create a simple EmbedDomain model that we can relate to a Project and/or Organization as well.

Opened #8530 to continue the conversation there.

readthedocs/settings/docker_compose.py

tox.embedapi.ini

Co-authored-by: Eric Holscher <[email protected]>

humitos added 2 commits September 22, 2021 17:16

EmbedAPIv3: updates after QA with sphinx-hoverxref

349d772

Add extra domains for external intersphinx.

Remove import not used and add some notes about encoding

c31f260

humitos mentioned this pull request Sep 22, 2021

Embed APIv3: use latest embed API version readthedocs/sphinx-hoverxref#146

Merged

9 tasks

humitos added 5 commits September 23, 2021 13:02

Sanitize CSS selector

7b91b12

Clarify comment

0baeed1

Remove old comment

a6f3eda

Add numpy.org as supported external domain

9b80ad7

Move custom domains to local Docker instance only

d5aab3d

humitos requested a review from a team September 27, 2021 09:06

humitos marked this pull request as ready for review September 27, 2021 09:06

humitos added 3 commits September 27, 2021 13:08

Add Sphinx 4.2 to our tox envlist

550ad0b

Make glossary and citation work properly

39c8829

Lint

ae212b5

ericholscher approved these changes Sep 27, 2021

View reviewed changes

humitos and others added 2 commits September 27, 2021 20:31

Add *.readthedocs-hosted.com as valid domain

23f9520

Apply suggestions from code review

cc45e39

Co-authored-by: Eric Holscher <[email protected]>

humitos enabled auto-merge September 27, 2021 18:40

humitos merged commit d8651c1 into master Sep 27, 2021

humitos deleted the humitos/embed-api-v3-updates branch September 27, 2021 18:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EmbedAPIv3: updates after QA with sphinx-hoverxref #8521

EmbedAPIv3: updates after QA with sphinx-hoverxref #8521

humitos commented Sep 22, 2021

ericholscher left a comment

ericholscher Sep 27, 2021

humitos Sep 27, 2021

ericholscher Sep 27, 2021

humitos Sep 27, 2021

humitos Sep 27, 2021

EmbedAPIv3: updates after QA with sphinx-hoverxref #8521

EmbedAPIv3: updates after QA with sphinx-hoverxref #8521

Conversation

humitos commented Sep 22, 2021

ericholscher left a comment

Choose a reason for hiding this comment

ericholscher Sep 27, 2021

Choose a reason for hiding this comment

humitos Sep 27, 2021

Choose a reason for hiding this comment

ericholscher Sep 27, 2021

Choose a reason for hiding this comment

humitos Sep 27, 2021

Choose a reason for hiding this comment

humitos Sep 27, 2021

Choose a reason for hiding this comment