Extend Zayo parser to handle additional examples #109

glennmatthews · 2021-11-11T15:11:32Z

Handle some additional Zayo email notifications that were failing the current parser.

I'm marking this as a draft for now as I have some questions about how to handle information that may be omitted from certain Zayo notification emails (see TODO comments in zayo.py).

circuit_maintenance_parser/parsers/zayo.py

chadell

LGTM

chadell · 2021-11-12T17:11:47Z

circuit_maintenance_parser/data.py

@@ -77,7 +77,8 @@ def init_from_emailmessage(cls: Type["NotificationData"], email_message) -> Opti
            # Adding extra headers that are interesting to be parsed
            data_parts.add(DataPart(EMAIL_HEADER_SUBJECT, email_message["Subject"].encode()))
            data_parts.add(DataPart(EMAIL_HEADER_DATE, email_message["Date"].encode()))
-            return cls(data_parts=list(data_parts))
+            # Ensure the data parts are processed in a consistent order
+            return cls(data_parts=sorted(data_parts, key=lambda part: part.type))


why order is important?

In the case where we have information of different accuracy/precision available from different data parts. For example with Zayo we can always pull an (approximate) stamp from the email date, but for some notifications we may have a more precise stamp explicitly stated in the email body. Without this change it was unpredictable as to which one of these would be processed last and therefore win out - see the CI failures on 7970ecc above for an example of this.

I understand, good catch.

circuit_maintenance_parser/parser.py

chadell · 2021-11-12T17:13:35Z

circuit_maintenance_parser/parsers/zayo.py

@@ -24,6 +43,14 @@ def parse_html(self, soup):
        self.parse_bs(soup.find_all("b"), data)
        self.parse_tables(soup.find_all("table"), data)

+        if data:


just wondering why do we need this check?

For the case where we might decide that there's no relevant content in the email (and hence return an empty data dict) I wouldn't want to just fill in these additional fields in an otherwise empty dict.

I see, otherwise you could just rely on Pydantic validation once the Maintenance output is instantiated.

WIP - extend Zayo parser to handle additional examples

db6c65c

retnuh reviewed Nov 11, 2021

View reviewed changes

circuit_maintenance_parser/parsers/zayo.py Outdated Show resolved Hide resolved

chadell reviewed Nov 12, 2021

View reviewed changes

circuit_maintenance_parser/parsers/zayo.py Outdated Show resolved Hide resolved

circuit_maintenance_parser/parsers/zayo.py Outdated Show resolved Hide resolved

circuit_maintenance_parser/parsers/zayo.py Outdated Show resolved Hide resolved

glennmatthews added 2 commits November 12, 2021 10:22

Extract account, maintenance ID, and stamp from Zayo email headers

53e8c2c

Combine multiple windows in a single Zayo notification

2187262

glennmatthews marked this pull request as ready for review November 12, 2021 15:32

glennmatthews requested a review from pke11y as a code owner November 12, 2021 15:32

glennmatthews added 2 commits November 12, 2021 10:35

Forgot to make mypy happy

7970ecc

Ensure consistent ordering of data_parts

79382bc

chadell approved these changes Nov 12, 2021

View reviewed changes

glennmatthews merged commit 9d1c6b7 into develop Nov 15, 2021

glennmatthews deleted the gfm-improve-zayo branch November 15, 2021 17:01

glennmatthews added a commit that referenced this pull request Nov 17, 2021

Add CHANGELOG entries for #109 and #110

b5306a1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend Zayo parser to handle additional examples #109

Extend Zayo parser to handle additional examples #109

glennmatthews commented Nov 11, 2021

chadell left a comment

chadell Nov 12, 2021

glennmatthews Nov 12, 2021

chadell Nov 15, 2021

chadell Nov 12, 2021

glennmatthews Nov 12, 2021

chadell Nov 15, 2021

Extend Zayo parser to handle additional examples #109

Extend Zayo parser to handle additional examples #109

Conversation

glennmatthews commented Nov 11, 2021

chadell left a comment

Choose a reason for hiding this comment

chadell Nov 12, 2021

Choose a reason for hiding this comment

glennmatthews Nov 12, 2021

Choose a reason for hiding this comment

chadell Nov 15, 2021

Choose a reason for hiding this comment

chadell Nov 12, 2021

Choose a reason for hiding this comment

glennmatthews Nov 12, 2021

Choose a reason for hiding this comment

chadell Nov 15, 2021

Choose a reason for hiding this comment