Initial version of PINS P4Orch HLD #825

mint570 · 2021-07-29T20:56:49Z

High Level Design document for PINS P4Orch.

reshmaintel

------- The P4Orch is a new orchagent which lives inside the OrchAgent process and picks up the entries to make the corresponding SAI API calls to program the switch hardware. ------

This may be old info ? P4OrchAgent will write the entries to ASIC DB. The demux does not happen here but after ASICDB in libsai?

mint570 · 2021-08-23T18:38:26Z

------- The P4Orch is a new orchagent which lives inside the OrchAgent process and picks up the entries to make the corresponding SAI API calls to program the switch hardware. ------

This may be old info ? P4OrchAgent will write the entries to ASIC DB. The demux does not happen here but after ASICDB in libsai?

All orchs make SAI API calls, but they don't directly program the hardware. Sairedis is the library that orchagent uses to make the SAI call, and it will write to ASIC DB. It is done in the sairedis library.
Updated the doc to clarify that P4Orch writes into ASIC DB.

Not sure what "demux" means. Does it mean to separate P4 requests from existing SONiC requests? If so, there is no demux after ASIC DB. We define new table in APPL DB, but the SAI API and ASIC DB are unchanged.

prsunny · 2021-09-08T20:10:54Z

For Copp Traps, you can refer this section to disable a trap via config_db entry. So its not required to edit/modify copp_cfg.json. Just have the entry with empty/null attributes.

For creating a hostif, can you refer this flow? We can have a discussion if needed.

prsunny · 2021-09-15T15:58:29Z

doc/pins/p4orch_hld.md

+
+## Restrictions/Limitations
+
+The P4Orch is designed to meet the SDN requirements. And hence there are some differences from the other SONiC orchagents:


Also specify how the re-programming happens if lets say there is an swss crash/restart and APP_DB is cleared during init. Call out if there are any limitations.

Thanks.

Updated the "Restrictions/Limitations" section that P4Orch does not support warmboot and orchagent restart in the initial phase.

For my understanding, orchagent restart is similar to warmboot. In both case, the APPL DB tables won't be cleared. And orchagent will re-program the APPL DB tables in "init view" mode. This is not specific to P4Orch. It is a common feature for all orchs. Our main challenge for supporting warmboot now is that P4Orch doesn't do retry. During warmboot init, all table requests will come in a single batch. Since P4Orch doesn't do retry, dependency on SONiC table (such as port & vrf) can not be satisfied. We do not support warmboot in the initial phase.

orchagent restart is not similar to warmboot. APP_DB shall be flushed during init. In regular cases, the configs are reprogrammed by managers. In case of p4rt, does this leave the system in such a state? What actions to reprogram the device?

Sorry about the late reply. (I didn't use my primary email to setup my github account. Need to do a better job in that.)

In our use case, we don't expect the system to recover after orchagent crashes. The system will go to "critical state". The controller will be notified and starts to drain and reboot the switch. Currently, we don't have special handling in recovering P4Orch after crashing.

I would like to understand more on how SONiC handles the crashing. This will help us improve this in the future. I have a few questions on orchagent restarts in the existing SONiC:

You mentioned that the "configs are reprogrammed by managers". Are those the config managers that reads from CONFIG DB and writes to APPL DB? But for L3 forwarding, there is no CONFIG DB. Does anyone read the host routing and program them in APPL DB?

When swss container restarts, will syncd container also restarts and wipes the ASIC? If not, how will syncd handle the "duplicate" L3 forwarding requests after orchanger restarts?

Thanks.

Yes and for BGP, it re-learns the route and program APP_DB. orchagent restart will also restart bgp

Yes, syncd container restarts when orchagent restarts

Thanks. That's very helpful.

From my understanding, the existing SONiC behavior does not meet our requirement. Especially in syncd restart, which will wipe the ASIC. It will cause packet drop even if we re-program the rules. In our usage, we do not restart syncd and orchagent. If any of them crash, the ASIC will still function (but no new rules can be programmed). The system will be in a "critical state". The controller will be notified and starts to drain and reboot the switch.

I updated the HLD a little bit to clarify that we don't support orchagent restart for P4RT table yet.

mint570 · 2021-10-13T22:38:17Z

@prsunny @reshmaintel
PTAL
Thanks

mint570 · 2022-04-08T17:23:13Z

@prsunny
Is this ready to merged? Once this HLD is merged, it will fill the P4Orch reference doc in the PIN HLD: https://github.com/Azure/SONiC/blob/master/doc/pins/pins_hld.md#p4-orchagent

This doc also have reference to other un-merged HLDs:
#840
#846
They should be ready to merged as well. Thanks.

zhangyanzhao · 2023-02-28T05:05:50Z

@mint570 can you please sign the EasyCLA which is required to merge this PR? Thanks.

mint570 · 2023-02-28T19:48:28Z

Done. Just need to re-run the check.

mint570 marked this pull request as draft July 29, 2021 21:01

bocon13 mentioned this pull request Aug 13, 2021

PINS Upstream Tracking for MVP #841

Open

reshmaintel reviewed Aug 18, 2021

View reviewed changes

prsunny reviewed Sep 15, 2021

View reviewed changes

mint570 marked this pull request as ready for review October 13, 2021 22:37

reshmaintel approved these changes Nov 7, 2021

View reviewed changes

yxieca force-pushed the master branch 2 times, most recently from 8498931 to 8837dc2 Compare April 15, 2022 16:51

reshmaintel approved these changes Feb 28, 2023

View reviewed changes

PINS P4Orch HLD

d744045

mint570 force-pushed the master branch from a96aed2 to d744045 Compare May 16, 2024 18:45

prsunny approved these changes May 16, 2024

View reviewed changes

prsunny merged commit ebf4d61 into sonic-net:master May 16, 2024
1 check passed

kishanps mentioned this pull request May 18, 2024

SWSS Upstream of P4Orch changes #1614

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial version of PINS P4Orch HLD #825

Initial version of PINS P4Orch HLD #825

mint570 commented Jul 29, 2021 •

edited

Loading

reshmaintel left a comment

mint570 commented Aug 23, 2021

prsunny commented Sep 8, 2021

prsunny Sep 15, 2021

mint570 Sep 20, 2021

prsunny Sep 20, 2021

mint570 Sep 24, 2021

prsunny Sep 24, 2021 •

edited

Loading

mint570 Sep 24, 2021

mint570 commented Oct 13, 2021

mint570 commented Apr 8, 2022

zhangyanzhao commented Feb 28, 2023

mint570 commented Feb 28, 2023


		## Restrictions/Limitations

		The P4Orch is designed to meet the SDN requirements. And hence there are some differences from the other SONiC orchagents:

Initial version of PINS P4Orch HLD #825

Initial version of PINS P4Orch HLD #825

Conversation

mint570 commented Jul 29, 2021 • edited Loading

reshmaintel left a comment

Choose a reason for hiding this comment

mint570 commented Aug 23, 2021

prsunny commented Sep 8, 2021

prsunny Sep 15, 2021

Choose a reason for hiding this comment

mint570 Sep 20, 2021

Choose a reason for hiding this comment

prsunny Sep 20, 2021

Choose a reason for hiding this comment

mint570 Sep 24, 2021

Choose a reason for hiding this comment

prsunny Sep 24, 2021 • edited Loading

Choose a reason for hiding this comment

mint570 Sep 24, 2021

Choose a reason for hiding this comment

mint570 commented Oct 13, 2021

mint570 commented Apr 8, 2022

zhangyanzhao commented Feb 28, 2023

mint570 commented Feb 28, 2023

mint570 commented Jul 29, 2021 •

edited

Loading

prsunny Sep 24, 2021 •

edited

Loading