Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

watchfrr is not running #3196

Open
nikos-github opened this issue Jul 22, 2019 · 6 comments
Open

watchfrr is not running #3196

nikos-github opened this issue Jul 22, 2019 · 6 comments
Assignees

Comments

@nikos-github
Copy link
Collaborator

nikos-github commented Jul 22, 2019

This breaks existing functionality.

@xinliu-seattle
Copy link
Contributor

@nikos-github can you please add the repro steps ? thanks.

@nikos-github
Copy link
Collaborator Author

nikos-github commented Jul 23, 2019

@xinliu-seattle No repro steps needed. watchfrr must always run in frr and currently it's not running. That has been a day 0 requirement.

Before Guohan's changes:
root@sonic:/home/admin# docker exec -it bgp ps axf
PID TTY STAT TIME COMMAND
528 pts/3 Rs+ 0:00 ps axf
497 pts/2 Ss+ 0:00 bash
490 pts/1 Ss+ 0:00 bash
1 pts/0 Ss 0:00 /bin/sh -c /usr/bin/config.sh && /usr/bin/s
33 ? Ssl 0:41 /usr/sbin/rsyslogd
65 ? Ss 20:35 /usr/lib/frr/watchfrr -d -r /usr/lib/frr/watchfrr.sh
80 ? Ssl 6:31 /usr/lib/frr/zebra -d -A 127.0.0.1 -s 90000000 -M fpm
83 ? Ssl 12:22 /usr/lib/frr/bgpd -d -A 127.0.0.1
90 ? Ss 3:51 /usr/lib/frr/staticd -d -A 127.0.0.1
112 pts/0 Sl 0:00 fpmsyncd
113 pts/0 S+ 0:00 /bin/bash

After Guohan's changes:
root@sonic:/home/admin# docker exec -it bgp ps axf
PID TTY STAT TIME COMMAND
235 pts/1 Rs+ 0:00 ps axf
1 pts/0 Ss+ 2:00 /usr/bin/python /usr/bin/supervisord
59 pts/0 Sl 0:01 /usr/sbin/rsyslogd -n
64 pts/0 S 0:00 python /usr/bin/bgpcfgd
72 pts/0 Sl 0:03 /usr/lib/frr/zebra -A 127.0.0.1 -s 90000000 -M fpm -M
75 pts/0 S 0:00 /usr/lib/frr/staticd -A 127.0.0.1
78 pts/0 Sl 0:00 /usr/lib/frr/bgpd -A 127.0.0.1 -M snmp
84 pts/0 Sl 0:00 fpmsyncd (edited)

@lguohan
Copy link
Collaborator

lguohan commented Jul 23, 2019

why do we need watchfrr, we are using supervisord to manage frr. according to manual, watchfrr is for systemd integration.

@nikos-github
Copy link
Collaborator Author

nikos-github commented Jul 23, 2019

@lguohan @xinliu-seattle @zhenggen-xu @MichelMoriniaux @mslocrian When frr was introduced to sonic, supervisord wasn't there. Supervisord came much later and recently you transitioned frr to supervisord without review from the community or the key frr stakeholders and that is breaking sonic and frr functionality for many. All the functionality of watchfrr is being used and required by those running frr. In addition frr-reload has a requirement on watchfrr too for incremental non-disruptive config push.

@lguohan
Copy link
Collaborator

lguohan commented Jul 25, 2019

why does frr-reload require watchfrr?

@nikos-github
Copy link
Collaborator Author

nikos-github commented Jul 25, 2019

@lguohan @xinliu-seattle @zhenggen-xu @MichelMoriniaux @mslocrian frr-reload is part of frr-pythontools pkg which requires watchfrr in order to work properly. In addition, watchfrr provides integrated config functionality among other things through commands that are not available if watchfrr is not running. watchfrr has been running since day 1 when frr was introduced into sonic 2+ yrs back and it's a requirement.

mssonicbld added a commit that referenced this issue Mar 22, 2024
…atically (#18437)

#### Why I did it
src/sonic-utilities
```
* fb4a090d - (HEAD -> 202311, origin/202311) [config] Add Table hard dependency check (#3159) (4 hours ago) [jingwenxie]
* 86f3de58 - [ipintutil]Handle exception in show ip interfaces command (#3182) (4 hours ago) [Sudharsan Dhamal Gopalarathnam]
* 5bfc3b40 - [Techsupport]Handle SAI kv pair if present in sai common profile (#3196) (4 hours ago) [Sudharsan Dhamal Gopalarathnam]
* f4ef7681 - [fast/warm-reboot] Put ERR message in syslog when a failure is seen (#3186) (4 hours ago) [Vaibhav Hemant Dixit]
* db36df24 - [Bug] Fix fw_setenv illegel character issue (#3201) (4 hours ago) [xumia]
* a58b78c8 - [config] Add YANG alerting for override (#3188) (4 hours ago) [jingwenxie]
* 943a6846 - Update port2alias (#3217) (4 hours ago) [abdosi]
* 3236fbf4 - [show] Update show run all to cover all asic config in multiasic (#3148) (#3224) (5 hours ago) [mssonicbld]
```
#### How I did it
#### How to verify it
#### Description for the changelog
mssonicbld added a commit that referenced this issue Mar 28, 2024
…atically (#18240)

#### Why I did it
src/sonic-utilities
```
* bdc57206 - (HEAD -> master, origin/master, origin/HEAD) Revert "Fix for Switch Port Modes and VLAN CLI Enhancement (#3108)" (#3246) (89 minutes ago) [jingwenxie]
* e35452b7 - Modify "show interface transceiver status" CLI to show SW cmis state (#3238) (2 days ago) [mihirpat1]
* 04a33e1f - Add "state" field in CONFIG_DB a toggle of the fabric port monitor feature (#2932) (2 days ago) [jfeng-arista]
* 3c489ba5 - Enhance route-check for multi-asic platforms (#3216) (5 days ago) [Deepak Singhal]
* c149e48b - [chassis] Add chassis support for CLI "config qos reload" (#3233) (6 days ago) [wenyiz2021]
* d8541add - Update port2alias (#3217) (8 days ago) [abdosi]
* d4688a8f - [graceful reboot] Add the pre_reboot_hook script execution, add the watchdog arm before the reboot (#3203) (8 days ago) [Vadym Hlushko]
* 125f36f3 - [ipintutil]Handle exception in show ip interfaces command (#3182) (10 days ago) [Sudharsan Dhamal Gopalarathnam]
* 9d532017 - [chassis][show-runningconfig] Fix the show runningconfiguration all issue on the Supervisor (#3194) (2 weeks ago) [Marty Y. Lok]
* 1a9261ce - [Techsupport]Handle SAI kv pair if present in sai common profile (#3196) (2 weeks ago) [Sudharsan Dhamal Gopalarathnam]
* 7466dc4a - Skip the validation of action in acl-loader if capability table in STATE_DB is empty (#3199) (2 weeks ago) [bingwang-ms]
* b879b658 - [Bug] Fix fw_setenv illegel character issue (#3201) (3 weeks ago) [xumia]
* 0b41a560 - [config] Add YANG alerting for override (#3188) (3 weeks ago) [jingwenxie]
* 24683b0c - [show] multi-asic show running test residue (#3198) (3 weeks ago) [jingwenxie]
* 995a797a - CLI to skip polling for periodic information for a port in DomInfoUpdateTask thread (#3187) (3 weeks ago) [mihirpat1]
* 9aa9eaa5 - [config] Add Table hard dependency check (#3159) (3 weeks ago) [jingwenxie]
* 5f0ffcca - [fast/warm-reboot] Put ERR message in syslog when a failure is seen (#3186) (4 weeks ago) [Vaibhav Hemant Dixit]
* 92220dcf - Fix for Switch Port Modes and VLAN CLI Enhancement (#3108) (4 weeks ago) [Saba Akram]
```
#### How I did it
#### How to verify it
#### Description for the changelog
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants