Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[201811] Zebra daemon crash with FRR docker #3365

Open
tonytitus opened this issue Aug 19, 2019 · 1 comment
Open

[201811] Zebra daemon crash with FRR docker #3365

tonytitus opened this issue Aug 19, 2019 · 1 comment
Assignees

Comments

@tonytitus
Copy link
Contributor

tonytitus commented Aug 19, 2019

Description

[201811] Zebra daemon crash with FRR docker

Steps to reproduce the issue:

  1. T1 Topology
  2. reboot or systemctl restart swss

Describe the results you received:
Zebra daemon core dump

Describe the results you expected:
No core dumps

Additional information you deem important (e.g. issue happens only occasionally):

**Output of `show version`:**
 show version

SONiC Software Version: SONiC.SONiC201811_Innovium1.0-dirty-20190809.063016
Distribution: Debian 9.9
Kernel: 4.9.0-9-amd64
Build commit: a41d9a5
Build date: Fri Aug  9 16:31:17 UTC 2019
Built by: admin@sjc-prd-d77

/var/log/quagga/zebra.log

Aug 19 13:29:06.181218 sonic WARNING bgp#zebra[76]: 0:20c0:a836:0:d1::/64: Route install failed
Aug 19 13:29:06.181271 sonic ERR bgp#zebra[76]: netlink-cmd (NS 0) error: No such device, type=RTM_NEWROUTE(24), seq=268552, pid=4059903160
Aug 19 13:29:06.181271 sonic WARNING bgp#zebra[76]: 0:20c0:a836:0:e0::/64: Route install failed
Aug 19 13:29:06.181271 sonic ERR bgp#zebra[76]: netlink-cmd (NS 0) error: No such device, type=RTM_NEWROUTE(24), seq=268553, pid=4059903160
Aug 19 13:29:06.181310 sonic WARNING bgp#zebra[76]: 0:20c0:a836:0:e1::/64: Route install failed
Aug 19 13:29:06.181422 sonic CRIT bgp#zebra[76]: Assertion `data_len' failed in file zebra/zebra_fpm.c, line 927, function zfpm_build_updates
Aug 19 13:29:06.181546 sonic CRIT bgp#zebra[76]: Backtrace for 8 stack frames:
Aug 19 13:29:06.181672 sonic CRIT bgp#zebra[76]: [bt 0] /usr/lib/libfrr.so.0(zlog_backtrace+0x3a) [0x7fbd4511d97a]
Aug 19 13:29:06.181717 sonic CRIT bgp#zebra[76]: [bt 1] /usr/lib/libfrr.so.0(_zlog_assert_failed+0x61) [0x7fbd4511ded1]
Aug 19 13:29:06.181717 sonic CRIT bgp#zebra[76]: [bt 2] /usr/lib/frr/modules/zebra_fpm.so(+0x3f47) [0x7fbd4241af47]
Aug 19 13:29:06.181717 sonic CRIT bgp#zebra[76]: [bt 3] /usr/lib/libfrr.so.0(thread_call+0x60) [0x7fbd4513d790]
Aug 19 13:29:06.181789 sonic CRIT bgp#zebra[76]: [bt 4] /usr/lib/libfrr.so.0(frr_run+0xb8) [0x7fbd4511b5c8]
Aug 19 13:29:06.181789 sonic CRIT bgp#zebra[76]: [bt 5] /usr/lib/frr/zebra(main+0x2a0) [0x55a053a08750]
Aug 19 13:29:06.181789 sonic CRIT bgp#zebra[76]: [bt 6] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf5) [0x7fbd43bc0b45]
Aug 19 13:29:06.181801 sonic CRIT bgp#zebra[76]: [bt 7] /usr/lib/frr/zebra(+0x14d4e) [0x55a053a08d4e]
Aug 19 13:29:06.181801 sonic CRIT bgp#zebra[76]: Current thread function zfpm_write_cb, scheduled from file zebra/zebra_fpm.c, line 452
@tonytitus tonytitus changed the title [201811] Zebra daemon crash [201811] Zebra daemon crash with FRR docker Aug 21, 2019
@pavel-shirshov
Copy link
Contributor

@tonytitus:
It's hard to say what was the trigger for FRR crash.
You could ask FRR guys here, but from my point of view you need to provide more information or debug by yourself.
First, you need to understand why you have netlink No such device issue.
Can you find another errors in the syslog?

mssonicbld added a commit that referenced this issue Jun 19, 2024
…atically (#19345)

#### Why I did it
src/sonic-utilities
```
* 3eb462f3 - (HEAD -> 202305, origin/202305) Improve load_mingraph to wait eth0 restart before exit (#3365) (#3371) (83 minutes ago) [Hua Liu]
```
#### How I did it
#### How to verify it
#### Description for the changelog
mssonicbld added a commit that referenced this issue Jun 23, 2024
…atically (#19369)

#### Why I did it
src/sonic-utilities
```
* 4f7825a7 - (HEAD -> 202311, origin/202311) Improve load_mingraph to wait eth0 restart before exit (#3365) (#3372) (34 hours ago) [Hua Liu]
* 2d7c6cd0 - Fix double hex to decimal conversion (#3267) (2 days ago) [Yuanzhe]
* 06fcbc78 - Update TRANSCEIVER_FIRMWARE_INFO table for all targets in sfputil (#3370) (2 days ago) [mihirpat1]
```
#### How I did it
#### How to verify it
#### Description for the changelog
mssonicbld added a commit that referenced this issue Jun 25, 2024
…atically (#19333)

#### Why I did it
src/sonic-utilities
```
* 0e6a55ef - (HEAD -> master, origin/master, origin/HEAD) [fast-reboot] Backup database after syncd/swss stopped (#3342) (20 hours ago) [Stepan Blyshchak]
* c51758df - fix: fix show bgp summary output typo (#3375) (25 hours ago) [Chenyang Wang]
* cf7bfa29 - Add the definition of `log` in `script decode-syseeprom` (#3383) (4 days ago) [Yutong Zhang]
* 9d206af7 - [chassis][mutli-asic][lldp] "show lldp table/neighbors" displays error message in output on multi-asis linecard. (#3358) (5 days ago) [Marty Y. Lok]
* 515265a5 - Update TRANSCEIVER_FIRMWARE_INFO table for all targets in sfputil (#3370) (6 days ago) [mihirpat1]
* 3df762f1 - [config] no op if Golden Config is invalid (#3367) (7 days ago) [jingwenxie]
* 8f715acf - [config]Support single file reload for multiasic (#3349) (7 days ago) [jingwenxie]
* 31f5fa8e - Improve load_mingraph to wait eth0 restart before exit (#3365) (7 days ago) [Hua Liu]
```
#### How I did it
#### How to verify it
#### Description for the changelog
arun1355492 pushed a commit to arun1355492/sonic-buildimage that referenced this issue Jul 26, 2024
…atically (sonic-net#19333)

#### Why I did it
src/sonic-utilities
```
* 0e6a55ef - (HEAD -> master, origin/master, origin/HEAD) [fast-reboot] Backup database after syncd/swss stopped (sonic-net#3342) (20 hours ago) [Stepan Blyshchak]
* c51758df - fix: fix show bgp summary output typo (sonic-net#3375) (25 hours ago) [Chenyang Wang]
* cf7bfa29 - Add the definition of `log` in `script decode-syseeprom` (sonic-net#3383) (4 days ago) [Yutong Zhang]
* 9d206af7 - [chassis][mutli-asic][lldp] "show lldp table/neighbors" displays error message in output on multi-asis linecard. (sonic-net#3358) (5 days ago) [Marty Y. Lok]
* 515265a5 - Update TRANSCEIVER_FIRMWARE_INFO table for all targets in sfputil (sonic-net#3370) (6 days ago) [mihirpat1]
* 3df762f1 - [config] no op if Golden Config is invalid (sonic-net#3367) (7 days ago) [jingwenxie]
* 8f715acf - [config]Support single file reload for multiasic (sonic-net#3349) (7 days ago) [jingwenxie]
* 31f5fa8e - Improve load_mingraph to wait eth0 restart before exit (sonic-net#3365) (7 days ago) [Hua Liu]
```
#### How I did it
#### How to verify it
#### Description for the changelog
liushilongbuaa pushed a commit to liushilongbuaa/sonic-buildimage that referenced this issue Aug 1, 2024
…atically (sonic-net#19333)

#### Why I did it
src/sonic-utilities
```
* 0e6a55ef - (HEAD -> master, origin/master, origin/HEAD) [fast-reboot] Backup database after syncd/swss stopped (sonic-net#3342) (20 hours ago) [Stepan Blyshchak]
* c51758df - fix: fix show bgp summary output typo (sonic-net#3375) (25 hours ago) [Chenyang Wang]
* cf7bfa29 - Add the definition of `log` in `script decode-syseeprom` (sonic-net#3383) (4 days ago) [Yutong Zhang]
* 9d206af7 - [chassis][mutli-asic][lldp] "show lldp table/neighbors" displays error message in output on multi-asis linecard. (sonic-net#3358) (5 days ago) [Marty Y. Lok]
* 515265a5 - Update TRANSCEIVER_FIRMWARE_INFO table for all targets in sfputil (sonic-net#3370) (6 days ago) [mihirpat1]
* 3df762f1 - [config] no op if Golden Config is invalid (sonic-net#3367) (7 days ago) [jingwenxie]
* 8f715acf - [config]Support single file reload for multiasic (sonic-net#3349) (7 days ago) [jingwenxie]
* 31f5fa8e - Improve load_mingraph to wait eth0 restart before exit (sonic-net#3365) (7 days ago) [Hua Liu]
```
#### How I did it
#### How to verify it
#### Description for the changelog
mssonicbld added a commit that referenced this issue Aug 3, 2024
…atically (#19806)

#### Why I did it
src/sonic-utilities
```
* cee3e8e0 - (HEAD -> 202405, origin/202405) Enable show ip bgp on sup and -n all for show ip bgp network (#3417) (4 hours ago) [Changrong Wu]
* 26f5d757 - Add the definition of `log` in `script decode-syseeprom` (#3383) (4 hours ago) [Yutong Zhang]
* af18c89c - Improve load_mingraph to wait eth0 restart before exit (#3365) (4 hours ago) [Hua Liu]
* 73e12564 - [chassis][mutli-asic][lldp] "show lldp table/neighbors" displays error message in output on multi-asis linecard. (#3358) (4 hours ago) [Marty Y. Lok]
* 69dab7ac - [pbh]: Fix show PBH counters when cache is partial (#3356) (4 hours ago) [Nazarii Hnydyn]
* ae20defd - [fast-reboot] Backup database after syncd/swss stopped (#3342) (4 hours ago) [Stepan Blyshchak]
* a9876ee6 - Add Checkpoint and Rollback for Multi ASIC. (#3299) (4 hours ago) [Xincun Li]
* 7643ac3c - [DPB]Fixing return code of breakout command on failure (#3357) (4 hours ago) [Sudharsan Dhamal Gopalarathnam]
```
#### How I did it
#### How to verify it
#### Description for the changelog
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants