Page MenuHomePhabricator

FRR crashing triggered by RPKI
Open, Requires assessmentPublicBUG

Description

I currently have a problem with FRR crashing in combination with RPKI. The router is running VyOS 1.2.4-epa1 but a similar error occured when running 1.2.3. The same configuration did not show any errors with version 1.2.2. The RPKI validator used in the backend is Routinator.

Dec 13 08:39:09 rt-1 bgpd[1209]: [EC 100663314] Attempting to process an I/O event but for fd: 45(4) no thread to handle this!
Dec 13 08:43:01 rt-1 bgpd[1209]: Received signal 11 at 1576222981 (si_addr 0x2, PC 0x55aa62e883a5); aborting...
Dec 13 08:43:01 rt-1 bgpd[1209]: Backtrace for 11 stack frames:
Dec 13 08:43:01 rt-1 bgpd[1209]: /usr/lib/x86_64-linux-gnu/frr/libfrr.so.0(zlog_backtrace_sigsafe+0x67) [0x7f9e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /usr/lib/x86_64-linux-gnu/frr/libfrr.so.0(zlog_signal+0x113) [0x7f9e60e52853]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /usr/lib/x86_64-linux-gnu/frr/libfrr.so.0(+0x71305) [0x7f9e60e73305]60e52853]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /lib/x86_64-linux-gnu/libpthread.so.0(+0xf890) [0x7f9e5fc7c890]3305]60e52853]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /usr/lib/frr/bgpd(bgp_table_range_lookup+0x65) [0x55aa62e883a5]3305]60e52853]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /usr/lib/x86_64-linux-gnu/frr/modules/bgpd_rpki.so(+0x5042) [0x7f9e5c0d7042]]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /usr/lib/x86_64-linux-gnu/frr/libfrr.so.0(thread_call+0x60) [0x7f9e60e80b20]]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /usr/lib/x86_64-linux-gnu/frr/libfrr.so.0(frr_run+0xd8) [0x7f9e60e505d8]b20]]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /usr/lib/frr/bgpd(main+0x2ff) [0x55aa62e32b4f]run+0xd8) [0x7f9e60e505d8]b20]]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf5) [0x7f9e5f8e3b45]b20]]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: /usr/lib/frr/bgpd(+0x3cb6c) [0x55aa62e34b6c]_main+0xf5) [0x7f9e5f8e3b45]b20]]e60e523f7]
Dec 13 08:43:01 rt-1 bgpd[1209]: in thread bgpd_sync_callback scheduled from bgpd/bgp_rpki.c:509#012883a5); aborting...
Dec 13 08:43:01 rt-1 watchfrr[1154]: [EC 268435457] bgpd state -> down : read returned EOF
Dec 13 08:43:01 rt-1 watchfrr[1154]: bgpd state -> up : connect succeeded
Dec 13 08:43:01 rt-1 zebra[1202]: [EC 4043309116] Client 'vnc' encountered an error and is shutting down.
Dec 13 08:43:01 rt-1 watchfrr[1154]: [EC 268435457] bgpd state -> down : unexpected read error: Connection reset by peer
Dec 13 08:43:01 rt-1 zebra[1202]: [EC 4043309116] Client 'bgp' encountered an error and is shutting down.
Dec 13 08:43:01 rt-1 zebra[1202]: client 30 disconnected. 0 vnc routes removed from the rib
Dec 13 08:43:01 rt-1 zebra[1202]: client 27 disconnected. 77337 bgp routes removed from the rib
Dec 13 08:43:06 rt-1 watchfrr[1154]: [EC 100663303] Forked background command [pid 4044]: /usr/lib/frr/watchfrr.sh restart bgpd
Dec 13 08:43:06 rt-1 zebra[1202]: client 27 says hello and bids fair to announce only bgp routes vrf=0
Dec 13 08:43:06 rt-1 zebra[1202]: client 30 says hello and bids fair to announce only vnc routes vrf=0
Dec 13 08:43:06 rt-1 watchfrr[1154]: bgpd state -> up : connect succeeded

Details

Difficulty level
Unknown (require assessment)
Version
1.2.4-epa1
Why the issue appeared?
Will be filled on close
Is it a breaking change?
Perfectly compatible

Event Timeline

MrXermon created this task.Dec 13 2019, 7:55 AM

Hi @MrXermon
Can you describe how we can reproduce this bug?
Can you show share your configuration?

Actually i'm currently unable to reproduce the bug because since i removed the configuration for RPKI everything works fine. Even more interesting is that my second router with the exact same configuration does not have the problem.