Discovered substitute servers not removed when they go offline

  • Open
  • quality assurance status badge
Details
One participant
  • Ludovic Courtès
Owner
unassigned
Submitted by
Ludovic Courtès
Severity
important
L
L
Ludovic Courtès wrote on 28 Feb 2024 00:05
(address . bug-guix@gnu.org)
871q8xh5ja.fsf@inria.fr
Hello,

Substitute servers discovered over Avahi don’t disappear when they go
off-line: they remain in /var/guix/discover/publish “forever” (?) and
‘guix substitute’ keeps trying to connect to them, in vain.

The problem AIUI is that the mDNS records are still valid according to
their TTL and thus cached by avahi-daemon; we can only notice that
things go wrong when trying to resolve:

Toggle snippet (7 lines)
$ avahi-browse _guix_publish._tcp -r
+ enp0s31f6 IPv6 guix-publish-xyz _guix_publish._tcp local
+ enp0s31f6 IPv4 guix-publish-xyz _guix_publish._tcp local
Failed to resolve service 'guix-publish-xyz' of type '_guix_publish._tcp' in domain 'local': Timeout reached
Failed to resolve service 'guix-publish-xyz' of type '_guix_publish._tcp' in domain 'local': Timeout reached

(One way to reproduce that is to ‘pkill -9 avahi-daemon’ on the machine
that advertises the service.)

Problem is that ‘avahi-browse-service-thread’ attempts to resolve
services once for all; thus it never notices that services went away, at
least not until some long TTL has expired.

Ludo’.
L
L
Ludovic Courtès wrote on 4 Apr 2024 23:36
control message for bug #69442
(address . control@debbugs.gnu.org)
87bk6ou7xv.fsf@gnu.org
severity 69442 important
quit
?
Your comment

Commenting via the web interface is currently disabled.

To comment on this conversation send an email to 69442@debbugs.gnu.org

To respond to this issue using the mumi CLI, first switch to it
mumi current 69442
Then, you may apply the latest patchset in this issue (with sign off)
mumi am -- -s
Or, compose a reply to this issue
mumi compose
Or, send patches to this issue
mumi send-email *.patch