DNS Cluster Sync incomplete

I have a main server with about 200 domains, most of them imported from a previous instance. There is a slave with everything setup according to the docs (DNS clusters and DNSSEC | Hestia Control Panel), checked thrice and rebooted multiple times, but the slave only receives about 20 domains, and querying it for other domains it refuses.
Both servers have a public IPv4.

Example:

❯ drill ftt.gmbh @ns1.iridion.it
;; ->>HEADER<<- opcode: QUERY, rcode: NOERROR, id: 23232
;; flags: qr aa rd ; QUERY: 1, ANSWER: 1, AUTHORITY: 0, ADDITIONAL: 0 
;; QUESTION SECTION:
;; ftt.gmbh.	IN	A

;; ANSWER SECTION:
ftt.gmbh.	300	IN	A	89.58.52.124

;; AUTHORITY SECTION:

;; ADDITIONAL SECTION:

;; Query time: 28 msec
;; SERVER: 89.58.52.124
;; WHEN: Mon Nov  6 22:59:31 2023
;; MSG SIZE  rcvd: 42

❯ drill ftt.gmbh @ns2.iridion.it
;; ->>HEADER<<- opcode: QUERY, rcode: REFUSED, id: 37833
;; flags: qr rd ; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 0 
;; QUESTION SECTION:
;; ftt.gmbh.	IN	A

;; ANSWER SECTION:

;; AUTHORITY SECTION:

;; ADDITIONAL SECTION:

;; Query time: 31 msec
;; SERVER: 152.89.105.120
;; WHEN: Mon Nov  6 22:59:33 2023
;; MSG SIZE  rcvd: 26

Found a workaround by executing v-add-remote-dns-domain for each domain of each user, but that does not feel right

v-sync-dns-cluster should do the trick…

But it does more or less the same …

no it did not, that is the odd thing, as said I executed it multiple times on the master node

It is designed to do that and loop trough all the domains…

And then do send the records to the new server:

And

Does the same …

This problem is unfortunately persisting.
v-sync-dns-cluster runs for a long time, but domains are missing.

If I call v-add-remote-dns-domain, I get a connection error even though the connection works just fine:

$ v-add-remote-dns-domain USER DOMAIN
Error: 152.89.105.XXX connection failed

@dpin I don’t know what happened but it seems fine again maybe an update?

I have had the same issue, at least 2 times a week. Everything works again with a manual sync

v-sync-dns-cluster 

now it was out of sync again, I ran v-sync-dns-cluster and it said Error: connection failed but now things are back in sync, curious

now I am getting servfails for all new domains - one thing to keep in mind is to regularly update your ns2 as well

probaly have a look at bind entries in syslog, sync works properly over here, there should be no need to run v-sync-dns-cluster.

now I updated my ns2 after a while and ended up in a very curious situation where ns2 refuses any query while ns1 still returns a servfail for new domains, even after dns-domain-rebuilds and restarts

EDIT: the ns2 issue was a temporary thing, the servfail for the new domain persists and there are no relevant logs I can find

SERVFAIL fixed, see [Feature] SRV records in the DNS zone editor · Issue #2645 · hestiacp/hestiacp · GitHub