Hello the Community,
my installation consists of 10 e510 in the following setup:
-A public access network (vlan11) and a management network (vlan92) are routed to the internet via a linux router as shown at network_topology.png
-E510s are configured using cnmaestro. See vlan settings at cnmaestro_vlan_settings.png.
After about a week of normal operation one e510 went offline without any apparent reason. The next day, a 2h power cut brought the whole network down. After power was restored, the device reappeared fully operational BUT, another e510 went offline...
Trying to realise what is going on, I run netdiscover and tcpdump on the linux router. On both cases, I had the following results:
a) the offline AP is discoverable with netdiscover, at the management network interface (eth1)
b) the offline AP shows up with THREE IP addresses (all at the same MAC, the e510 ethernet MAC): its expected IP address on the management network, an IP address from the public access network and 192.168.0.1. I must repeat that all 3 IPs are found on eth1 which is the management network, which is not directly connected to the public access network.
c) the offline AP is continuously trying to communicate with the router (172.20.2.2) but rather fails to receive the replies as can bee seen at the tcpdump output (172.20.2.78 is the offline e510):
19:39:42.728061 ARP, Request who-has 172.20.2.2 tell 172.20.2.78, length 46
19:39:42.728074 ARP, Reply 172.20.2.2 is-at 00:0c:29:XX:XX:XX (oui Unknown), length 28
...<ignoring irrelevant capture>...
19:39:43.720053 ARP, Request who-has 172.20.2.2 tell 172.20.2.78, length 46
19:39:43.720078 ARP, Reply 172.20.2.2 is-at 00:0c:29:XX:XX:XX (oui Unknown), length 28
...<ignoring irrelevant capture>...
19:39:59.733948 ARP, Request who-has 172.20.2.2 tell 172.20.2.78, length 46
19:39:59.733961 ARP, Reply 172.20.2.2 is-at 00:0c:29:XX:XX:XX (oui Unknown), length 28
...<ignoring irrelevant capture>...
19:40:00.726303 ARP, Request who-has 172.20.2.2 tell 172.20.2.78, length 46
19:40:00.726313 ARP, Reply 172.20.2.2 is-at 00:0c:29:XX:XX:XX (oui Unknown), length 28
19:40:01.734339 ARP, Request who-has 172.20.2.2 tell 172.20.2.78, length 46
19:40:01.734362 ARP, Reply 172.20.2.2 is-at 00:0c:29:XX:XX:XX (oui Unknown), length 28
d) the offline AP does not respond to ping requests on its proper IP on the management network nor is reachable by any other mean.
Can anyone help me understand what is going on there? My feeling is that it has to do with vlans and the way they are implemented at e510s, which does not seem to be the way I would expect.
Before I thank you for taking the time to consider this weird behaviour, please take in to account this these 10 APs are an extension to a network with another 17 ubiquiti APs, operating seamlessly for 4 years now. I really do not expect the problem having to do with the rest of the infrastructure.
Thank you very much in advance!
Niko
- network_topology.png (22.1 KB)
- cnmaestro_vlan_settings.png (22 KB)