Hey all,
This morning we ran some upgrades on one of our sites from the regular 4.7.0.1 version to 4.7.0.1STA. We have done this upgrade on over a thousand devices already, and have not had any issues. However, this morning there were 10 SMs that did not come back online…
We had the customers power cycle their equipment, and it did not bring them back up. We just sent out a tech to one of them, and while he was getting dhcp from the SM properly (so it was not defaulted) he was unable to get into the radio on either the IP we assign them or via the backdoor IP. We had him perform a factory reset on the radio, and he was unable to get to it via the backdoor OR its default 192 address. According to the tech, the SM at their location is boot looping every 15 seconds.
These are all Force 200 5GHz radios. I wouldn’t have been too surprised if a bunch of elevated units had decided to die on us, but I have never had F200s behave like this after a software upgrade, especially when the only difference in the firmware is whether or not it is the STA version. What’s really interesting is that the firmware update job reports like 3 different statuses for these radios, it looks like some got the upgrade successfully and some did not, but none of them came back afterwards.
We already tried rebooting the AP, reverting the firmware on the AP to non-STA 4.7.0.1, changing frequencies on the AP, just about everything we could think of. It really seems to just be an issue with the upgrade procedure on the SMs themselves. At this point we are going to have to replace all of these units, some of which are in trees (so we will have to eat the climber cost as well…)
Any ideas, anybody had this happen before and managed to get the SM back to working order? It would be really crappy to have to replace 10 radios because of a routine software upgrade…