I fixed part of the problem. There is a setting in the Mikrotik router called "Keep Alive timout" that causes the rotuer to check and see if the client is still there every X seconds and if it isn't then the router drops the session. Since the canopy radios default setting is to not reply to pings at the WAN interface the mikrotik was bumping them off anytime they were idle for more than 20 seconds. I turned that function off and found that if I set the canopy radio to "kee alive" then SOME of the radios will maintain PPPoE sessions for days at a time. This also seems to keep the radios from dropping PPPoE sessions even when they were active and now it looks like they only drop the session when idle for random amounts of time.
If I set the customer radio to "Idle time out" or "Keep Alive" and I enable the "Reply to Ping on WAN Interface" then I start Pinging the WAN interfacthe the radio will maintain a PPPoE session for as long as I am pinging it.
I configured the Mikrotik's PPPoE session idle time out to 2 days (I even turned it off) and it has no affect on the problem. The "Keep alive timout" is also disabled on the mikrotik. So as far as I can tell there is no reason (assuming the mikrotik is working correctly) for it to terminate a PPPoE session. If I ping the WAN interface on the radio the radio will maintain the PPPoE session for as long as it is being pinged.
I guess at this point I need to figure out if the canopy radios are terminating the session or the if it is the Mikrotik is doing it. I don't see anything in the Mikrotik logs that appears to be the mikrotik terminating the sessions and I don't understand the canopy's logs.
Below are the logs from a radio that had just restarted a PPPoE session about 20 seconsds before I copy/pasted the logs. Can someone look at these logs and tell me the reason the PPoE session ended ? Thanks
12/22/2014 : 11:01:56 CST : rcvd PPPoE data pkt, type 0xc021
12/22/2014 : 11:01:56 CST : link 0xb420dc rcvd:
12/22/2014 : 11:01:56 CST : c0 21 05 02 00 11 6c 69 6e 6b 20 69 6e 61 63 74 69 76 65 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:01:56 CST : ppp_inpkt (link 0xb420dc); prot: LCP, code TERMREQ, state OPENED, len 34
12/22/2014 : 11:01:56 CST : ppp_fsm (link 0xb420dc): LCP state: OPENED -> STOPPING
12/22/2014 : 11:01:56 CST : LCP down.
12/22/2014 : 11:01:56 CST : close (link 0xb420dc): IPCP
12/22/2014 : 11:01:56 CST : ppp_fsm (link 0xb420dc): IPCP state: OPENED -> CLOSING
12/22/2014 : 11:01:56 CST : ipcp: down
12/22/2014 : 11:01:56 CST : ppp_fsm (link 0xb420dc) sending TERMREQ, len:0
12/22/2014 : 11:01:56 CST : link 0xb420dc sending:
12/22/2014 : 11:01:56 CST : 80 21 05 11 00 04
12/22/2014 : 11:01:56 CST : kicking off timer, oldstate 9 newstate 4
12/22/2014 : 11:01:56 CST : lowerdown (link 0xb420dc): IPCP
12/22/2014 : 11:01:56 CST : ppp_fsm (link 0xb420dc): IPCP state: CLOSING -> INITIAL
12/22/2014 : 11:01:56 CST : UPAP down.
12/22/2014 : 11:01:56 CST : poe_ln_disconnect: Disconnecting
12/22/2014 : 11:01:56 CST : poe_disconnect on link 0xb420dc, pline 0xb4c3bc
12/22/2014 : 11:01:56 CST : poe_sendctl code 0xa7, state:SESSION->CLOSED, tags:
12/22/2014 : 11:01:56 CST : close (link 0xb420dc): LCP
12/22/2014 : 11:01:56 CST : ppp_fsm (link 0xb420dc): LCP state: STOPPING -> CLOSING
12/22/2014 : 11:01:56 CST : lowerdown (link 0xb420dc): LCP
12/22/2014 : 11:01:56 CST : ppp_fsm (link 0xb420dc): LCP state: CLOSING -> INITIAL
12/22/2014 : 11:01:56 CST : LCP (0xb420dc) failed, closing IPCP
12/22/2014 : 11:01:56 CST : close (link 0xb420dc): IPCP
12/22/2014 : 11:01:56 CST : ppp_fsm (link 0xb420dc): IPCP state: INITIAL -> INITIAL
12/22/2014 : 11:01:56 CST : ppp_fsm (link 0xb420dc) sending TERMACK, len:13
12/22/2014 : 11:01:56 CST : link 0xb420dc sending:
12/22/2014 : 11:01:56 CST : c0 21 06 02 00 11 6c 69 6e 6b 20 69 6e 61 63 74 69 76 65
12/22/2014 : 11:01:56 CST : ppp_allpktsend (0xb420dc): lost line
12/22/2014 : 11:01:57 CST : CleanupCallback (PPPoE Server Driven)
12/22/2014 : 11:01:58 CST : PPPoE reconnect attempt in 3 seconds. (not idle)
12/22/2014 : 11:02:01 CST : start_ppp_link: Link No Longer Idle Attempt Connect
12/22/2014 : 11:02:01 CST : Connect (Link No Longer Idle)
12/22/2014 : 11:02:01 CST : open (link 0xb420dc): IPCP
12/22/2014 : 11:02:01 CST : ppp_fsm (link 0xb420dc): IPCP state: INITIAL -> STARTING
12/22/2014 : 11:02:01 CST : ppp_fsm_init (link 0xb420dc)
12/22/2014 : 11:02:01 CST : open (link 0xb420dc): LCP
12/22/2014 : 11:02:01 CST : ppp_fsm (link 0xb420dc): LCP state: INITIAL -> STARTING
12/22/2014 : 11:02:01 CST : poe_sendctl code 0x09, state:INIT->PADISENT, tags:
12/22/2014 : 11:02:01 CST : poe_rxctlcode code 0x7
12/22/2014 : 11:02:01 CST : poe_sendctl code 0x19, state:PADISENT->PADRSENT, tags:
12/22/2014 : 11:02:01 CST : poe_rxctlcode code 0x65
12/22/2014 : 11:02:01 CST : lowerup: (link 0xb420dc) LCP
12/22/2014 : 11:02:01 CST : ppp_fsm (link 0xb420dc): LCP state: STARTING -> REQSENT
12/22/2014 : 11:02:01 CST : ppp_fsm (link 0xb420dc) sending CONFREQ, len:10
12/22/2014 : 11:02:01 CST : link 0xb420dc sending:
12/22/2014 : 11:02:01 CST : c0 21 01 06 00 0e 01 04 05 d4 05 06 00 b4 20 e8
12/22/2014 : 11:02:01 CST : kicking off timer, oldstate 1 newstate 6
12/22/2014 : 11:02:02 CST : rcvd PPPoE data pkt, type 0xc021
12/22/2014 : 11:02:02 CST : link 0xb420dc rcvd:
12/22/2014 : 11:02:02 CST : c0 21 01 01 00 12 03 04 c0 23 01 04 05 d4 05 06 8a 4b f4 5f 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : ppp_inpkt (link 0xb420dc); prot: LCP, code CONFREQ, state REQSENT, len 34
12/22/2014 : 11:02:02 CST : lcp_reqci: rcvd AUTHTYPE
12/22/2014 : 11:02:02 CST : (c023)
12/22/2014 : 11:02:02 CST : (ACK)
12/22/2014 : 11:02:02 CST : lcp_reqci: rcvd MRU
12/22/2014 : 11:02:02 CST : (1492)
12/22/2014 : 11:02:02 CST : (ACK)
12/22/2014 : 11:02:02 CST : lcp_reqci: rcvd MAGICNUMBER
12/22/2014 : 11:02:02 CST : (8a4bf45f)
12/22/2014 : 11:02:02 CST : (ACK)
12/22/2014 : 11:02:02 CST : lcp_reqci: returning CONFACK.
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc): LCP state: REQSENT -> ACKSENT
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc) sending CONFACK, len:14
12/22/2014 : 11:02:02 CST : link 0xb420dc sending:
12/22/2014 : 11:02:02 CST : c0 21 02 01 00 12 03 04 c0 23 01 04 05 d4 05 06 8a 4b f4 5f
12/22/2014 : 11:02:02 CST : rcvd PPPoE data pkt, type 0xc021
12/22/2014 : 11:02:02 CST : link 0xb420dc rcvd:
12/22/2014 : 11:02:02 CST : c0 21 02 06 00 0e 01 04 05 d4 05 06 00 b4 20 e8 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : ppp_inpkt (link 0xb420dc); prot: LCP, code CONFACK, state ACKSENT, len 34
12/22/2014 : 11:02:02 CST : LCP rcvd ACK: OK
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc): LCP state: ACKSENT -> OPENED
12/22/2014 : 11:02:02 CST : lowerup: (link 0xb420dc) AUTH
12/22/2014 : 11:02:02 CST : link 0xb420dc sending:
12/22/2014 : 11:02:02 CST : c0 23 01 01 00 1c 0e 63 77 74 74 68 5f 69 6e 73 70 65 63 74 31 08 73 70 61 72 6b 6c 65 73
12/22/2014 : 11:02:02 CST : upap_sauth: Sent id 1.
12/22/2014 : 11:02:02 CST : rcvd PPPoE data pkt, type 0xc023
12/22/2014 : 11:02:02 CST : link 0xb420dc rcvd:
12/22/2014 : 11:02:02 CST : c0 23 02 00 00 0d 08 4c 6f 67 69 6e 20 6f 6b 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : upap_input link b420dc: code: AUTHACK, id 0, len 13
12/22/2014 : 11:02:02 CST : Login ok
12/22/2014 : 11:02:02 CST : lowerup: (link 0xb420dc) IPCP
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc): IPCP state: STARTING -> REQSENT
12/22/2014 : 11:02:02 CST : ipcp_addci: ouraddr==0.0.0.0
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc) sending CONFREQ, len:18
12/22/2014 : 11:02:02 CST : link 0xb420dc sending:
12/22/2014 : 11:02:02 CST : 80 21 01 12 00 16 03 06 00 00 00 00 81 06 00 00 00 00 83 06 00 00 00 00
12/22/2014 : 11:02:02 CST : kicking off timer, oldstate 1 newstate 6
12/22/2014 : 11:02:02 CST : rcvd PPPoE data pkt, type 0x8021
12/22/2014 : 11:02:02 CST : link 0xb420dc rcvd:
12/22/2014 : 11:02:02 CST : 80 21 01 01 00 0a 03 06 42 2b 23 fe 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : ppp_inpkt (link 0xb420dc); prot: IPCP, code CONFREQ, state REQSENT, len 34
12/22/2014 : 11:02:02 CST : ipcp: received ADDR
12/22/2014 : 11:02:02 CST : IP:66.43.35.254:
12/22/2014 : 11:02:02 CST :
12/22/2014 : 11:02:02 CST : (ACK)
12/22/2014 : 11:02:02 CST : ipcp: returning Configure-ACK
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc): IPCP state: REQSENT -> ACKSENT
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc) sending CONFACK, len:6
12/22/2014 : 11:02:02 CST : link 0xb420dc sending:
12/22/2014 : 11:02:02 CST : 80 21 02 01 00 0a 03 06 42 2b 23 fe
12/22/2014 : 11:02:02 CST : rcvd PPPoE data pkt, type 0x8057
12/22/2014 : 11:02:02 CST : link 0xb420dc rcvd:
12/22/2014 : 11:02:02 CST : 80 57 01 01 00 0e 01 0a 00 00 00 00 00 f0 51 d5 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : ppp_inpkt: (link 0xb420dc): unsupported prot 8057
12/22/2014 : 11:02:02 CST : link 0xb420dc sending:
12/22/2014 : 11:02:02 CST : c0 21 08 06 00 2c 80 57 01 01 00 0e 01 0a 00 00 00 00 00 f0 51 d5 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : rcvd PPPoE data pkt, type 0x8281
12/22/2014 : 11:02:02 CST : link 0xb420dc rcvd:
12/22/2014 : 11:02:02 CST : 82 81 01 01 00 04 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : ppp_inpkt: (link 0xb420dc): unsupported prot 8281
12/22/2014 : 11:02:02 CST : link 0xb420dc sending:
12/22/2014 : 11:02:02 CST : c0 21 08 06 00 2c 82 81 01 01 00 04 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : rcvd PPPoE data pkt, type 0x8021
12/22/2014 : 11:02:02 CST : link 0xb420dc rcvd:
12/22/2014 : 11:02:02 CST : 80 21 03 12 00 16 03 06 a2 dc b7 2e 81 06 08 08 08 08 83 06 08 08 04 04 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : ppp_inpkt (link 0xb420dc); prot: IPCP, code CONFNAK, state ACKSENT, len 34
12/22/2014 : 11:02:02 CST : local IP address 162.220.183.46
12/22/2014 : 11:02:02 CST : ipcp rcv NAK: DNS (0x81) address 8.8.8.8
12/22/2014 : 11:02:02 CST : ipcp rcv NAK: DNS (0x83) address 8.8.4.4
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc): IPCP state: ACKSENT -> ACKSENT
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc) sending CONFREQ, len:18
12/22/2014 : 11:02:02 CST : link 0xb420dc sending:
12/22/2014 : 11:02:02 CST : 80 21 01 13 00 16 03 06 a2 dc b7 2e 81 06 08 08 08 08 83 06 08 08 04 04
12/22/2014 : 11:02:02 CST : rcvd PPPoE data pkt, type 0x8021
12/22/2014 : 11:02:02 CST : link 0xb420dc rcvd:
12/22/2014 : 11:02:02 CST : 80 21 02 13 00 16 03 06 a2 dc b7 2e 81 06 08 08 08 08 83 06 08 08 04 04 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
12/22/2014 : 11:02:02 CST : ppp_inpkt (link 0xb420dc); prot: IPCP, code CONFACK, state ACKSENT, len 34
12/22/2014 : 11:02:02 CST : ppp_fsm (link 0xb420dc): IPCP state: ACKSENT -> OPENED
12/22/2014 : 11:02:02 CST : ipcp: up