
I'm seeing some minor packet loss traversing Level3 in Chicago. We connect to their network in Detroit, and we're clean until the first hop into Chicago. Anyone else seeing anything like that? --- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8 tel. 519-985-8410 fax. 519-985-8409

On Tue, 2009-08-18 at 13:05 -0400, Clayton Zekelman wrote:
I'm seeing some minor packet loss traversing Level3 in Chicago.
We connect to their network in Detroit, and we're clean until the first hop into Chicago.
Anyone else seeing anything like that? Out of downtown Chicago I see the following:
sudo traceroute-nanog www.mnsi.net traceroute to www.mnsi.net (216.8.137.229), 64 hops max, 40 byte packets 1 10.10.10.1 (10.10.10.1) 1 ms 7 ms 4 ms 2 10.20.0.1 (10.20.0.1) 13 ms 9 ms 7 ms 3 vl2.aggr1.chgo.il.rcn.net (207.229.191.130) 8 ms 11 ms 7 ms 4 tge3-1.border2.eqnx.il.rcn.net (207.172.19.159) 12 ms 17 ms 15 ms 5 te-8-3.car3.Chicago1.Level3.net (4.71.101.73) 9 ms 7 ms 13 ms 6 ae-32-52.ebr2.Chicago1.Level3.net (4.68.101.62) 8 ms 13 ms 18 ms 7 ae-8-8.car1.Detroit1.Level3.net (4.69.133.241) 14 ms 14 ms 16 ms 8 ae-11-11.car2.Detroit1.Level3.net (4.69.133.246) 16 ms 14 ms 18 ms 9 MANAGED-NET.car2.Detroit1.Level3.net (64.152.144.78) 16 ms 15 ms 17 ms L3 looks clean from me to you,or as close as I can get, might the problem be in Detroit? Anyone? Richard Golodner

Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6. 1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0 msec 3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
I'm seeing some minor packet loss traversing Level3 in Chicago.
We connect to their network in Detroit, and we're clean until the first hop into Chicago.
Anyone else seeing anything like that?
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8 tel. 519-985-8410 fax. 519-985-8409

It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11). mtr --order=LSRNABW can help in diagnosing this. -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.
1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0 msec 3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
I'm seeing some minor packet loss traversing Level3 in Chicago.
We connect to their network in Detroit, and we're clean until the first hop into Chicago.
Anyone else seeing anything like that?
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

I'm here in Kansas City, KS region and a trace to that IP in Detroit for Level3 shows this with the flags you have added for mtr. Level3 is just having a bad week me thinks. fed11.home.lan (0.0.0.0) Tue Aug 18 13:28:42 2009 Keys: Help Display mode Restart statistics Order of fields quit Packets Pings Host Loss% Snt Rcv Last Avg Best Wrst 1. router.home.lan 0.0% 274 274 0.2 0.2 0.1 0.3 2. ks-76-7-1-1.sta.embarqhsd.net 0.0% 274 274 7.7 8.3 6.2 114.8 3. ks-76-7-255-241.sta.embarqhsd.net 0.0% 274 274 8.4 8.6 6.1 114.4 4. ge-7-19.car1.StLouis1.Level3.net 41.5% 273 159 34.1 26.6 12.6 213.9 5. ae-11-11.car2.StLouis1.Level3.net 47.3% 273 144 12.6 25.9 12.6 205.6 6. ae-4-4.ebr2.Chicago1.Level3.net 1.1% 273 270 19.2 26.4 18.5 140.8 7. ae-8-8.car1.Detroit1.Level3.net 0.0% 273 273 27.8 38.8 24.1 229.2 8. ge-6-12-222.car2.Detroit1.Level3. 0.0% 273 273 26.5 39.1 24.1 239.0 --Brett Jeremy Chadwick wrote:
It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).
mtr --order=LSRNABW can help in diagnosing this.

The packet loss shown at hops #4, #5, and possibly #6 is either ICMP deprioritisation or high CPU utilisation on said routers. My vote is the lesser, given that it's common these days, and most backbone providers are known to use it (L3, Abovenet, Sprint, Verizon/MCI, and AT&T just to name a few). The 18ms jump between hop #3 and #4 is likely normal due to geographic distance, though I don't know where hop #3 is located; #4 is obviously Missouri. The same goes for the 12ms increase between #6 and #7 If this evidence was presented to Level 3, I can assure you they'd tell you the same thing. Here's a similar example showing the exact behaviour I describe but within the Comcast network. Note hop #6: HOST: icarus.home.lan Loss% Snt Rcv Last Avg Best Wrst 1. -------------- 0.0% 45 45 0.3 0.4 0.3 2.4 2. ??? 100.0 45 0 0.0 0.0 0.0 0.0 3. 68.85.191.253 0.0% 45 45 8.4 8.1 6.7 11.8 4. 68.85.154.149 0.0% 45 45 16.3 13.1 8.9 16.3 5. 68.86.90.137 0.0% 45 45 14.1 13.1 10.9 17.1 6. 68.86.85.181 97.8% 45 1 22.4 22.4 22.4 22.4 7. 4.71.118.9 0.0% 45 45 13.6 14.9 11.4 49.1 8. 4.68.18.195 0.0% 45 45 14.0 26.1 11.5 172.7 9. 4.79.219.106 0.0% 45 45 14.0 14.5 12.4 17.3 10. 209.128.95.111 0.0% 45 45 13.5 14.3 12.0 27.8 11. 72.20.109.194 0.0% 45 45 14.0 15.2 12.0 40.0 12. 72.20.106.125 0.0% 45 45 12.7 14.2 12.2 19.4 Here's another. Note hops #7, #8, and #9 (all Cogent), and also note in this example the increased latency at hop #7 and #8 which doesn't continue downwards through later hops: HOST: icarus.home.lan Loss% Snt Rcv Last Avg Best Wrst 1. -------------- 0.0% 45 45 0.4 0.4 0.3 0.4 2. ??? 100.0 45 0 0.0 0.0 0.0 0.0 3. 68.85.191.253 0.0% 45 45 6.9 8.1 6.5 12.6 4. 68.85.154.149 0.0% 45 45 9.0 10.0 8.6 12.6 5. 68.86.91.225 0.0% 45 45 11.1 11.9 10.7 13.7 6. 68.86.85.78 0.0% 45 45 12.0 14.3 11.6 42.5 7. 154.54.11.105 68.9% 45 14 191.4 26.3 11.9 191.4 8. 154.54.28.81 51.1% 45 22 206.6 37.7 12.9 206.6 9. 66.28.4.150 46.7% 45 24 14.9 16.8 14.5 29.7 10. 38.112.39.114 0.0% 45 45 16.9 17.4 15.4 20.7 11. 38.104.134.30 0.0% 45 45 14.7 18.5 13.9 26.6 12. ??? 100.0 45 0 0.0 0.0 0.0 0.0 The easiest way to determine if there's a problem is whether or not the loss witnessed at a hop continues down through succeeding hops. Here's a real life example (IPs removed for security reasons): HOST: ---------------------- Loss% Snt Last Avg Best Wrst StDev 1. --------------- 0.0% 60 6.8 6.0 0.4 85.9 16.1 2. --------------- 0.0% 60 1.0 48.1 0.6 320.0 76.4 3. 204.70.193.101 86.7% 60 29.3 122.4 23.5 290.8 106.1 4. 206.24.227.105 86.7% 60 50.7 9.4 1.3 50.7 17.5 5. 204.70.192.53 88.3% 60 15.0 11.3 1.6 15.6 6.6 6. 204.70.194.178 88.3% 60 15.3 25.4 15.1 34.7 9.8 7. 204.70.192.70 86.7% 60 34.7 40.9 34.6 84.2 17.5 8. 204.70.194.18 88.3% 60 35.0 34.9 34.7 35.1 0.1 9. 204.70.194.10 88.3% 60 34.9 35.0 34.7 35.6 0.3 10. 208.175.175.18 88.3% 60 35.7 35.6 35.2 36.2 0.4 11. 216.52.191.11 95.0% 60 35.8 35.7 35.6 35.8 0.1 12. --------------- 100.0 60 0.0 0.0 0.0 0.0 0.0 What you see here was an issue with SAVVIS. The root cause was a router of theirs which rebooted unexpectedly. The destination (hop #12) drops ICMP, so 100% loss there was completely normal -- the rest wasn't. This was determined by comparing the loss to historic data when the issue with SAVVIS wasn't occurring. And one more example -- which just occurred about 10 minutes ago on my home Comcast connection: HOST: ---------------------- Loss% Snt Rcv Last Avg Best Wrst 1. --------------- 28.9% 45 32 0.5 0.6 0.3 5.6 2. ??? 100.0 45 0 0.0 0.0 0.0 0.0 3. 68.85.191.253 28.9% 45 32 9.8 12.4 6.8 134.1 4. 68.85.154.149 28.9% 45 32 10.6 10.7 8.7 16.1 5. 68.86.90.141 28.9% 45 32 12.8 13.3 10.7 36.0 6. 68.86.85.181 28.9% 45 32 13.1 14.4 11.7 29.4 7. 68.86.86.50 28.9% 45 32 16.6 17.1 15.3 21.8 8. 75.149.228.254 28.9% 45 32 27.2 20.6 14.7 40.3 9. 209.58.116.50 28.9% 45 32 28.2 17.3 14.5 34.4 10. 216.6.33.6 28.9% 45 32 17.8 17.2 15.1 23.1 11. 144.223.243.21 28.9% 45 32 16.5 17.5 15.9 21.3 12. 144.232.8.111 28.9% 45 32 20.0 20.8 19.1 25.5 13. 144.232.24.35 28.9% 45 32 23.2 24.3 22.8 28.9 14. 144.232.20.60 31.1% 45 31 76.5 78.9 75.5 110.0 15. 144.232.20.3 28.9% 45 32 76.3 76.7 74.9 81.4 16. 144.223.34.242 28.9% 45 32 76.6 76.7 74.9 81.7 17. 64.127.129.10 28.9% 45 32 107.7 89.0 85.8 107.7 18. 96.34.2.9 28.9% 45 32 87.8 88.3 86.0 100.6 What happened here? Based on my router logs, it appears DHCP expired and numerous daemons on the router were automatically restarted (this is normal in such a circumstance). It's possible that Comcast's DHCP servers took too long to respond to a renewal and the router chose to down/up the WAN interface, or possibly restart relevant daemons. Hard to say -- the logging isn't as verbose as I'd like. My cable modem log doesn't indicate LOS nor being rebooted, so this was purely an IP-related issue, or issue with my own equipment. -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 01:31:28PM -0500, Brett Cooper wrote:
I'm here in Kansas City, KS region and a trace to that IP in Detroit for Level3 shows this with the flags you have added for mtr. Level3 is just having a bad week me thinks.
fed11.home.lan (0.0.0.0) Tue Aug 18 13:28:42 2009 Keys: Help Display mode Restart statistics Order of fields quit Packets Pings Host Loss% Snt Rcv Last Avg Best Wrst 1. router.home.lan 0.0% 274 274 0.2 0.2 0.1 0.3 2. ks-76-7-1-1.sta.embarqhsd.net 0.0% 274 274 7.7 8.3 6.2 114.8 3. ks-76-7-255-241.sta.embarqhsd.net 0.0% 274 274 8.4 8.6 6.1 114.4 4. ge-7-19.car1.StLouis1.Level3.net 41.5% 273 159 34.1 26.6 12.6 213.9 5. ae-11-11.car2.StLouis1.Level3.net 47.3% 273 144 12.6 25.9 12.6 205.6 6. ae-4-4.ebr2.Chicago1.Level3.net 1.1% 273 270 19.2 26.4 18.5 140.8 7. ae-8-8.car1.Detroit1.Level3.net 0.0% 273 273 27.8 38.8 24.1 229.2 8. ge-6-12-222.car2.Detroit1.Level3. 0.0% 273 273 26.5 39.1 24.1 239.0
--Brett
Jeremy Chadwick wrote:
It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).
mtr --order=LSRNABW can help in diagnosing this.

Is there version of mtr that include support for TCP or UDP types? A google search didn't reveal anything to me, but would be the best of mtr with the benefits of tcptraceroute. Frank -----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Tuesday, August 18, 2009 12:50 PM To: outages@outages.org Subject: Re: [outages] Level3 Chicago It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11). mtr --order=LSRNABW can help in diagnosing this. -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.
1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0
msec
3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
I'm seeing some minor packet loss traversing Level3 in Chicago.
We connect to their network in Detroit, and we're clean until the first hop into Chicago.
Anyone else seeing anything like that?
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

The closest thing I know of is PingPlotter for Windows. I'll remind folks that TCP and UDP-based traceroute/mtr-like utilities still rely on ICMP time exceeded (type 11) being sent out to obtain the present path, so if folks are looking for a pure TCP or UDP-based traceroute or equivalent, to avoid dropping of ICMP, they're barking up the wrong tree. :-) -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 10:42:14PM -0500, Frank Bulk wrote:
Is there version of mtr that include support for TCP or UDP types? A google search didn't reveal anything to me, but would be the best of mtr with the benefits of tcptraceroute.
Frank
-----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Tuesday, August 18, 2009 12:50 PM To: outages@outages.org Subject: Re: [outages] Level3 Chicago
It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).
mtr --order=LSRNABW can help in diagnosing this.
-- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |
On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.
1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0
msec
3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
I'm seeing some minor packet loss traversing Level3 in Chicago.
We connect to their network in Detroit, and we're clean until the first hop into Chicago.
Anyone else seeing anything like that?
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

Good point. Is the ICMP rate-limiting of these largish routers ingress and egress, or only ingress-only? Frank -----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Wednesday, August 19, 2009 12:21 AM To: outages@outages.org Subject: Re: [outages] Level3 Chicago The closest thing I know of is PingPlotter for Windows. I'll remind folks that TCP and UDP-based traceroute/mtr-like utilities still rely on ICMP time exceeded (type 11) being sent out to obtain the present path, so if folks are looking for a pure TCP or UDP-based traceroute or equivalent, to avoid dropping of ICMP, they're barking up the wrong tree. :-) -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 10:42:14PM -0500, Frank Bulk wrote:
Is there version of mtr that include support for TCP or UDP types? A google search didn't reveal anything to me, but would be the best of mtr with the benefits of tcptraceroute.
Frank
-----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Tuesday, August 18, 2009 12:50 PM To: outages@outages.org Subject: Re: [outages] Level3 Chicago
It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).
mtr --order=LSRNABW can help in diagnosing this.
-- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |
On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.
1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0
msec
3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
I'm seeing some minor packet loss traversing Level3 in Chicago.
We connect to their network in Detroit, and we're clean until the first hop into Chicago.
Anyone else seeing anything like that?
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

That's an excellent question -- and one I've always wondered myself. This is purely speculative, but I believe outbound ICMP (e.g. sent from the router to whatever src solicited it) is what's de-prioritised. Someone more familiar with Cisco and Juniper equipment might know for certain. -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Wed, Aug 19, 2009 at 09:55:40AM -0500, Frank Bulk - iName.com wrote:
Good point. Is the ICMP rate-limiting of these largish routers ingress and egress, or only ingress-only?
Frank
-----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Wednesday, August 19, 2009 12:21 AM To: outages@outages.org Subject: Re: [outages] Level3 Chicago
The closest thing I know of is PingPlotter for Windows.
I'll remind folks that TCP and UDP-based traceroute/mtr-like utilities still rely on ICMP time exceeded (type 11) being sent out to obtain the present path, so if folks are looking for a pure TCP or UDP-based traceroute or equivalent, to avoid dropping of ICMP, they're barking up the wrong tree. :-)
-- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |
On Tue, Aug 18, 2009 at 10:42:14PM -0500, Frank Bulk wrote:
Is there version of mtr that include support for TCP or UDP types? A google search didn't reveal anything to me, but would be the best of mtr with the benefits of tcptraceroute.
Frank
-----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Tuesday, August 18, 2009 12:50 PM To: outages@outages.org Subject: Re: [outages] Level3 Chicago
It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).
mtr --order=LSRNABW can help in diagnosing this.
-- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |
On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:
Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.
1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0
msec
3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec
At 01:05 PM 8/18/2009, Clayton Zekelman wrote:
I'm seeing some minor packet loss traversing Level3 in Chicago.
We connect to their network in Detroit, and we're clean until the first hop into Chicago.
Anyone else seeing anything like that?
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00
--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8
tel. 519-985-8410 fax. 519-985-8409
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages
outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

'Jeremy Chadwick' wrote:
That's an excellent question -- and one I've always wondered myself.
This is purely speculative, but I believe outbound ICMP (e.g. sent from the router to whatever src solicited it) is what's de-prioritised.
Someone more familiar with Cisco and Juniper equipment might know for certain.
Usually packets destined to the control-plane of the system are prioritized based on criteria. It is better to let routing control protocols (e.g. ospf, bgp, isis) through first than someone pinging or running a traceroute. Packets *through* the router take the normal forwarding path and are not affected by this system. There may be system defaults based on hardware/software, but Cisco has CoPP and I believe Juniper uses a firewall on the lo0 interface (been a while since I touched one) for user-defined rules. -- Devon

Yes, and when pure prioritization is not available, ISPs will use rate limiting too which is one of the ways Level 3 restricts ICMP to the cpu on the core devices. Also, the car device is an edge router so there could be congestion on a customer port too when higher response times are seen on the other side of a hop. Response times that settle could be the control plane/data plane issue or could be once traffic gets to a far end there's an asymetric path that goes a different return path rather than back across the link seen on the forward traceroute. All these are why simple pings and traceroutes don't always tell the story. * Devon True was thought to have said:
'Jeremy Chadwick' wrote:
That's an excellent question -- and one I've always wondered myself.
This is purely speculative, but I believe outbound ICMP (e.g. sent from the router to whatever src solicited it) is what's de-prioritised.
Someone more familiar with Cisco and Juniper equipment might know for certain.
Usually packets destined to the control-plane of the system are prioritized based on criteria. It is better to let routing control protocols (e.g. ospf, bgp, isis) through first than someone pinging or running a traceroute. Packets *through* the router take the normal forwarding path and are not affected by this system.
There may be system defaults based on hardware/software, but Cisco has CoPP and I believe Juniper uses a firewall on the lo0 interface (been a while since I touched one) for user-defined rules.
-- Devon
_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

I know Nlayer has multiple cities with Level3, try doing ping/ traceroutes using their Looking Glass. I also provided l3 LG too. http://lg.nlayer.net/ http://lg.level3.net/ Clayton Zekelman wrote:
I'm seeing some minor packet loss traversing Level3 in Chicago. We connect to their network in Detroit, and we're clean until the first hop into Chicago. Anyone else seeing anything like that?
participants (10)
-
'Jeremy Chadwick'
-
Brett Cooper
-
Clayton Zekelman
-
Craig Pierantozzi
-
Devon True
-
Frank Bulk
-
Frank Bulk - iName.com
-
Jeremy Chadwick
-
Matthew Walker
-
Richard Golodner