Level3 Chicago - Outages - Outages Lists

newer
Internet at Connect ISP in Fiji is...

Level3 Chicago

older
Re: [outages] Level3 Chicago

Clayton Zekelman

18 Aug 2009 18 Aug '09

10:05 a.m.

I'm seeing some minor packet loss traversing Level3 in Chicago. We connect to their network in Detroit, and we're clean until the first hop into Chicago. Anyone else seeing anything like that? --- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8 tel. 519-985-8410 fax. 519-985-8409

Reply

Sign in to reply online Use email software

Show replies by date

Richard Golodner

18 Aug 18 Aug

10:29 a.m.

New subject: [outages] Level3 Chicago

On Tue, 2009-08-18 at 13:05 -0400, Clayton Zekelman wrote:

I'm seeing some minor packet loss traversing Level3 in Chicago.

We connect to their network in Detroit, and we're clean until the first hop into Chicago.

Anyone else seeing anything like that? Out of downtown Chicago I see the following:

sudo traceroute-nanog www.mnsi.net traceroute to www.mnsi.net (216.8.137.229), 64 hops max, 40 byte packets 1 10.10.10.1 (10.10.10.1) 1 ms 7 ms 4 ms 2 10.20.0.1 (10.20.0.1) 13 ms 9 ms 7 ms 3 vl2.aggr1.chgo.il.rcn.net (207.229.191.130) 8 ms 11 ms 7 ms 4 tge3-1.border2.eqnx.il.rcn.net (207.172.19.159) 12 ms 17 ms 15 ms 5 te-8-3.car3.Chicago1.Level3.net (4.71.101.73) 9 ms 7 ms 13 ms 6 ae-32-52.ebr2.Chicago1.Level3.net (4.68.101.62) 8 ms 13 ms 18 ms 7 ae-8-8.car1.Detroit1.Level3.net (4.69.133.241) 14 ms 14 ms 16 ms 8 ae-11-11.car2.Detroit1.Level3.net (4.69.133.246) 16 ms 14 ms 18 ms 9 MANAGED-NET.car2.Detroit1.Level3.net (64.152.144.78) 16 ms 15 ms 17 ms L3 looks clean from me to you,or as close as I can get, might the problem be in Detroit? Anyone? Richard Golodner

Reply

Sign in to reply online Use email software

Clayton Zekelman

10:36 a.m.

New subject: [outages] Level3 Chicago

Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6. 1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0 msec 3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec At 01:05 PM 8/18/2009, Clayton Zekelman wrote:

I'm seeing some minor packet loss traversing Level3 in Chicago.

We connect to their network in Detroit, and we're clean until the first hop into Chicago.

Anyone else seeing anything like that?

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8 tel. 519-985-8410 fax. 519-985-8409

Reply

Sign in to reply online Use email software

Jeremy Chadwick

10:50 a.m.

New subject: [outages] Level3 Chicago

It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11). mtr --order=LSRNABW can help in diagnosing this. -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:

Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.

1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0 msec 3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec

At 01:05 PM 8/18/2009, Clayton Zekelman wrote:

...
I'm seeing some minor packet loss traversing Level3 in Chicago.

We connect to their network in Detroit, and we're clean until the first hop into Chicago.

Anyone else seeing anything like that?

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

Reply

Sign in to reply online Use email software

Brett Cooper

11:31 a.m.

New subject: [outages] Level3 Chicago

I'm here in Kansas City, KS region and a trace to that IP in Detroit for Level3 shows this with the flags you have added for mtr. Level3 is just having a bad week me thinks. fed11.home.lan (0.0.0.0) Tue Aug 18 13:28:42 2009 Keys: Help Display mode Restart statistics Order of fields quit Packets Pings Host Loss% Snt Rcv Last Avg Best Wrst 1. router.home.lan 0.0% 274 274 0.2 0.2 0.1 0.3 2. ks-76-7-1-1.sta.embarqhsd.net 0.0% 274 274 7.7 8.3 6.2 114.8 3. ks-76-7-255-241.sta.embarqhsd.net 0.0% 274 274 8.4 8.6 6.1 114.4 4. ge-7-19.car1.StLouis1.Level3.net 41.5% 273 159 34.1 26.6 12.6 213.9 5. ae-11-11.car2.StLouis1.Level3.net 47.3% 273 144 12.6 25.9 12.6 205.6 6. ae-4-4.ebr2.Chicago1.Level3.net 1.1% 273 270 19.2 26.4 18.5 140.8 7. ae-8-8.car1.Detroit1.Level3.net 0.0% 273 273 27.8 38.8 24.1 229.2 8. ge-6-12-222.car2.Detroit1.Level3. 0.0% 273 273 26.5 39.1 24.1 239.0 --Brett Jeremy Chadwick wrote:

It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).

mtr --order=LSRNABW can help in diagnosing this.

Reply

Sign in to reply online Use email software

Jeremy Chadwick

12:56 p.m.

New subject: [outages] Level3 Chicago

The packet loss shown at hops #4, #5, and possibly #6 is either ICMP deprioritisation or high CPU utilisation on said routers. My vote is the lesser, given that it's common these days, and most backbone providers are known to use it (L3, Abovenet, Sprint, Verizon/MCI, and AT&T just to name a few). The 18ms jump between hop #3 and #4 is likely normal due to geographic distance, though I don't know where hop #3 is located; #4 is obviously Missouri. The same goes for the 12ms increase between #6 and #7 If this evidence was presented to Level 3, I can assure you they'd tell you the same thing. Here's a similar example showing the exact behaviour I describe but within the Comcast network. Note hop #6: HOST: icarus.home.lan Loss% Snt Rcv Last Avg Best Wrst 1. -------------- 0.0% 45 45 0.3 0.4 0.3 2.4 2. ??? 100.0 45 0 0.0 0.0 0.0 0.0 3. 68.85.191.253 0.0% 45 45 8.4 8.1 6.7 11.8 4. 68.85.154.149 0.0% 45 45 16.3 13.1 8.9 16.3 5. 68.86.90.137 0.0% 45 45 14.1 13.1 10.9 17.1 6. 68.86.85.181 97.8% 45 1 22.4 22.4 22.4 22.4 7. 4.71.118.9 0.0% 45 45 13.6 14.9 11.4 49.1 8. 4.68.18.195 0.0% 45 45 14.0 26.1 11.5 172.7 9. 4.79.219.106 0.0% 45 45 14.0 14.5 12.4 17.3 10. 209.128.95.111 0.0% 45 45 13.5 14.3 12.0 27.8 11. 72.20.109.194 0.0% 45 45 14.0 15.2 12.0 40.0 12. 72.20.106.125 0.0% 45 45 12.7 14.2 12.2 19.4 Here's another. Note hops #7, #8, and #9 (all Cogent), and also note in this example the increased latency at hop #7 and #8 which doesn't continue downwards through later hops: HOST: icarus.home.lan Loss% Snt Rcv Last Avg Best Wrst 1. -------------- 0.0% 45 45 0.4 0.4 0.3 0.4 2. ??? 100.0 45 0 0.0 0.0 0.0 0.0 3. 68.85.191.253 0.0% 45 45 6.9 8.1 6.5 12.6 4. 68.85.154.149 0.0% 45 45 9.0 10.0 8.6 12.6 5. 68.86.91.225 0.0% 45 45 11.1 11.9 10.7 13.7 6. 68.86.85.78 0.0% 45 45 12.0 14.3 11.6 42.5 7. 154.54.11.105 68.9% 45 14 191.4 26.3 11.9 191.4 8. 154.54.28.81 51.1% 45 22 206.6 37.7 12.9 206.6 9. 66.28.4.150 46.7% 45 24 14.9 16.8 14.5 29.7 10. 38.112.39.114 0.0% 45 45 16.9 17.4 15.4 20.7 11. 38.104.134.30 0.0% 45 45 14.7 18.5 13.9 26.6 12. ??? 100.0 45 0 0.0 0.0 0.0 0.0 The easiest way to determine if there's a problem is whether or not the loss witnessed at a hop continues down through succeeding hops. Here's a real life example (IPs removed for security reasons): HOST: ---------------------- Loss% Snt Last Avg Best Wrst StDev 1. --------------- 0.0% 60 6.8 6.0 0.4 85.9 16.1 2. --------------- 0.0% 60 1.0 48.1 0.6 320.0 76.4 3. 204.70.193.101 86.7% 60 29.3 122.4 23.5 290.8 106.1 4. 206.24.227.105 86.7% 60 50.7 9.4 1.3 50.7 17.5 5. 204.70.192.53 88.3% 60 15.0 11.3 1.6 15.6 6.6 6. 204.70.194.178 88.3% 60 15.3 25.4 15.1 34.7 9.8 7. 204.70.192.70 86.7% 60 34.7 40.9 34.6 84.2 17.5 8. 204.70.194.18 88.3% 60 35.0 34.9 34.7 35.1 0.1 9. 204.70.194.10 88.3% 60 34.9 35.0 34.7 35.6 0.3 10. 208.175.175.18 88.3% 60 35.7 35.6 35.2 36.2 0.4 11. 216.52.191.11 95.0% 60 35.8 35.7 35.6 35.8 0.1 12. --------------- 100.0 60 0.0 0.0 0.0 0.0 0.0 What you see here was an issue with SAVVIS. The root cause was a router of theirs which rebooted unexpectedly. The destination (hop #12) drops ICMP, so 100% loss there was completely normal -- the rest wasn't. This was determined by comparing the loss to historic data when the issue with SAVVIS wasn't occurring. And one more example -- which just occurred about 10 minutes ago on my home Comcast connection: HOST: ---------------------- Loss% Snt Rcv Last Avg Best Wrst 1. --------------- 28.9% 45 32 0.5 0.6 0.3 5.6 2. ??? 100.0 45 0 0.0 0.0 0.0 0.0 3. 68.85.191.253 28.9% 45 32 9.8 12.4 6.8 134.1 4. 68.85.154.149 28.9% 45 32 10.6 10.7 8.7 16.1 5. 68.86.90.141 28.9% 45 32 12.8 13.3 10.7 36.0 6. 68.86.85.181 28.9% 45 32 13.1 14.4 11.7 29.4 7. 68.86.86.50 28.9% 45 32 16.6 17.1 15.3 21.8 8. 75.149.228.254 28.9% 45 32 27.2 20.6 14.7 40.3 9. 209.58.116.50 28.9% 45 32 28.2 17.3 14.5 34.4 10. 216.6.33.6 28.9% 45 32 17.8 17.2 15.1 23.1 11. 144.223.243.21 28.9% 45 32 16.5 17.5 15.9 21.3 12. 144.232.8.111 28.9% 45 32 20.0 20.8 19.1 25.5 13. 144.232.24.35 28.9% 45 32 23.2 24.3 22.8 28.9 14. 144.232.20.60 31.1% 45 31 76.5 78.9 75.5 110.0 15. 144.232.20.3 28.9% 45 32 76.3 76.7 74.9 81.4 16. 144.223.34.242 28.9% 45 32 76.6 76.7 74.9 81.7 17. 64.127.129.10 28.9% 45 32 107.7 89.0 85.8 107.7 18. 96.34.2.9 28.9% 45 32 87.8 88.3 86.0 100.6 What happened here? Based on my router logs, it appears DHCP expired and numerous daemons on the router were automatically restarted (this is normal in such a circumstance). It's possible that Comcast's DHCP servers took too long to respond to a renewal and the router chose to down/up the WAN interface, or possibly restart relevant daemons. Hard to say -- the logging isn't as verbose as I'd like. My cable modem log doesn't indicate LOS nor being rebooted, so this was purely an IP-related issue, or issue with my own equipment. -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 01:31:28PM -0500, Brett Cooper wrote:

I'm here in Kansas City, KS region and a trace to that IP in Detroit for Level3 shows this with the flags you have added for mtr. Level3 is just having a bad week me thinks.

fed11.home.lan (0.0.0.0) Tue Aug 18 13:28:42 2009 Keys: Help Display mode Restart statistics Order of fields quit Packets Pings Host Loss% Snt Rcv Last Avg Best Wrst 1. router.home.lan 0.0% 274 274 0.2 0.2 0.1 0.3 2. ks-76-7-1-1.sta.embarqhsd.net 0.0% 274 274 7.7 8.3 6.2 114.8 3. ks-76-7-255-241.sta.embarqhsd.net 0.0% 274 274 8.4 8.6 6.1 114.4 4. ge-7-19.car1.StLouis1.Level3.net 41.5% 273 159 34.1 26.6 12.6 213.9 5. ae-11-11.car2.StLouis1.Level3.net 47.3% 273 144 12.6 25.9 12.6 205.6 6. ae-4-4.ebr2.Chicago1.Level3.net 1.1% 273 270 19.2 26.4 18.5 140.8 7. ae-8-8.car1.Detroit1.Level3.net 0.0% 273 273 27.8 38.8 24.1 229.2 8. ge-6-12-222.car2.Detroit1.Level3. 0.0% 273 273 26.5 39.1 24.1 239.0

--Brett

Jeremy Chadwick wrote:

...
It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).

mtr --order=LSRNABW can help in diagnosing this.

Reply

Sign in to reply online Use email software

Frank Bulk

8:42 p.m.

New subject: [outages] Level3 Chicago

Is there version of mtr that include support for TCP or UDP types? A google search didn't reveal anything to me, but would be the best of mtr with the benefits of tcptraceroute. Frank -----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Tuesday, August 18, 2009 12:50 PM To: outages@outages.org Subject: Re: [outages] Level3 Chicago It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11). mtr --order=LSRNABW can help in diagnosing this. -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:

Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.

1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0

msec

3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec

At 01:05 PM 8/18/2009, Clayton Zekelman wrote:

...
I'm seeing some minor packet loss traversing Level3 in Chicago.

We connect to their network in Detroit, and we're clean until the first hop into Chicago.

Anyone else seeing anything like that?

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

Reply

Sign in to reply online Use email software

Jeremy Chadwick

10:20 p.m.

New subject: [outages] Level3 Chicago

The closest thing I know of is PingPlotter for Windows. I'll remind folks that TCP and UDP-based traceroute/mtr-like utilities still rely on ICMP time exceeded (type 11) being sent out to obtain the present path, so if folks are looking for a pure TCP or UDP-based traceroute or equivalent, to avoid dropping of ICMP, they're barking up the wrong tree. :-) -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 10:42:14PM -0500, Frank Bulk wrote:

Is there version of mtr that include support for TCP or UDP types? A google search didn't reveal anything to me, but would be the best of mtr with the benefits of tcptraceroute.

Frank

-----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Tuesday, August 18, 2009 12:50 PM To: outages@outages.org Subject: Re: [outages] Level3 Chicago

It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).

mtr --order=LSRNABW can help in diagnosing this.

-- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |

On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:

...
Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.

1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0

msec

...
3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec

At 01:05 PM 8/18/2009, Clayton Zekelman wrote:

...
I'm seeing some minor packet loss traversing Level3 in Chicago.

We connect to their network in Detroit, and we're clean until the first hop into Chicago.

Anyone else seeing anything like that?

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

Reply

Sign in to reply online Use email software

Frank Bulk - iName.com

19 Aug 19 Aug

7:55 a.m.

New subject: [outages] Level3 Chicago

Good point. Is the ICMP rate-limiting of these largish routers ingress and egress, or only ingress-only? Frank -----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Wednesday, August 19, 2009 12:21 AM To: outages@outages.org Subject: Re: [outages] Level3 Chicago The closest thing I know of is PingPlotter for Windows. I'll remind folks that TCP and UDP-based traceroute/mtr-like utilities still rely on ICMP time exceeded (type 11) being sent out to obtain the present path, so if folks are looking for a pure TCP or UDP-based traceroute or equivalent, to avoid dropping of ICMP, they're barking up the wrong tree. :-) -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Tue, Aug 18, 2009 at 10:42:14PM -0500, Frank Bulk wrote:

Is there version of mtr that include support for TCP or UDP types? A google search didn't reveal anything to me, but would be the best of mtr with the benefits of tcptraceroute.

Frank

-----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Tuesday, August 18, 2009 12:50 PM To: outages@outages.org Subject: Re: [outages] Level3 Chicago

It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).

mtr --order=LSRNABW can help in diagnosing this.

-- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |

On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:

...
Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.

1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0

msec

...
3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec

At 01:05 PM 8/18/2009, Clayton Zekelman wrote:

...
I'm seeing some minor packet loss traversing Level3 in Chicago.

We connect to their network in Detroit, and we're clean until the first hop into Chicago.

Anyone else seeing anything like that?

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

Reply

Sign in to reply online Use email software

'Jeremy Chadwick'

8:07 a.m.

New subject: [outages] Level3 Chicago

That's an excellent question -- and one I've always wondered myself. This is purely speculative, but I believe outbound ICMP (e.g. sent from the router to whatever src solicited it) is what's de-prioritised. Someone more familiar with Cisco and Juniper equipment might know for certain. -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB | On Wed, Aug 19, 2009 at 09:55:40AM -0500, Frank Bulk - iName.com wrote:

Good point. Is the ICMP rate-limiting of these largish routers ingress and egress, or only ingress-only?

Frank

-----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Wednesday, August 19, 2009 12:21 AM To: outages@outages.org Subject: Re: [outages] Level3 Chicago

The closest thing I know of is PingPlotter for Windows.

I'll remind folks that TCP and UDP-based traceroute/mtr-like utilities still rely on ICMP time exceeded (type 11) being sent out to obtain the present path, so if folks are looking for a pure TCP or UDP-based traceroute or equivalent, to avoid dropping of ICMP, they're barking up the wrong tree. :-)

-- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |

On Tue, Aug 18, 2009 at 10:42:14PM -0500, Frank Bulk wrote:

...
Is there version of mtr that include support for TCP or UDP types? A google search didn't reveal anything to me, but would be the best of mtr with the benefits of tcptraceroute.

Frank

-----Original Message----- From: outages-bounces@outages.org [mailto:outages-bounces@outages.org] On Behalf Of Jeremy Chadwick Sent: Tuesday, August 18, 2009 12:50 PM To: outages@outages.org Subject: Re: [outages] Level3 Chicago

It would be helpful if you could use something like mtr instead of traceroute in this case. The below trace could be indicating ICMP de-prioritisation on L3 routers, which is known to be enabled, but could also be an indicator of packet loss starting at hop #5 and "trickling down" through succeeding hops (possibly #10 or #11).

mtr --order=LSRNABW can help in diagnosing this.

-- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |

On Tue, Aug 18, 2009 at 01:36:58PM -0400, Clayton Zekelman wrote:

...
Further to the issue below - I'm typically seeing packet loss at hop 5. This particular trace doesn't show it though, although there are timeouts on hop 6.

1 br0.win.mnsi.net (216.8.137.193) 0 msec 0 msec 0 msec 2 ge-6-12-222.car2.Detroit1.Level3.net (64.152.144.77) 0 msec 4 msec 0

msec

...
3 ae-11-11.car1.Detroit1.Level3.net (4.69.133.245) 4 msec 0 msec 4 msec 4 ae-8-8.ebr2.Chicago1.Level3.net (4.69.133.242) 8 msec 4 msec 16 msec 5 ae-5.ebr2.Chicago2.Level3.net (4.69.140.194) 8 msec 8 msec 8 msec 6 * * ae-2-2.ebr2.Washington1.Level3.net (4.69.132.70) 32 msec 7 ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 36 msec ae-82-82.csw3.Washington1.Level3.net (4.69.134.154) 36 msec ae-72-72.csw2.Washington1.Level3.net (4.69.134.150) 32 msec 8 ae-2-79.edge2.Washington4.Level3.net (4.68.17.83) 28 msec ae-3-89.edge2.Washington4.Level3.net (4.68.17.147) 40 msec ae-1-69.edge2.Washington4.Level3.net (4.68.17.19) 24 msec 9 savvis-level3-te.Washington1.Level3.net (4.68.110.102) 32 msec 28 msec 40 msec 10 * * * 11 cr1-bundle-pos1.NewYork.savvis.net (204.70.196.137) 28 msec * 28 msec 12 206.24.199.2 48 msec 32 msec 32 msec 13 toroonnldr01.bb.telus.com (154.11.6.71) 32 msec 192 msec 36 msec 14 host82.145.115.209.in-addr.arpa (209.115.145.82) 40 msec 40 msec 40 msec 15 209.202.116.58 40 msec 48 msec 36 msec 16 www.govital.net (209.202.88.24) 36 msec 40 msec 36 msec

At 01:05 PM 8/18/2009, Clayton Zekelman wrote:

...
I'm seeing some minor packet loss traversing Level3 in Chicago.

We connect to their network in Detroit, and we're clean until the first hop into Chicago.

Anyone else seeing anything like that?

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

No virus found in this incoming message. Checked by AVG - www.avg.com Version: 8.5.409 / Virus Database: 270.13.60/2311 - Release Date: 08/18/09 06:03:00

--- Clayton Zekelman Managed Network Systems Inc. (MNSi) 344-300 Tecumseh Rd. E. Windsor, Ontario N8X 5E8

tel. 519-985-8410 fax. 519-985-8409

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

Reply

Sign in to reply online Use email software

Devon True

8:50 a.m.

New subject: [outages] Level3 Chicago

'Jeremy Chadwick' wrote:

That's an excellent question -- and one I've always wondered myself.

This is purely speculative, but I believe outbound ICMP (e.g. sent from the router to whatever src solicited it) is what's de-prioritised.

Someone more familiar with Cisco and Juniper equipment might know for certain.

Usually packets destined to the control-plane of the system are prioritized based on criteria. It is better to let routing control protocols (e.g. ospf, bgp, isis) through first than someone pinging or running a traceroute. Packets *through* the router take the normal forwarding path and are not affected by this system. There may be system defaults based on hardware/software, but Cisco has CoPP and I believe Juniper uses a firewall on the lo0 interface (been a while since I touched one) for user-defined rules. -- Devon

Reply

Sign in to reply online Use email software

Craig Pierantozzi

11:47 a.m.

New subject: [outages] Level3 Chicago

Yes, and when pure prioritization is not available, ISPs will use rate limiting too which is one of the ways Level 3 restricts ICMP to the cpu on the core devices. Also, the car device is an edge router so there could be congestion on a customer port too when higher response times are seen on the other side of a hop. Response times that settle could be the control plane/data plane issue or could be once traffic gets to a far end there's an asymetric path that goes a different return path rather than back across the link seen on the forward traceroute. All these are why simple pings and traceroutes don't always tell the story. * Devon True was thought to have said:

'Jeremy Chadwick' wrote:

...
That's an excellent question -- and one I've always wondered myself.

This is purely speculative, but I believe outbound ICMP (e.g. sent from the router to whatever src solicited it) is what's de-prioritised.

Someone more familiar with Cisco and Juniper equipment might know for certain.

Usually packets destined to the control-plane of the system are prioritized based on criteria. It is better to let routing control protocols (e.g. ospf, bgp, isis) through first than someone pinging or running a traceroute. Packets *through* the router take the normal forwarding path and are not affected by this system.

There may be system defaults based on hardware/software, but Cisco has CoPP and I believe Juniper uses a firewall on the lo0 interface (been a while since I touched one) for user-defined rules.

-- Devon

_______________________________________________ outages mailing list outages@outages.org https://puck.nether.net/mailman/listinfo/outages

Reply

Sign in to reply online Use email software

Matthew Walker

18 Aug 18 Aug

11:33 a.m.

New subject: [outages] was Level3 Chicago

I know Nlayer has multiple cities with Level3, try doing ping/ traceroutes using their Looking Glass. I also provided l3 LG too. http://lg.nlayer.net/ http://lg.level3.net/ Clayton Zekelman wrote:

I'm seeing some minor packet loss traversing Level3 in Chicago. We connect to their network in Detroit, and we're clean until the first hop into Chicago. Anyone else seeing anything like that?

Reply

Sign in to reply online Use email software

6041

Age (days ago)

6042

Last active (days ago)

Download

12 comments

10 participants

tags

participants (10)

'Jeremy Chadwick'
Brett Cooper
Clayton Zekelman
Craig Pierantozzi
Devon True
Frank Bulk
Frank Bulk - iName.com
Jeremy Chadwick
Matthew Walker
Richard Golodner