Broadcom NetXtremeII Packet Loss/PE1950
Matt Saladna
msaladna at apisnetworks.com
Thu Jan 3 13:29:09 CST 2008
Hi,
Has anyone noticed sporadic packet loss with the BCM5708 NICs on Linux?
I have been attempting to track down the root cause, but everything is
turning up empty.
The symptoms are once every few hundred HTTP requests or so, there will
be a timeout; same goes for DNS resolution. It looks like any TCP or
UDP request has a 1 in 30 or so chance of abruptly timing out on this
card. I have been testing by running a lookup query to 4.2.2.1 and
averaging out trip times with the following script:
( set -o pipefail ; let COUNT=0 ; RET=0 ; until [ $RET -ne 0 -o $COUNT
-gt 1000 ] ; do dig +short +trace @4.2.2.1 google.com | grep 'from
server' | awk '{print $7}' ; RET=$? ; ((COUNT++)) ; done ) | awk '{ SUM
+= $1; COUNT += 1; } END { print SUM/COUNT, COUNT }' ; clock
Three of the servers running CentOS 4 with the BCM5708 chipset die
within the first 30 queries:
dig: couldn't get address for 'H.ROOT-SERVERS.NET': not found
722.948 77
Two of the three servers are running the official 2.6.22.1 kernel, while
another is running on 2.6.24-rc6. There is a fourth server, a PE SC1435
with the Broadcom BCM5721 chipset, which performs the lookups
flawlessly. All four servers are on the same switch in the facility
effectively ruling out network issues on the uplink. I removed the TOE
from one of the troubled PE1950s last weekend, but that hasn't affected
packet loss. Packet loss occurs independent of any offload settings
made via ethtool. I have turned all of the parameters off and on with
no success.
Finally, just for kicks, I decided to run the DNS lookup test on a
development machine with a mirror image of the filesystem. I'm running
2.6.22.2 with Via's VT6102 (Rhine-II) embedded NIC. This works without
a problem either.
Thanks!
Matt Saladna
Apis Networks
More information about the Linux-PowerEdge
mailing list