NIC Flapping problem?
Peter.Talmadge at CyberTools.biz
Peter.Talmadge at CyberTools.biz
Thu Jun 15 08:58:20 CDT 2006
Hi all,
Sometimes our NIC will drop connection for no apparent reason, and
usually shortly thereafter come back online. However sometimes it
will freeze up the machine.
We have tried many hardware solutions including switching out the
NIC's, switching the firewall and switch devices, and even switching
out the motherboard which had an onboard Gb card. Nothing seems to
cure it, and we never had these problems prior to upgrading to RHEL4
from RedHat 9.
Our latest effort is to slow down the NIC to 100Mb. We have set the
card to 100Mb/full duplex/autonegotiation off. However, we have had
the NIC hang on us again yesterday. In the /var/log/messages file,
as usual there is message like:
Jun 14 12:30:19 maple kernel: NETDEV WATCHDOG: eth0: transmit timed out
Jun 14 12:30:23 maple kernel: e1000: eth0: e1000_watchdog: NIC Link
is Up 100 Mbps Full Duplex
We are running a FortiGate60 firewall, and a D-Link Gigabit switch
that after this incident has been replaced by a Dell PowerConnect
2708. The server is a PowerEdge 2650. I have tried to include some
diagnostics that might help below. Any help would be
appreciated. One other question, is this the official place to get
the latest e1000 driver version
http://sourceforge.net/projects/e1000, and would it help to update the driver?
ethtools -S eth0:
NIC statistics:
rx_packets: 47386363
tx_packets: 88662955
rx_bytes: 4111506176
tx_bytes: 2296849700
rx_errors: 39526
tx_errors: 0
rx_dropped: 0
tx_dropped: 0
multicast: 0
collisions: 0
rx_length_errors: 19930
rx_over_errors: 0
rx_crc_errors: 12476
rx_frame_errors: 7003
rx_fifo_errors: 117
rx_no_buffer_count: 0
rx_missed_errors: 117
tx_aborted_errors: 0
tx_carrier_errors: 0
tx_fifo_errors: 0
tx_heartbeat_errors: 0
tx_window_errors: 0
tx_abort_late_coll: 0
tx_deferred_ok: 0
tx_single_coll_ok: 0
tx_multi_coll_ok: 0
rx_long_length_errors: 0
rx_short_length_errors: 0
rx_align_errors: 7003
tx_tcp_seg_good: 595533
tx_tcp_seg_failed: 0
rx_flow_control_xon: 0
rx_flow_control_xoff: 0
tx_flow_control_xon: 0
tx_flow_control_xoff: 0
rx_long_byte_count: 4111506176
rx_csum_offload_good: 47135867
rx_csum_offload_errors: 52
lspci -vv:
03:01.0 Ethernet controller: Intel Corporation 82544GC Gigabit
Ethernet Controller (LOM) (rev 02)
Subsystem: Dell PRO/1000 XT Network Connection
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop-
ParErr- Stepping- SERR+ FastB2B-
Status: Cap+ 66Mhz+ UDF- FastB2B- ParErr-
DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 64 (63750ns min), Cache Line Size 10
Interrupt: pin A routed to IRQ 5
Region 0: Memory at fdae0000 (64-bit, non-prefetchable) [size=128K]
Region 2: Memory at fdac0000 (64-bit, non-prefetchable) [size=128K]
Region 4: I/O ports at dce0 [size=32]
Expansion ROM at fdb00000 [disabled] [size=128K]
Capabilities: [dc] Power Management version 2
Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA
PME(D0+,D1-,D2-,D3hot+,D3cold+)
Status: D0 PME-Enable- DSel=0 DScale=1 PME-
Capabilities: [e4] PCI-X non-bridge device.
Command: DPERE- ERO+ RBC=0 OST=0
Status: Bus=3 Dev=1 Func=0 64bit+ 133MHz+ SCD- USC-,
DC=simple, DMMRBC=2, DMOST=0, DMCRS=1, RSCEM-
Capabilities: [f0] Message Signalled Interrupts: 64bit+
Queue=0/0 Enable-
Address: 0000000000000000 Data: 0000
ifconfig:
eth0 Link encap:Ethernet HWaddr 00:06:5B:F3:ED:9E
inet addr:10.10.10.67 Bcast:10.10.10.255 Mask:255.255.255.0
inet6 addr: fe80::206:5bff:fef3:ed9e/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:47388158 errors:39547 dropped:117 overruns:117
frame:39430
TX packets:88664487 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:4111761171 (3.8 GiB) TX bytes:2297142353 (2.1 GiB)
Base address:0xdce0 Memory:fdae0000-fdb00000
More information about the Linux-PowerEdge
mailing list