I am having an extremely strange issue with a BL460c1 (G1) blade on a c7000 enclosure. I could not for the life of me get the machine to ping the gateway or any other blade on the same enclosure (yes, the subnet mask was correct & quadruple-checked); although pinging to the local IP works. I was almost convinced that it was a network or hardware issue; until I asked someone to try installing Windows on that blade to verify that it was not working as well; however to my surprise it worked fine in Windows after installing the network driver; by just setting the IP address (which was the same IP I was trying to configure CentOS with).
The interfaces come up fine (eth0 & eth1); the cards are: 03:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708S Gigabit Ethernet (rev 12) 07:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708S Gigabit Ethernet (rev 12)
NetworkManager is disabled. There is no bonding configured. The network firmware is the latest available from the HP website (bootcode 4.4.1, CLP 1.3.6). The network is supposed to be simple; if I do a:
ifconfig eth0 172.12.34.112 netmask 255.255.255.0
I am supposed to be able to:
ping 172.12.34.1
However, I get:
PING 172.12.34.1 (172.12.34.1) 56(84) bytes of data. From 172.12.34.112 icmp_seq=2 Destination Host Unreachable From 172.12.34.112 icmp_seq=3 Destination Host Unreachable From 172.12.34.112 icmp_seq=4 Destination Host Unreachable From 172.12.34.112 icmp_seq=5 Destination Host Unreachable
I cannot even ping blades in the same enclosure. Yet in windows server 2008 after putting the same IP pinging the same IPs works.
I tried: - the disable_msi=1 parameter of the bnx2 driver - a fresh install of CentOS 6.3 - a fresh install of RHEL 6.2 - a live CD of CentOS 5.5 - Clonezilla-ubuntu and systemrescuecd (kernel 3.2) live CDs - CentOS 6.3 with the kernel(-headers,-firmware,-devel) updated to the latest (RPMs were copied with a USB/iLO) - the bnx2 network driver available from the HP website - the bnx2 network driver available from Broadcom
Other than a check_ncic warning with the stock 6.3 driver (doesn't appear when trying other drivers); the bnx2 isn't logging anything problematic when I checked with dmesg or /var/log/messages. With tcpdump; strangely, I get random traffic destined to different IPs (probably from the same enclosure), but those IPs do not ping either.
However, I got a feeling of deja vu in the midst of all this; as I recall setting up an RHEL machine somewhere else 2-3 years ago; and had the issue of the network working with Windows but not Linux; it turned out that it was because Autonegotiation was disabled with the gigabit network (the few attempts with playing with ethtool did not work); and searching online led me to people saying it ought to be enabled anyway as a standard requires it for gigabit. When the network guys enabled autoneg; it started working in Linux.
I am wondering if I'm facing the same issue here, as I see ethtool saying: Advertised auto-negotiation: No Speed: 1000Mb/s I tried doing some ethtool settings (setting autoneg off and forcing 1000, other random options, etc) but it didn't help; but possibly I was trying the wrong things. I am not very familiar with HP blades; but it seems I cannot enable Autoneg for this blade (?); I do not have direct access to the hardware though (everything was done through iLO).
Any of you faced issues with a gigabit network with Autoneg disabled? Or any other ideas? This is the only Linux machine in the entire network.
Here are some extra info:
ethtool eth0:
Settings for eth0: Supported ports: [ FIBRE ] Supported link modes: 1000baseT/Full 2500baseX/Full Supports auto-negotiation: Yes Advertised link modes: 1000baseT/Full 2500baseX/Full Advertised pause frame use: No Advertised auto-negotiation: No Speed: 1000Mb/s Duplex: Full Port: FIBRE PHYAD: 2 Transceiver: internal Auto-negotiation: on Supports Wake-on: g Wake-on: g Link detected: yes -----------------------
modinfo bnx2:
filename: /lib/modules/2.6.32-279.11.1.el6.x86_64/kernel/drivers/net/bnx2.ko ... version: 2.2.1 license: GPL description: Broadcom NetXtreme II BCM5706/5708/5709/5716 Driver ... vermagic: 2.6.32-279.11.1.el6.x86_64 SMP mod_unload modversions ... -----------------------
route -n:
Kernel IP routing table Destination Gateway Genmask Flags Metric Ref Use Iface 172.12.34.0 0.0.0.0 255.255.255.0 U 0 0 0 eth0 169.254.0.0 0.0.0.0 255.255.0.0 U 1002 0 0 eth0 0.0.0.0 172.12.34.1 0.0.0.0 UG 0 0 0 eth0 ---------------------
ifconfig -a
eth0 Link encap:Ethernet HWaddr 00:19:BB:34:FA:70 inet addr:172.12.34.112 Bcast:172.12.34.255 Mask:255.255.255.0 inet6 addr: fe80::219:bbff:fe34:fa70/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:6800 errors:0 dropped:0 overruns:0 frame:0 TX packets:34 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:4616962 (4.4 MiB) TX bytes:4827 (4.7 KiB) Interrupt:16 Memory:f6000000-f6012800
eth1 Link encap:Ethernet HWaddr 00:19:BB:34:FA:78 BROADCAST MULTICAST MTU:1500 Metric:1 RX packets:0 errors:0 dropped:0 overruns:0 frame:0 TX packets:0 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:0 (0.0 b) TX bytes:0 (0.0 b) Interrupt:16 Memory:fa000000-fa012800
lo Link encap:Local Loopback inet addr:127.0.0.1 Mask:255.0.0.0 ....
traceroute 172.12.34.1:
traceroute to 172.12.34.1 (172.12.34.1), 30 hops max, 60 byte packets 1 172.12.34.112 (172.12.34.112) 3000.686 ms !H 3000.673 ms !H 3000.661 ms !H ....
traceroute 172.12.34.112
traceroute to 172.12.34.112 (172.12.34.112), 30 hops max, 60 byte packets 1 172.12.34.112 (172.12.34.112) 0.041 ms 0.011 ms 0.009 ms
-xrx