I am trying to
copy a large amount (about 50 GB) of data over my network from a Hyper-V-hosted virtual machine running Ubuntu 11.04 (Natty Narwhal) to another (non-virtual) Ubuntu host that I plan to use for testing upgrades to one of our web applications. The problem I am having is with the virtual machine, which I shall refer to in what follows as "source.host". This machine is running 64-bit Ubuntu Server with the 2.6.38-8-server kernel
and the Microsoft Linux Integration Components for Hyper-V kernel modules (hv_utils, hv_timesource, hv_netvsc, hv_blkvsc, hv_storvsc,
and hv_vmbus) loaded. It uses a Hyper-V "synthetic network adapter" for its networking interface.
To do the
copy, I log on to the machine with the data
and run the following commands (Call the remote machine "destination.host".):
$ cd /path/to/data
$ tar -cvf - datafolder/ | ssh
[email protected] "cat > ~/data.tar"
This runs for a while
and then suddenly stops after transferring somewhere from 2-6 GB. The terminal on the source.host machine displays a Write failed: broken pipe error. The odd part is this: after this occurs, the "source.host" machine is no longer able to talk to the rest of the network. I cannot ping any other hosts on the network from the "source.host" machine,
and I cannot ping the "source.host" machine from any other host on the network. I am equally unable to access the any of the web services hosted on "source.host".
Running ifconfig on "source.host" shows the network adapter to be up
and running as usual with the correct IP address
and everything. I tried restarting the networking service with
$ /etc/init.d/networking restart
but the problem does not go away. Restarting the machine makes it capable of talking to the network again -- it can ping
and be pinged by other hosts,
and the web services are also accessible
and usable as normal -- but attempting the
copy operation again results in the same failure, requiring another restart.
As an experiment, I tried replacing the tar -- ssh pipeline above with a straight scp:
$ scp -r datafolder/
[email protected]:~
but to no avail
Thinking that the issue might have to do with the kernel packet-send buffers filling up, I tried increasing the buffer size to 12 MB (up from the 128 KB default) with
# echo 12582911 > /proc/sys/net/core/wmem_max
but this also had no effect.
I'm guessing at this point that it might be a problem with the Microsoft synthetic network driver, but I don't really know. Does anyone have any suggestions?
Thank you very much in advance!