Geographically distributed file system with preferred locality

Posted by dpb on Server Fault See other posts from Server Fault or by dpb
Published on 2010-03-25T00:36:38Z Indexed on 2010/03/25 0:43 UTC
Read the original article Hit count: 400

Hi All --

I'm building a application that needs to distribute a standard file server across a few sites over a WAN. Basically, each site needs to write a lot of misc files of varying size (some in the 100s MB range, but most small), and the application is written such that collisions aren't a problem. I'd like to have a system set up that meets the following qualifications:

  1. Each site can store files in a shared "namespace". That is, all the files would show up in the same filesystem.
  2. Each site would not send data over the WAN unless necessary. I.e., there would be local storage on each side of the WAN that would be "merged" into the same logical filesystem.
  3. Linux & Free ($$$) is a must.

Basically, something like a central NFS share would meet most of the requirements, however it would not allow the locally written data to stay local. All data from remote sides of the WAN would be copied locally all the time.

I have looked into Lustre, and have run some successful tests with it, however, it appears to distribute files fairly uniformly across the distributed storage. I have dug through the documentation and have not found anything that automatically will "prefer" local storage over remote storage. Even something that went with the lowest latency storage would be fine. It would work most of the time, which would meet this application's requirements.

Any ideas?

© Server Fault or respective owner

Related posts about linux

Related posts about nfs