[CentOS] ZFS on Linux in production?

Chuck Munro chuckm at seafoam.net
Fri Oct 25 18:14:41 UTC 2013


On 10/25/2013, 05:00 , centos-request at centos.org wrote:
> We are a CentOS shop, and have the lucky, fortunate problem of having
> ever-increasing amounts of data to manage. EXT3/4 becomes tough to
> manage when you start climbing, especially when you have to upgrade, so
> we're contemplating switching to ZFS.
>
> As of last spring, it appears that ZFS On Linuxhttp://zfsonlinux.org/
> calls itself production ready despite a version number of 0.6.2, and
> being acknowledged as unstable on 32 bit systems.
>
> However, given the need to do backups, zfs send sounds like a godsend
> over rsync which is running into scaling problems of its own. (EG:
> Nightly backups are being threatened by the possibility of taking over
> 24 hours per backup)
>
> Was wondering if anybody here could weigh in with real-life experience?
> Performance/scalability?
>
> -Ben

FWIW, I manage a small IT shop with a redundant pair of ZFS file servers 
running the  zfsonlinux.org  package on 64-bit ScientificLinux-6 
platforms.  CentOS-6 would work just as well.  Installing it with yum 
couldn't be simpler, but configuring it takes a bit of reading and 
experimentation.  I reserved a bit more than 1GByte of RAM for each 
TByte of disk.

One machine (20 useable TBytes in raid-z3) is the SMB server for all of 
the clients, and the other machine (identically configured) sits in the 
background acting as a hot spare.  Users tell me that performance is 
quite good.

After about 2 months of testing, there have been no problems whatsoever, 
although I'll admit the servers do not operate under much stress.  There 
is a cron job on each machine that does a scrub every Sunday.

The old ext4 primary file servers have been shut down and the ZFS boxes 
put into production, although one of the old ext4 servers will remain 
rsync'd to the new machines for a few more months (just in case).

The new servers have the zfsonlinux repositories configured for manual 
updates, but the two machines tend to be left alone unless there are 
important security updates or new features I need.

To keep the two servers in sync I use 'lsyncd' which is essentially a 
front-end for rsync that cuts down thrashing and overhead dramatically 
by excluding the full filesystem scan and using inotify to figure out 
what to sync.  This allows almost-real-time syncing of the backup 
machine.  (BTW, you need to crank the resources for inotify waaaaay up 
for large filesystems with a couple million files.)

So far, so good.  I still have a *lot* to learn about ZFS and its 
feature set, but for now it's doing the job very nicely.  I don't miss 
the long ext4 periodic fsck's one bit  :-)

YMMV, of course,
Chuck



More information about the CentOS mailing list