Re: ZFS vs BTRFS

17 Apr 2012

      On Tue, 17 Apr 2012, Toby Corkindale <toby.corkindale@strategicdata.com.au> 
wrote:
...
On 17/04/12 11:25, Trent W. Buck wrote:
...
Russell Coker wrote:
...
One of my clients needs to reliably store terabytes of data which is
mostly comprised of data files in the 10MB - 15MB size range.  The
data files will almost never be re-written [...]
Will they ever be re-read? ;-)
Yes.  There will be batch jobs that process them.
...
For that matter, do you really need all the files stored on a single
machine? No matter what level of RAID you use, you still have a single
point of failure in the server hardware.
Why not pick up several smaller servers and use a cluster filesystem
instead?
A cluster filesystem requires significantly more work to administer.  It needs 
to have at least 3 servers to avoid split-brain problems and will either 
involve performance loss (in the case of a cluster where the blocks are shared 
via Ethernet) or significant extra expense (in the case of shared block 
devices).

http://www.dell.com/au/business/p/poweredge-t110-2/pd
http://www.dell.com/au/business/p/poweredge-t610/pd

A cluster filesystem also doesn't solve any real problems.  A Dell server 
which can handle 8 disks (all that is needed for the next few years) is not 
particularly expensive.  A PowerEdge T610 costs $2100 plus about $200 per disk 
(or a lot more per disk if you buy disks from Dell).  A PowerEdge T110 costs 
$700 and can handle 4 disks.  If you had three of the T110 servers in a 
cluster they would cost the same as a single T610 and provide the same 
effective capacity, however you would purchase 12 disks instead of 8 and thus 
the hardware cost would involve an additional 4 disks ($800 or more if you buy 
disks from Dell).  Then there's the cost of sysadmin work.

Now one thing that a cluster can solve is a server failing at some unexpected 
time (*).  But in this case the use is going to be 9-5 operation.  Batch jobs 
will be run overnight, but getting a cluster fail-over event to not interrupt 
the batch jobs would be more effort than it's worth.  Getting a system that 
can handle 8*SATA disks isn't THAT difficult.  In the unlikely event that a 
Dell server entirely broke and Dell couldn't fix it fast enough it would be OK 
to install a white-box system as a temporary replacement.  So the failure 
recovery case will not be preserving 24*7 operation, but not wasting too much 
9-5 time after the problem has been discovered.

http://etbe.coker.com.au/2010/08/04/clusters-dont-work/

(*) In theory, in practice I haven't observed that happening.  See the above 
URL.

-- 
My Main Blog         http://etbe.coker.com.au/
My Documents Blog    http://doc.coker.com.au/