benchmark

History

Name	Name	Last commit message	Last commit date
parent directory ..
ansible	ansible	[benchmark] used pre-built image for CephFS to accelerate the deploym…	Nov 12, 2018
docs/progress	docs/progress	progress tracked in docs/progress/11-1.md	Nov 1, 2018
tools	tools	progress tracked in docs/progress/10-31.md	Nov 1, 2018
.gitignore	.gitignore	Benchmark init commit	Oct 22, 2018
README.md	README.md	progress tracked in docs/progress/11-1.md	Nov 1, 2018

README.md

Shared Filesystem Benchmark

Shared filesystem is commonly used among distributed containers to share data and states.

The benchmark aims at testing various shared filesystem solutions for (geo-)distributed containers and finding the best one in terms of read/write throughput, responsiveness and scalability.

Usage pattern

The usage pattern of the shared filesystem by distributed containers are characterized as follows:

Must be POSIX-compliant
Frequent small random reads/writes, usually more reads than writes
Occasional bulk data transfers typically at the scale ranging from ~10 to ~1000 of GBs
Concurrent reads/writes by multiple clients. The concurrency ranges from 2-3 to ~100.

Performance definition and metrics

Based on the scenario and usage pattern, we define the performance metrics for evaluating a solution. In general, we evaluate the following aspects of every solution.

Throughput

We measure throughput to evaluate the performance of a filesystem in bulk data transfers. Specifically, we measure both read and write throughput. We define data transfers of GBs of data as bulk data transfers.

Responsiveness

The responsiveness is defined as the average time for completing an operation on a filesystem. We measure responsiveness to evaluate the performance of a filesystem in small random reads/writes operations, which typically consists the majority of operations on a filesystem. The operations include file creation, read, write and deletion.

The responsiveness also reflects the overhead for data access in a shared file system - with a large number of operations, the delay incurred by each operation can accrue and become a significant portion of the end-to-end runtime of the application performing the operations.

Scalability

The scalability is reflected by 1) the ability of growing/shrinking the storage capacity and 2) the ability of serving multiple distributed clients for concurrent reads/writes.

We evaluate 1) of every filesystem qualitatively by checking whether it allows dynamic growing/shrinking of storage capacity and the simplicity of scaling up and down.

We evaluate 2) by measuring the throughput and responsiveness of the target filesystem as the number of clients increases.

Performance metrics

With the performance defined as above, we mainly use the following metrics to quantify the performance of a file system:

Read/write throughput (MB/s)
Operation latency (ms)
Maximum number of concurrent clients (without crashing the filesystem or causing significant performance drop)
Steps needed for growing/shrinking storage capacity

Benchmark Tools

Small-file I/O
- smallfile
Large-file I/O
- fio
- iozone

Shared filesystem solutions

The filesystem solutions to be evaluated are listed as below:

In addition, some solutions provide multiple configurations (not for performance tuning) for different use cases, which are likely to impact the performance, e.g., the GlusterFS volume type, distribution of backend Ceph OSDs.

Findings

Frequent Asked Question

1. Why do you use NFS Ganesha over GlusterFS but not Ceph?

The native GlusterFS client uses FUSE [reference], which incurs overhead for context switches between user/kernel spaces and could bog down the performance.

The alternative to access GlusterFS is using libgfapi, which bypasses FUSE and thus saves the context switch overhead. NFS Ganesha is one of the projects that uses the library to access GlusterFS, and presumably able to bring certain performance gain.

The FSAL (File System Abstract Layer) backend of NFS Ganesha for Ceph currently uses libcephfs to access CephFS, which is a layer above the librados -- the library at the heart of Ceph. Therefore, it is meaningless to run NFS Ganesha over CephFS from the standpoint of performance, since it introduces two more layers as compared to mounting CephFS directly. A user experience reported in the ceph-user mailing list proves my intuition to some extent.

2. Why not use filesystems such as ext4 and XFS over Ceph RBD instead of NFS?

Ceph RBD is exposed as a local block device rather than a file system to the clients. In other word, it is unable to handle file and metadata management, which a file system is responsible for. In the experiment, when mapping a Ceph RBD with a file system (XFS) on top on multiple hosts, the content in the file system cannot be synchronized in time and requires manual mount/unmount to force the synchronization. As reported in this thread, running ext4 with -m 0 (the mode in which metadata is maintained in the file system instead of the OS) tends to result in synchronization problems due to the inability of the local file system in handling concurrent operations performed by multiple distributed clients. In contrast, NFS is a known shared file system able to properly handle concurrent operations from multiple clients. Hence, it is advisable to use NFS on top of Ceph RBD to provide shared storage to clients.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

benchmark

benchmark

README.md

Shared Filesystem Benchmark

Usage pattern

Performance definition and metrics

Throughput

Responsiveness

Scalability

Performance metrics

Benchmark Tools

Shared filesystem solutions

Findings

Frequent Asked Question

1. Why do you use NFS Ganesha over GlusterFS but not Ceph?

2. Why not use filesystems such as ext4 and XFS over Ceph RBD instead of NFS?

Files

benchmark

Directory actions

More options

Directory actions

More options

Latest commit

History

benchmark

Folders and files

parent directory

README.md

Shared Filesystem Benchmark

Usage pattern

Performance definition and metrics

Throughput

Responsiveness

Scalability

Performance metrics

Benchmark Tools

Shared filesystem solutions

Findings

Frequent Asked Question

1. Why do you use NFS Ganesha over GlusterFS but not Ceph?

2. Why not use filesystems such as ext4 and XFS over Ceph RBD instead of NFS?