Iozone - Filesystem Benchmark
Iozone [-a|-A] [-s filesize_Kb]
[-r record_size_Kb] [-f [path]filename]
[-i test] [-E] [-p] [-m] [-M]
[-t children] [-h] [-o] [-l
min_number_procs] [-u max_number_procs] [-v]
[-R] [-x] [-d microseconds] [-F path1
path2...] [-V pattern ] [-j stride]
[-T] [-C] [-B] [-D] [-G] [-I]
[-H depth] [-k depth] [-U
mount_point] [-S cache_size] [-O] [-L
line_size] [-K] [-N] [-Q] [-P
start_cpu] [-c] [-e] [-b Excel.xls]
[-J milliseconds] [-X [path]filename] [-Y
[path]filename] [-w] [-W] [-z] [-Z]
[-n min_filesize_Kb] [-g max_filesize_Kb]
[-y min_recordsize_Kb] [-q max_recordsize_Kb]
[-+d] [-+u] [-+m client_filename] [-+n]
[-+N] [-+p percent_read] [-+r] [-+t]
[-+l] [-+L] [-+D] [-+A madvise_selector]
[-+h hostname] [-+T] [-+w Percent
de-dupable.]
Iozone is a filesystem benchmark tool. The benchmark
generates and measures a variety of file operations. Iozone has been
ported to many machines and runs under many operating systems. This document
will cover the many different types of operations that are tested as well as
coverage of all of the command line options.
Iozone is useful for determining a broad filesystem
analysis of a vendor's computer platform. The benchmark tests file I/O
performance for the following operations.
Read, write, re-read, re-write, read backwards, read strided, fread, fwrite,
random read/write, pread/pwrite variants
While computers are typically purchased with an application in
mind it is also likely that over time the application mix will change. Many
vendors have enhanced their operating systems to perform well for some
frequently used applications. Although this accelerates the I/O for those
few applications it is also likely that the system may not perform well for
other applications that were not targeted by the operating system. An
example of this type of enhancement is: Database. Many operating systems
have tested and tuned the filesystem so it works well with databases. While
the database users are happy, the other users may not be so happy as the
entire system may be giving all of the system resources to the database
users at the expense of all other users. As time rolls on the system
administrator may decide that a few more office automation tasks could be
shifted to this machine. The load may now shift from a random reader
application (database) to a sequential reader. The users may discover that
the machine is very slow when running this new application and become
dissatisfied with the decision to purchase this platform. By using
Iozone to get a broad filesystem performance coverage the buyer is
much more likely to see any hot or cold spots and pick a platform and
operating system that is more well balanced.
- -a
- Used to select full automatic mode. Produces output that covers all tested
file operations for record sizes of 4k to 16M for file sizes of 64k to
512M.
- -A
- This version of automatic mode provides more coverage but consumes a bunch
of time. The -a option will automatically stop using transfer sizes
less than 64k once the file size is 32M or larger. This saves time. The
-A option tells Iozone that you are willing to wait and want
dense coverage for small transfers even when the file size is very large.
NOTE: This option is deprecated in Iozone version 3.61. Use
-az -i 0 -i 1 instead.
- -b filename
- Used to specify a filename that will be used for output of an Excel
compatible file that contains the results.
- -B
- Use mmap() files. This causes all of the temporary files being measured to
be created and accessed with the mmap() interface. Some applications
prefer to treat files as arrays of memory. These applications mmap() the
file and then just access the array with loads and stores to perform file
I/O.
- -c
- Include close() in the timing calculations. This is useful only if you
suspect that close() is broken in the operating system currently under
test. It can be useful for NFS Version 3 testing as well to help identify
if the nfs3_commit is working well.
- -C
- Show bytes transferred by each child in throughput testing. Useful if your
operating system has any starvation problems in file I/O or in process
management.
- -d #
- Microsecond delay out of barrier. During the throughput tests all threads
or processes are forced to a barrier before beginning the test. Normally,
all of the threads or processes are released at the same moment. This
option allows one to delay a specified time in microseconds between
releasing each of the processes or threads.
- -D
- Use msync(MS_ASYNC) on mmap files. This tells the operating system that
all the data in the mmap space needs to be written to disk
asynchronously.
- -e
- Include flush (fsync,fflush) in the timing calculations
- -E
- Used to select the extension tests. Only available on some platforms. Uses
pread interfaces.
- -f filename
- Used to specify the filename for the temporary file under test. This is
useful when the unmount option is used. When testing with unmount between
tests it is necessary for the temporary file under test to be in a
directory that can be unmounted. It is not possible to unmount the current
working directory as the process Iozone is running in this
directory.
- -F filename filename filename ?
- Specify each of the temporary file names to be used in the throughput
testing. The number of names should be equal to the number of processes or
threads that are specified.
- -g #
- Set maximum file size (in Kbytes) for auto mode. One may also specify
-g #k (size in Kbytes) or -g #m (size in
Mbytes) or -g #g (size in Gbytes). See -n for minimum
file size.
- -G
- Use msync(MS_SYNC) on mmap files. This tells the operating system that all
the data in the mmap space needs to be written to disk synchronously.
- -h
- Displays help screen.
- -H #
- Use POSIX async I/O with # async operations. Iozone will use
POSIX async I/O with a bcopy from the async buffers back into the
applications buffer. Some versions of MSC NASTRAN perform I/O this way.
This technique is used by applications so that the async I/O may be
performed in a library and requires no changes to the applications
internal model.
- -i #
- Used to specify which tests to run. (0=write/rewrite, 1=read/re-read,
2=random-read/write, 3=Read-backwards, 4=Re-write-record, 5=stride-read,
6=fwrite/re-fwrite, 7=fread/Re-fread, 8=mixed workload,
9=pwrite/Re-pwrite, 10=pread/Re-pread, 11=pwritev/Re-pwritev,
12=preadv/Re-preadv). One will always need to specify 0 so that any of the
following tests will have a file to measure.
-i # -i # -i # is also supported
so that one may select more than one test.
- -I
- Use DIRECT IO if possible for all file operations. Tells the filesystem
that all operations to the file are to bypass the buffer cache and go
directly to disk. (not available on all platforms)
- -j #
- Set stride of file accesses to (# * record size). The stride read test
will read records at this stride.
- -J #
- Millisecond delay before each I/O operation. This simulates the cpu
compute cycle of an application that precedes an I/O operation. One may
also use -X or -Y to control the compute cycle on a per I/O
operation basis.
- -k #
- Use POSIX async I/O (no bcopy) with # async operations.
Iozone will use POSIX async I/O and will not perform any extra
bcopys. The buffers used by Iozone will be handed to the async I/O
system call directly.
- -K
- Inject some random accesses in the testing.
- -l #
- Set the lower limit on number of processes to run. When running throughput
tests this option allows the user to specify the least number of processes
or threads to start. This option should be used in conjunction with the
-u option.
- -L #
- Set processor cache line size to value (in bytes). Tells Iozone the
processor cache line size. This is used internally to help speed up the
test.
- -m
- Tells Iozone to use multiple buffers internally. Some applications
read into a single buffer over and over. Others have an array of buffers.
This option allows both types of applications to be simulated.
Iozone´s default behavior is to re-use internal buffers.
This option allows one to override the default and to use multiple
internal buffers.
- -M
- Iozone will call uname() and will put the string in the output
file.
- -n #
- Set minimum file size (in Kbytes) for auto mode. One may also specify
-n #k (size in Kbytes) or -n #m (size in
Mbytes) or -n #g (size in Gbytes). See -g for maximum
file size.
- -N
- Report results in microseconds per operation.
- -o
- Writes are synchronously written to disk. (O_SYNC). Iozone will
open the files with the O_SYNC flag. This forces all writes to the file to
go completely to disk before returning to the benchmark.
- -O
- Give results in operations per second.
- -p
- This purges the processor cache before each file operation. Iozone
will allocate another internal buffer that is aligned to the same
processor cache boundary and is of a size that matches the processor
cache. It will zero fill this alternate buffer before beginning each test.
This will purge the processor cache and allow one to see the memory
subsystem without the acceleration due to the processor cache.
- -P #
- Bind processes/threads to processors, starting with this cpu #.
Only available on some platforms. The first sub process or thread will
begin on the specified processor. Future processes or threads will be
placed on the next processor. Once the total number of cpus is exceeded
then future processes or threads will be placed in a round robin
fashion.
- -q #
- Set maximum record size (in Kbytes) for auto mode. One may also specify
-q #k (size in Kbytes) or -q #m (size in
Mbytes) or -q #g (size in Gbytes). See -y for minimum
record size.
- -Q
- Create offset/latency files. Iozone will create latency versus
offset data files that can be imported with a graphics package and
plotted. This is useful for finding if certain offsets have very high
latencies. Such as the point where UFS will allocate its first indirect
block. One can see from the data the impacts of the extent allocations for
extent based filesystems with this option.
- -r #
- Used to specify the record size, in Kbytes, to test. One may also specify
-r #k (size in Kbytes) or -r #m (size in
Mbytes) or -r #g (size in Gbytes).
- -R
- Generate Excel report. Iozone will generate an Excel compatible
report to standard out. This file may be imported with Microsoft Excel
(space delimited) and used to create a graph of the filesystem
performance. Note: The 3D graphs are column oriented. You will need to
select this when graphing as the default in Excel is row oriented
data.
- -s #
- Used to specify the size, in Kbytes, of the file to test. One may also
specify -s #k (size in Kbytes) or -s #m (size
in Mbytes) or -s #g (size in Gbytes).
- -S #
- Set processor cache size to value (in Kbytes). This tells Iozone
the size of the processor cache. It is used internally for buffer
alignment and for the purge functionality.
- -t #
- Run Iozone in a throughput mode. This option allows the user to
specify how many threads or processes to have active during the
measurement.
- -T
- Use POSIX pthreads for throughput tests. Available on platforms that have
POSIX threads.
- -u #
- Set the upper limit on number of processes to run. When running throughput
tests this option allows the user to specify the greatest number of
processes or threads to start. This option should be used in conjunction
with the -l option.
- -U mountpoint
- Mount point to unmount and remount between tests. Iozone will
unmount and remount this mount point before beginning each test. This
guarantees that the buffer cache does not contain any of the file under
test.
- -v
- Display the version of Iozone.
- -V #
- Specify a pattern that is to be written to the temporary file and
validated for accuracy in each of the read tests.
- -w
- Do not unlink temporary files when finished using them.
- -W
- Lock file when reading or writing.
- -x
- Turn off stone-walling. Stonewalling is a technique used internally to
Iozone. It is used during the throughput tests. The code starts all
threads or processes and then stops them on a barrier. Once they are all
ready to start then they are all released at the same time. The moment
that any of the threads or processes finish their work then the entire
test is terminated and throughput is calculated on the total I/O that was
completed up to this point. This ensures that the entire measurement was
taken while all of the processes or threads were running in parallel. This
flag allows one to turn off the stonewalling and see what happens.
- -X filename
- Used to specify a filename that will be used for the write telemetry
information. The file contains lines with offset, size,
delay_in_milliseconds. Each of these lines are used to perform an I/O
operation. This is used when an application's specific I/O operations are
known, and one wishes to benchmark the system with this specific
application file behavior.
- -y #
- Set minimum record size (in Kbytes) for auto mode. One may also specify
-y #k (size in Kbytes) or -y #m (size in
Mbytes) or -y #g (size in Gbytes). See -q for maximum
record size.
- -Y filename
- Used to specify a filename that will be used for the read telemetry
information. The file contains lines with offset, size,
delay_in_milliseconds. Each of these lines are used to perform an I/O
operation. This is used when an application's specific I/O operations are
known, and one wishes to benchmark the system with this specific
application file behavior.
- -z
- Used in conjunction with -a to test all possible record sizes.
Normally Iozone omits testing of small record sizes for very large
files when used in full automatic mode. This option forces Iozone
to include the small record sizes in the automatic tests also.
- -Z
- Enable mixing of mmap I/O and file I/O.
- -+m filename
- Used to specify a filename that will be used to specify the clients in a
distributed measurement. The file contains one line for each client. The
fields are space delimited. Field 1 is the client name. Field 2 is the
working directory, on the client, where Iozone will run. Field 3 is the
path to the executable Iozone on the client.
- -+n
- No retests selected. Used to prevent retests from running.
- -+N
- No truncating or deleting of previous test file before the sequential
write test. Useful only after -w is used in previous command to leave the
test file in place for reuse. This flag is of limited use, when a single
retest is not enough, or to easily control when sequential write retests
occur without file truncation or deletion.
- -+u
- Used to enable CPU statistics collection.
- -+d
- Diagnostic mode to troubleshoot a broken file I/O subsystem.
- -+p percentage_reads
- Used to set the percentage of threads/processes that will perform read
testing in the mixed workload test case.
- -+r
- Enable O_RSYNC | O_SYNC on all testing.
- -+l
- Enable byte range locking.
- -+L
- Enable byte range locking & shared file mode.
- -+D
- Enable O_DSYNC on all testing.
- -+t
- Enable network performance test. Use with -+m
- -+A#
- Enable madvise behavior. 0 = normal, 1=random, 2=sequential, 3=dontneed,
4=willneed
- -+B
- Enable sequential mixed workload testing.
- -+T
- Enable time stamps logging.
- -+h
- Manually set hostname.
- -+w#
- Percentage of data to be de-dupable between files.
Original Author: William D. Norcott. wnorcott@us.oracle.com
Features & extensions: Don Capps capps@iozone.org