Skip to content

File system and storage benchmark that uses a custom language to generate a large variety of workloads. Adapted to numa workloads - NUMA support added

License

Notifications You must be signed in to change notification settings

jeromecst/filebench

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Filebench - A Model Based File System Workload Generator
========================================================


ABOUT THIS FORK
---------------

This fork adds support for NUMA architectures. A keyword has been added to the
model language. A NUMA "node" can be specified when defining a fileset, a thread
or a process.

In the case of threads and processes, the "node" keywords refers to the NUMA
node where the initial placement of the tasks takes place. They are bind to
the node, and no later migration is possible during the benchmarks.

When defining a "node" for a fileset, it will allocate the page cache on the
specified node.

The node option can therefore be used to simulate complex setup on a NUMA
architecture. Note that "prealloc" is needed for fileset, otherwise there will be
no pre allocation on the specified node, and the later allocation could be made
on another node.

See example below for a NUMA workload.

INTRODUCTION
------------

Filebench is a file system and storage benchmark that can generate a large
variety of workloads. Unlike typical benchmarks it is extremely flexible and
allows to specify application's I/O behavior using its extensive Workload Model
Language (WML). Users can either describe desired workloads from scratch or use
(with or without modifications) workload personalities shipped with Filebench
(e.g., mail-, web-, file-, and database-server workloads). Filebench is equally
good for micro- and macro-benchmarking, quick to setup, and relatively easy to
use.


INSTALLATION
------------

Filebench compilation and installation is simple, and can be completed in two
steps. However, if you have downloaded a release tarball from Github
(https://github.com/filebench/filebench/releases), you can go ahead and skip to
Step 2.

The libnuma-dev package is required to compile this version.


Step 1: Generating autotool scripts

Makefile.in and configure files are not included in the repo, so they will have
to be generated. To do so, make sure you have libtoolize and automake tools
installed and run the following commands:

$ libtoolize
$ aclocal
$ autoheader
$ automake --add-missing
$ autoconf


Step 2: Compilation and installation

Before proceeding, make sure yacc and lex are available in your system. Then,
you can proceed with configuration, compilation, and installation as usual:

$ ./configure
$ make
$ sudo make install


QUICK-START GUIDE
-----------------

To quickly introduce a new user to Filebench we use two examples demonstrating
two different typical Filebench workflows: 1) describe a user-defined workload
in Workload Model Language (WML) and generate the workload; 2) generate one of
the predefined Filebench workloads.
The description in this guide assumes Filebench 1.5-alpha1 or higher.


Example 1: User-defined workloads

First step is to create the description of the workload (so called workload
personality) in WML language. Typically, workload personalities are stored in
files with '.f' extension. In this example we describe a very simple workload
consisting of two processes with three threads each. Every thread continuously
picks a file among many, reads it, and then closes the file. Here is the
corresponding workload personality:

01  define fileset name="testF",entries=10000,filesize=16k,prealloc,path="/tmp"
02
03  define process name="readerP",instances=2 {
04    thread name="readerT",instances=3 {
05      flowop openfile name="openOP",filesetname="testF"
06      flowop readwholefile name="readOP",filesetname="testF"
07      flowop closefile name="closeOP"
08    }
09  }
10
11  run 60

Four main entities in Filebench are 'filesets', 'processes' consisting of
'threads', and 'flowops'. In the first line we define a fileset containing
10,000 files of 16KiB size each in /tmp directory. Filebench is instructed to
precreate all files in the fileset before executing the actual workload.

In the third and fourth lines we define two identical processes each consisting
of three identical threads. Every thread in Filebench repeats flowops (operations)
defined in it in a loop. Lines 05-07 describe the flowops that constitute the
threads: open a file in "testF" fileset, read the file  completely, and close
it. Finally, in the 11th line we indicate to run the workload for 60 seconds.

In more complex workloads one can define any number of filesets, multiple
different processes and threads, use a variety of flowops and attributes, and
more. Refer to the complete WML vocabulary for details at
https://github.com/filebench/filebench/wiki/Workload-model-language

Assuming that the personality is saved in 'readfiles.f' file, one can then
generate corresponding workload by running 'filebench -f readfiles.f' command.


Example 2: Pre-defined workloads

Filebench comes with several predefined micro- and macro-workloads (e.g.,
webserver, fileserver, mailserver) which are also described in WML, not
differently than the workload in Example 1 above. In the source code tree,
workloads are located in the workloads/ directory and are typically installed in
/usr/local/share/filebench/workloads/ during 'make install' (though this can
differ from one installation to another).

We do *not* recommend to directly use workload files from workloads/ or
/usr/local/share/filebench/workloads/ directories. The main reason is that these
workloads *are not properly sized* (e.g., in terms of the dataset sizes) to a
particular system. For instance, the initial dataset size of the webserver
workload is only slightly larger than 16MiB, which is typically not the size you
want to test the system containing multiple gigabytes of RAM with.

So, instead, copy webserver workload to any other directory:

$ cp /usr/local/share/filebench/workloads/webserver.f mywebserver.f

Then edit the copied file to increase the dataset size by setting the number of
files ('entries' attribute of a fileset) to an appropriate value. Finally, run
the workload:

$ filebench -f mywebserver.f

An extended discussion on how to scale Filebench workloads can be found at
https://github.com/filebench/filebench/wiki/Scaling-Filebench-workloads

Example 3: NUMA Workload

set $dir=/tmp
set $filesize=2g
set $iosize=10m

define fileset name=bigfile0,path=$dir,size=$filesize,prealloc,node=0,dirwidth=20,entries=5,reuse
define fileset name=bigfile1,path=$dir,size=$filesize,prealloc,node=1,dirwidth=20,entries=5,reuse
define fileset name=bigfile2,path=$dir,size=$filesize,prealloc,node=2,dirwidth=20,entries=5,reuse
define fileset name=bigfile3,path=$dir,size=$filesize,prealloc,node=3,dirwidth=20,entries=5,reuse

define process name=filereader3,instances=1
{
  thread name=filereaderthread,memsize=300m,instances=10,node=0
  {
    flowop read name=write-file,filesetname=bigfile3,random,iosize=$iosize
  }
}

define process name=filereader2,instances=1
{
  thread name=filereaderthread,memsize=300m,instances=10,node=3
  {
    flowop read name=write-file,filesetname=bigfile2,random,iosize=$iosize
  }
}

define process name=filereader1,instances=1
{
  thread name=filereaderthread,memsize=300m,instances=10,node=2
  {
    flowop read name=write-file,filesetname=bigfile1,random,iosize=$iosize
  }
}

define process name=filereader0,instances=1
{
  thread name=filereaderthread,memsize=300m,instances=10,node=1
  {
    flowop read name=write-file,filesetname=bigfile0,random,iosize=$iosize
  }
}

echo  "FileMicro-ReadRand Version 2.2 personality successfully loaded"
psrun -1 30



SUPPORT
-------

To ask questions about Filebench, report bugs, or request new features, you can
use GitHub's issue tracking system. This is the central hub for both user support
and bug tracking, and can be found at https://github.com/filebench/filebench/issues

About

File system and storage benchmark that uses a custom language to generate a large variety of workloads. Adapted to numa workloads - NUMA support added

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C 64.9%
  • C++ 12.2%
  • Filebench WML 10.4%
  • Yacc 7.6%
  • Roff 2.4%
  • M4 1.2%
  • Other 1.3%