• Stars
    star
    140
  • Rank 261,473 (Top 6 %)
  • Language
    Python
  • License
    MIT License
  • Created about 5 years ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Tool which enables the creation of synthetic storage workloads, automates the execution and results collection of synthetic storage benchmarks.

fiosynth

FB fio Synthetic Benchmark Suite for Flash ver 3.5.43

Examples

Run workload suite on a single device:

fiosynth -d /dev/sdc -w LE_Flash -f LE_Flash_sdc

Run workload suite on all data devices:

fiosynth -d ALL -w LE_Flash -f LE_Flash_noRAID

Run workload suite on all mdraid devices:

fiosynth -d ALLRAID -w LE_Flash -f LE_Flash_MDRAID

Run workload suite on all data devices while collecting smartctl logs

fiosynth -d ALL -w LE_Flash -f LE_Flash_noRAID -t "./smartAll.sh"

Run a dry run of workload suite for all mdraid devices:

fiosynth -d ALLRAID -w LE_Flash -f LE_Flash_MDRAID -r y

Run a workload suite across multiple servers:

fiosynth -d ALL -w LE_Flash -f LE_Flash_server -s <server1> -s <server2>

Run a workload suite from a server list:

fiosynth -d ALL -w LE_Flash -f LE_Flash_server -l '/path/servers.txt'

Requirements

fiosynth should work

  • Linux (CentOS, RedHat, Ubuntu)

Installing fiosynth

To install fiosynth simply git clone this repo and have the latest versions of pip with setuptools installed

Installing into Python environment

$ git clone https://github.com/facebookincubator/FioSynth
$ sudo apt install python3 python3-pip fio
$ cd FioSynth
$ sudo python3 setup.py install
$ fiosynth -h

Executing fiosynth without installing

Simply execute the fiosynth.py entry script.

$ cd FioSynth
$ python3 fiosynth.py -h

Note: You may require sudo if installing to System's Python environment

Note: We recommend using a virtualenv to avoid polluting the Systems dependencies.

How fiosynth works

Requires the use of workload suite files (in JSON format). Workload suite files can be edited to adjust the number of precondition cycles, number of runs, number of workloads and individual workload parameters. Use the dry run option to verify that benchmark suite executes as expected. All workloads are normalized on a per TB bases. If capacity is not specified, capacity will be based on capacity of the first specified device. If specified, a vendor specific health monitor tool will be executed before and after each benchmark suite executes and logged in "health.log" file. Log can be used to calculate write amplification factor. Raw results are stored in individual JSON files but are also parsed into single a csv file for easy visualization in Excel. Prefix for result filename must be specified. This will define the name of the result files directory and csv filename.

Input arguments (3 of arguments are required):

 -d
      (Required) device path for single device target, ALL for all
      all data devices, or ALLRAID for all mdraid devices
 -w
      (Required) filename for workload suite (default = )
 -f
      (Required) Results filename (default = )
 -c
      (Optional) specify capacity in TB (default = <device capacity>)
 -r
      (Optional) Set to y to do dry run (default = n)
 -t
      (Optional) Enter Health Monitoring Tool Syntax (default = )
 -p
      (Optional) Set to n to skip drive prep (default = y)
 -n
      (Optional) Specific the number of run cycles (default = 3 )
 -g
      (Optional) Set to y to enable flash configuration logging (default = n)
 -s
      (Optional) Add a server for workload to run on when in server mode.
      For multiple servers add multiple times. (default = )
 -l
      (Optional) Add a text file listing a server on each line (default = )

Workload suites

PeakWklds
      Executes workloads to measure the peak small block random and large
      block sequential performance of a single or multiple flash devices.

LE_Flash
      Executes rate limited workloads with similar I/O characteristics to
      applications that run on Type VIII (Low Flash) production hardware
      on a single or multiple flash devices. Workloads use a combination
      of read iops bins (low, medium, high) and write DWPD (Drive Writes
      Per Day) bins that correlate to various Type VIII flash application
      I/O profiles.  Workload alias names are a combination of read I/O
      block size, read iops bins and write DWPD bins.

HE_Flash
      Executes rate limited workloads with similar I/O characteristics to
      applications that run on Type VI and III (High Endurance Flash)
      production hardware on a single or multiple flash devices.
      Workloads use a combination of read iops bins (low, medium, high)
      and write DWPD (Drive Writes Per Day) bins that correlate to various
      Type VI flash application I/O  profiles.  Workload alias names are
      a combination of read I/O block size, read iops bins and write DWPD
      bins.

TrimRate
      Executes a workload that evaluates the raw trim performance of a
      flash device.

LE_Flash_Short
      Short version of LE_Flash.  Executes a limited set of rate limited
      workloads to provide a high level view of performance in a few hours.

HE_Flash_Short
      Short version of HE_Flash.  Executes a limited set of rate limited
      workloads to provide a high level view of performance in a few hours.

readhammer
      Executes a workload that evaluates a flash device ability to
      continually read from a small LBA range on a single or multiple
      flash devices.
      See README-READHAMMER.txt for more information.

prep
      Executes two full capacity writes on a single or multiple flash
      devices to get the flash close to steady state.

Creating/Modifying Workload Suites

Workload suites are json formatted files with a list of run rules for fio. Follow the formatting used in other workload suites. Generally, if all parameters in the workload suite workload file is note defined, it will not run.

The following parameters need to be defined globally:

run_cycles
      Number of times the entire workload suite will run.

precondition_cycles
      Number of times the flash will be preconditioned before each
      run cycle.  A full TRIM of the flash device will complete before
      the flash is preconditioned.  Each precondition consist of 2 full
      sequential writes of the flash device.

devices_in_global
      Specifies if the devices should be defined in the global field or
      within jobs.  Most fio job files should define devices_in_global
      parameter to Y and can scale to multiple drives using the numjob fio
      parameter.  Some jobfiles will not run properly with the devices
      defined in the global section.  These jobfiles need to create
      new job section for each device and the device_in_global parameter
      should be set to N.
      For more on fio job file format, see section 4 of
      https://github.com/axboe/fio/blob/master/HOWTO

The following parameters are workload specific:

template
      Fio jobfile name that should be used.

alias
      Name of the workload.  Use a name that describes the workload that
      is being executed.

TIME
      Sets the "runtime" parameter in the fio job file (where applicable).

BLKSIZE
      Sets the "blocksize" and "blockalign" parameter in the fio job file
      (where applicable).

DEPTH
      Sets the "iodepth" parameter in the fio job file (where applicable).

RRATE
      Sets the "rate_iops" parameter in the fio job file (where applicable).

SIZE
      Sets the "size" parameter in the fio job file (where applicable).

W1THINK, W2THINK, W3THINK & W4THINK
      Sets the "thinktime" parameter in the fio job file (where applicable).
      W1THINK and W2THINK are the think time in between each write. The smaller
      the number, the higher the write rate. 6pDWPD writes at a rate of about
      72 MB/s per TB. W3THINK and W4THINK are typically used as the think
      time in between each trim command.

Client/Server Mode

To run in client/server mode, ensure that the following requirements are met:

  • Both your client and all of your servers must have the same version of fio

  • Any devices you want to test on the clients must be unmounted from the filesystem

  • You may want to prepend "nohup" to the command when you call this tool, that way if your ssh session ends, the test will continue to run on your devserver.

Join the fiosynth community

See the CONTRIBUTING file for how to help out.

License

MIT License

Copyright (c) Facebook, Inc. and its affiliates.

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

More Repositories

1

SocketRocket

A conforming Objective-C WebSocket client library.
Objective-C
9,534
star
2

katran

A high performance layer 4 load balancer
C
4,674
star
3

AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Python
4,515
star
4

velox

A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
C++
3,474
star
5

cinder

Cinder is Meta's internal performance-oriented production version of CPython.
Python
3,458
star
6

FBX2glTF

A command-line tool for the conversion of 3D model assets on the FBX file format to the glTF file format.
C++
2,060
star
7

spectrum

A client-side image transcoding library.
C++
1,987
star
8

oomd

A userspace out-of-memory killer
C++
1,795
star
9

fastmod

A fast partial replacement for the codemod tool
Rust
1,648
star
10

xar

executable archive format
Python
1,571
star
11

Bowler

Safe code refactoring for modern Python.
Python
1,532
star
12

submitit

Python 3.8+ toolbox for submitting jobs to Slurm
Python
1,245
star
13

gloo

Collective communications library with various primitives for multi-machine training.
C++
1,181
star
14

fizz

C++14 implementation of the TLS-1.3 standard
C++
1,128
star
15

dhcplb

dhcplb is Facebook's implementation of a load balancer for DHCP.
Go
1,046
star
16

below

A time traveling resource monitor for modern Linux systems
Rust
1,029
star
17

OnlineSchemaChange

A tool for performing online schema changes on MySQL.
Python
965
star
18

Glean

System for collecting, deriving and working with facts about source code.
Hack
923
star
19

Battery-Metrics

Library that helps in instrumenting battery related system metrics.
Java
736
star
20

retrie

Retrie is a powerful, easy-to-use codemodding tool for Haskell.
Haskell
500
star
21

superconsole

The superconsole crate provides a handler and building blocks for powerful, yet minimally intrusive TUIs. It is cross platform, supporting Windows 7+, Linux, and MacOS. Rustaceans who want to create non-interactive TUIs can use the component composition building block system to quickly deploy their code.
Rust
477
star
22

nvdtools

A set of tools to work with the feeds (vulnerabilities, CPE dictionary etc.) distributed by National Vulnerability Database (NVD)
Go
446
star
23

nimble

New file format for storage of large columnar datasets.
C++
419
star
24

infima

A UI framework that provides websites with the minimal CSS and JS needed to get started with building a modern responsive beautiful website
HTML
405
star
25

CG-SQL

CG/SQL is a compiler that converts a SQL Stored Procedure like language into C for SQLite. SQLite has no stored procedures of its own. CG/CQL can also generate other useful artifacts for testing and schema maintenance.
HTML
391
star
26

TTPForge

The TTPForge is a Cybersecurity Framework for developing, automating, and executing attacker Tactics, Techniques, and Procedures (TTPs).
Go
320
star
27

flowtorch

This library would form a permanent home for reusable components for deep probabilistic programming. The library would form and harness a community of users and contributors by focusing initially on complete infra and documentation for how to use and create components.
Jupyter Notebook
300
star
28

ptr

Python Test Runner.
Python
284
star
29

senpai

Senpai is an automated memory sizing tool for container applications.
Python
270
star
30

fbjni

A library designed to simplify the usage of the Java Native Interface
C++
260
star
31

dynolog

Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also integrates with pytorch and can trigger traces for distributed training applications.
C++
251
star
32

gazebo

A Rust library containing a collection of small well-tested primitives.
Rust
235
star
33

reindeer

Reindeer is a tool to transform Rust Cargo dependencies into generated Buck build rules
Rust
177
star
34

dispenso

The project provides high-performance concurrency, enabling highly parallel computation.
C++
174
star
35

GeoLift

GeoLift is an end-to-end geo-experimental methodology based on Synthetic Control Methods used to measure the true incremental effect (Lift) of ad campaign.
R
171
star
36

oculus-linux-kernel

The Linux kernel code for Oculus devices
C
156
star
37

FCR

FBNet-Command-Runner: A thrift service to run commands on heterogeneous Network devices with configurable parameters.
154
star
38

dataclassgenerate

DataClassGenerate (or simply DCG) is a Kotlin compiler plugin that addresses an Android APK size overhead from Kotlin data classes.
Kotlin
154
star
39

hsthrift

The Haskell Thrift Compiler. This is an implementation of the Thrift spec that generates code in Haskell. It depends on the fbthrift project for the implementation of the underlying transport.
Haskell
148
star
40

meta-code-verify

Code Verify is an open source web browser extension that confirms that your Facebook, Messenger, Instagram, and WhatsApp Web code hasn’t been tampered with or altered, and that the Web experience you’re getting is the same as everyone else’s.
TypeScript
137
star
41

tacquito

Tacquito is an open source TACACs+ server written in Go that implements RFC8907
Go
93
star
42

go-qfext

a fast counting quotient filter implementation in golang
Go
91
star
43

momentum

A library for human kinematic motion and numerical optimization solvers to apply human motion
C++
89
star
44

ForgeArmory

ForgeArmory provides TTPs that can be used with the TTPForge (https://github.com/facebookincubator/ttpforge).
Swift
80
star
45

antlir

ANoTher Linux Image buildeR
Rust
76
star
46

sks

Secure Key Storage (SKS) is a library for Go that abstracts Security Hardware on laptops.
Go
72
star
47

dcrpm

A tool to detect and correct common issues around RPM database corruption.
Python
72
star
48

ConversionsAPI-Tag-for-GoogleTagManager

This repository will contain the artifacts needed for setting up Conversions API implementation on Google Tag Manager's serverside. Please follow the instructions https://www.facebook.com/business/help/702509907046774
Smarty
64
star
49

InjKit

Injection Kit. It is a java bytecode processing library for bytecode injection and transformation.
Java
55
star
50

obs-plugins

OBS Plugins
C++
54
star
51

glTFVariantMeld

An application that accepts files on the glTF format, interprets them as variants of an over-arching whole, and melds them together.
Rust
51
star
52

haberdashery

A collection of high-performance crypto implementations.
Rust
44
star
53

later

A framework for python asyncio with batteries included for people writing services in python asyncio
Python
39
star
54

go2chef

A Golang tool to bootstrap a system from zero so that it's able to run Chef to be managed
Go
39
star
55

CommutingZones

Commuting zones are geographic areas where people live and work and are useful for understanding local economies, as well as how they differ from traditional boundaries. These zones are a set of boundary shapes built using aggregated estimates of home and work locations. Data used to build commuting zones is aggregated and de-identified.
JavaScript
39
star
56

ConversionsAPI-Client-for-GoogleTagManager

This repository will contain the artifacts needed for setting up Conversions API implementation on Google Tag Manager's serverside. Primarily we will be hosting, - ConversionsAPI(Facebook) Client - listens on the events fired to GTM Server and maps them to common GTM schema. - ConversionsAPI(Facebook) Tag - server tag that fires events to CAPI.For more details on Design here https//fburl.com/uae68vlr
37
star
57

Facebook-Pixel-for-Wordpress

A plugin for advertisers who use Wordpress to enable them easily setup the facebook pixel.
JavaScript
37
star
58

strobelight

Meta's fleetwide profiler framework
C++
28
star
59

buck2-change-detector

Given a Buck2 built project and a set of changes (e.g. from source control) compute the targets that may have changed. Sometimes known as a target determinator, useful for optimizing a CI system.
Rust
28
star
60

wordpress-messenger-customer-chat-plugin

Messenger Customer Chat Plugin for WordPress
PHP
27
star
61

CP4M

CP4M is a conversational marketing platform which enables advertisers to integrate their customer-facing chatbots with FB Messenger/WhatsApp, in order to meet customers where they are and drive native conversations on the advertiser's owned infra.
Java
27
star
62

rush

RUSH (Reliable - unreliable - Streaming Protocol)
C++
26
star
63

MY_ENUM

Small c++ macro library to add compile-time introspection to c++ enum classes.
C++
17
star
64

SafeC

Library containing safer alternatives/wrappers for insecure C APIs.
C++
16
star
65

go-belt

It is an implementation-agnostic Go(lang) package to generalize observability tooling (logger, metrics, tracer and so on) and provide ability to use any of these tools with a standard context. Essentially it is an attempt to standardize observability API in Go.
Go
16
star
66

spark-ar-core-libs

Core libraries that can be used in Spark AR. You can import each library depends on your requirements.
TypeScript
15
star
67

scrut

Scrut is a testing toolkit for CLI applications. A tool to scrutinize terminal programs without fuss.
Rust
15
star
68

sado

A macOS signed-app shim for running daemons with reliable capabilities.
Swift
12
star
69

Portal-Kernel

Kernel Code for Portal.
C
11
star
70

npe-toolkit

Libraries, guides, blueprints, and sample code, to enable rapidly building 0-1 applications on iOS, Android and web.
TypeScript
10
star
71

Eigen-FBPlugins

This is collection of plugins extending Eigen arrays/matrices with main focus on using them for computer vision. In particular, this project should provide support for multichannel arrays (missing in vanilla Eigen) and seamless integration between Eigen types and OpenCV functions.
C++
9
star
72

isometric_pattern_matcher

A new isometric calibration pattern - which should/might lead to higher accuracy calibrations compared to existing solutions (checkerboards, patterns of circles).
C++
8
star
73

dnf-plugin-cow

Code to enable Copy on Write features being upstreamed in rpm and librepo
Shell
8
star
74

jupyterhub_fb_authenticator

JupyterHub Facebook Authenticator is a Facebook OAuth authenticator built on top of OAuthenticator.
Python
8
star
75

wireguard_py

Cython library for Wireguard
C
7
star
76

kernel-patches-daemon

Sync Patchwork series's with Github pull requests
Python
6
star
77

meta-fbvuln

OpenEmbedded meta-layer that allows producing a vulnerability manifest alongside a Yocto build. The produced manifest is suitable for ongoing vulnerability scanning of fielded software.
5
star
78

gazebo_lint

A Rust linter that provides various suggestions based on the new primitives offered in the `gazebo` library.
Rust
4
star
79

language-capirca

Adds syntax highlighting for Capirca filetypes in Atom. Capirca is an open source standard for writing vendor-neutral firewall policies as originally released by Google: https://github.com/google/capirca
3
star
80

cinderx

cinderx
C++
3
star
81

fbc_owrt_feed

Facebook Connectivity OpenWrt Feed. Package feed for OpenWrt router OS by Facebook Connectivity programme.
Lua
2
star
82

cutlass-fork

A Meta fork of NV CUTLASS repo.
C++
2
star
83

hwbits_lib

Abstraction of hardware register-level protocols in a python semantic names.
Python
1
star