• Stars
    star
    17
  • Rank 1,257,181 (Top 25 %)
  • Language
    Assembly
  • Created over 7 years ago
  • Updated almost 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Measure instruction latency and throughput

More Repositories

1

likwid

Performance monitoring and benchmarking suite
C
1,660
star
2

OSACA

Open Source Architecture Code Analyzer
Jupyter Notebook
238
star
3

kerncraft

Loop Kernel Analysis and Performance Modeling Toolkit
Jupyter Notebook
86
star
4

pycachesim

Python Cache Hierarchy Simulator
Jupyter Notebook
84
star
5

pylikwid

Python interface for the LIKWID C API (https://github.com/RRZE-HPC/likwid)
C
42
star
6

TheBandwidthBenchmark

The ultimate memory bandwidth benchmark
C
37
star
7

asmbench

A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT
Python
16
star
8

MachineState

This CLI tool and Python3 module collects the current system state for documentation
Python
13
star
9

lbm-benchmark-kernels

Simple LBM kernels for benchmarking and performance evaluation
C
13
star
10

MD-Bench

A performance-oriented prototyping harness for state of the art Molecular Dynamics algorithms
C
12
star
11

GHOST

General, Hybrid and Optimized Sparse Toolkit (Bitbucket mirror)
C
10
star
12

stempel

Stencil TEMPlate Engineering Library
Python
7
star
13

ThePerformanceLogbook

A template for documenting PE projects
Shell
7
star
14

INSPECT

INSPECT: Intranode Stencil Performance Collection
Shell
7
star
15

HPCCG-F90

A Fortran 90 port of the Mantevo HPCCG SpMVM benchmark
Fortran
6
star
16

LMS

LIKWID Monitoring Stack
Python
6
star
17

RACE

The Recursive Algebraic Coloring Engine
C++
4
star
18

CLPE-Hands-On

C++
3
star
19

A64FX_SpMV_hands-on

SpMV hands-on exercise with SVE intrinsics (ACLE) for teaching
C++
3
star
20

DFG-PE

Exchange platform for projects from DFG Call on "Performance Engineering für wissenschaftliche Software"
3
star
21

pmbs2020-paper-artifact

Artifact Repository for PMBS 2020 paper "Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX"
C++
3
star
22

loop_adapt

Loop profiling with system adjustments for timestep-based applications
C
2
star
23

OSACA-Artifact-Appendix

Measurements and reproducibility instructions for the Master Thesis "Cross-Architecture Automatic Critical Path Detection For In-Core Performance Analysis" by Jan Laukemann
Assembly
2
star
24

TheBandwidthBenchmark-F90

Fortran version of the ultimate teaching bandwidth benchmark.
Fortran
2
star
25

miniMD

A modified fork from Mantevo miniMD
C++
1
star
26

OSACA-CP-2019

Reproducibility artifacts for critical path analysis with OSACA
Assembly
1
star
27

MPIBench

A benchmark suite for MPI libraries
C
1
star
28

TheBenchmarkGame

The Bandwidth Benchmark in all programming languages on the planet
C
1
star
29

pmbs2018-paper-artifact

Artifact Repository for PMBS 2018 Paper "Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures"
Assembly
1
star
30

DDOT-Bench

A ddot benchmark with various accuracy enhanced variants (including optimized Kahan)
C
1
star
31

gather-bench

A X86 gather instruction performance benchmark
C
1
star
32

Makefile-template

A generic Makefile template for C, C++ and Fortran programs
C
1
star