• Stars
    star
    62
  • Rank 490,914 (Top 10 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created about 2 years ago
  • Updated about 1 month ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

An I/O benchmark for deep Learning applications

More Repositories

1

ai-science-training-series

Jupyter Notebook
205
star
2

TensorFlowFoam

Integrating the TensorFlow 1.15 C-API into OpenFOAM 5.0 for data-driven CFD algorithm development
C++
201
star
3

PythonFOAM

In-situ data analyses and machine learning with OpenFOAM and Python
C++
161
star
4

balsam

High throughput workflows and automation for HPC
Python
76
star
5

ALCF_Hands_on_HPC_Workshop

The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the examples used in the workshop.
HTML
54
star
6

CompPerfWorkshop

ALCF Computational Performance Workshop
Python
34
star
7

ATPESC_MachineLearning

Lecture and hands-on material for Track 8- Machine Learning of Argonne Training Program on Extreme-Scale Computing
LLVM
32
star
8

molan

Data analytics for molecular solids melting points
Jupyter Notebook
26
star
9

active-learning-md

Active learning workflow developed as a part of the upcoming article "Machine Learning Inter-Atomic Potentials Generation Driven by Active Learning: A Case Study for Amorphous and Liquid Hafnium dioxide"
Python
25
star
10

ALCFBeginnersGuide

Cuda
24
star
11

llm-workshop

Jupyter Notebook
23
star
12

AI4ScienceTutorial

A tutorial for students that surveys basic ML techniques in ipython notebook format.
Jupyter Notebook
22
star
13

THAPI

A tracing infrastructure for heterogeneous computing applications.
C
22
star
14

user-guides

ALCF Systems User Documentation
HTML
20
star
15

GettingStarted

Collection of small examples for running on ALCF resources
C
16
star
16

AIaccelerators-SC23-tutorial

AI Accelerators-SC23-tutorial Repository
Jupyter Notebook
11
star
17

HPC-Patterns

Provide examples on how to use GPU with Parallel Programing Paradigm (MPI, OpenMP, SYCL)
C++
10
star
18

AIaccelerators-SC22-tutorial

AI Accelerator Tutorial SC22
Python
8
star
19

alcl

Argonne Leadership Computing Facility OpenCL tutorial
C
8
star
20

alcf-nccl-tests

NCCL tests for ALCF machines
Roff
7
star
21

alcf4_benchmarks

Repository for ALCF-4 Benchmarks as defined in the RFP.
6
star
22

copper

scalable data movement in Exascale Supercomputers
C++
6
star
23

ThetaGPU-Docs

Staging area for Theta-GPU documentation
Python
5
star
24

SimAI-Bench

ALCF benchmarks for coupled simulation and AI workflows
Python
4
star
25

summer-school-2024

4
star
26

pbs_utils

scripts for working with PBS
Shell
4
star
27

SyclCPLX

Sycl complex library header-only
C++
4
star
28

ai-testbed-userdocs

AI-Testbed at ALCF provides an infrastructure for the next-generation of AI-accelerator machines.
HTML
4
star
29

nexus

Scripts and tools for IRI applications at ALCF
Python
4
star
30

checkpoint_restart

This repo is for providing instructions on how to do checkpoint/restart at large scale simulations on exasscale machines
Shell
3
star
31

dlSoftwareTests

Simple tests to verify ML/DL environments on ALCF HPC resources are working correctly.
Python
3
star
32

DL-Profiling-Examples

Example scripts and profiling demonstrations for deep learning models
Python
3
star
33

THAPI-spack

This is a spack environment for THAPI.
Python
3
star
34

AIAccelerators-AE

AD/AE repo for the paper on AI Accelerator Evaluation
Python
3
star
35

IntroSYCLtutorial

Jupyter Notebook
2
star
36

dlio_ml_workloads

Reference workloads for DLIO Benchmark
Python
2
star
37

CCS

CCS (C Configuration Space and Tuning Library) aims at providing interoperability between autotuning frameworks and applications with auto-tuning needs. It does so by providing a C interface to describe autotuning problems and autotuners.
C
2
star
38

autoperf

Core autoperf source
C
2
star
39

polaris-userdocs

Temporary documentation for Polaris resource
HTML
2
star
40

LLM-Inference-Bench

LLM-Inference-Bench
Jupyter Notebook
2
star
41

conda_install_scripts

Some install scripts for miniconda that include tensorflow and keras in the intel channel.
Shell
2
star
42

mlprof

Profiling tools for performance studies of competing ML frameworks on HPC systems
Python
2
star
43

container-registry

Contains images and recipes to run container images on Theta, ThetaGPU, Polaris and Sunspot
Python
2
star
44

dl_scaling

Scaling Deep learning on HPC systems
Python
2
star
45

alcf-osu-benchmarks

OSU benchmarks on ALCF systems
Roff
1
star
46

scalable_conda_env

This is to show how to setup conda environment for large scale runs
Python
1
star
47

storage

storage benchmarks
C
1
star
48

inference-endpoints

This is a repository with examples to run inference endpoints on various ALCF clusters
Jupyter Notebook
1
star
49

tc-ipu

Implementation of triangle counting for Graphcore IPU
Emacs Lisp
1
star
50

balsam-serial-mode-profiling

Contains application tools and scripts for measuring balsam performance in serial mode.
Python
1
star
51

theta_conda_channel

Files for building anaconda cloud packages for Theta.
Shell
1
star
52

docker_image_recipes

A repository for image recipe files useful for ALCF systems.
Dockerfile
1
star
53

babeltrace2-ruby

Babeltrace2 Ruby bindings
Ruby
1
star
54

ALCF-AI-Testbed

HTML
1
star
55

cast-to-yaml

Extract information fom a c ast
Ruby
1
star