• Stars
    star
    3
  • Rank 3,963,521 (Top 79 %)
  • Language
    Python
  • Created 12 months ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Using DeepSpeed and Nvidia DALI to train various models to solve CIFAR-10

More Repositories

1

supercharger

Supercharge Open-Source AI Models
Python
348
star
2

dora

Implementation of DoRA
Python
276
star
3

Zpng

Better lossless compression than PNG with a simpler algorithm
C
267
star
4

wirehair

Wirehair : O(N) Fountain Code for Large Data
C++
267
star
5

self-discover

Implementation of Google's SELF-DISCOVER
Python
263
star
6

WLANOptimizer

Single-header C library that fixes WiFi performance issues for online gaming and other low-latency real-time network traffic.
C++
223
star
7

longhair

Longhair : O(N^2) Cauchy Reed-Solomon Block Erasure Code for Small Data
C++
156
star
8

leopard

Leopard-RS : O(N Log N) MDS Reed-Solomon Block Erasure Code for Large Data
C++
135
star
9

shorthair

Shorthair : Generational Block Streaming Erasure Codes
C++
128
star
10

TimeSync

TimeSync: Time Synchronization Library in Portable C++
C++
122
star
11

cm256

Fast GF(256) Cauchy MDS Block Erasure Codec in C
C++
107
star
12

tonk

Tonk : Reliable UDP (rUDP) Network Library and Infinite Window Erasure Code
C++
101
star
13

Zdepth

Zdepth :: Streaming Depth Compressor in C++ for Azure Kinect DK
C++
97
star
14

xrcap

Azure Kinect multi-camera secure network capture/record/replay
C++
76
star
15

snowshoe

Snowshoe - Portable, Secure, Fast Elliptic Curve Math Library in C
C++
62
star
16

XRmonitors

XRmonitors : User-Friendly Virtual Multi-Monitors for the Workplace
C++
54
star
17

tabby

Tabby - Strong, Fast, and Portable Cryptographic Signatures, Handshakes, and Password Authentication
C++
50
star
18

gf256

GF256 - Fast 8-bit Galois Field Math in C
C++
50
star
19

kvm

Low-Bandwidth IP KVM using Raspberry Pi 4
C++
48
star
20

siamese

Siamese : Infinite-Window Streaming Erasure Code (HARQ)
C++
47
star
21

bitnet_cpu

Experiments with BitNet inference on CPU
C++
46
star
22

ZdepthLossy

Lossy version of Zdepth using video encoders
C++
41
star
23

fecal

FEC-AL : O(N^2) Fountain Code for Small Data
C++
36
star
24

CauchyCaterpillar

Cauchy Caterpillar : O(N^2) Short-Window Streaming Erasure Code
C++
35
star
25

aiwebcam2

Second attempt at AI webcam, this time with OpenAI API
Python
32
star
26

upsampling

Image Upsampling with PyTorch
Python
22
star
27

calico

Calico - Strong, Fast, and Portable Authenticated Encryption
C++
21
star
28

cymric

Cymric - Portable secure random number generator
C++
20
star
29

loraftp

File transfer between two Raspberry Pis using the LoRa Pi HAT from Waveshare
C
19
star
30

mau

Network simulator for reliable UDP testing in C++
C++
16
star
31

oaillama3

Simple setup to self-host LLaMA3-70B model with an OpenAI API
16
star
32

lllm

Latent Large Language Models
Python
16
star
33

PacketAllocator

C++ Memory allocator for packet queues that free() in roughly the same order that they alloc().
C++
15
star
34

spectral_ssm

Implementation of Spectral State Space Models
Python
15
star
35

libcat

Common code library
C++
14
star
36

AutoAudiobook

Automatically create an audiobook using OpenAI
Python
14
star
37

minigpt4

MiniGPT-4 :: Updated to Torch 2.0, simple setup, easier API, cut out training code
Python
13
star
38

sdxl

SDXL GPU cluster scripts
Python
13
star
39

dataloader

High-performance tokenized language data-loader for Python C++ extension
C++
12
star
40

fp61

Experiment: Fast finite field Fp=2^61-1 in C++
C++
11
star
41

cuda_float_compress

Python package for compressing floating-point PyTorch tensors
Cuda
10
star
42

phind

Locally hosted: 60% HumanEval
Python
8
star
43

rtmp_receiver

Simple unidirectional RTMP video stream receiver
C++
7
star
44

counter

C++ wrapper for counters that can roll-over (e.g. timestamps/ack-ids)
C++
6
star
45

z16

16-bit monochrome image compressor based on Zstd
C
5
star
46

AQLM

Fixes for AQLM
Python
5
star
47

llamanal.cpp

Static code analysis for C++ projects using llama.cpp and the best LLM you can run offline without an expensive GPU.
C
5
star
48

unfiltered-diffusers

Simple fork that disables NSFW filter
Python
5
star
49

hloc

Python
4
star
50

boss-balloon

BossBalloon.io
TypeScript
4
star
51

libcatid

Automatically exported from code.google.com/p/libcatid
C++
3
star
52

voron

Voron 3D Printer files
3
star
53

halide-test

Test v14/v15 performance regression
CMake
3
star
54

logger

Feature-rich portable C++ logging subsystem in 650 lines of code
C++
3
star
55

chainlit-anthropic

Chainlit AI UI with Anthropic Backend
Python
3
star
56

fastest_gf_matrix_mult

A fairly hacked together piece of code that can quickly search for the optimal GF polynomials and Cauchy matrices for XOR-based GF matrix multiplication for erasure code encoders. May be useful if the matrices are of fixed size.
C++
3
star
57

recfilter-2020-fail

This is a failed attempt to port Recfilter to latest Halide from 2020. Maybe someone else can figure this out?
C++
2
star
58

bentpipe

Simple UDP rebroadcaster
C++
2
star
59

whisper3

Testing out Whisper 3
Python
2
star
60

sphynx

Sphynx - High Performance Network Transport Layer
C++
2
star
61

textworld_llm_benchmark

TextWorld LLM Benchmark
Python
2
star
62

pixel-perfect-sfm

pixel-perfect-sfm with some minor fixes
C++
2
star
63

train_ticket

Gated PRNGs are all you need? Spoilers: No.
Python
2
star
64

CatsChoice

PRNG Parameter Generation
C++
1
star
65

CRC16Recovery

Optimized CRC16 with error recovery in C
C++
1
star
66

Splane

Archived old code - /ban_ids/ is kind of interesting
C
1
star
67

rust_webgl_demo

Hello World : Rust Web Assembly
Rust
1
star
68

blog2022

HTML
1
star
69

swe_agent_playground

swe_agent_playground
1
star
70

Exapunks_Solutions

Exapunks Walkthrough Solutions
1
star
71

audio_prediction

Simple audio prediction example with RNN
Python
1
star
72

never_forget

Implementation of Overcoming Catastrophic Forgetting
Python
1
star
73

DependencyInjected

DependencyInjected : Light-weight and powerful Dependency Injection pattern for C++
C++
1
star
74

quicsend

quicsend :: Super-fast Internet-ready file transfer right from Python
C++
1
star