• Stars
    star
    63
  • Rank 484,938 (Top 10 %)
  • Language Cuda
  • License
    BSD 3-Clause "New...
  • Created over 8 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Subset of BLAS routines optimized for NVIDIA GPUs

More Repositories

1

exageostat

A High Performance Unified Framework for Geostatistics on Manycore Systems.
C
36
star
2

hicma

HiCMA: Hierarchical Computations on Manycore Architectures
Jupyter Notebook
27
star
3

girih

This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distributed memory systems
C
20
star
4

bemfmm

Extreme scale FMM-accelerated boundary integral equation solver for wave scattering.
C++
16
star
5

rdf-exp

C++
15
star
6

kfun3d

Unstructured computations on emerging architectures.
C
13
star
7

polar

Distributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix
C
12
star
8

h2opus

H2Opus: a performance-oriented library for hierarchical matrices
C++
11
star
9

exageostatR

An R Package for the Maximum Likelihood Evaluation on Large-Scale Spatial Datasets using Many-core Systems.
R
9
star
10

al4san

AL4SAN stands for an Abstraction Layer library For Standardizing APIs of task-based eNgines.
C
9
star
11

tlrmvm

C++
7
star
12

ksvd

The KAUST SVD (KSVD) is a high performance software framework for computing a dense SVD on distributed-memory manycore systems.
C
7
star
13

stars-h

Software for Testing Accuracy, Reliability and Scalability of Hierarchical computations.
C
6
star
14

ecrc_cmake

This project provides a collection of CMake modules that can be shared among projects using CMake as build system. For now it is mainly constituted of "Find" modules that help detecting installed libraries on the system.
CMake
6
star
15

ExaGeoStatCPP

C++
5
star
16

kblas

4
star
17

BeBeCA

A benchmark for evaluating approximate betweenness centrality algorithms.
C++
4
star
18

README

Useful information - Readme first
3
star
19

kblas-cpu

A subset of BLAS routines optimized for x86 architectures
C
2
star
20

dare

C++
2
star
21

quark

C
1
star
22

ACR

Accelerated Cyclic Reduction - Solver for Structured Linear Systems
C
1
star
23

hcore

BLAS operations for matrices in tile low-rank format
C
1
star
24

hcorepp

C++ API for the BLAS of Tile Low-rank Matrix Algebra
C++
1
star
25

moao

MOAO simulation framework for manycore architectures.
C++
1
star