• Stars
    star
    118
  • Rank 298,122 (Top 6 %)
  • Language
    Python
  • License
    GNU General Publi...
  • Created about 7 years ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors

pytorch-extension

This is an example of a CUDA extension/function/layer for PyTorch which uses CuPy to compute the Hadamard product of two tensors.

For a more advanced extension that uses CuPy as well, please see: https://github.com/sniklaus/softmax-splatting
And yet another extension that uses CuPy can be found here: https://github.com/sniklaus/revisiting-sepconv

setup

Make sure to install CuPy, which can be done using pip install cupy or alternatively using one of the provided binary packages as outlined in the CuPy repository.

usage

There is no separate build process necessary, simply run python run.py to test it. A minimal example of how the sample extension can be used is also shown below.

import torch

import hadamard # the custom layer

class Network(torch.nn.Module):
	def __init__(self):
		super().__init__()
	# end

	def forward(self, tenOne, tenTwo):
		return hadamard.hadamard_func.apply(tenOne, tenTwo)
	# end
# end

netNetwork = Network().cuda()

tenOne = torch.rand(64, 3, 128, 128).cuda().requires_grad_()
tenTwo = torch.rand(64, 3, 128, 128).cuda().requires_grad_()

tenOut = netNetwork(tenOne, tenTwo)
tenExpected = torch.mul(tenOne, tenTwo)

print(torch.sum(tenOut.data - tenExpected.data), '<-- should be 0.0')

license

Please refer to the appropriate file within this repository.

More Repositories

1

3d-ken-burns

an implementation of 3D Ken Burns Effect from a Single Image using PyTorch
Python
1,497
star
2

sepconv-slomo

an implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch
Python
1,008
star
3

pytorch-pwc

a reimplementation of PWC-Net in PyTorch that matches the official Caffe version
Python
602
star
4

pytorch-hed

a reimplementation of Holistically-Nested Edge Detection in PyTorch
Python
451
star
5

softmax-splatting

an implementation of softmax splatting for differentiable forward warping using PyTorch
Python
436
star
6

pytorch-liteflownet

a reimplementation of LiteFlowNet in PyTorch that matches the official Caffe version
Python
400
star
7

pytorch-spynet

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch
Python
298
star
8

wasm-raytracer

a performance comparison of a simple raytracer in JavaScript, asm.js, WebAssembly, and GLSL
HTML
168
star
9

pytorch-unflow

a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version
Python
143
star
10

youtube-watchmarker

a browser extension that keeps track of your YouTube watch history and marks videos that you have already watched
JavaScript
141
star
11

revisiting-sepconv

an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch
Python
77
star
12

arxiv-doom

a parody of the ever-increasing amount of papers that appear on arXiv
HTML
32
star
13

teaching-vision

the framework for my computer vision class, in which the students are ought to solve various exercises
Python
24
star
14

teaching-webdev

the framework for my full stack web development class, in which the students are ought to solve various exercises
HTML
22
star
15

bookmark-tab

JavaScript
14
star
16

teaching-minichess

the framework for my advanced artificial intelligence class, in which an artificial chess player is ought to be implemented
C
14
star
17

nes-memoryview

visualizing the value of each individual byte in an emulated NES as a time series
HTML
9
star
18

teaching-confour

the framework for my advanced artificial intelligence class, in which an connect-four player is ought to be implemented
C
3
star
19

resume

my personal resume written in LaTeX for others to use
HTML
1
star