GlassyWing/nvae

Stars
108
Rank 321,259 (Top 7 %)
Language
Python
License
Apache License 2.0
Created over 4 years ago
Updated over 3 years ago

GlassyWing/nvae

GlassyWing

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

An unofficial toy implementation for NVAE 《A Deep Hierarchical Variational Autoencoder》

nvae

An unofficial toy implementation for NVAE 《A Deep Hierarchical Variational Autoencoder》

This implement is tiny, just for learning and validation, the model structure:

As shown in the figure above, the model architecture is very simple, the image is scaled to 64x64 size, in order to reduce the amount of calculation.

The following ops are not implemented, because they are just some tricks, but they greatly increases the computational complexity:

IAF
Discretized mix logistic distribution (Replaced with lighter Adaptive Loss)

Example

Hair and other details are still blurry, but this is mostly because the image used is a low-quality 64x64 size and the model is realy tiny.

Usage

Training

Just specify the image directory, see more with python train.py -h.

python train.py --dataset_path <img_directory> --batch_size 128

During training, the dataloader will capture the central area of the image and scale it to 64 x 64, and the checkpoints will be saved into checkpoints directory.

Generate images

Replacing the default checkpoint path in the file random_sample.py and run example as follow:

python random_sample.py

or generate a 768 x 768 image contains 144 sub-imgs:

python gen_imgs.py

Variable control experiment

The highest-level variables control advanced attributes such as face shape, hairstyle, background, gender, and direction:

Secondary variables seem to control facial muscles:

The lowest-level variables seem to be just some noise:

bi-lstm-crf

使用keras实现的基于Bi-LSTM + CRF的中文分词+词性标注

text-detection-ocr

Chinese text detection and recognition based on CTPN + DENSENET using Keras and Tensor Flow，使用keras和tensorflow基于CTPN+Densenet实现的中文文本检测和识别

transformer-word-segmenter

Sequence labeling base on universal transformer (Transformer encoder) and CRF; 基于Universal Transformer + CRF 的中文分词和词性标注

fourier-feature-networks

An unofficial pytorch implementation of 《Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains》

yolo_deepsort

Fast MOT base on yolo+deepsort, support yolo3 and yolo4

transformer-keras

Using Keras + Tensor Flow to Implement Model Transformer in Paper "Attention Is All You Need". 使用 keras+tensorflow 实现论文"Attention Is All You Need"中的模型Transformer。

stomp_ws_py

Stomp over websocket for python

better-jieba

更好的jieba java版

ode_rnn

RNN 拟合微分算子

keras_dataloader

DataLoader for keras

ogan-torch

An unofficial pytorch implementation for OGAN

graph_gru

Reconstructed GRU, used to process the graph sequence.

stable-net

Unofficial pytorch implementation of StableNet

TorchDiffusion

One Diffusion model implementation base on LibTorch

gon_emb

Using fourier feature mapping Strengthen GON (《Gradient Origin Networks》)'s performance

searching-recommend

基于solr和协同过滤算法的构件检索与推荐系统

nn_precipitation_forecast

基于神经网络进行1-60天降水预测研究，Precipitation Forecast Based on Neural Network

components-recommend

基于itemCF的物品间具有先后关系的构件推荐模型

OnLSTM-torch

OnLSTM implemented by pytorch

learn-spring

This is a maven project that contains source code of learning spring

tobacco_diseases_example

这是烟草病害预测工具的使用示例

nn_book_answers

《神经网络与深度学习》习题

stock_crawler

股票行情爬虫

DailyPaper

这是一个基于Material Design设计的新闻阅读客户端

yolo3_torch

yolo3 视频检测

srapp

基于solr的构件检索与推荐系统 web端应用

tokenizer

使用ND4J实现基于Bi-LSTM + CRF 模型的中文分词

GlassyWing.github.io

个人博客发布页

sent_embedding

sentence embedding

spark-runner

执行spark graphx