Piji Li (@lipiji)
  • Stars
    star
    3,316
  • Global Rank 8,789 (Top 0.4 %)
  • Followers 1,458
  • Following 607
  • Registered almost 13 years ago
  • Most used languages
    Python
    68.5 %
    Java
    9.3 %
    Shell
    5.6 %
    C++
    5.6 %
    HCL
    3.7 %
    R
    1.9 %
    HTML
    1.9 %
    MATLAB
    1.9 %
    Objective-C
    1.9 %

Top repositories

1

App-DL

Deep Learning and applications in Startups, CV, NLP
801
star
2

AIStartups

Startups about artificial intelligence. (DM, ML, NLP, CV...)
595
star
3

SongNet

Code for ACL 2020 paper "Rigid Formats Controlled Text Generation":https://www.aclweb.org/anthology/2020.acl-main.68/
Python
226
star
4

neural-summ-cnndm-pytorch

Neural abstractive summarization (seq2seq + copy (or pointer network) + coverage) in pytorch on CNN/Daily Mail
Python
203
star
5

Guyu

Chinese GPT2: pre-training and fine-tuning framework for text generation
Python
188
star
6

TranSummar

Transformer for abstractive summarization on cnn/daily-mail and gigawords
Python
139
star
7

JRNN

LSTM and GRU in JAVA
Java
114
star
8

PG_BOW_DEMO

Image Classification using Bag of Words and Spatial Pyramid BoW
C++
110
star
9

hierarchical-encoder-decoder

Hierarchical encoder-decoder framework for sequences of words, sentences, paragraphs and documents using LSTM and GRU in Theano
Python
109
star
10

TtT

code for ACL2021 paper "Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction"
Python
99
star
11

PG_Curve

Matlab code for computing and visualization: Confusion Matrix, Precision/Recall, ROC, Accuracy, F-Measure etc. for Classification.
Objective-C
91
star
12

PG_DEEP

demo of deep belief nets
C++
69
star
13

rnn-theano

RNN(LSTM, GRU) in Theano with mini-batch training; character-level language models in Theano
Python
69
star
14

variational-autoencoder-theano

Variational Autoencoders (VAEs) in Theano for Images and Text
Python
55
star
15

DRGD-LCSTS

code for "Deep Recurrent Generative Decoder for Abstractive Text Summarization"
Python
53
star
16

dialogue-hred-vhred

HRED VHRED VHCR for Multi-Turn Dialogue Systems
Python
44
star
17

lipiji.github.io

HTML
31
star
18

datasets

datasets for NLP research
24
star
19

HFT

code:Hidden factors and hidden topics: Understanding rating dimensions with review text.
Shell
23
star
20

uChecker

Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"
18
star
21

rnn-pytorch

study pytorch
Python
15
star
22

NRT-theano

Code for our SIGIR'2017 paper "Neural Rating Regression with Abstractive Tips Generation for Recommendation"
Python
14
star
23

data-summ-cnn_dailymail

non-anonymized cnn/dailymail dataset for text summarization
Python
12
star
24

JLBFGS

Limited-memory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) in Java
Java
10
star
25

PG_PageRank

pagerank using mapreduce format
Shell
9
star
26

vae-salience

"Salience Estimation via Variational Auto-Encoders for Multi-Document Summarization"
HCL
9
star
27

PG_PLSA

plsa demo in python
Python
7
star
28

neural-dialogue-s2s-weibo-py3

Python
7
star
29

world2vec

Pre-trained word and phrase vectors
7
star
30

neural-topic-model

neural topic model based on VAE - theano
Python
6
star
31

stopwords

6
star
32

t5_summarization

Python
6
star
33

language_model_transformer

language model via transformer
Python
6
star
34

vae-salience-ramds

"Reader-Aware Multi-Document Summarization: An Enhanced Model and The First Dataset"
HCL
6
star
35

gan-bow-text

Generative Adversarial Network (GAN) for text modeling
Python
5
star
36

bert_zh_open200g_wordpiece

Python
5
star
37

jilp

Java ILP is a simplified java interface to (mixed) integer linear programming solvers like, e.g., lp_solve, Glpk, SAT4J (0-1 ILP), CPLEX, or Mosek.
Java
4
star
38

cws-seq2seq

Chinese Word Segment using Seq2Seq Framework.
4
star
39

SwarmRank

Particle Swarm Optimization for Classification and Recommender Systems
MATLAB
4
star
40

PG_ROC_PR_R

ROC and PR curve using R
R
4
star
41

collaborative-topic-regression

C++
4
star
42

textrank_keyword_summary

textrank based keywords extraction and summarization
Python
3
star
43

interpretability-methods

gradients based interpretability methods
Python
3
star
44

tokenizer_zh

Python
3
star
45

TextAdventure

3
star
46

pointer_generator_csc_lstm

Python
3
star
47

PG_LINEAR

L_p-Regularized logistic regression using Gradient Decent (batch)
Shell
3
star
48

TopCJ

Top Conference Timeline
3
star
49

PATG

Code for WWW2019 paper "Persona-Aware Tips Generation"
2
star
50

gan-intro-theano

Generative Adversarial Networks (GAN) example in Theano.
Python
2
star
51

water_level_prediction

Python
2
star
52

S5

2
star
53

adversarial-variational-autoencoders

Adversarial Variational Auto-Encoders (AVAEs) in Theano
Python
2
star
54

TextRefiner

Java
1
star
55

neural-fig2txt

Python
1
star
56

VecComp

Vector completion using sparse coding
Python
1
star
57

instruction_data

Python
1
star
58

corr

Python
1
star
59

pointer_generator_transformer

Python
1
star
60

WebHarvester

Web-Harvest is Open Source Web Data Extraction tool written in Java. This is an extension of the original version.
Java
1
star
61

Finetrain-BERT

Python
1
star
62

big_tpl_zh_10_base

Python
1
star
63

mlp-theano

MLP using Theano
Python
1
star
64

civrealm

CivRealm: A Learning and Reasoning Odyssey for Decision-Making Agents: https://civrealm.github.io/civrealm/
Python
1
star
65

RAG4DocReader

llamaindex testing
Python
1
star