• Stars
    star
    1
  • Language
    Python
  • Created about 1 year ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

An open Chinese language byte piece encoding tokenization models.

More Repositories

1

CAIL

中国法研杯-司法人工智能挑战赛(CAIL2018-2020)
Python
89
star
2

icme2019

短视频内容理解与推荐竞赛
Python
77
star
3

Chinese-PreTrained-BERT

We released BERT-wwm, a Chinese pre-training model based on Whole Word Masking technology, and models closely related to this technology. 我们发布了基于全词遮罩(Whole Word Masking)技术的中文预训练模型BERT-wwm,以及与此技术密切相关的模型
Python
53
star
4

lawa

“法阿”中文分词:做最好的 Python 法律中文分词组件
Python
22
star
5

Chinese-PreTrained-XLNet

本项目提供了面向中文的XLNet预训练模型,旨在丰富中文自然语言处理资源,提供多元化的中文预训练模型选择。 我们欢迎各位专家学者下载使用,并共同促进和发展中文资源建设。
Python
11
star
6

Open-Prompt-Research

Some thoughts on prompts for Large Language Models.
Python
9
star
7

lawaplugin

lawa-plugin是Elasticsearch的中文分词插件,后端模型是由法律法规、案例、期刊语料统计而成,并具有新词发现功能。
Java
6
star
8

rct-intel-hw-recommend

2021微信大数据挑战赛
Python
6
star
9

aft-pytorch

Unofficial PyTorch implementation of **Attention Free Transformer**'s layers by [Zhai](https://twitter.com/zhaisf?lang=en), et al. [[abs](https://openreview.net/forum?id=pW--cu2FCHY), [pdf](https://arxiv.org/pdf/2105.14103.pdf)] from Apple Inc.
Python
6
star
10

OpenAI-Tech-Research

A project following OpenAI's cutting edge technology
Jupyter Notebook
4
star
11

Open-GPT

GPT is a belief. This project provides a code library for efficiently training Chinese GPT. This project uses the nanoGPT framework and the novel optimizer algorithm sophia.
Jupyter Notebook
4
star
12

DeepLayerNorm

An unofficial implement of DeepNorm. https://arxiv.org/pdf/2203.00555.pdf
Python
4
star
13

dezhoukv-cpp

DezhouKV的C++版本实现(C++ implementation of DezhouKV database)
Roff
3
star
14

academic

A site of self introduction in acdemic.
Vue
2
star
15

Open-Transformer

An open transformer project with T5-style architecture.
Jupyter Notebook
1
star
16

Lanton

A C implementation of Google's consistent hashing algorithm used in Maglev system(https://github.com/kkdai/maglev)
C
1
star
17

CRF

A Conditional Random Field Model based Chinese Word Segmentation Project.
Python
1
star
18

Open-BERT-IR

Open BERT rank pretrained models.
Jupyter Notebook
1
star
19

PyCRM

CRM system.
HTML
1
star
20

CBLSTM

A comparision of C-LSTM and BiLSTM on sentiment classification task in Chinese.
Python
1
star
21

lawrouge

“法摘”中英文摘要评价:做最好的 Python 中英文摘要评价组件
Python
1
star
22

dezhoukv

一个普适性、可扩展、高性能、高可用的KV集群,可以同时架设在数十到数千台集群服务器上,存储几百TB数据。
Python
1
star
23

trading_okcoin

www.okcoin.cn
Groovy
1
star
24

courtpy

scrapy the court.
Python
1
star
25

LSTM

A LSTM based Chinese Segment Project.
Vue
1
star
26

APSP

An algorithm to calculate All Paires Shortest Path efficiently.
Java
1
star
27

CAIL2021

Some competitions
Python
1
star