• Stars
    star
    182
  • Rank 209,847 (Top 5 %)
  • Language
    C
  • License
    Apache License 2.0
  • Created about 6 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

面向中文电子病历的命名实体识别

运行流程:

@Author zhoukaiyin

任务描述

本评测任务为面向中文电子病历的命名实体识别,即对于给定的一组电子病历纯文本文档,任务的目标是识别并抽取出与医学临床相关的实体提及(entity mention),并将它们归类到预先定义好的类别(pre-defined categories),比如症状,药品,手术等。

第一步:数据处理(Linux)

$python raw2bio.py -1 #将训练数据分词并贴上字典特征
$python raw2bio.py -2 #将标签数据分词并贴上标签
$python raw2bio.py -3 #将标签保存成pickle文件为了后面将训练数据与标签合在一起
$python raw2bio.py -4 #将标签与训练数据文本接起来构成如下格式
$python raw2bio.py -1 test #将测试数据处理成需要的格式

第二部:模型训练(Linux)

$bash wapiti_ccks.sh #训练模型,模型储存在/eval/bio_ccks中

第三部分:获得结果(Linux)

$python get_result.py #提取结果文件,结果保存在CCKS_result中其格式为BIO和finall中格式为官方标签格式
$python onefile.py #将结果转成提交格式

结果文件

Flyon\CCKS_CRF\eval\result.txt

Wapiti is a simple and fast discriminative sequence labeling toolkit ( http://wapiti.limsi.fr )。A little same as CRF++

注:可以尝试BERT,ALBERT等预训练模型 参见:NLPGNN

More Repositories

1

BERT-NER

Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).
Python
1,231
star
2

NLPGNN

1. Use BERT, ALBERT and GPT2 as tensorflow2.0's layer. 2. Implement GCN, GAN, GIN and GraphSAGE based on message passing.
Python
329
star
3

Bert-BiLSTM-CRF

This model base on bert-as-service. Model structure : bert-embedding bilstm crf.
Python
38
star
4

Relation_Extration

Relation Classification via Convolutional Deep Neural Network
Python
13
star
5

Sequence_Labeling

BiLSTM+CRF
Python
10
star
6

Attention_CNN

Relation Classification via Multi-Level Attention CNNs
Python
6
star
7

CCLNER

Python
5
star
8

knowledge-graph

a QA Demo based on KG! use scrapy and jena.
Python
4
star
9

drugbank_search

The drugbank database is stored locally with es to provide highly customizable searches!
Python
4
star
10

Pytorch-BiLSTM-Attention-CRF

4
star
11

AGAC

Supplementary data for Bioinformatics.
AMPL
3
star
12

Variational-autoencoder

Knowledge about Variational autoencoder!
Python
3
star
13

Crawler

Download related informations from drugbank using requests and selenium.webdriver.
Python
3
star
14

Machine-Learning

Python
3
star
15

Gumble_lstm

Tensorflow 实现,将Gumble 函数用在 LSTM的每一个units中
Python
3
star
16

NLP_Dataset

Repo for collect NLP realted datasets!
2
star
17

Machine_learning

Jupyter Notebook
2
star
18

JavaDemo

java 学习笔记
Java
2
star
19

2019SpringTextM

Python
2
star
20

Algorithm-python

日常刷题存放
Python
2
star
21

GDAMDB

Python
2
star
22

tfmetrics

package "tfmetrics" will be used to eval f1_score, recall_score, precision_score in tensorflow>1.12
Python
2
star
23

Python

This file used to record my study details!
Python
1
star
24

Tensorflow2.0-Demo

Python
1
star
25

R-Chip-seq

R
1
star
26

crf-lstm-text

1
star
27

Intelligent-optimization-method

Code for Intelligent optimization method
Python
1
star
28

breastcancer

HTML
1
star
29

TensorFlow

Python
1
star
30

NLPGNNDATA

1
star
31

High-performance-parallel-computing

Some codes to practice Parallel Computing
C
1
star
32

Web_Crawler

爬虫小项目,用来抓取智联招娉网站上的招聘信息,并存入mongodb中。
Python
1
star