• Stars
    star
    5
  • Rank 2,861,937 (Top 57 %)
  • Language
    Jupyter Notebook
  • Created almost 8 years ago
  • Updated over 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Stuff for the upcoming IR course 2017

More Repositories

1

Turku-neural-parser-pipeline

A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more than 50 languages. Top ranker in the CoNLL-18 Shared Task.
Python
103
star
2

FinBERT

BERT model trained from scratch on Finnish
Shell
86
star
3

Finnish-dep-parser

The Finnish dependency parsing pipeline being developed by the Turku NLP group. Documentation:
Python
49
star
4

wikibert

BERT models for many languages created from Wikipedia texts
31
star
5

Text_Mining_Course

Stuff for the Text Mining course
Jupyter Notebook
25
star
6

ocr-correction

Post-processing OCR errors with seq2seq models
Python
25
star
7

finngen-tools

Tools for training causal language models for Finnish
Python
25
star
8

Deep_Learning_in_LangTech_course

Materials for the University of Turku course TKO_8965 Deep Learning in Human Language Technology (previously named TKO_2101 Natural Language Processing)
Jupyter Notebook
14
star
9

bert-eval

Python
9
star
10

turku-ner-corpus

Open broad-coverage corpus for Finnish named entity recognition.
Python
9
star
11

turku-one

Turku OntoNotes Entities Corpus (TurkuONE)
8
star
12

pubmed_parses

Syntactic parses and named entity recognition for PubMed abstracts and PubMed Central full documents
8
star
13

finnish-generative-model-eval

Evaluation of Finnish generative models
Python
6
star
14

class-explainer

Python
5
star
15

Finnish_PropBank

Finnish Proposition Bank
CSS
4
star
16

intro-to-nlp

Introduction to Natural Language Processing
Jupyter Notebook
4
star
17

register-labeling

Python
4
star
18

RAG-web-app

HTML
3
star
19

Turku-paraphrase-corpus

Python
3
star
20

biBERT

Finnish English bilingual BERT models
3
star
21

BINF_Programming

Stuff for the BINF programming course (@fginter)
Jupyter Notebook
3
star
22

ATP_kurssi

Jupyter Notebook
3
star
23

multilingual-register-labeling

Multilingual, multilabel modeling of registers
Python
3
star
24

CAFA3

University of Turku CAFA3 project
Python
3
star
25

conll17-system

Instructions for TurkuNLP system in CoNLL 2017 Shared Task on Multilingual Parsing from Raw Text to Universal Dependencies.
Shell
2
star
26

WAC-XII

Data presented in the paper "From Web Crawl to Clean Register-Annotated Corpora"
2
star
27

textual-data-analysis-course

Jupyter Notebook
2
star
28

DIKI1002-Working-with-Text-in-Python

Jupyter Notebook
2
star
29

FinCORE

Finnish Corpus of Online REgisters
Python
2
star
30

BioCreativeVI_CHEMPROT_RE

Deep learning-based systems for biomedical relation extraction: recognizing the statements of relations between chemical compounds/drugs and genes/proteins from biomedical literature. The code is developed for our participation in the BioCreative VI Task 5 (CHEMPROT) challenge. Contact: [email protected]
Python
2
star
31

BioCreativeVI_BioID_assignment

Python
2
star
32

Corpus-linguistics

Code and data for the examples and use cases described in the article "Määrällinen korpuslingvistiikka" to be published in the book "Kielentutkimuksen metodologian käsikirja" in Finnish.
Python
2
star
33

korona-tweets

stuff for our korona-tweets
Python
1
star
34

ocr_errors_simulator

Functions and codes used to determine probabilities on OCR errors and simulate them
Python
1
star
35

registerlabeling

Python
1
star
36

Cell-line-recognition

Cell line names recognition and normalization
CSS
1
star
37

Digi_menetelmat

Johdatus digitaalisiin ihmistieteisiin -kurssin työpaja "Digitaaliset ihmistieteet kielentutkimuksessa: tekstinlouhinta"
Python
1
star
38

Multilingual-register-corpora

French Corpus of Online REgisters (FreCORE) and Swedish Corpus of Online REgisters (SweCORE)
1
star
39

BHE

End-to-end System for Bacteria Habitat Extraction: Named-entity recognition (NER), named-entity normalization, relation extraction. email: [email protected]
Python
1
star
40

dolly-fi

Finnish version of databricks-dolly-15k instruction dataset
Python
1
star
41

sentiment-target-corpus

Targeted sentiment corpus
1
star
42

dep_search

JavaScript
1
star
43

deepfin-tools

DeepFin tools
Python
1
star
44

SRNNMT

Sentence representation for translation finding
Python
1
star
45

CORE-corpus

1
star
46

FinCORE_full

1
star
47

oasst-fi

Open Assistant dataset translated to Finnish
Python
1
star
48

DigiHum16

Random course notes for the DigiHum16 course
Jupyter Notebook
1
star
49

wikipedia-toxicity-data-fi

Python
1
star
50

toxicity-classifier

Repository for all things related to classifying whether a text is toxic or not using data from https://github.com/TurkuNLP/wikipedia-toxicity-data-fi
Python
1
star
51

PB_solr

Work towards indexing the Finnish Parsebank in SOLR
Python
1
star
52

TDT_editor

The tree editor used to annotate the Turku Dependency Treebank. Vintage code, but putting it online in case someone finds it in any way useful.
Python
1
star
53

pytorch-registerlabeling

Jupyter Notebook
1
star