• Stars
    star
    44
  • Rank 634,439 (Top 13 %)
  • Language
    R
  • License
    Mozilla Public Li...
  • Created over 4 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Contextualised Embeddings and Language Modelling using BERT and Friends using R

More Repositories

1

taskscheduleR

Schedule R scripts/processes with the Windows task scheduler.
R
331
star
2

image

Computer Vision and Image Recognition algorithms for R users
C++
270
star
3

udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
C++
209
star
4

audio.whisper

Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R
C
109
star
5

ruimtehol

R package to Embed All the Things! using StarSpace
C++
100
star
6

BTM

Biterm Topic Modelling for Short Text with R
C++
93
star
7

textrank

Summarise text by finding relevant sentences and keywords using the Textrank algorithm
R
76
star
8

word2vec

Distributed Representations of Words using word2vec
C++
69
star
9

pattern.nlp

R package to perform sentiment analysis and Parts of Speech tagging for Dutch/French/English/German/Spanish/Italian
R
67
star
10

crfsuite

Labelling Sequential Data in Natural Language Processing with R - using CRFsuite
C
62
star
11

textplot

Text Plots
R
53
star
12

ETM

Topic Modelling in Semantic Embedding Spaces
R
49
star
13

doc2vec

Distributed Representations of Sentences and Documents
C++
46
star
14

RDRPOSTagger

R package for Ripple Down Rules-based Part-Of-Speech Tagging (RDRPOS). On more than 45 languages.
R
35
star
15

spark.sas7bdat

Read in SAS data in parallel into Apache Spark
R
26
star
16

sentencepiece

R package for Byte Pair Encoding / Unigram modelling based on Sentencepiece
C++
24
star
17

GAlogger

Log R Events and R Usage to Google Analytics
R
23
star
18

BelgiumMaps.StatBel

Administrative boundaries of Belgium based on Open Data available at Statistics Belgium
R
16
star
19

tokenizers.bpe

R package for Byte Pair Encoding based on YouTokenToMe
C++
14
star
20

dlib

allowing R users to work with dlib through Rcpp
C++
13
star
21

udpipe.models.ud

custom udpipe models
R
11
star
22

nametagger

Named Entity Recognition with the Nametag Maximum Entropy Markov model
C++
10
star
23

drat

4
star