• Stars
    star
    69
  • Rank 452,512 (Top 9 %)
  • Language
    C++
  • License
    Apache License 2.0
  • Created over 4 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Distributed Representations of Words using word2vec

More Repositories

1

taskscheduleR

Schedule R scripts/processes with the Windows task scheduler.
R
331
star
2

image

Computer Vision and Image Recognition algorithms for R users
C++
270
star
3

udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
C++
209
star
4

audio.whisper

Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R
C
109
star
5

ruimtehol

R package to Embed All the Things! using StarSpace
C++
100
star
6

BTM

Biterm Topic Modelling for Short Text with R
C++
93
star
7

textrank

Summarise text by finding relevant sentences and keywords using the Textrank algorithm
R
76
star
8

pattern.nlp

R package to perform sentiment analysis and Parts of Speech tagging for Dutch/French/English/German/Spanish/Italian
R
67
star
9

crfsuite

Labelling Sequential Data in Natural Language Processing with R - using CRFsuite
C
62
star
10

textplot

Text Plots
R
53
star
11

ETM

Topic Modelling in Semantic Embedding Spaces
R
49
star
12

doc2vec

Distributed Representations of Sentences and Documents
C++
46
star
13

golgotha

Contextualised Embeddings and Language Modelling using BERT and Friends using R
R
44
star
14

RDRPOSTagger

R package for Ripple Down Rules-based Part-Of-Speech Tagging (RDRPOS). On more than 45 languages.
R
35
star
15

spark.sas7bdat

Read in SAS data in parallel into Apache Spark
R
26
star
16

sentencepiece

R package for Byte Pair Encoding / Unigram modelling based on Sentencepiece
C++
24
star
17

GAlogger

Log R Events and R Usage to Google Analytics
R
23
star
18

BelgiumMaps.StatBel

Administrative boundaries of Belgium based on Open Data available at Statistics Belgium
R
16
star
19

tokenizers.bpe

R package for Byte Pair Encoding based on YouTokenToMe
C++
14
star
20

dlib

allowing R users to work with dlib through Rcpp
C++
13
star
21

udpipe.models.ud

custom udpipe models
R
11
star
22

nametagger

Named Entity Recognition with the Nametag Maximum Entropy Markov model
C++
10
star
23

drat

4
star