• Stars
    star
    4
  • Rank 3,304,323 (Top 66 %)
  • Language
    R
  • Created 12 months ago
  • Updated 11 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Data and code supporting the study Manhattan, Euclidean, and their Siblings: Exploring Exotic Similarity Measures in Text Classification

More Repositories

1

stylo

R package for stylometric analyses
R
168
star
2

100_english_novels

A benchmark corpus of 100 English novels, covering the 19th and the beginning of the 20th century
11
star
3

stylo_howto

Documentation for 'stylo', an R package for text analysis, suitable for authorship attribution, stylometry, and other multivariate analysis tasks in the domain of (literary) texts
TeX
10
star
4

tidystopwords

Customizable lists of stopwords in multiple languages
R
6
star
5

A_Small_Collection_of_British_Fiction

A selection of 28 classic British novels from the 19th century (including a few late 18th-century items). Full text versions, in plain text format, harvested from trustworthy public domain sites.
6
star
6

100_polish_novels

A benchmark corpus of 100 Polish novels, covering the 19th and the beginning of the 20th century
4
star
7

68_german_novels

A benchmark corpus of 68 German novels, covering the 19th and the beginning of the 20th century
4
star
8

DHAbstracts_biblio_style

A bibliographic style definition for Digital Humanities 2016 conference
3
star
9

computationalstylistics.github.io

SCSS
3
star
10

NT_Vulgate

2
star
11

stylometry_of_papyri

2
star
12

preprints

A selection of pre-prints by the members of the Group
1
star
13

word_frequencies

Code for the study on improving relative word frequencies
1
star
14

presentations

HTML
1
star
15

litRiddle

The package contains the data of a reader survey about fiction in Dutch, a description of the novels the readers rated, and the results of stylistic measurements of the novels. The package also contains functions to combine, analyze, and visualize these data.
R
1
star