• Stars
    star
    5
  • Rank 2,861,937 (Top 57 %)
  • Language
    HTML
  • Created about 6 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Set of tools we used to create, cultivate and process datasets for our math2vec project.

More Repositories

1

pdf-benchmark

A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents
Python
17
star
2

d3-dataset

The official repository for the LREC'22 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research"
13
star
3

PhysWikiQuiz

>>PhysWikiQuiz<< - a Physics Question Generation and Interrogation System
Python
11
star
4

MathMLTools

This project provides various tools for processing content MathML with Java.
Java
10
star
5

citeplag

Prototype of an external plagiarism detection system that combines the analysis of citations and text in academic documents to improve the identification of disguised forms of academic plagiarism
PHP
10
star
6

vmext

vmext: A Visualization Tool for Mathematical Expression Trees
JavaScript
9
star
7

FormulaCloudData

Discovering Mathematical Objects of Interest - A Study of Mathematical Notations
Java
9
star
8

AnnoMathTeX

>>AnnoMathTeX<< - a LaTeX formula annotation facilitation and recommendation tool for STEM documents
Python
7
star
9

LaCASt

LaCASt - A LaTeX Translator for Computer Algebra Systems
Java
7
star
10

sherlock

Sherlock Plagiarism Detector by Rob Pike & Loki
C
6
star
11

mathosphere

Java
6
star
12

latexPaperTemplate

This repository serves as a template for LaTeX papers.
TeX
6
star
13

zotero-backup

Python
5
star
14

MathMLben

A quality benchmark for MathML
HTML
4
star
15

grespa

A tool to obtain and analyze data from Google Scholar
HTML
4
star
16

formula-concept-retrieval

Methods for Formula Concept Discovery (FCD) and Formula Concept Recognition (FCR)
HTML
4
star
17

preprint_generator

CiteAssist generates preprints, suggests related papers, and adds BibTeX annotation to the PDF
TypeScript
4
star
18

decentralized-open-science

Decentralized Open Science
3
star
19

MathRecGoldStandData

A gold standard dataset for recommending scientific documents with mathematical content.
Python
3
star
20

OpenScienceTemplate

Template for OpenScience projects
3
star
21

node-mathml

VMEdit: A visual wikidata aware content MathML editor
JavaScript
3
star
22

imageplag

ImagePlag is an adaptive, scalable, and extensible image-based plagiarism detection system suitable for analyzing a wide range of image similarities.
VBA
3
star
23

eCoachSql

SQL checker for ecoach
Java
2
star
24

Electronic-Laboratory-Notebook

Python
2
star
25

bc_p2p

A peer to peer implementation of confidential bibliographic coupling detection.
Jupyter Notebook
2
star
26

dataAnnoMathTex

data repo for https://AnnoMathTeX.wmfLabs.org
TeX
2
star
27

cs-insights-uptime

Uptime tracker for endpoints of the cs-insights project.
2
star
28

StorageUI

Online Publishing System Utilizing Peer-2-Peer to Support Priority Claims and Data Accessibility
HTML
2
star
29

MathWikiLink

MathWikiLink - an entity linking system for mathematical formulae
Python
2
star
30

acst

ACademic-STorage-cluster
Shell
2
star
31

chem_formula_extractor

The chemical formula extraction parses PDF files and extracts all checmical entities from these files.
Python
2
star
32

recvis-frontend

This repository contains front-end source code for RecVis project.
JavaScript
2
star
33

pds_ws23_LastName_FirstName

Template Git for the PDS
1
star
34

LLM-Investig-MathStackExchange

This repository contains the resources used for SIGIR'2024 paper "Can LLMs Master Math? Investigating Large Language Models on Math Stack Exchange"
Python
1
star
35

ws21-swt-007

Java
1
star
36

TEIMMA-Reuse-Annotator

TE (Text) - IM(Image) - MA(Math) reuse annotator
Python
1
star
37

texvc-ocaml

texvc component of the MediaWiki Math extension
OCaml
1
star
38

citrec

Java
1
star
39

cl-osa

Cross-language plagiarism detection using Wikidata
Java
1
star
40

bib

Biblography
TeX
1
star
41

docker-receval

Recommender System Evaluations
Python
1
star
42

MathMLConverters

Collection of service calls to convert from various input formats to MathML
Java
1
star
43

WikidataListGenerator

Creates a list of Page titles and their corresponding Wikidata Items
Java
1
star
44

node-cytoscape-mathml

Cytoscape MathML is a plugin to the graph visualization cytoscape with the dagre-plugin and provides the interactivity with the MathML tree.
JavaScript
1
star
45

news-story-identification

Implementation for news story identification
Python
1
star
46

docker-latexml

LaTeXML docker container repo
Dockerfile
1
star
47

MathMLSim

Similarity calculation module for MathML formulae
Java
1
star