Scientific Information Analytics Group, Prof. Gipp (@gipplab)

Top repositories

1

d3-dataset

The official repository for the LREC'22 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research"
13
star
2

pdf-benchmark

A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents
Python
13
star
3

PhysWikiQuiz

>>PhysWikiQuiz<< - a Physics Question Generation and Interrogation System
Python
10
star
4

citeplag

Prototype of an external plagiarism detection system that combines the analysis of citations and text in academic documents to improve the identification of disguised forms of academic plagiarism
PHP
10
star
5

MathMLTools

This project provides various tools for processing content MathML with Java.
Java
9
star
6

FormulaCloudData

Discovering Mathematical Objects of Interest - A Study of Mathematical Notations
Java
9
star
7

vmext

vmext: A Visualization Tool for Mathematical Expression Trees
JavaScript
8
star
8

LaCASt

LaCASt - A LaTeX Translator for Computer Algebra Systems
Java
7
star
9

sherlock

Sherlock Plagiarism Detector by Rob Pike & Loki
C
6
star
10

AnnoMathTeX

>>AnnoMathTeX<< - a LaTeX formula annotation facilitation and recommendation tool for STEM documents
Python
6
star
11

mathosphere

Java
6
star
12

latexPaperTemplate

This repository serves as a template for LaTeX papers.
TeX
6
star
13

math2vec

Set of tools we used to create, cultivate and process datasets for our math2vec project.
HTML
5
star
14

zotero-backup

Python
5
star
15

MathMLben

A quality benchmark for MathML
HTML
4
star
16

grespa

A tool to obtain and analyze data from Google Scholar
HTML
4
star
17

formula-concept-retrieval

Methods for Formula Concept Discovery (FCD) and Formula Concept Recognition (FCR)
HTML
4
star
18

decentralized-open-science

Decentralized Open Science
3
star
19

OpenScienceTemplate

Template for OpenScience projects
3
star
20

node-mathml

VMEdit: A visual wikidata aware content MathML editor
JavaScript
3
star
21

imageplag

ImagePlag is an adaptive, scalable, and extensible image-based plagiarism detection system suitable for analyzing a wide range of image similarities.
VBA
3
star
22

eCoachSql

SQL checker for ecoach
Java
2
star
23

Electronic-Laboratory-Notebook

Python
2
star
24

bc_p2p

A peer to peer implementation of confidential bibliographic coupling detection.
Jupyter Notebook
2
star
25

MathRecGoldStandData

A gold standard dataset for recommending scientific documents with mathematical content.
Python
2
star
26

cs-insights-uptime

Uptime tracker for endpoints of the cs-insights project.
2
star
27

StorageUI

Online Publishing System Utilizing Peer-2-Peer to Support Priority Claims and Data Accessibility
HTML
2
star
28

MathWikiLink

MathWikiLink - an entity linking system for mathematical formulae
Python
2
star
29

dataAnnoMathTex

data repo for https://AnnoMathTeX.wmfLabs.org
TeX
2
star
30

acst

ACademic-STorage-cluster
Shell
2
star
31

recvis-frontend

This repository contains front-end source code for RecVis project.
JavaScript
2
star
32

chem_formula_extractor

The chemical formula extraction parses PDF files and extracts all checmical entities from these files.
Python
2
star
33

pds_ws23_LastName_FirstName

Template Git for the PDS
1
star
34

LLM-Investig-MathStackExchange

This repository contains the resources used for SIGIR'2024 paper "Can LLMs Master Math? Investigating Large Language Models on Math Stack Exchange"
Python
1
star
35

ws21-swt-007

Java
1
star
36

TEIMMA-Reuse-Annotator

TE (Text) - IM(Image) - MA(Math) reuse annotator
Python
1
star
37

texvc-ocaml

texvc component of the MediaWiki Math extension
OCaml
1
star
38

citrec

Java
1
star
39

cl-osa

Cross-language plagiarism detection using Wikidata
Java
1
star
40

MathMLConverters

Collection of service calls to convert from various input formats to MathML
Java
1
star
41

bib

Biblography
TeX
1
star
42

docker-receval

Recommender System Evaluations
Python
1
star
43

news-story-identification

Implementation for news story identification
Python
1
star
44

WikidataListGenerator

Creates a list of Page titles and their corresponding Wikidata Items
Java
1
star
45

node-cytoscape-mathml

Cytoscape MathML is a plugin to the graph visualization cytoscape with the dagre-plugin and provides the interactivity with the MathML tree.
JavaScript
1
star
46

MathMLSim

Similarity calculation module for MathML formulae
Java
1
star