• Stars
    star
    1
  • Language
    Scheme
  • License
    MIT License
  • Created about 3 years ago
  • Updated about 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Collection of code and data for ACM/IEEE JCDL 2021 paper: "Garbage, Glitter, or Gold: Assigning Multi-dimensional QualityScores to Social Media Seeds for Web Archive Collections"

More Repositories

1

ipwb

InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS
Python
606
star
2

archivenow

A Tool To Push Web Resources Into Web Archives
Python
403
star
3

CarbonDate

Estimating the age of web resources
HTML
94
star
4

warrick

Recover lost websites from the Web Infrastructure
HTML
85
star
5

MemGator

A Memento Aggregator CLI and Server in Go
Go
55
star
6

sumgram

sumgram is a tool that summarizes a collection of text documents by generating the most frequent sumgrams (conjoined ngrams)
Python
55
star
7

tweetedat

TweetedAt tells the time of a tweet based on its tweet id
HTML
43
star
8

FollowerCountHistory

Crawler that grabs Twitter follower counts across time via internet archives given account user name
Python
31
star
9

ORS

Object Resource Stream and CDXJ Drafts
15
star
10

MementoEmbed

A service that provides archive-aware oEmbed-compatible embeddable surrogates (social cards, thumbnails, etc.) for archived web pages (mementos).
HTML
15
star
11

Reconstructive

A ServiceWorker for client-side reconstruction of composite mementos
JavaScript
13
star
12

QueryClassification

Source code for domain classification (scholar or non-scholar) of a web query.
Python
11
star
13

raintale

A Python utility for publishing a social media story built from archived web pages to multiple services.
Python
11
star
14

MementoMap

A Tool to Summarize Web Archive Holdings
Python
9
star
15

off-topic-memento-toolkit

This system evaluates a collection of mementos (archived web pages) to determine which are off topic. The collection can be part of an Archive-It collection, a single TimeMap, or stored in a WARC file.
Python
9
star
16

Memento-aware-Browser

Chromium based memento-aware browser
9
star
17

aiu

A library for interacting with web archive collections at Archive-It, Trove, Pandora, and more.
Python
8
star
18

tmvis

An archival thumbnail visualization server
JavaScript
7
star
19

web-memento-damage

Web service to estimate the damage that exists on a memento
JavaScript
6
star
20

Extract-URLs

6
star
21

archivefacebook

JavaScript
6
star
22

storygraph-suite

A collection of software used by StoryGraphs (http://storygraph.cs.odu.edu/)
Python
5
star
23

US-Congress

Twitter handles for US Congress
5
star
24

IPNS-Blockchain

An IPNS implementation using Blockchain with Memento support
5
star
25

Scholar-Groups

HTML
5
star
26

hypercane

A toolkit for developing algorithms that sample mementos from a web archive collection.
Python
5
star
27

mementos-fixity

Python
4
star
28

archive_profiler

Scripts to generate profiles of various Web archives
Python
4
star
29

acm-paper-template

A starter LaTeX template for ACM conferences such as JCDL
TeX
4
star
30

wdill

What Did It Look Like?
Python
4
star
31

wsdlthesis

ODU WS-DL Thesis/Dissertation LaTeX Template
TeX
3
star
32

oduwsdl.github.io

ODU Web Science and Digital Libraries Research Group (WS-DL) home page.
HTML
3
star
33

odusci-etd-template

ODU College of Sciences LaTeX template for Theses and Dissertations - Overleaf sync
TeX
3
star
34

dsa-puddles

This repository stores the stories, summaries, and other visualizations of the Dark and Stormy Archives Project.
2
star
35

NwalaTextUtils

Collection of functions for processing text
Python
2
star
36

archive_profiles

A repository for collecting profiles of various web archiving services and updating as they evolve.
HTML
2
star
37

SSAuth

Python
2
star
38

top-news-selectors

Top News Selectors (tns): Top news parsing from select websites
HTML
2
star
39

University-Twitter-Engagement

2
star
40

2020DemFollowerGraph

This repository contains Twitter follower growth graphs for 2020 Democratic Party Candidates.
JavaScript
2
star
41

accesslog-parser

Web server access log parser and CLI tool with added features for web archive replay logs
Python
1
star
42

TwitterLabels

Analyzing the issues such as missing labels, temporal violations in archived Twitter using @realDonaldTrump mementos.
R
1
star
43

Analysing-change-in-Twitter-UI

Analysing change in Twitter UI
Python
1
star
44

SampleURLs

A collection of various URI sample setst
1
star
45

access-patterns

Access patterns of robots vs. humans in the Internet Archive and Portuguese web archive using web archive access logs.
Shell
1
star
46

seed-analyzer

Scripts to analyze collection seeds for their diversity and entropy
Python
1
star
47

Recommending-Archived-Webpages

"Expanding the Usage of Web Archives by Recommending Archived Webpages using only the URI
"
1
star
48

MergeArabicNames

Python
1
star
49

2024-research-expo

1
star
50

dsa

Repository for the collective work of the Dark and Stormy Archives project.
Shell
1
star
51

storygraphbot

Python
1
star
52

2022-research-expo

2022 Web Science & Digital Libraries Research Group Expo
1
star
53

utils

Assorted utility scripts for various tasks
Python
1
star
54

2021-research-expo

2021 Web Science & Digital Libraries Research Group Expo -- 2021-04-12, noon-2:30pm EDT
1
star
55

offtopic-goldstandard-data

Data for testing the Offtopic detection software
Python
1
star