There are no reviews yet. Be the first to send feedback to the community and the maintainers!
aiflows
🤖🌊 aiFlows: The building blocks of your collaborative AIGoogleTrendsAnchorBank
Google Trends, made easy.GenIE
The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.transformers-CFG
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face TransformersSynthIE
The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Generation: SynthIE and the Case of Information Extraction".llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".homepage2vec
Language-Agnostic Website Embedding and ClassificationCr5
Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"GPTurk
quootstrap
Unsupervised method for extracting quotation-speaker pairs from large news corpora.GCD
YouNiverse
Code for the dataset paper: "YouNiverse: Large-Scale Channel and Video Metadata from English-Speaking YouTube"Quotebank
Code and data for the WSDM '21 paper "Quotebank: A Corpus of Quotations from a Decade of News"WikiHist.html
This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wikitext to HTML format.understanding-decoding
The data and the PyTorch implementation for the models and experiments in the paper "Language Model Decoding as Likelihood–Utility Alignment".LAMEN
entity-matchers
Source code for "A Critical Re-evaluation of Neural Methods for Entity Alignment"pairformance
Tool to perform paired evaluation of automatic systemsWikiPDA
Crosslingual Topic Modeling with WikiPDAunfun
Code and data for the AAAI'19 paper "Reverse-Engineering Satire, or 'Paper on Computational Humor Accepted Despite Making Serious Advances'"invariant-language-models
A framework to train language models to learn invariant representations.eigenthemes
Source code for "Low-rank Subspaces for Unsupervised Entity Linking"causal-distances
property-inference-attacks
Modular framework for property inference attacks on deep neural networksGraphCyclesRemoval
Implementation of "A fast and effective heuristic for the feedback arc set problem"secvm-server
The server to collect user data and learn an SVM in the SecVM projectKLearn
BT-eval
Code to reproduce experiments of the ACL 2021 publication on the evaluation of NLP systems with the BT mechanismNegativity_in_2016_campaign
Code for the Paper "United States Politicians' Tone Became More Negative with 2016 Primary Campaigns"flows
The flows libraryamplification_paradox
This repo contains the simulation code for the paper "The Amplification Paradox in Recommender Systems"llm-grounding-analysis
distribution-inference-risks
Distribution Inference Risks: Identifying and Mitigating Sources of Leakagewiki_pageviews_covid
Data and code for the paper "Sudden Attention Shifts on Wikipedia During the COVID-19 Crisis"descartes
The PyTorch implementation for the models in the paper "Descartes: Generating Short Descriptions of Wikipedia Articles"nelight
laughing-head
Code for the laughing head papermdic
Code and data for the paper: "Message Distortion in Information Cascades" (TheWebConf2019)wiki_image_classification
Wikipedia Image Classification projectyoutube-embeddings
YouTube channel embeddings and social dimensions140_to_280
Repository for the paper “How Constraints Affect Content: The Case of Twitter's Switch from 140 to 280” published at ICWSM’18WikipediaAsWebGateway
CELMOC
Framework for Cost-Effective Language Model Choicequotebank-toolkit
Scripts for cleaning and enriching Quotebankpost-mortem-memory
broccoli-plugin
structuring-wikipedia-articles
Structuring Wikipedia Articles with Section Recommendationsfoodle-trends
when_sheep_shop
Repository for the article "When Sheep Shop: Measuring Herding Effects in Product Ratings with Natural Experiments" published at WWW2018manosphere_to_altright
DIPPS
anticipated-vs-actual
deplatforming_dataset
SpokespersonAttributionCOVID
Repository of code and data for the paper "The effect of spokesperson attribution on public health message sharing during the COVID-19 pandemic".WCNPruning
A framework to clean the Wikipedia category network.wikipedia-citation-engagement
Quantifying Engagement with Citations on Wikipedia https://arxiv.org/abs/2001.08614Love Open Source and this site? Check out how you can help us