There are no reviews yet. Be the first to send feedback to the community and the maintainers!
allennlp
An open-source NLP research library, built on PyTorch.OLMo
Modeling, training, eval, and inference code for OLMoRL4LMs
A modular RL library to fine-tune language models to human preferenceslongformer
Longformer: The Long-Document Transformerbilm-tf
Tensorflow implementation of contextualized word representations from bi-directional language modelsscispacy
A full spaCy pipeline and models for scientific/biomedical documents.bi-att-flow
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.scibert
A BERT model for scientific text.open-instruct
ai2thor
An open-source platform for Visual AI.dolma
Data and tools for generating and inspecting OLMo pre-training data.XNOR-Net
ImageNet classification using binary Convolutional Neural Networkss2orc
S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.scitldr
objaverse-xl
πͺ Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!papermage
library supporting NLP and CV research on scientific papersnatural-instructions
Expanding natural instructionsvisprog
Official code for VisProg (CVPR 2023 Best Paper!)science-parse
Science Parse parses scientific papers (in PDF form) and returns them in structured form.pdffigures2
Given a scholarly PDF, extract figures, tables, captions, and section titles.writing-code-for-nlp-research-emnlp2018
A companion repository for the "Writing code for NLP Research" Tutorial at EMNLP 2018tango
Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.allennlp-models
Officially supported AllenNLP modelsspecter
SPECTER: Document-level Representation Learning using Citation-informed Transformersdont-stop-pretraining
Code associated with the Don't Stop Pretraining ACL 2020 paperunified-io-2
macaw
Multi-angle c(q)uestion answeringlumos
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"document-qa
scholarphi
An interactive PDF reader.deep_qa
A deep NLP library, based on Keras / tf, focused on question answering (but useful for other NLP too)acl2018-semantic-parsing-tutorial
Materials from the ACL 2018 tutorial on neural semantic parsingunifiedqa
UnifiedQA: Crossing Format Boundaries With a Single QA Systempawls
Software that makes labeling PDFs easy.OLMoE
OLMoE: Open Mixture-of-Experts Language Modelskb
KnowBert -- Knowledge Enhanced Contextual Word RepresentationsPeerRead
Data and code for Kang et al., NAACL 2018's paper titled "A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications"reward-bench
RewardBench: the first evaluation tool for reward models.naacl2021-longdoc-tutorial
openie-standalone
Quality information extraction at web scale. EditHolodeck
CVPR 2024: Language Guided Generation of 3D Embodied AI Environments.python-package-template
A template repo for Python packagesallenact
An open source framework for research in Embodied-AI from AI2.ir_datasets
Provides a common interface to many IR ranking datasets.s2orc-doc2json
Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)acl2022-zerofewshot-tutorial
OLMo-Eval
Evaluation suite for LLMsprocthor
ποΈ Scaling Embodied AI by Procedurally Generating Interactive 3D Housesfm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.FineGrainedRLHF
beaker-cli
A collaborative platform for rapid and reproducible research.comet-atomic-2020
spv2
Science-parse version 2scifact
Data and models for the SciFact verification task.objaverse-rendering
π· Scripts for rendering ObjaverseScienceWorld
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.unified-io-inference
allennlp-demo
Code for the AllenNLP demo.citeomatic
A citation recommendation system that allows users to find relevant citations for their paper drafts. The tool is backed by Semantic Scholar's OpenCorpus dataset.cartography
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamicssavn
Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)vampire
Variational Methods for Pretraining in Resource-limited Environmentsvila
Incorporating VIsual LAyout Structures for Scientific Text Classifications2-folks
Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.hidden-networks
cord19
Get started with CORD-19mmda
multimodal document analysisPRIMER
The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarizationcatwalk
This project studies the performance and robustness of language models and task-adaptation methods.dnw
Discovering Neural Wirings (https://arxiv.org/abs/1906.00586)deepfigures-open
Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" π€tpu_pretrain
LM Pretraining with PyTorch/TPUallentune
Hyperparameter Search for AllenNLPSciREX
Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122scidocs
Dataset accompanying the SPECTER modellm-explorer
interactive explorer for language modelspdffigures
Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.OpenBookQA
Code for experiments on OpenBookQA from the EMNLP 2018 paper "Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering"peS2o
Pretraining Efficiently on S2ORC!gooaq
Question-answers, collected from Googleallennlp-as-a-library-example
A simple example for how to build your own model using AllenNLP as a dependency.embodied-clip
Official codebase for EmbCLIPmultimodalqa
alexafsm
With alexafsm, developers can model dialog agents with first-class concepts such as states, attributes, transition, and actions. alexafsm also provides visualization and other tools to help understand, test, debug, and maintain complex FSM conversations.allennlp-semparse
A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLPscicite
Repository for NAACL 2019 paper on Citation Intent predictionai2thor-rearrangement
π Visual Room Rearrangementcommonsense-kg-completion
medicat
Dataset of medical images, captions, subfigure-subcaption annotations, and inline textual referencesreal-toxicity-prompts
s2search
The Semantic Scholar Search Rerankeraristo-mini
Aristo mini is a light-weight question answering system that can quickly evaluate Aristo science questions with an evaluation web server and the provided baseline solvers.gpv-1
A task-agnostic vision-language architecture as a step towards General Purpose Visionflex
Few-shot NLP benchmark for unified, rigorous evalelastic
manipulathor
ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic armspoc-robot-training
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real Worldpropara
ProPara (Process Paragraph Comprehension) dataset and modelsARC-Solvers
ARC Question SolversLove Open Source and this site? Check out how you can help us