There are no reviews yet. Be the first to send feedback to the community and the maintainers!
awesome-document-similarity
A curated list of resources on document similarity measures (papers, tutorials, code, ...)pytorch-bert-document-classification
Enriching BERT with Knowledge Graph Embedding for Document Classification (PyTorch)scincl
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)llm-datasets
A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.semantic-document-relations
Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles"legal-document-similarity
Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal Literature Recommendations"clp-transfer
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learningaspect-document-embeddings
Code, dataset & models for the paper Specialized Document Embeddings for Aspect-based Similarity of Research Papers (#JCDL2022)german-language-models
A collection of German GPT language modelsawesome-contrastive-learning-for-nlp
A collection of papers about contrastive learning for natural language processing.wikipedia-article-recommendations
Survey data and Python code for the ICADL 2021 paper "A Qualitative Evaluation of User Preference for Link-based vs. Text-based Recommendations of Wikipedia Articles"getting-started
covid-vaccination-appointment
Leaflet.Sim
Leaflet.Sim is a framework for location-based simulations with Leaflet maps that can visualise moving markers, which can change their style, and events over time on a map.emnlp2022-papers
finetune-evaluation-harness
CmdLineSlideShow
Command line script for generating rich slide shows from a set of images with transition effects and audio. Using ImageMagick and FFMPEG.Wikipedia2Lucene
Import a Wikipedia XML Dump from HDFS to Lucene index or Elasticsearch and retrieve similar Wikipedia articles based on Lucene's MoreLikeThis query.kibana-reallybettermap
Multiple locations for Kibana's bettermap panelnews-visualization
News visualization with Elastic Search and Kibana including NER, Sentiment Analysis and Geo Locations.data-sourcing
turkish-lm-bias
Investigating Gender Bias in Turkish Language ModelsLove Open Source and this site? Check out how you can help us