• Stars
    star
    341
  • Rank 119,890 (Top 3 %)
  • Language
    Jupyter Notebook
  • License
    Apache License 2.0
  • Created about 4 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

API & Webapp to answer questions about COVID-19. Using NLP (Question Answering) and trusted data sources.

cover-photo

This open source project serves two purposes.

  1. Collection and evaluation of a Question Answering dataset to improve existing QA/search methods - COVID-QA
  2. Question matching capabilities: Provide trustworthy answers to questions about COVID-19 via NLP - outdated

COVID-QA

Update 14th April, 2020: We are open sourcing the first batch of SQuAD style question answering annotations. Thanks to Tony Reina for managing the process and the many professional annotators who spend valuable time looking through Covid related research papers.

FAQ matching

Update 17th June, 2020: As the pandemic is thankfully slowing down and other information sources have catched up, we decided to take our hosted API and UI offline. We will keep the repository here as an inspiration for other projects and to share the COVID-QA dataset.

⚑ Problem

  • People have many questions about COVID-19
  • Answers are scattered on different websites
  • Finding the right answers takes a lot of time
  • Trustworthiness of answers is hard to judge
  • Many answers get outdated soon

πŸ’‘ Idea

  • Aggregate FAQs and texts from trustworthy data sources (WHO, CDC ...)
  • Provide a UI where people can ask questions
  • Use NLP to match incoming questions of users with meaningful answers
  • Users can provide feedback about answers to improve the NLP model and flag outdated or wrong answers
  • Display most common queries without good answers to guide data collection and model improvements

βš™οΈ Tech

  • Scrapers to collect data
  • Elasticsearch to store texts, FAQs, embeddings
  • NLP Models implemented via Haystack to find answers via a) detecting similar question in FAQs b) detect answers in free texts (extractive QA)
  • React Frontend

More Repositories

1

haystack

πŸ” LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Python
13,738
star
2

FARM

🏑 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Python
1,723
star
3

haystack-tutorials

Here you can find all the Tutorials for Haystack πŸ““
Jupyter Notebook
203
star
4

haystack-cookbook

πŸ‘©πŸ»β€πŸ³ A collection of example notebooks
Jupyter Notebook
122
star
5

prompthub

Go
114
star
6

haystack-demos

Fully working applications that demonstrate how to use Haystack to implement common NLP use cases
Python
102
star
7

haystack-core-integrations

Additional packages (components, document stores and the likes) to extend the capabilities of Haystack version 2.0 and onwards
Python
60
star
8

rasa-haystack

Python
45
star
9

haystack-integrations

πŸš€ A list of Haystack Integrations, maintained by the community or deepset.
32
star
10

haystack-website

Contents moved to https://github.com/deepset-ai/haystack-home
TypeScript
31
star
11

canals

A component orchestration engine
Python
27
star
12

haystack-search-pipeline-streamlit

πŸš€ Template Haystack Search Application with Streamlit
Python
21
star
13

hayhooks

Deploy Haystack pipelines behind a REST Api.
Python
18
star
14

bert-tensorflow

original Google AI BERT repro (Tensorflow) with TPU Tensorboard logging and LAMB optimizer
Python
18
star
15

haystack-sagemaker

πŸš€ This repo is a showcase of how you can use models deployed on AWS SageMaker in your Haystack Retrieval Augmented Generative AI pipelines
Jupyter Notebook
13
star
16

haystack-home

Website for Haystack, the open source LLM framework
HTML
12
star
17

haystack-helm

Repo containing haystack helm charts
Smarty
8
star
18

deepset-cloud-sdk

A Python SDK to interact with deepset Cloud
Python
8
star
19

visualizer-source-text

Source text for deepset Cloud pipeline nodes
5
star
20

document-store

Python
5
star
21

biqa-llm

Python
5
star
22

haystack-rest-api

Python
4
star
23

templates

Usable templates for your work.
3
star
24

prompthub-py

Python
3
star
25

charts

Smarty
1
star
26

haystack-json-schema

JSON Schema for validating Haystack Pipeline YAML files
1
star
27

.github

1
star