Skip to content

Latest commit

 

History

History
26 lines (16 loc) · 1.34 KB

README.md

File metadata and controls

26 lines (16 loc) · 1.34 KB

covid_answers

Covid19 answers is a web application aiming to provide a practical solution to query research papers about corona virus from the Kaggle CORD19 dataset.

Architecture

The architecture focus on creating a pipeline that allow rapid iteration as new papers are constantly being added. It is being developed using Google Cloud Platform and hosting public docker images on Docker Hub, so the app can be deploy anywhere.

The result of this pipeline is an app built around the following three modules:

  • UI built with Plotly Dash framework to query documents.
  • API that provides relevant documents given an input question or topic.
  • AI server with natural language models for creating embeddings and question answering capabilities.

Ideas and work in progress

  • Refactor Annoy index creation, test a Kubeflow pipeline.
  • Consider Annoy alternatives like Faiss or using a search engine like Elastic.
  • Remove documents index from webapp image. Create a new API in order to query similar documents.
  • Add QA functionality to UI

Covid19 Icon made by Freepik from www.flaticon.com