• Stars
    star
    156
  • Rank 239,589 (Top 5 %)
  • Language
    Python
  • Created about 1 year ago
  • Updated 12 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning

📖 Paper

We propose a Multi-disciplinary Collaboration (MC) framework. The framework works in five stages: (i) expert gathering: gather experts from distinct disciplines according to the clinical question; (ii) analysis proposition: domain experts put forward their own analysis with their expertise; (iii) report summarization: compose a summarized report on the basis of a previous series of analyses; (iv) collaborative consultation: engage the experts in discussions over the summarized report. The report will be revised iteratively until an agreement from all the experts is reached; (v) decision making: derive a final decision from the unanimous report.

Requirements

Install all required python dependencies:

pip install -r requirements.txt

Data

We evaluate our MC framework on two benchmark datasets MedQA, MedMCQA, and PubMedQA, as well as six subtasks most relevant to the medical domain from MMLU datasets including anatomy, clinical knowledge, college medicine, medical genetics, professional medicine, and college biology.

Please check our Google Drive: https://drive.google.com/file/d/11qNzDYIlimGGJ1fhQn2ux6w_rfFgJbyo/view?usp=sharing

Implementations

Input your own openai api key in api_utils.py.

sh inference.sh

Cite Us

If you find this project useful, please cite the following paper:

@article{tang2023MedAgents,
      title={ML-MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning}, 
      author={Xiangru Tang and Anni Zou and Zhuosheng Zhang and Yilun Zhao and Xingyao Zhang and Arman Cohan and Mark Gerstein},
      year={2023},
      journal={arXiv preprint arXiv:2311.10537},
}

Acknowledgments

We thank Ziming Li (HKUST) and Junlong Li (Shanghai Jiao Tong University) for participating in discussions at the early stages of the project.

More Repositories

1

ML-Bench

The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.09835)
Python
353
star
2

ThermoNet

ThermoNet is a computational method for quantitative prediction of the impact of single-point mutations on protein thermodynamic stability. The core algorithm of ThermoNet is an ensemble of deep 3D convolutional neural networks.
Python
107
star
3

Struc-Bench

Python
42
star
4

BioCoder

Jupyter Notebook
34
star
5

SAMChain

Python
34
star
6

MUSIC

MUltiScale enrIchment Calling for ChIP-Seq Datasets
C++
21
star
7

MolLM

A Unified Language Model to Integrate Biomedical Text with 2D and 3D Molecular Representations
Python
19
star
8

GenAI4Drug

19
star
9

aloft

ALOFT, the Annotation Of Loss-of-Function Transcripts, provides extensive functional annotations to loss-of-function variants in the human genome.
Python
18
star
10

MatchedFilter

Prediction of Active Enhancers and Promoters in a Cell type specific manner using shape matching of histone and/or DNase-I hypersensitivity profiles
Python
16
star
11

forest-fire-clustering

A clustering algorithm that can perform internal validation inspired by forest fire dynamics and self-organized criticality
Python
16
star
12

DeepVelo

Jupyter Notebook
14
star
13

starrpeaker

STARRPeaker: STARR-seq peak caller
Python
14
star
14

SVFX

Machine learning framework to quantify pathogenicity of structural variants
Python
12
star
15

siglasso

Optimizing Cancer Mutation Signatures Jointly with Sampling Likelihood
R
10
star
16

HiC-spector

Spectral and reproducibility analysis of Hi-C contact maps
Julia
10
star
17

LESSeq

Local Event-based analysis of alternative Splicing using RNA-Seq
C
7
star
18

idash19he

C++
7
star
19

STRESS

C
7
star
20

FunSeq2

A flexible framework to annotate and prioritize cancer somatic mutations.
Perl
7
star
21

PsychENCODE-DSPN

#DSPN
MATLAB
6
star
22

topicnet

R
5
star
23

entexBERT

The DNABERT model for ENTEx
Python
5
star
24

OrthoClust

Julia
5
star
25

texp

TeXP is a pipeline to gauge the autonomous transcription level of L1 subfamilies using short read RNA-seq data
Makefile
5
star
26

LARVA

Large-scale Analysis of Recurrent Variants in noncoding Annotations
C++
4
star
27

PeakSeq

Identifying and ranking peak regions in ChIP-Seq experiments
C++
4
star
28

MrTADFinder

a method to identify TADs in Hi-C data
Julia
4
star
29

UMG

A network propagation method to prioritize long tail genes in cancer
Python
3
star
30

RSEQtools

A modular framework to analyze RNA-Seq data using compact and anonymized data summaries.
C
3
star
31

RADAR

RADAR source code
Python
3
star
32

step-back-profiling

Jupyter Notebook
3
star
33

mhealthCI

HTML
3
star
34

FusionSeq

Identifying fusion transcripts by using paired-end RNA Sequencing
C
2
star
35

MEDIQA-Chat-2023

MEDIQA-Chat-2023 GersteinLab solution
Jupyter Notebook
2
star
36

papers.gersteinlab.org

Go
2
star
37

decoasthma

Code to reproduce the analyses described in the manuscript, "Approaches for integrating heterogeneous RNA-seq data reveals cross-talk between microbes and genes in asthmatic patients"
HTML
2
star
38

idash22bc

Solution to track 1 of iDASH 2022 secure genome analysis competition
JavaScript
2
star
39

shiny-dim-reduction

This project applies dimensionality reduction to tabular data and generates applications for visualizing the results.
R
2
star
40

idash19bc

Solution to iDASH'19 challenge track 1
JavaScript
2
star
41

vat

Variant Annotation Tool
PHP
2
star
42

FANCY

Predicting privacy risk of functional genomics data
Jupyter Notebook
2
star
43

retrodup

An integrated retroduplication caller based on 1) exon-exon junction and 2) discordant reads
Perl
2
star
44

chronODE

Jupyter Notebook
2
star
45

HotCommics

C
1
star
46

Frustration

Python
1
star
47

uORFs

Output files and source code for our project predicting functional upstream open reading frames.
Python
1
star
48

PLIGHT

Privacy Leakage through Inference across Genotypic HMM Trajectories
Python
1
star
49

Public_Documents

A Repository for Gerstein Lab public documents.
1
star
50

Loregic

R
1
star
51

nimbus

R
1
star
52

ABCD

Python
1
star
53

TIP

TIP
R
1
star
54

MOAT

Mutations Overburdening Annotations Tool
C++
1
star
55

IQSeq

Isoform quantification from RNA-seq data
C
1
star
56

dAsthma

R
1
star
57

scan-atac-sim

SCAN-ATAC Sim is a single-cell ATAC-seq data simulator used to benchmark various single-cell ATAC-seq data analysis methods.
C++
1
star
58

MOBI

Shell
1
star
59

GRAM

GRAM: A GeneRAlized Model to predict the molecular effect of a non-coding variant in a cell type-specific manner
R
1
star
60

idash20FL

Jupyter Notebook
1
star
61

Factual_ClinicalSumm

Python
1
star