• This repository has been archived on 03/Jul/2023
  • Stars
    star
    148
  • Rank 248,510 (Top 5 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 4 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

ReinventCommunity (jupyter notebook tutorials for REINVENT 3.2)

This repository is a collection of useful jupyter notebooks, code snippets and example JSON files illustrating the use of Reinvent 3.2. At the moment, the following notebooks are supported:

  • Complete_Use-Case_DRD2_Demo: a full-fledged use case using public data on DRD2, including use of predictive models and elucidating general considerations
  • Create_Model_Demo: explanation on how to initialize a new model (prior / agent) for REINVENT which can be trained in a transfer learning setup
  • Data_Preparation: tutorial on how to prepare (clean, filter and standardize) data from a source such as ChEMBL to be used for training
  • Model_Building_Demo: shows how to train a predictive (QSAR) model to be used with REINVENT based on the public DRD2 dataset (classification problem)
  • Reinforcement_Learning_Demo: example reinforcement learning run with a selection of scoring function components to generate novel compounds with ever higher scores iteratively
  • Reinforcement_Learning_Demo_Selectivity: example illustrating the use of the relatively complicated selectivity_component to optimize potency against a target while simultaneously pushing for a low potency against one or more off-targets
  • Reinforcement_Learning_Demo_Tanimoto: very simple (only 1, easy-to-understand component) transfer learning example
  • Reinforcement_Learning_Exploitation_Demo: illustrates the exploitation scenario, where one is after solutions from a subspace in chemical space already well defined
  • Reinforcement_Learning_Exploration_Demo: illustrates the exploration scenario, where the aim is to generate a varied set of solutions to a less stringently defined problem
  • Reinforcement_Learning_Demo_DockStream: illustrates the use of DockStream in REINVENT, allowing the generative model to gradually optimize the docking score of proposed compounds. For more information on DockStream, please see the DockStream repository and the corresponding DockStreamCommunity repository for tutorial notebooks on DockStream as a standalone molecular docking tool.
  • Reinforcement_Learning_Demo_Icolos: illustrates the use of Icolos in REINVENT using a docking scenario.
  • Sampling_Demo: once an agent has been trained and is producing interesting results, it can be used to generate more compounds without actually changing it further - this is facilitated by the sampling mode
  • Score_Transformations: as many components produce scores on an arbitrary scale, but REINVENT needs to receive it normalized to be a number between 0 and 1 (with values close to 1 meaning "good"), score transformations have been implemented and can be used as shown in this tutorial
  • Scoring_Demo: in case a set of existing compound definitions (for example prior to starting a project) should be scored with a scoring function definition, the scoring mode can be used
  • Transfer_Learning_Demo: this tutorial illustrates the transfer learning mode, which usually is used to "pre-train" an agent before reinforcement learning in case no adequate naive prior is available or to focus an already existing agent further
  • Transfer_Learning_Demo_Teachers_Forcing: same as Transfer_Learning_Demo above, with explanation of teachers forcing
  • Lib-INVENT_RL1_QSAR: Lib-INVENT example reinforcement learning run using a QSAR model
  • Lib-INVENT_RL2_QSAR_RF: Lib-INVENT example reinforcement learning run using a random forest (RF) QSAR model
  • Lib-INVENT_RL3_ROCS_RF: Lib-INVENT example reinforcement learning using OpenEye's ROCS 3D similarity (requires an OpenEye license)
  • Link-INVENT_RL: Link-INVENT example reinforcement learning
  • Automated_Curriculum_Learning_demo: illustrates the automated curriculum learning running model. The example demonstrates how to set-up a curriculum to guide the REINVENT agent to sample a target molecular scaffold. This scenario represents a complex objective as the target scaffold is not present in the training set for the prior model

More Repositories

1

aizynthfinder

A tool for retrosynthetic planning
Python
565
star
2

GraphINVENT

Graph neural networks for molecular design.
Python
361
star
3

Reinvent

Python
335
star
4

Chemformer

Python
202
star
5

REINVENT4

AI molecular design tool for de novo design, scaffold hopping, R-group replacement, linker design and molecule optimization.
Python
165
star
6

deep-molecular-optimization

Molecular optimization by capturing chemistโ€™s intuition using the Seq2Seq with attention and the Transformer
Python
132
star
7

DockStream

DockStream: A Docking Wrapper to Enhance De Novo Molecular Design
Python
91
star
8

QSARtuna

QSARtuna: QSAR model building with the optuna framework
Jupyter Notebook
77
star
9

PaRoutes

Home of the PaRoutes framework for benchmarking multi-step retrosynthesis predictions.
Python
62
star
10

reaction_utils

Utilities for working with datasets of chemical reactions, reaction templates and template extraction.
Python
62
star
11

pysmilesutils

Utilities for working with SMILES based encodings of molecules for deep learning (PyTorch oriented)
Python
56
star
12

Icolos

Icolos: A workflow manager for structure based post-processing of de novo generated small molecules
Python
53
star
13

Lib-INVENT

Jupyter Notebook
49
star
14

MolBART

Pretrained SMILES transformation model for finetuning for diverse molecular tasks.
Python
43
star
15

maize

A graph-based workflow manager for computational chemistry pipelines
Python
31
star
16

DockStreamCommunity

Jupyter Notebook
24
star
17

aizynthtrain

Tools to train synthesis prediction models
Python
21
star
18

reinvent-hitl

Code for paper "Human-in-the-Loop Assisted de Novo Molecular Design".
Python
21
star
19

route-distances

Tools and routines to calculate distances between synthesis routes and to cluster them.
Python
20
star
20

Deep-Drug-Coder

Python
17
star
21

Lib-INVENT-dataset

Python
15
star
22

SMILES-RL

Python
12
star
23

reinvent-scoring

Python
10
star
24

NonadditivityAnalysis

Notebook for standardization of actvity data, nonadditivity analysis and its evaluation.
Jupyter Notebook
9
star
25

Levenshtein

Levenshtein SMILES augmentation for reaction datasets
Python
8
star
26

Siamese-RNN-Self-Attention

Contains code for Siamese Recurrent Neural Network with Self-Attention for Bioactivity Prediction
Python
7
star
27

IcolosCommunity

Repository contains jupyter notebooks illustrating the use of the Icolos workflow manager
Jupyter Notebook
6
star
28

reaction-graph-link-prediction

Python
6
star
29

maize-contrib

Contributed and additional nodes for maize
Python
5
star
30

MMP_project

Code for paper
Jupyter Notebook
5
star
31

reinvent-chemistry

Python
5
star
32

molwall

MolWall: "Wall of molecules" interface to see and rate molecules
Python
4
star
33

reinvent-models

Python
4
star
34

reinforcement-learning-active-learning

Python
4
star
35

IcolosData

Contains the data required for the example workflows and jupyter notebooks utilizing the Icolos workflow manager
Rich Text Format
1
star
36

reinvent-scoring-gpflow

Code for paper "Human-in-the-Loop Assisted de Novo Molecular Design".
Python
1
star