Existing Workflows
This section provides general descriptions of existing PerMedCoE Workflows, as well as links to Workflow GitHub repositories.
Basic examples
A basic example application comprised of a single building block can be found in this repository.
A more complex application involving several building blocks that internally
call gromacs
can be found here.
Tip
Note that, for these basic examples, the workflow repository contains both the Building Block and the application. In other examples, Building Blocks are maintained in a separate repository.
COVID-19 Multiscale Modelling of the Virus and Patients’ Tissue
Uses multiscale simulations to predict patient-specific SARS‑CoV‑2 severity subtypes (moderate, severe or control), using single-cell RNA-Seq data, MaBoSS and PhysiBoSS. Boolean models are used to determine the behaviour of individual agents as a function of extracellular conditions and the concentration of different substrates, including the number of virions. Predictions of severity subtypes are based on a meta-analysis of personalised model outputs simulating cellular apoptosis regulation in epithelial cells infected by SARS‑CoV‑2.
The workflow uses the following building blocks, described in order of execution:
High-throughput mutant analysis
Single-cell processing
Personalise patient
PhysiBoSS
Analysis of all simulations
For details on individual workflow steps, see the user documentation for each building block.
Drug Synergies Screening
This pipeline simulates a drug screening on personalised cell line models. It automatically builds Boolean models of interest, then uses cell lines data (expression, mutations, copy number variations) to personalise them as MaBoSS models. Finally, this pipeline simulates multiple drug intervention on these MaBoSS models, and lists drug synergies of interest.
The workflow uses the following building blocks, described in order of execution:
Build model from species
Personalise patient
MaBoSS
Print drug results
For details on individual workflow steps, see the user documentation for each building block. GitHub repository
Single drug prediction
Complementarily, the workflow supports single drug response predictions to provide a baseline prediction in cases where drug response information for a given drug and cell line is not available. As an input, the workflow needs basal gene expression data for a cell, the drug targets (they need to be known for untested drugs) and optionally CARNIVAL features (sub-network activity predicted with CARNIVAL building block) and predicts log(IC50) values. This workflow uses a custom matrix factorization approach built with Google JAX and trained with gradient descent. The workflow can be used both for training a model, and for predicting new drug responses.
The workflow uses the following building blocks in order of execution (for training a model):
- Carnival_gex_preprocess
Preprocessed the basal gene expression data from GDSC. The input is a matrix of Gene x Sample expression data.
- Progeny
Using the preprocessed data, it estimates pathway activities for each column in the data (for each sample). It returns a matrix of Pathways x Samples with activity values for 11 pathways.
- Omnipath
It downloads latest Prior Knowledge Network of signalling. This building block can be ommited if there exists already a csv file with the network.
- TF Enrichment
For each sample, transcription factor activities are estimated using Dorothea.
- CarnivalPy
Using the TF activities estimated before, it runs Carnival to obtain a sub-network consistent with the TF activities (for each sample).
- Carnival_feature_merger
Preselect a set of genes by the user (if specified) and merge the features with the basal gene expression data.
- ML Jax Drug Prediction
Trains a model using the combined features to predict IC50 values from GDSC.
For details on individual workflow steps, please check the scripts that use each individual building block in the workflow GitHub repository.