Contact Us

Perturbation Data

Functional Genomics

We generate large perturbation datasets to power your AI models of cell and disease biology for use in target identification, target validation, and drug discovery.

Download Sample Data

How It Works

Our functional genomics product addresses key problems you face when training AI models for early drug discovery: data availability, quality, and uniformity. We generate large, high fidelity transcriptomic and phenotypic datasets in the disease context of your choice.

We introduce genetic and chemical perturbations in major cell types and provide you with your readouts of choice. Our highly automated workflow means you get data back in as little as 3 weeks.

Select from a range of perturbations and readout types

ReadoutsMeasurements

DRUG-seq

(high throughput bulk RNA-seq)

Transcript abundance

High Content Imaging

Images, fluorescence intensity

 High Throughput Verify

 (Amp-seq)

Indel formation in the target gene

PerturbationsType of perturbation

Arrayed

Chemical

Arrayed

Genetic CRISPRi / CRISPR KO - incl. siRNAs

(lipofectamine transfection, lentiviral transduction)

Cell ModelsExamples

Primary cells

iPSC-derived neurons, Fibroblasts

Standard cell lines

A549, HEK293, U20S, etc. 

Co-cultures

Human + mouse neurons

Designed with your needs in mind

01

Flexibility & Customization

Bring your own cell line or choose one of ours. Select a compound library and your data readout preference.

02

Scale & Speed

10,000s of in vitro chemical and genetic perturbations in each cell type. All delivered in as little as 3 weeks.

03

Simple & Easy Terms

Fee-for-service only: no royalties, no milestones. And you own the data, always. Our version of an easy button.

Datasets

Interact with Sample Datasets

DetailsGDPx3GDPx2GDPx1

DOWNLOAD NOW

Download GDPx3

Download GDPx2

Download GDPx1

RELEASE DATE

May 2025

December 2024

September 2024

DATA PACKAGE

  • Metadata for a subset of 7 test compounds and 6 controls 

  • PDF outlining the experimental setup

  • PDF outlining the data analysis pipeline steps

  • Metadata for a subset of 10 compounds and all controls 

  • PDF outlining the experimental set up

  • Metadata for the 1264 compounds screen

  • Raw UMI counts for the subset of 20 compounds

  • PDF outlining the experimental set up

CELL TYPES

  • A549 cells (human non-small cell carcinoma epithelial cell line)

  • aortic smooth muscle cells

  • dermal fibroblast

  • aortic endothelial cells

  • Human melanocytes cells

  • aortic smooth muscle cells

  • dermal fibroblast

  • skeletal muscle myoblasts

A549 cells (human non-small cell carcinoma epithelial cell line)

PERTURBATIONS

46 compounds total: 40 compounds, 4 concentrations, 2 time points, 4 replicates + 6 controls

85 compounds, 6 concentrations, 4 replicates

1,264 compounds, 2 concentrations, 4 replicates

READOUT

High Content Imaging (Cell Painting) with 2200 x 2200 image dimensions

Transcriptomic (DRUG-seq) with 2M reads sequencing depth

Transcriptomic (DRUG-seq) with 2M reads sequencing depth

PLATE DENSITY

384 well plate

384 well plate

384 well plate

AVAILABLE DATASET SIZE

~55 GB / cell type compressed, 100 GB uncompressed

~200 GB / cell type 

60 GB

Tell us what data you need. We’ll generate a large perturbation dataset to power your AI model.

Need data? Let's talk.

Request Info