Building the World’s Largest Spatial Multiomics Dataset in Oncology–CeGaT’s Role in the MOSAIC Project

To advance personalized cancer therapy, Owkin’s MOSAIC project is building the largest spatial multiomics dataset in oncology. The goal: to train AI models that detect disease subtypes, uncover biomarkers, and guide treatment decisions. The foundation of this endeavor is high-resolution molecular data from 7,000 tumor samples across seven cancer types – connecting cancer cells and their microenvironment. That is where CeGaT comes in: as the project’s European sequencing partner, we generate the high-resolution molecular data that powers the MOSAIC project.

Our role: high-resolution sequencing and smart data analysis
We receive tumor samples from four of the five participating clinical centers. Each sample undergoes a full set of sequencing analyses:

Graphic showing four sequencing approaches in MOSAIC: Single-Cell RNA Sequencing, Spacial Transcriptome Sequencing, Transcriptome Sequencing and Whole Exome Sequencing.

Figure 1 | Four sequencing approaches in MOSAIC.

Beyond sample processing, we perform secondary bioinformatic analyses, including sequencing quality control, alignment, variant calling, and annotation. Our in-house workflows are optimized for reproducibility, scalability, and consistent data quality across all modalities.

Designed for scale – with reliable, high-quality performance
Owkin launched the project in January 2024. With over 2,000 samples already processed, we have managed high volumes with consistent performance. The first results are already becoming visible: MOSAIC has recently launched its online ‘MOSAIC Window’ – offering researchers a first glimpse into the insights this project could deliver. These early insights are built on a robust sequencing framework that ensures consistency, reproducibility, and data quality across thousands of samples. To meet the project’s demanding requirements, we adapted our workflows – including setting up an additional version of our single-cell RNA sequencing product– to ensure reliable quality and efficiency at scale.

Enabling complex research – with depth and infrastructure
With extensive experience in research, CeGaT supports multiomics projects of all sizes. In MOSAIC, we contribute more than data – we provide the expertise, infrastructure, and flexibility required for successful execution.

We are proud to be part of this groundbreaking initiative – helping shape the future of personalized cancer care with high-quality sequencing.

July 3, 2025 | Sequencing |