Citation
Orvis J, et al. Nat Methods. 2021 Jun 25.
doi: 10.1038/s41592-021-01200-9
PMID: 34172972
Vast quantities of multi-omic data have been produced to characterize the development and diversity of cell types in the cerebral cortex of humans and other mammals. To more fully harness the collective discovery potential of these data, we have assembled gene-level transcriptomic data from 188 published studies of neocortical development, including the transcriptomes of >33 million single-cells, extensive spatial transcriptomic experiments and RNA sequencing of sorted cells and bulk tissues. Applying joint matrix decomposition to mouse, macaque and human data in this collection, we defined transcriptome dynamics that are conserved across mammalian neurogenesis and which elucidate the evolution of outer, or basal, radial glial cells. Decomposition of adult human neocortical data identified layer-specific signatures in mature neurons and, in combination with transfer learning methods in NeMO Analytics, enabled the charting of their early developmental emergence and protracted maturation across years of postnatal life. Interrogation of data from cerebral organoids demonstrated that while many molecular elements of in vivo development are recapitulated in vitro, specific transcriptomic programs in neuronal maturation are absent. We invite computational biologists and cell biologists without coding expertise to use NeMO Analytics and to fuel it with emerging data.
Joint decomposition of scRNA-seq data in the NeMO Analytics neocortical development data collection
Schematic of NeMO Analytics data resources employed in conjunction with joint decomposition and transfer learning approaches. This is an outline of specific analyses in this report as well as a description of a general approach we invite others to take on. Analysis-ready datasets and detailed sample metadata can be downloaded from NeMO Analytics and analyzed offline. Elements learned from joint decomposition can then be uploaded to NeMO Analytics to explore their dynamics across the broad data collection. In this flow, the offline analysis could be any exploratory technique applied to mutli-omics data matrices that produces gene signatures in the form of simple lists or quantitative loadings, e.g. PCA, clustering, or differential expression analysis.
The central notion behind our approach is to:
Figure 1: scRNA-seq in mouse, macaque and human mid-gestational neocortex
Figure 2: Bulk, single-cell and spatial transcriptomic data in mouse, macaque and human mid-gestational neocortex
NeocortexDevoHsInVivo: in vivo data from the developing human neocortex
NeocortexDevoHsInVitro: cerebral organoids neural differentiation in human stem cell models of the neocortex
NeocortexDevoMmInVivo: in vivo data from the developing mouse neocortex
NeocortexDevoNHPInVivo: in vivo data from the developing macaque, marmoset, and chimpanzee neocortex
NeocortexEvoDevo: selected subset of in vivo datasets from mouse, macaque and human developing neocortex
AdultNeoctxLayers: in vivo data from the mature human neocortex and spatial transcriptomics across species
MammalianEmbryo: mammalian embryo development and the emergence of the neural lineage and telencephalon
MammCtxDev.jNMF.p7: Seven patterns defining conserved transcriptomic dynamics across scRNA-seq from mouse, macaque, and human neocortical neurogenesis (Figures 1 & 2, Sonthalia et al, where p5=RG/cycling, p4=IPC, p7=nascent neuron, p2=maturing neuron).
MammCtxDev.jNMF.p40: Forty patterns defining conserved transcriptomic dynamics across scRNA-seq from mouse, macaque, and human neocortical neurogenesis (Figure 3, Sonthalia et al, where p27=oRG).
HsCtxLayer.jNMF.p20: Twenty patterns defining shared transcriptomic signatures in snRNA-seq from adult human neocortical neurons from layer-specific microdissections across 8 neocortical regions and 5 donors (Figures 4 & 5, Sonthalia et al, where p1=subplate/L6b, p9=L6CT, p15=L6IT, p13=L5/6NP, p6=L5IT, p19=L4primate-specific, p4=L4conserved, p17=L2/3, p7=L2).
CellCycleGenes: Two groups of genes - those expressed specifically in S- or G2M-phase of the cell cycle (from Seurat).
CtxDiseaseGeneLists: Many neocortical disease risk gene lists (from MatoBlanco et al 2024).
Each cell in the table links to the projection of 1 of the NeMO analytics data collections described above into 1 of the gene signatures described above, allowing the exploration of each of the gene signatures as it change across the millions of samples in the NeMO Analytics neocortical data collections (the first column of links can be used to view the expression of individual genes across the data collections). Red highlighted cells indicate specific [data collection :: gene signature] combinations of particular interest.
Gene signatures | Individual gene links | MammCtxDev.jNMF.p7 | MammCtxDev.jNMF.p40 | HsCtxLayer.jNMF.p20 | CellCycleGenes | CtxDiseaseGeneLists | |
---|---|---|---|---|---|---|---|
Transcriptome data collections | |||||||
Figure 1 | LINK | LINK | LINK | LINK | LINK | LINK | |
Figure 2 | LINK | LINK | LINK | LINK | LINK | LINK | |
NeocortexDevoHsInVivo | LINK | LINK | LINK | LINK | LINK | LINK | |
NeocortexDevoHsInVitro | LINK | LINK | LINK | LINK | LINK | LINK | |
NeocortexDevoMmInVivo | LINK | LINK | LINK | LINK | LINK | LINK | |
NeocortexDevoNHPInVivo | LINK | LINK | LINK | LINK | LINK | LINK | |
NeocortexEvoDevo | LINK | LINK | LINK | LINK | LINK | LINK | |
AdultNeoctxLayers | LINK | LINK | LINK | LINK | LINK | LINK | |
MammalianEmbryo | LINK | LINK | LINK | LINK | LINK | LINK |
We invite researchers to add their own datasets and gene signatures to this environment to explore their own data, ideas & hypotheses and to expand the discovery potential of this shared community research resource: