Citation
Orvis J, et al. Nat Methods. 2021 Jun 25.
doi: 10.1038/s41592-021-01200-9
PMID: 34172972
To enable researchers to more fully harness the collective discovery potential of multi-omic data in the public domain, we have assembled gene-level transcriptomic data from ~200 studies of neocortical development and in vitro models. Applying joint matrix decomposition to mouse, macaque and human data, we define transcriptome dynamics that are conserved across neocortical neurogenesis and identify a program that emerges in ventricular progenitors, is later expressed in neurogenic outer, or basal, radial glia of primates, but is limited to gliogenic precursors in the rodent. Decomposition of adult human neocortical data identified layer-specific signatures in excitatory neurons, enabling the charting of their developmental emergence and protracted maturation, which is in stark contrast to the early peaking expression of layer-defining transcription factors. Interrogation of data from cerebral organoids demonstrated that while broad elements of in vivo development are recapitulated in vitro, many layer-specific transcriptomic programs in neuronal maturation are absent. We invite cell biologists without coding expertise to use NeMO Analytics in their research and to fuel it with their own emerging data.
Joint decomposition of scRNA-seq data in the NeMO Analytics neocortical development data collection
Schematic of NeMO Analytics data resources employed in conjunction with joint decomposition and transfer learning approaches. This is an outline of specific analyses in this report as well as a description of a general approach we invite others to take on. Analysis-ready datasets and detailed sample metadata can be downloaded from NeMO Analytics and analyzed offline. Elements learned from joint decomposition can then be uploaded to NeMO Analytics to explore their dynamics across the broad data collection. In this flow, the offline analysis could be any exploratory technique applied to mutli-omics data matrices that produces gene signatures in the form of simple lists or quantitative loadings, e.g. PCA, clustering, or differential expression analysis.
The central notion behind our approach is to:
Figure 1: scRNA-seq in mouse, macaque and human mid-gestational neocortex
Figure 2: Bulk, single-cell and spatial transcriptomic data in mouse, macaque and human mid-gestational neocortex
NeocortexDevoHsInVivo: in vivo data from the developing human neocortex
NeocortexDevoHsInVitro: cerebral organoids neural differentiation in human stem cell models of the neocortex
NeocortexDevoMmInVivo: in vivo data from the developing mouse neocortex
NeocortexDevoNHPInVivo: in vivo data from the developing macaque, marmoset, and chimpanzee neocortex
NeocortexEvoDevo: selected subset of in vivo datasets from mouse, macaque and human developing neocortex
AdultNeoctxLayers: in vivo data from the mature human neocortex and spatial transcriptomics across species
MammalianEmbryo: mammalian embryo development and the emergence of the neural lineage and telencephalon
MammCtxDev.jNMF.p7: Seven patterns defining conserved transcriptomic dynamics across scRNA-seq from mouse, macaque, and human neocortical neurogenesis (Figures 1 & 2, Sonthalia et al, where p5=RG/cycling, p4=IPC, p7=nascent neuron, p2=maturing neuron).
MammCtxDev.jNMF.p40: Forty patterns defining conserved transcriptomic dynamics across scRNA-seq from mouse, macaque, and human neocortical neurogenesis (Figure 3, Sonthalia et al, where p27=oRG).
HsCtxLayer.jNMF.p20: Twenty patterns defining shared transcriptomic signatures in snRNA-seq from adult human neocortical neurons from layer-specific microdissections across 8 neocortical regions and 5 donors (Figures 4 & 5, Sonthalia et al, where p1=subplate/L6b, p9=L6CT, p15=L6IT, p13=L5/6NP, p6=L5IT, p19=L4primate-specific, p4=L4conserved, p17=L2/3, p7=L2).
CellCycleGenes: Two groups of genes - those expressed specifically in S- or G2M-phase of the cell cycle (from Seurat).
CtxDiseaseGeneLists: Many neocortical disease risk gene lists (from MatoBlanco et al 2024).
Each cell in the table links to the projection of 1 of the NeMO analytics data collections described above into 1 of the gene signatures described above, allowing the exploration of each of the gene signatures as it change across the millions of samples in the NeMO Analytics neocortical data collections (the first column of links can be used to view the expression of individual genes across the data collections). Red highlighted cells indicate specific [data collection :: gene signature] combinations of particular interest.
| Gene signatures | Individual gene links | MammCtxDev.jNMF.p7 | MammCtxDev.jNMF.p40 | HsCtxLayer.jNMF.p20 | CellCycleGenes | CtxDiseaseGeneLists | |
|---|---|---|---|---|---|---|---|
| Transcriptome data collections | |||||||
| Figure 1 | LINK | LINK | LINK | LINK | LINK | LINK | |
| Figure 2 | LINK | LINK | LINK | LINK | LINK | LINK | |
| NeocortexDevoHsInVivo | LINK | LINK | LINK | LINK | LINK | LINK | |
| NeocortexDevoHsInVitro | LINK | LINK | LINK | LINK | LINK | LINK | |
| NeocortexDevoMmInVivo | LINK | LINK | LINK | LINK | LINK | LINK | |
| NeocortexDevoNHPInVivo | LINK | LINK | LINK | LINK | LINK | LINK | |
| NeocortexEvoDevo | LINK | LINK | LINK | LINK | LINK | LINK | |
| AdultNeoctxLayers | LINK | LINK | LINK | LINK | LINK | LINK | |
| MammalianEmbryo | LINK | LINK | LINK | LINK | LINK | LINK |
We invite researchers to add their own datasets and gene signatures to this environment to explore their own data, ideas & hypotheses and to expand the discovery potential of this shared community research resource: