NOTCH2NL ( NOTCH2-N-terminus-like ) genes arose from ape-specific chromosome 1 segmental duplications implicated in human brain cortical expansion, including an incomplete NOTCH2 gene. Genetic characterization of these loci and their regulation is complicated because they are embedded in large, nearly identical duplications that predispose to recurrent microdeletion syndromes. Using near-complete long-read assemblies generated from 70 human and 12 ape haploid genomes, we show independent recurrent duplication among apes with protein-coding copies emerging in humans 2.2–3.7 million years ago. We distinguish NOTCH2NL paralogs present in every human haplotype ( NOTCH2NLA ) from copy-number-variable ones. We also characterize large-scale structural variation, including gene conversion, for 28% of haplotypes, leading to a previously undescribed paralog, NOTCH2tv . Finally, we apply Fiber-seq and long-read transcript sequencing to human dorsal forebrain organoids to characterize the regulatory landscape and find that the most fixed paralogs, NOTCH2 and NOTCH2NLA , harbor the greatest number of paralog-specific elements potentially driving their regulation.
Genetic diversity and regulatory features of human-specific NOTCH2NL duplications
Antonacci, Francesca;
2026-01-01
Abstract
NOTCH2NL ( NOTCH2-N-terminus-like ) genes arose from ape-specific chromosome 1 segmental duplications implicated in human brain cortical expansion, including an incomplete NOTCH2 gene. Genetic characterization of these loci and their regulation is complicated because they are embedded in large, nearly identical duplications that predispose to recurrent microdeletion syndromes. Using near-complete long-read assemblies generated from 70 human and 12 ape haploid genomes, we show independent recurrent duplication among apes with protein-coding copies emerging in humans 2.2–3.7 million years ago. We distinguish NOTCH2NL paralogs present in every human haplotype ( NOTCH2NLA ) from copy-number-variable ones. We also characterize large-scale structural variation, including gene conversion, for 28% of haplotypes, leading to a previously undescribed paralog, NOTCH2tv . Finally, we apply Fiber-seq and long-read transcript sequencing to human dorsal forebrain organoids to characterize the regulatory landscape and find that the most fixed paralogs, NOTCH2 and NOTCH2NLA , harbor the greatest number of paralog-specific elements potentially driving their regulation.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


