Intron Biology

Introns are non-coding sequences within genes that are transcribed into RNA but are spliced out before translation into protein. Despite introns being ubiquitous features of eukaryotic genomes, their function and persistence in genomes remain elusive. My research aims to unravel the role of introns in chromatin structure and explore how their architecture and sequences influence transcriptional regulation.
Predicting Chromatin Features from Intron Architecture
To understand the relationship between intron architecture and chromatin features in the model plant Arabidopsis thaliana, we predicted histone modifications, histone variants, DNA methylation, and gene expression patterns from gene features. Our analysis revealed that first intron position is predictive of active chromatin marks enriched near transcription start sites, while intron number correlates with distinct chromatin marks found in gene bodies of broadly expressed genes.
Notably, we identified enrichment of the gene body mark H3K36me3 within introns, a pattern that appears to be plant-specific. To validate these associations, we compared recently duplicated gene pairs with divergent intron architectures and confirmed that intron number is positively associated with H3K4me1, H3K36me3, H2A.X, meCG methylation, and broad expression across tissues.