TY - JOUR
T1 - A Cyclic Permutation Approach to Removing Spatial Dependency between Clustered Gene Ontology Terms
AU - Rapoport, Rachel
AU - Greenberg, Avraham
AU - Yakhini, Zohar
AU - Simon, Itamar
N1 - Publisher Copyright:
© 2024 by the authors.
PY - 2024/3
Y1 - 2024/3
N2 - Traditional gene set enrichment analysis falters when applied to large genomic domains, where neighboring genes often share functions. This spatial dependency creates misleading enrichments, mistaking mere physical proximity for genuine biological connections. Here we present Spatial Adjusted Gene Ontology (SAGO), a novel cyclic permutation-based approach, to tackle this challenge. SAGO separates enrichments due to spatial proximity from genuine biological links by incorporating the genes’ spatial arrangement into the analysis. We applied SAGO to various datasets in which the identified genomic intervals are large, including replication timing domains, large H3K9me3 and H3K27me3 domains, HiC compartments and lamina-associated domains (LADs). Intriguingly, applying SAGO to prostate cancer samples with large copy number alteration (CNA) domains eliminated most of the enriched GO terms, thus helping to accurately identify biologically relevant gene sets linked to oncogenic processes, free from spatial bias.
AB - Traditional gene set enrichment analysis falters when applied to large genomic domains, where neighboring genes often share functions. This spatial dependency creates misleading enrichments, mistaking mere physical proximity for genuine biological connections. Here we present Spatial Adjusted Gene Ontology (SAGO), a novel cyclic permutation-based approach, to tackle this challenge. SAGO separates enrichments due to spatial proximity from genuine biological links by incorporating the genes’ spatial arrangement into the analysis. We applied SAGO to various datasets in which the identified genomic intervals are large, including replication timing domains, large H3K9me3 and H3K27me3 domains, HiC compartments and lamina-associated domains (LADs). Intriguingly, applying SAGO to prostate cancer samples with large copy number alteration (CNA) domains eliminated most of the enriched GO terms, thus helping to accurately identify biologically relevant gene sets linked to oncogenic processes, free from spatial bias.
KW - GO annotations
KW - copy number alterations (CNA)
KW - cyclic permutation
KW - gene set enrichment analysis (GSEA)
KW - replication timing
KW - spatial dependencies
UR - http://www.scopus.com/inward/record.url?scp=85188684318&partnerID=8YFLogxK
U2 - 10.3390/biology13030175
DO - 10.3390/biology13030175
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 38534445
AN - SCOPUS:85188684318
SN - 2079-7737
VL - 13
JO - Biology
JF - Biology
IS - 3
M1 - 175
ER -