BRCA genes, comprising BRCA1 and BRCA2 play indispensable roles in preserving genomic stability and facilitating DNA repair mechanisms. The presence of germline mutations in these genes has been associated with increased susceptibility to various cancers, notably breast and ovarian cancers. Recent advancements in cost-effective sequencing technologies have revolutionized the landscape of cancer genomics, leading to a notable rise in the number of sequenced cancer patient genomes, enabling large-scale computational studies. In this study, we delve into the BRCA mutations in the dbSNP, housing an extensive repository of 41,177 and 44,205 genetic mutations for BRCA1 and BRCA2, respectively. Employing meticulous computational analysis from an umbrella perspective, our research unveils intriguing findings pertaining to a number of critical aspects. Namely, we discover that the majority of BRCA mutations in dbSNP have unknown clinical significance. We find that, although exon 11 for both genes contains the majority of the mutations and may seem as if it is a mutation hot spot, upon analyzing mutations per base pair, we find that all exons exhibit similar levels of mutations. Investigating mutations within introns, while we observe that the recorded mutations are generally uniformly distributed, almost all of the pathogenic mutations in introns are located close to splicing regions (at the beginning or the end). In addition to the findings mentioned earlier, we have also made other discoveries concerning mutation types and the level of confidence in observations within the dbSNP database.
翻译:BRCA基因(包括BRCA1和BRCA2)在维持基因组稳定性及促进DNA修复机制中发挥不可或缺的作用。这些基因的胚系突变与多种癌症(尤其是乳腺癌和卵巢癌)的易感性增加密切相关。近年来,经济高效的测序技术革新了癌症基因组学领域,导致测序的癌症患者基因组数量显著增加,从而支持了大规模计算研究。本研究深入探讨了dbSNP中的BRCA突变,该数据库分别包含BRCA1和BRCA2的41,177个和44,205个遗传突变。通过采用伞形视角的精细计算分析,我们的研究揭示了若干关键方面的有趣发现。具体而言,我们发现dbSNP中大多数BRCA突变的临床意义尚不明确。同时发现,尽管两个基因的第11号外显子包含大多数突变,看似为突变热点,但按碱基对分析突变后,所有外显子的突变水平实则相似。针对内含子突变的调查表明,记录的突变通常均匀分布,但几乎所有致病性内含子突变均位于剪接区域附近(起始或末端)。除上述发现外,我们还在突变类型及dbSNP数据库中观测结果的置信度方面取得了其他发现。