Spanning two decades, the Encyclopaedia of DNA Elements (ENCODE) is a collaborative research project that aims to identify all the functional elements in the human and mouse genomes. To best serve the scientific community, all data generated by the consortium is shared through a web-portal (https://www.encodeproject.org/) with no access restrictions. The fourth and final phase of the project added a diverse set of new samples (including those associated with human disease), and a wide range of new assays aimed at detection, characterization and validation of functional genomic elements. The ENCODE data portal hosts results from over 23,000 functional genomics experiments, over 800 functional elements characterization experiments (including in vivo transgenic enhancer assays, reporter assays and CRISPR screens) along with over 60,000 results of computational and integrative analyses (including imputations, predictions and genome annotations). The ENCODE Data Coordination Center (DCC) is responsible for development and maintenance of the data portal, along with the implementation and utilisation of the ENCODE uniform processing pipelines to generate uniformly processed data. Here we report recent updates to the data portal. Specifically, we have completely redesigned the home page, improved search interface, added several new pages to highlight collections of biologically related data (deeply profiled cell lines, immune cells, Alzheimer's Disease, RNA-Protein interactions, degron matrix and a matrix of experiments organised by human donors), added single-cell experiments, and enhanced the cart interface for visualisation and download of user-selected datasets.
翻译:《DNA元件百科全书》(ENCODE)是一项横跨二十年的合作研究项目,旨在鉴定人类和小鼠基因组中的所有功能元件。为更好地服务科学界,该联盟产生的所有数据均通过无访问限制的网络门户(https://www.encodeproject.org/)共享。该项目第四阶段也是最终阶段新增了多样化的样本集(包括与人类疾病相关的样本),以及旨在检测、表征和验证功能基因组元件的多种新型实验。ENCODE数据门户托管了超过23,000项功能基因组学实验的结果、800余项功能元件表征实验(包括体内转基因增强子检测、报告基因检测和CRISPR筛选)以及超过60,000项计算与整合分析结果(包括插补、预测和基因组注释)。ENCODE数据协调中心负责该数据门户的开发与维护,并实施和利用ENCODE统一处理流程生成标准化处理数据。本文报告了数据门户的最新更新:具体包括完全重新设计的首页、改进的搜索界面、新增多个专题页面以突出显示生物学相关数据集(深度分析细胞系、免疫细胞、阿尔茨海默病、RNA-蛋白质相互作用、降解决定子矩阵及按人类供者组织的实验矩阵)、新增单细胞实验,并增强了用于可视化与下载用户选定数据集的购物车界面功能。