Big earth science data offers the scientific community great opportunities. Many more studies at large-scales, over long-terms and at high resolution can now be conducted using the rich information collected by remote sensing satellites, ground-based sensor networks, and even social media input. However, the hundreds of terabytes of information collected and compiled on an hourly basis by NASA and other government agencies present a significant challenge for atmospheric scientists seeking to improve the understanding of the Earth atmospheric system. These challenges include effective discovery, organization, analysis and visualization of large amounts of data. This paper reports the outcomes of an NSF-funded project that developed a geospatial cyberinfrastructure -- the A2CI (Atmospheric Analysis Cyberinfrastructure) -- to support atmospheric research. We first introduce the service-oriented system framework then describe in detail the implementation of the data discovery module, data management module, data integration module, data analysis and visualization modules following the cloud computing principles-Data-as-a-Service, Software-as-a-Service, Platform-as-a-Service and Infrastructure-as-a-Service. We demonstrate the graphic user interface by performing an analysis between Sea Surface Temperature and the intensity of tropical storms in the North Atlantic and Pacific oceans. We expect this work to contribute to the technical advancement of cyberinfrastructure research as well as to the development of an online, collaborative scientific analysis system for atmospheric science.
翻译:地球科学大数据为科学界提供了巨大机遇。利用遥感卫星、地面传感器网络甚至社交媒体输入所收集的丰富信息,现可在大规模、长时间尺度及高分辨率下开展更多研究。然而,美国国家航空航天局及其他政府机构每小时收集和汇编的数百TB信息,为寻求深化对地球大气系统理解的大气科学家带来了重大挑战。这些挑战包括海量数据的有效发现、组织、分析和可视化。本文报告了一项美国国家科学基金会资助项目的成果,该项目开发了一种地理空间网络基础设施——A2CI(大气分析网络基础设施),以支持大气研究。我们首先介绍了面向服务的系统框架,随后详细描述了数据发现模块、数据管理模块、数据集成模块、数据分析与可视化模块的实现,这些模块遵循云计算原则——数据即服务、软件即服务、平台即服务及基础设施即服务。通过分析北大西洋和太平洋热带气旋强度与海表温度的关系,我们展示了图形用户界面。我们期望这项工作能为网络基础设施研究的技术进步,以及大气科学在线协作分析系统的开发做出贡献。