Natural disasters are increasing in frequency and severity, causing hundreds of billions of dollars in damage annually and posing growing threats to infrastructure and human livelihoods. Accurate data on roofing materials is critical for modeling building vulnerability to natural hazards such as earthquakes, floods, wildfires, and hurricanes, yet such data remain unavailable. To address this gap, we introduce RoofNet, the largest and most geographically diverse novel multimodal dataset to date, comprising over 51,500 samples from 184 geographically diverse sites pairing high-resolution Earth Observation (EO) imagery with curated text annotations for global roof material classification. RoofNet includes geographically diverse satellite imagery labeled with 14 key roofing types and is designed to enhance the fidelity of global exposure datasets through vision-language modeling (VLM). We sample EO tiles from climatically and architecturally distinct regions to construct a representative dataset. A subset of 6,000 images was annotated in collaboration with domain experts to fine-tune a VLM. We used geographic- and material-aware prompt tuning to enhance class separability. The fine-tuned model was then applied to the remaining EO tiles, with predictions refined through rule-based and human-in-the-loop verification. In addition to material labels, RoofNet provides rich metadata including roof shape, footprint area, solar panel presence, and indicators of mixed roofing materials (e.g., HVAC systems). The dataset used in earlier experiments has been removed due to licensing constraints related to imagery sources. Results based on this dataset should be interpreted with caution. Updated experiments using compliant data are in progress.
翻译:自然灾害的频率和严重程度持续增加,每年造成数千亿美元损失,对基础设施和人类生计构成日益严峻的威胁。精确的屋顶材料数据对于模拟建筑在地震、洪水、野火和飓风等自然灾害中的脆弱性至关重要,然而此类数据目前仍不可得。为弥补这一空白,我们提出了RoofNet——迄今为止规模最大、地理多样性最丰富的新型多模态数据集,包含来自184个地理分布广泛区域的51,500余个样本,将高分辨率地球观测(EO)影像与经过整理的文本注释相结合,用于全球屋顶材料分类。RoofNet包含标注有14种关键屋顶类型的地理多样化卫星影像,旨在通过视觉语言建模(VLM)提升全球暴露数据的保真度。我们从气候和建筑特征各异的区域采样EO图块,构建具有代表性的数据集。其中6,000幅图像与领域专家合作进行注释,用于微调VLM。我们采用地理和材料感知的提示调优方法以增强类别可分性。随后将微调模型应用于其余EO图块,并通过基于规则和人在回路的验证对预测结果进行优化。除材料标签外,RoofNet还提供丰富的元数据,包括屋顶形状、占地面积、太阳能电池板存在情况以及混合屋顶材料(如暖通空调系统)的标识。早期实验中使用的数据集因影像来源的许可限制已被移除。基于该数据集的实验结果应谨慎解读。使用合规数据进行的更新实验正在进行中。