A Hybrid Feature Selection and Construction Method for Detection of Wind Turbine Generator Heating Faults

Preprocessing of information is an essential step for the effective design of machine learning applications. Feature construction and selection are powerful techniques used for this aim. In this paper, a feature selection and construction approach is presented for the detection of wind turbine generator heating faults. Data were collected from Supervisory Control and Data Acquisition (SCADA) system of a wind turbine. The original features directly collected from the data collection system consist of wind characteristics, operational data, temperature measurements and status information. In addition to these original features, new features were created in the feature construction step to obtain information that can be more powerful indications of the faults. After the construction of new features, a hybrid feature selection technique was implemented to find out the most relevant features in the overall set to increase the classification accuracy and decrease the computational burden. Feature selection step consists of filter and wrapper-based parts. Filter based feature selection was applied to exclude the features which are non-discriminative and wrapper-based method was used to determine the final features considering the redundancies and mutual relations amongst them. Artificial Neural Networks were used both in the detection phase and as the induction algorithm of the wrapper-based feature selection part. The results show that, the proposed approach contributes to the fault detection system to be more reliable especially in terms of reducing the number of false fault alarms.

翻译：信息预处理是机器学习应用有效设计中的关键步骤。特征构造与选择是实现该目标的强大技术。本文提出了一种用于风电机组发电机发热故障检测的特征选择与构造方法。数据采集自某风电机组的监控与数据采集系统。直接从数据采集系统获取的原始特征包括风特性、运行数据、温度测量值和状态信息。除这些原始特征外，在特征构造步骤中还生成了新特征，以获取更能有效指示故障的信息。生成新特征后，采用混合特征选择技术从整体特征集中筛选最相关特征，以提高分类精度并降低计算负担。特征选择步骤包含基于过滤器和基于封装器的两部分：基于过滤器的特征选择用于排除非判别性特征，而基于封装器的方法则通过考虑特征间的冗余性和相互关系来确定最终特征。人工神经网络既用于故障检测阶段，也作为基于封装器的特征选择部分的归纳算法。结果表明，所提方法有助于提高故障检测系统的可靠性，尤其在减少误报警次数方面效果显著。

相关内容

特征选择

关注 5940

特征选择( Feature Selection )也称特征子集选择( Feature Subset Selection , FSS )，或属性选择( Attribute Selection )。是指从已有的M个特征(Feature)中选择N个特征使得系统的特定指标最优化，是从原始特征中选择出一些最有效特征以降低数据集维度的过程,是提高学习算法性能的一个重要手段,也是模式识别中关键的数据预处理步骤。对于一个学习算法来说,好的学习样本是训练模型的关键。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日