Domestic violence, which is often perceived as a gendered issue among female victims, has gained increasing attention in recent years. Despite this focus, male victims of domestic abuse remain primarily overlooked, particularly in Bangladesh. Our study represents a pioneering exploration of the underexplored realm of male domestic violence (MDV) within the Bangladeshi context, shedding light on its prevalence, patterns, and underlying factors. Existing literature predominantly emphasizes female victimization in domestic violence scenarios, leading to an absence of research on male victims. We collected data from the major cities of Bangladesh and conducted exploratory data analysis to understand the underlying dynamics. We implemented 11 traditional machine learning models with default and optimized hyperparameters, 2 deep learning, and 4 ensemble models. Despite various approaches, CatBoost has emerged as the top performer due to its native support for categorical features, efficient handling of missing values, and robust regularization techniques, achieving 76% accuracy. In contrast, other models achieved accuracy rates in the range of 58-75%. The eXplainable AI techniques, SHAP and LIME, were employed to gain insights into the decision-making of black-box machine learning models. By shedding light on this topic and identifying factors associated with domestic abuse, the study contributes to identifying groups of people vulnerable to MDV, raising awareness, and informing policies and interventions aimed at reducing MDV. Our findings challenge the prevailing notion that domestic abuse primarily affects women, thus emphasizing the need for tailored interventions and support systems for male victims. ML techniques enhance the analysis and understanding of the data, providing valuable insights for developing effective strategies to combat this pressing social issue.
翻译:家庭暴力通常被视为针对女性受害者的性别议题,近年来日益受到关注。然而,男性家庭暴力受害者仍普遍被忽视,尤其在孟加拉国。本研究开创性地探索了孟加拉国背景下未被充分研究的男性家庭暴力(MDV)领域,揭示了其普遍性、模式和潜在因素。现有文献主要聚焦于家庭暴力中的女性受害现象,导致针对男性受害者的研究缺失。我们收集了孟加拉国主要城市的数据,通过探索性数据分析理解其内在机制。我们采用默认和优化超参数实施了11种传统机器学习模型、2种深度学习模型和4种集成模型。尽管尝试多种方法,CatBoost凭借其原生支持分类特征、高效处理缺失值和稳健的正则化技术脱颖而出,实现了76%的准确率,而其他模型的准确率在58-75%之间。采用可解释人工智能技术SHAP和LIME来洞察黑盒机器学习模型的决策过程。通过揭示这一议题并识别与家庭虐待相关的因素,本研究有助于识别易受MDV影响的群体,提高公众认知,并为减少MDV的政策和干预措施提供依据。我们的研究结果挑战了家庭暴力主要影响女性的普遍认知,从而强调为男性受害者制定针对性干预措施和支持体系的必要性。机器学习技术增强了数据分析与理解能力,为制定有效策略应对这一紧迫社会问题提供了宝贵见解。