After decades of improvements in the employment conditions of females in Spain, this process came to a sudden stop with the Great Spanish Recession of 2008. In this contribution, we analyse a large longitudinal corpus of national and regional news outlets employing advanced Natural Language Processing techniques to capture the valence of mentions of gender inequality expressed in the Spanish press. The automatic analysis of the news articles does indeed capture the known hardships faced by females in the Spanish labour market. Our approach can be straightforwardly generalised to other topics of interest. Assessing the sentiment and moral values expressed in the articles, we notice that females are, in the majority of cases, concerned more than males when there is a deterioration in the overall labour market conditions, based on newspaper articles. This behaviour has been present in the entire period of study (2000--2022) and looked particularly pronounced during the economic crisis of 2008 and the recent COVID-19 pandemic. Most of the time, this phenomenon looks to be more pronounced at the regional level, perhaps caused by a significant focus on local labour markets rather than on aggregate statistics or because, in local contexts, females might suffer more from an isolation or discrimination condition. Our findings contribute to a deeper understanding of the gender inequalities in Spain using alternative data, informing policymakers and stakeholders.
翻译:摘要:在西班牙女性就业状况持续改善数十年后,这一进程因2008年西班牙大萧条而骤然停滞。本研究运用先进自然语言处理技术,分析涵盖全国及地区新闻媒体的大型纵向语料库,以捕捉西班牙新闻媒体中性别不平等表述的情感效价。新闻文章的自动分析确实捕捉到了西班牙劳动力市场中女性所面临的已知困境。我们的方法可直接推广至其他相关议题。通过评估文章所表达的情绪和道德价值观,我们发现:根据报纸文章分析,在劳动力市场整体状况恶化时,多数情况下女性受到的关注程度高于男性。这一现象在研究时段(2000-2022年)始终存在,并在2008年经济危机和近期新冠疫情时期尤为显著。该现象在地区层面往往更为突出,这可能源于地方媒体更关注本地劳动力市场而非总体统计数据,或是在地方性情境中女性可能面临更严重的孤立或歧视状况。本研究利用另类数据深化了对西班牙性别不平等问题的理解,可为政策制定者和利益相关者提供参考。