In the realm of public health, vaccination stands as the cornerstone for mitigating disease risks and controlling their proliferation. The recent COVID-19 pandemic has highlighted how vaccines play a crucial role in keeping us safe. However the situation involves a mix of perspectives, with skepticism towards vaccines prevailing for various reasons such as political dynamics, apprehensions about side effects, and more. The paper addresses the challenge of comprehensively understanding and categorizing these diverse concerns expressed in the context of vaccination. Our focus is on developing a robust multi-label classifier capable of assigning specific concern labels to tweets based on the articulated apprehensions towards vaccines. To achieve this, we delve into the application of a diverse set of advanced natural language processing techniques and machine learning algorithms including transformer models like BERT, state of the art GPT 3.5, Classifier Chains & traditional methods like SVM, Random Forest, Naive Bayes. We see that the cutting-edge large language model outperforms all other methods in this context.
翻译:在公共卫生领域,疫苗接种是降低疾病风险和控制其传播的基石。近期COVID-19大流行凸显了疫苗在保障我们安全方面的关键作用。然而,现实中存在多种观点的交织,由于政治动态、对副作用的担忧等原因,对疫苗的怀疑态度普遍存在。本文旨在应对全面理解并分类疫苗接种背景下所表达的各种关切这一挑战。我们的研究重点是开发一个稳健的多标签分类器,能够根据推文中表达的对疫苗的担忧,为其分配特定的关切标签。为此,我们探索了多种先进自然语言处理技术与机器学习算法的应用,包括Transformer模型(如BERT)、最新的GPT 3.5、分类器链,以及传统方法(如SVM、随机森林、朴素贝叶斯)。研究结果表明,在该情境下,尖端的大语言模型性能优于所有其他方法。