In recent years, the use of emojis in social media has increased dramatically, making them an important element in understanding online communication. However, predicting the meaning of emojis in a given text is a challenging task due to their ambiguous nature. In this study, we propose a transformer-based approach for emoji prediction using BERT, a widely-used pre-trained language model. We fine-tuned BERT on a large corpus of text containing both text and emojis to predict the most appropriate emoji for a given text. Our experimental results demonstrate that our approach outperforms several state-of-the-art models in predicting emojis with an accuracy of over 75 \% This work has potential applications in natural language processing, sentiment analysis, and social media marketing.
翻译:近年来,社交媒体中Emoji的使用急剧增加,使其成为理解在线交流的重要元素。然而,由于Emoji具有歧义性,在给定文本中预测其含义是一项具有挑战性的任务。本研究提出一种基于Transformer的方法,利用广泛使用的预训练语言模型BERT进行Emoji预测。我们在包含文本与Emoji的庞大数据集上对BERT进行微调,以预测给定文本中最合适的Emoji。实验结果表明,我们的方法在Emoji预测任务上以超过75%的准确率优于多个最先进模型。该工作在自然语言处理、情感分析及社交媒体营销等领域具有潜在应用价值。