View : 352 Download: 0

Fine-Tuning BERT Models to Classify Misinformation on Garlic and COVID-19 on Twitter

Title
Fine-Tuning BERT Models to Classify Misinformation on Garlic and COVID-19 on Twitter
Authors
Kim M.G.Kim M.Kim J.H.Kim K.
Ewha Authors
김명규
SCOPUS Author ID
김명규scopus
Issue Date
2022
Journal Title
International Journal of Environmental Research and Public Health
ISSN
1661-7827JCR Link
Citation
International Journal of Environmental Research and Public Health vol. 19, no. 9
Keywords
bidirectional encoder representations from transformers (BERT)COVID-19garlicmisinformationTwitter
Publisher
MDPI
Indexed
SCIE; SSCI; SCOPUS WOS scopus
Document Type
Article
Abstract
Garlic-related misinformation is prevalent whenever a virus outbreak occurs. With the outbreak of COVID-19, garlic-related misinformation is spreading through social media, including Twitter. Bidirectional Encoder Representations from Transformers (BERT) can be used to classify misinformation from a vast number of tweets. This study aimed to apply the BERT model for classifying misinformation on garlic and COVID-19 on Twitter, using 5929 original tweets mentioning garlic and COVID-19 (4151 for fine-tuning, 1778 for test). Tweets were manually labeled as ‘misinformation’ and ‘other.’ We fine-tuned five BERT models (BERTBASE, BERTLARGE, BERTweet-base, BERTweet-COVID-19, and BERTweet-large) using a general COVID-19 rumor dataset or a garlicspecific dataset. Accuracy and F1 score were calculated to evaluate the performance of the models. The BERT models fine-tuned with the COVID-19 rumor dataset showed poor performance, with maximum accuracy of 0.647. BERT models fine-tuned with the garlic-specific dataset showed better performance. BERTweet models achieved accuracy of 0.897–0.911, while BERTBASE and BERTLARGE achieved accuracy of 0.887–0.897. BERTweet-large showed the best performance with maximum accuracy of 0.911 and an F1 score of 0.894. Thus, BERT models showed good performance in classifying misinformation. The results of our study will help detect misinformation related to garlic and COVID-19 on Twitter. © 2022 by the authors. Licensee MDPI, Basel, Switzerland.
DOI
10.3390/ijerph19095126
Appears in Collections:
약학대학 > 약학과 > Journal papers
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

BROWSE