View : 763 Download: 0

A sequential and intensive weighted language modeling scheme for multi-task learning-based natural language understanding

Title
A sequential and intensive weighted language modeling scheme for multi-task learning-based natural language understanding
Authors
Son S.Hwang S.Bae S.Park S.J.Choi J.-H.
Ewha Authors
최장환
SCOPUS Author ID
최장환scopus
Issue Date
2021
Journal Title
Applied Sciences (Switzerland)
ISSN
2076-3417JCR Link
Citation
Applied Sciences (Switzerland) vol. 11, no. 7
Keywords
Language modelingMulti-task learningNatural language understandingNeural networksSupervised learning
Publisher
MDPI AG
Indexed
SCIE; SCOPUS WOS scopus
Document Type
Article
Abstract
Multi-task learning (MTL) approaches are actively used for various natural language processing (NLP) tasks. The Multi-Task Deep Neural Network (MT-DNN) has contributed significantly to improving the performance of natural language understanding (NLU) tasks. However, one drawback is that confusion about the language representation of various tasks arises during the training of the MT-DNN model. Inspired by the internal-transfer weighting of MTL in medical imaging, we introduce a Sequential and IntensiveWeighted Language Modeling (SIWLM) scheme. The SIWLM consists of two stages: (1) Sequential weighted learning (SWL), which trains a model to learn entire tasks sequentially and concentrically, and (2) Intensive weighted learning (IWL), which enables the model to focus on the central task. We apply this scheme to the MT-DNN model and call this model the MTDNN-SIWLM. Our model achieves higher performance than the existing reference algorithms on six out of the eight GLUE benchmark tasks. Moreover, our model outperforms MT-DNN by 0.77 on average on the overall task. Finally, we conducted a thorough empirical investigation to determine the optimal weight for each GLUE task. © 2021 by the authors.
DOI
10.3390/app11073095
Appears in Collections:
인공지능대학 > 인공지능학과 > Journal papers
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

BROWSE