View : 345 Download: 0

Full metadata record

DC Field Value Language
dc.contributor.author배호*
dc.date.accessioned2022-08-12T16:31:22Z-
dc.date.available2022-08-12T16:31:22Z-
dc.date.issued2022*
dc.identifier.issn2471-285X*
dc.identifier.otherOAK-32107*
dc.identifier.urihttps://dspace.ewha.ac.kr/handle/2015.oak/262409-
dc.description.abstractAs modern programs grow in size and complexity, the importance of program behavior modeling is emerging in various areas. Because of the large amount of data generated by a target program and the difficulty of runtime analysis, previous works in these areas employ deep learning. However, they did not sufficiently consider the input of a target program, since, in our view, program behavior is a history of computational steps consisting of a function and its input arguments. A naive, intuitive way to embed the value of <formula><tex>$x$</tex></formula> as it is in a vector representation creates a tremendously large vector size. Instead, we found that all the values inducing the same runtime behavior can be represented as one identical characteristic value (CV). In this paper, we show that not only can a characteristic value sequence replace the argument input, but it is also efficient to use it as an input vector for a neural network. This efficiency comes from modeling the whole program with multiple LSTM-RNN models and reducing the input space of the neural network. To demonstrate the effectiveness of this replacement, we performed experiments on the problem of program behavior anomaly detection. Our results show that our model achieves better detection performance compared to previous models and similar detection performance even with smaller model sizes. We also provide a visualization of the embedded vectors extracted from the embedding layer in the neural network model to prove that the CV sequence well represents the arguments. Author*
dc.languageEnglish*
dc.publisherInstitute of Electrical and Electronics Engineers Inc.*
dc.subjectAdaptation models*
dc.subjectAnalytical models*
dc.subjectArtificial neural network*
dc.subjectCodes*
dc.subjectComputational modeling*
dc.subjectData models*
dc.subjectdeep learning*
dc.subjectfeature embedding*
dc.subjectnatural language processing*
dc.subjectNeural networks*
dc.subjectprogram behavior modeling*
dc.subjectRuntime*
dc.titleData Embedding Scheme for Efficient Program Behavior Modeling With Neural Networks*
dc.typeArticle*
dc.relation.indexSCIE*
dc.relation.indexSCOPUS*
dc.relation.startpage1*
dc.relation.lastpage12*
dc.relation.journaltitleIEEE Transactions on Emerging Topics in Computational Intelligence*
dc.identifier.doi10.1109/TETCI.2022.3146425*
dc.identifier.scopusid2-s2.0-85132503583*
dc.author.googleAhn S.*
dc.author.googleYi H.*
dc.author.googleBae H.*
dc.author.googleYoon S.*
dc.author.googlePaek Y.*
dc.contributor.scopusid배호(57205541775)*
dc.date.modifydate20240322134121*
Appears in Collections:
인공지능대학 > 사이버보안학과 > Journal papers
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

BROWSE