no code implementations • 15 Jan 2024 • Imam Nur Bani Yusuf, Lingxiao Jiang
However, a challenge in deploying deep learning for vulnerability detection is the limited availability of training data.
1 code implementation • 28 Aug 2022 • Hoang H. Nguyen, Nhat-Minh Nguyen, Chunyao Xie, Zahra Ahmadi, Daniel Kudendo, Thanh-Nam Doan, Lingxiao Jiang
Moreover, it develops a multi-metapath heterogeneous graph attention network to learn multi-level embeddings of different types of nodes and their metapaths in the heterogeneous contract graphs, which can capture the code semantics of smart contracts more accurately and facilitate both fine-grained line-level and coarse-grained contract-level vulnerability detection.
1 code implementation • 19 May 2021 • Minghuan Tan, Lei Wang, Lingxiao Jiang, Jing Jiang
In this paper, we revisit math word problems~(MWPs) from the cross-lingual and multilingual perspective.
1 code implementation • 14 Dec 2020 • Stefanus Agus Haryono, Ferdian Thung, David Lo, Lingxiao Jiang, Julia Lawall, Hong Jin Kang, Lucas Serrano, Gilles Muller
Usages of deprecated APIs in Android apps need to be updated to ensure the apps' compatibility with the old and new versions of Android OS.
Software Engineering
no code implementations • 13 Dec 2020 • Nghi D. Q. Bui, Yijun Yu, Lingxiao Jiang
We trained an InferCode model instance using the Tree-based CNN as the encoder of a large set of Java code and applied it to downstream unsupervised tasks such as code clustering, code clone detection, cross-language code search or reused under a transfer learning scheme to continue training the model weights for supervised tasks such as code classification and method name prediction.
no code implementations • 10 Nov 2020 • Stefanus Agus Haryono, Ferdian Thung, David Lo, Julia Lawall, Lingxiao Jiang
In this paper, we built a tool to automate these updates.
Software Engineering
no code implementations • 6 Sep 2020 • Nghi D. Q. Bui, Yijun Yu, Lingxiao Jiang
Corder is designed to alleviate the need of labeled data for code retrieval and code summarization tasks.
no code implementations • 5 Sep 2020 • Nghi D. Q. Bui, Yijun Yu, Lingxiao Jiang
Although syntax trees are precisely defined according to the language grammar and easier to construct and process than graphs, previous tree-based learning techniques have not been able to learn semantic information from trees to achieve better accuracy than graph-based techniques.
1 code implementation • 31 Jul 2020 • Md Rafiqul Islam Rabin, Nghi D. Q. Bui, Ke Wang, Yijun Yu, Lingxiao Jiang, Mohammad Amin Alipour
With the prevalence of publicly available source code repositories to train deep neural network models, neural program models can do well in source code analysis tasks such as predicting method names in given programs that cannot be easily done by traditional program analysis techniques.
1 code implementation • 20 Jan 2020 • Zhipeng Gao, Lingxiao Jiang, Xin Xia, David Lo, John Grundy
However, many bugs and vulnerabilities have been identified in many contracts which raises serious concerns about smart contract security, not to mention that the blockchain systems on which the smart contracts are built can be buggy.
Software Engineering
no code implementations • 27 Oct 2019 • Vinoj Jayasundara, Nghi Duy Quoc Bui, Lingxiao Jiang, David Lo
Program comprehension is a fundamental task in software development and maintenance processes.
1 code implementation • 22 Aug 2019 • Zhipeng Gao, Vinoj Jayasundara, Lingxiao Jiang, Xin Xia, David Lo, John Grundy
In addition to the uses by individual developers, SmartEmbed can also be applied to studies of smart contracts in a large scale.
Software Engineering
no code implementations • 10 Jun 2019 • Nghi D. Q. Bui, Yijun Yu, Lingxiao Jiang
However, all these approaches still require large amount of manual effort in preparing parallel program corpora, ranging from pairs of APIs, to manually identified code in different languages that are considered as functionally equivalent.
1 code implementation • 13 Mar 2018 • Nghi D. Q. Bui, Lingxiao Jiang
Our preliminary evaluations on about 40, 000 Java and C# source files from 9 software projects show that our approach can automatically learn shared embeddings for various code elements in different languages and identify their cross-language mappings with reasonable Mean Average Precision scores.
1 code implementation • 17 Oct 2017 • Nghi D. Q. Bui, Lingxiao Jiang, Yijun Yu
It is layered on top of two tree-based convolutional neural networks (TBCNNs), each of which recognizes the algorithm of code written in an individual programming language.