On Multi-Modal Learning of Editing Source Code

Published 15 Aug 2021 in cs.SE, cs.LG, and cs.PL | (2108.06645v1)

Abstract: In recent years, Neural Machine Translator (NMT) has shown promise in automatically editing source code. Typical NMT based code editor only considers the code that needs to be changed as input and suggests developers with a ranked list of patched code to choose from - where the correct one may not always be at the top of the list. While NMT based code editing systems generate a broad spectrum of plausible patches, the correct one depends on the developers' requirement and often on the context where the patch is applied. Thus, if developers provide some hints, using natural language, or providing patch context, NMT models can benefit from them. As a proof of concept, in this research, we leverage three modalities of information: edit location, edit code context, commit messages (as a proxy of developers' hint in natural language) to automatically generate edits with NMT models. To that end, we build MODIT, a multi-modal NMT based code editing engine. With in-depth investigation and analysis, we show that developers' hint as an input modality can narrow the search space for patches and outperform state-of-the-art models to generate correctly patched code in top-1 position.

Abstract PDF Upgrade to Chat

Citations (51)

View on Semantic Scholar

Summary

The paper introduces Modit, a multi-modal NMT model designed to improve automated source code editing by integrating various data types like code context and commit messages.
Modit achieves significantly higher top-1 accuracy, up to 167% over baseline models, by effectively utilizing contextual code representations and pre-trained parameters.
The study demonstrates that incorporating developer intentions via commit messages can guide neural models towards more accurate code edits and that single encoder models are effective for cross-modal learning.

The paper presents a study on the effectiveness of using multi-modal Neural Machine Translation (NMT) models for automatic source code editing. The researchers develop Modit, a multi-modal NMT-based tool designed to enhance the automated code editing process by incorporating multiple data modalities, such as edit location, edit code context, and commit messages that act as proxies for developers' intentions. This approach aims to refine the accuracy of patches in software development, highlighting the challenges and benefits of integrating diverse forms of input data to support neural models in generating code edits.

Key Findings

Multi-Modal Input Utilization: Modit uses a combination of code that requires editing, the contextual code where the edit is applied, and natural language guidance derived from commit messages. This amalgamation helps NMT models to shrink the search space for possible code changes, improving both patch relevance and correctness.
Comparative Performance: When benchmarked against existing models like CodeBERT and GraphCodeBERT, Modit significantly enhances top-1 accuracy, showcasing up to a 167% increase over models trained from scratch. The improvement is attributed to better utilization of bi-directionally learned code representations and pre-trained model parameters in understanding programming language nuances and generating syntactic and semantically correct code.
Effective Contextual Integration: Explicitly providing code fragments needing changes as individual inputs, isolate from their context yet informed by it, proved to be efficient. Modit successfully leverages contextual data to choose relevant tokens and code elements, enabling precise patch generation.
Single vs. Multi Encoder Models: Notably, single encoder setups outperformed multi-encoder configurations due to their ability to cross-learn representations with inter-modal attentiveness. Single encoders exhibit a greater capacity for holistic data comprehension across different code-related modalities, contributing to more robust coding patch strategies.
Empirical Lessons: The study highlights that the inclusion of natural language in the form of commit messages can steer neural models towards more accurate predictions by conveying developer intentions, which is crucial for tasks like bug fixing or code refactoring in practical scenarios.

Future Implications and Challenges

The integration of multi-modal learning in Modit illuminates the path towards more contextual-aware code editing systems, where neural models can concurrently digest multiple data types for nuanced decision-making in automated programming tasks. This approach is promising for encompassing varying requirements of different coding environments and practices, such as handling different programming languages, understanding legacy systems, and integrating continuous feedback from development platforms.

Moving forward, challenges such as optimizing the quality and relevance of commit messages, further improving cross-modal interaction, and streamlining the adaptability of such models across diverse code bases remain key areas for exploration. Moreover, applying similar methodologies to broader software maintenance problems could expand the utility and receptiveness of machine learning solutions in complex software ecosystems.

By leveraging these findings, future research might focus on refining these models' efficiency and extending their applicability to a variety of programming environments, adapting to more complex syntactical structures and integrating deeper semantic understanding. Overall, Modit's development demonstrates the powerful synergy enabled by combining natural language processing with programming language comprehension for refined automated coding interventions.

Markdown Report Issue