Program Comprehension

Understanding Digits in Identifier Names: An Exploratory Study

Before any software maintenance can occur, developers must read the identifier names found in the code to be maintained. Thus, …

Anthony S. Peruma, Christian D. Newman

IDEAL: An Open-Source Identifier Name Appraisal Tool

Developers must comprehend the code they will maintain, meaning that the code must be legible and reasonably self-descriptive. …

Anthony S. Peruma, Venera Arnaoudova, Christian D. Newman

An Ensemble Approach for Annotating Source Code Identifiers with Part-of-speech Tags

This paper presents an ensemble part-of-speech tagging approach for source code identifiers. Ensemble tagging is a technique that uses …

Christian D. Newman, Michael J. Decker, Reem S. AlSuhaibani, Anthony S. Peruma, Mohamed Wiem Mkaouer, Satyajit Mohapatra, Tejal Vishoi, Marcos Zampieri, Timothy J. Sheldon, Emily Hill

Using Grammar Patterns to Interpret Test Method Name Evolution

It is good practice to name test methods such that they are comprehensible to developers; they must be written in such a way that their …

Anthony S. Peruma, Emily Hu, Jiajun Chen, Eman Abdullah AlOmar, Mohamed Wiem Mkaouer, Christian D. Newman

On the generation, structure, and semantics of grammar patterns in source code identifiers

Identifiers make up a majority of the text in code. They are one of the most basic mediums through which developers describe the code …

Christian D. Newman, Reem S. AlSuhaibani, Michael J. Decker, Anthony S. Peruma, Dishant Kaushik, Mohamed Wiem Mkaouer, Emily Hill

Contextualizing rename decisions using refactorings, commit messages, and data types

Identifier names are the atoms of program comprehension. Weak identifier names decrease developer productivity and degrade the …

Anthony S. Peruma, Mohamed Wiem Mkaouer, Michael J. Decker, Christian D. Newman

An Empirical Study of Abbreviations and Expansions in Software Artifacts

Expanding abbreviations is an important text normalization technique used for the purpose of either increasing developer comprehension …

Christian D. Newman, Michael J. Decker, Reem S. AlSuhaibani, Anthony S. Peruma, Dishant Kaushik, Emily Hill

An Open Dataset of Abbreviations and Expansions

We present a data set of abbreviations and expansions, derived from a set of five open source systems, for use by the research and …

Christian D. Newman, Michael J. Decker, Reem S. AlSuhaibani, Anthony S. Peruma, Dishant Kaushik, Emily Hill

Contextualizing Rename Decisions using Refactorings and Commit Messages

Identifier names are the atoms of comprehension; weak identifier names decrease productivity by increasing the chance that developers …

Anthony S. Peruma, Mohamed Wiem Mkaouer, Michael J. Decker, Christian D. Newman

Modeling the Relationship Between Identifier Name and Behavior

This paper presents the features of a model that relates the natural language found in identifiers with program semantics. The model …

Christian D. Newman, Anthony S. Peruma, Reem S. AlSuhaibani