A Multi-Label Code Comment Classifier using BERT

Zarah Shibli; Emad Albassam

doi:10.4197/Comp.13-2.5

pdf

Published: Dec 30, 2024

DOI: https://doi.org/10.4197/Comp.13-2.5

Keywords:

Code Comment Classification Natural Language Processing Deep Learning Software Engineering

Zarah Shibli

Department of Computer Science, King Abdulaziz University

Emad Albassam

Department of Computer Science, King Abdulaziz University

Abstract

Code comments play an essential role in software development by providing documentation, explanations, and clarifications for program logic and functionality. It is crucial to effectively classify code comments to improve software maintainability and collaboration in the face of a growing amount of code. Developers can easily identify and comprehend different code sections' purpose, behavior, and requirements by accurately classifying code comments. This paper presents a novel approach utilizing multi-label classification to enhance code comment classification in three programming languages: Python, Pharo, and Java. We employ BERT, a widely used language model, and achieve an F1 score of 0.64 through experimentation. Our proposed approach facilitates the understanding and managing code comments, making software development more efficient and productive. Additionally, our approach can be extended to other programming languages and serve as a foundation for further research in code comment classification.

How to Cite

Shibli, Z., & Albassam, E. (2024). A Multi-Label Code Comment Classifier using BERT. Journal of King Abdulaziz University: Computing and Information Technology Sciences, 13(2). https://doi.org/10.4197/Comp.13-2.5

Issue

Vol. 13 No. 2 (2024)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details