Reproduce RepliComment JSS 2021 experiments

Arianna Blasi, Nataliia Stulova, Alessandra Gorla, Oscar Nierstrasz, RepliComment: Identifying clones in code comments, Journal of Systems and Software Volume 182, 2021, https://doi.org/10.1016/j.jss.2021.111069.

Repo structure

RepliComment works in tandem with upDoc, the short paper published at SCAM 2020. upDoc is what we call "Clone Analyzer" in the paper: It makes the final assessments on the comment clones RepliComment fetches and filters.

👉 The full source of the clone analyzer is available here: https://github.com/s0nata/updoc/tree/replicomment-integration

This repo as you clone it will only contain two bash scripts and this README. After you launch the execute-full-JSS-pipeline script, you obtain:

The subjects the experiments involves (their sources and executable .jar)
RepliComment 2.0 repository (and RepliComment outputs after executing it)
upDoc 1.0 executable (and upDoc outputs after executing it) (you may also edit the script to clone&build from the sources 👆)

To download the subjects we use in our evaluation, the script runs get_evaluation_project_sources.bash first.

Notice that these scripts use curl to download everything. If this doesn't work for you, change it into wget, or whatever your OS pefers.

Output interpretation: RepliComment

RepliComment outputs are divided by subject and context of clone search. We do not output all the clones found for subject into a single file for the sake of better clarity (but if you prefer to change this behaviour, feel free to edit the source code!).

RepliComment 2.0 as explained in the JSS paper works by looking for clones found at different contexts, namely inside the same class, or among hierarchies of classes, or among all classes of a project (and all of these at method-level and then field-level). All these different searches were added for the JSS publication, so we consider the default RepliComment behaviour as the original ICPC one: An analysis within one same class of method-level comment clones.

Considering for example the subject Apache Lucene, we will have the following outputs:

2020_JavadocClones_lucene.csv the default analysis (method-level, single-class context)
2020_JavadocClones_cf_lucene.csv method-level, cross-file (multiple classes) context
2020_JavadocClones_h_lucene.csv method-level, hierarchy-context
2020_JavadocClones_fields_* field-level, *-contexts same as above

Output interpretation: upDoc

upDoc or, if you want, the Clone Analyzer (i.e. the last component of our pipeline), will produce .txt reports under JSS-outputs. Their content should be self-explainatory. Specifically, you will see the following files:

pc_high_severity.txt -- cloned comment parts that represent a HIGH severity issue (copy&paste)
pc_low_severity.txt -- cloned comment parts that represent a LOW severity issue (likely false positives)
pc_mild_severity.txt -- cloned comment parts that represent a MILD severity issue (poor info)
wc_high_severity.txt -- cloned whole comments that represent a HIGH severity issue (unrelated methods)
wc_mild_severity.txt -- cloned whole comments that represent a MILD severity issue (e.g., overridden methods)
special_issues.txt -- general issues affecting the comments (e.g., wrong parameter names)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
execute-full-JSS-pipeline.sh		execute-full-JSS-pipeline.sh
get_evaluation_project_subjects.bash		get_evaluation_project_subjects.bash

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproduce RepliComment JSS 2021 experiments

Repo structure

Output interpretation: RepliComment

Output interpretation: upDoc

About

Releases

Packages

Languages

s0nata/replicomment-JSS21-experiments

Folders and files

Latest commit

History

Repository files navigation

Reproduce RepliComment JSS 2021 experiments

Repo structure

Output interpretation: RepliComment

Output interpretation: upDoc

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages