This corpus contains different datasets of behaviorally equivalent C/C++ programs to evaluate their semantic similitude. The datasets: 6 Type-4 scenarios extracted from the BigCloneBench 10 programs for sorting, aggregation, and search algorithms 566 programs extracted from CodeForces solving 5 different problems
In the following, we present the results for the different experiemnts conducted using the SimCorp Corpus
In this experiment we evaluate the semantic similitude between programs using an approach based on programs’ Control Flow Graf (CFG) using a neighbourhood approach and normalizing the weight of nodes with respecto their neighbour. This work is presented at the Software Quality Analysis, Monitoring, Improvement, and Applications (SQAMIA’24)