Experiment Design and Metrics for Mutation Testing with LLMs | HackerNoon
In evaluating LLM-generated mutations, we designed metrics that encompass cost, usability, and behavior, recognizing that higher mutation scores don't guarantee higher quality.