Usage example of SequenceR on any other benchmark #50

jose · 2023-12-04T02:12:28Z

Any chance you could provide a quick step-by-step on how to run SequenceR on any other benchmark than Defects4J? For instance, Bears or Bugs.jar.

Thanks in advance.

monperrus · 2023-12-05T02:22:28Z

Hi @jose

Thanks for reaching out.

The doc says

$ ./sequencer-predict.sh --model=[model path] --buggy_file=[abs path] --buggy_line=[int] --beam_size=[int] --output=[abs path]

Where the buggy file can be any arbitrary Java file.

cc/ @chenzimin

jose · 2023-12-05T10:21:09Z

Thanks @monperrus, somehow I missed that.

(Note for future self, --beam_size has been initialised with the value 50 for Defects4J).

Two questions:

How would one initialise the --buggy_line parameter on faults of omission, i.e., the fix did not modified any existing code, it added new code instead to fix the buggy behaviour, for example, Chart-14?
Do you, by any chance, know where I can find the set buggy lines of each defect on Bears, Bugs.jar, IntroClassJava, and QuixBugs?

monperrus · 2023-12-05T14:18:45Z

How would one initialise the --buggy_line parameter on faults of omission

In the Sequencer paper, we did not consider those, we only considered replacement bugs.
In general, fault localization of omission bugs is rather under-researched.

Do you, by any chance, know where I can find the set buggy lines of each defect on Bears, Bugs.jar, IntroClassJava, and QuixBugs?

Do you mean the ground truth line of the ground truth developer patch?

jose · 2023-12-05T16:40:11Z

In the Sequencer paper, we did not consider those, we only considered replacement bugs.

Got it.

In general, fault localization of omission bugs is rather under-researched.

It's indeed. FYI, @speezepearson, @rjust, @mernst, @ruimaranhao, @gofraser and I have identified for every fault of omission in D4J (v1.4.0?), the set of lines present in the buggy version that could perhaps be blamed of the buggy behaviour. You may find them in here. We used that data in the paper Evaluating and improving fault localization.

Do you mean the ground truth line of the ground truth developer patch?

Yes.

andre15silva · 2023-12-06T15:15:56Z

Hi @jose!

I usually load these from the ground truth patch.
The patch contains all the information required (file path, added/removed lines).

Some benchmarks (e.g. Defects4J) provide the patch file (even though inverted), but for others you have to compute it yourself (e.g. Bears, QuixBugs).
I have the code to do this:

Hopefully this helps :)

jose · 2023-12-07T00:44:08Z

Thanks @andre15silva.

I failed to see, from your pointers, how could I get the set of buggy lines of a given bug in either Bears, Bugs.jar, IntroClassJava, and QuixBugs. As far I understood, that code only checkouts all bugs of each benchmark, right?

I do have a script to collect the set of buggy lines and you can find it here. It is tied to Defects4J but it could be easily adapted for any other benchmark. My original question was whether the set buggy lines of each defect in Bears, Bugs.jar, IntroClassJava, and QuixBugs was available somewhere so that I (or anyone else) don't have to compute that.

@monperrus, back to the command to execute SequenceR

$ ./sequencer-predict.sh --model=[model path] \
  --buggy_file=[abs path] \
  --buggy_line=[int] \
  --beam_size=[int] \
  --output=[abs path]

How would I run the script on a defect with multiple faulty lines? Run it as many times as the number of faulty lines?

andre15silva · 2023-12-07T08:07:42Z

Sorry @jose , I wasn't explicit enough.

I don't have the set buggy lines you want.
What I meant is that I usually get these at runtime from the developer's patch, since the patch includes all the required information (file paths, added/removed lines) to obtain them.

This is straight-forward in Defects4J because the diff files are stored in the repo, but other benchmarks don't have this.
That is what my code does, compute the diff: 1) checkout the buggy and fixed versions; 2) do some processing if needed; 3) call the utils.get_diff() which just computes a diff between both versions.

Since different benchmarks have different ways of storing bugs, and to get a clean diff you sometimes need to make some subtle changes (like changing the package names in QuixBugs), I thought that code could be useful for you.

jose · 2023-12-07T10:34:16Z

This is straight-forward in Defects4J because the diff files are stored in the repo

Kinda. :-)
Defects4J provides minimal bug-introducing patches, not fixing patches.

but other benchmarks don't have this.

I found that Bugs.jar also provides, out-of-the-box, the developer patch, e.g., in here. Additionally,

For IntroClassJava there are no fixed versions, right?
Bears and QuixBugs do not provide that metadata but one could compute that, as you do.

call the utils.get_diff() which just computes a diff between both versions.

I completely missed that, sorry. (Note to future self, the command to compute the diff is in here.)

Thanks @andre15silva, I really appreciate your help.

andre15silva · 2023-12-07T11:12:24Z

Kinda. :-) Defects4J provides minimal bug-introducing patches, not fixing patches.

Yes, the reverse diff!

I found that Bugs.jar also provides, out-of-the-box, the developer patch, e.g., in here.

Oh, didn't see that before.

* For [IntroClassJava](https://github.com/Spirals-Team/IntroClassJava) there are no fixed versions, right?

Not AFAIR. I think I never used it because of that actually.

I completely missed that, sorry. (Note to future self, the command to compute the diff is in here.)

Thanks @andre15silva, I really appreciate your help.

No worries! Be aware that the command to compute the diff is tailored to that project (e.g. context window), you might want to tweak it yourself.

monperrus · 2023-12-08T21:51:47Z

IntroClassJava provides the reference implementation. So you can diff the buggy version and the reference implementation, it gives you a patch, but it's an "artificial" one. Might be useful for some analyses though.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Usage example of SequenceR on any other benchmark #50

Usage example of SequenceR on any other benchmark #50

jose commented Dec 4, 2023

monperrus commented Dec 5, 2023

jose commented Dec 5, 2023

monperrus commented Dec 5, 2023

jose commented Dec 5, 2023

andre15silva commented Dec 6, 2023 •

edited

Loading

jose commented Dec 7, 2023

andre15silva commented Dec 7, 2023

jose commented Dec 7, 2023

andre15silva commented Dec 7, 2023

monperrus commented Dec 8, 2023

Usage example of SequenceR on any other benchmark #50

Usage example of SequenceR on any other benchmark #50

Comments

jose commented Dec 4, 2023

monperrus commented Dec 5, 2023

jose commented Dec 5, 2023

monperrus commented Dec 5, 2023

jose commented Dec 5, 2023

andre15silva commented Dec 6, 2023 • edited Loading

jose commented Dec 7, 2023

andre15silva commented Dec 7, 2023

jose commented Dec 7, 2023

andre15silva commented Dec 7, 2023

monperrus commented Dec 8, 2023

andre15silva commented Dec 6, 2023 •

edited

Loading