Risk score logic explanation #138

kadamsolanki · 2024-05-15T09:49:55Z

Hey, can anyone explain me the logic of risk score calculation in toxicity scanner in input scanner, as the formula in util does not give justice to the model generated scores.

If possible please provide a detailed explanation behind adding risk_score as a metric/indicator.

Thanks,
Kadam

asofter · 2024-05-15T10:06:43Z

Hey @kadamsolanki , thanks for reaching out.

We use threshold configured and only calculate the risk score if it's above that threshold. Then the risk score is basically how far above the confidence score from the threshold.

Hope it makes sense

kadamsolanki · 2024-05-16T07:46:37Z

Hey @asofter, it does make sense and I was aware of this, but I wanted to say use the risk score for evaluation and there it does not make sense in case of all the scanners where we have sentence level scores. Because it will take the max score of all sentences of any 1 of the labels.

now to use the same max score for risk score calculation, does not help me as I am not sure on which sentence, or which label it is failing. So, I wanted to understand that is there a way for some sort of aggregation calculation or some confidence score at overall level for me to be clear with the model output.

asofter · 2024-05-16T07:53:17Z

I see, your use-case is sentence-level matching instead of overall text. Do you mean something which provides avg score across all sentences instead of the highest?

kadamsolanki · 2024-05-16T12:24:20Z

Yes.

asofter · 2024-07-29T07:05:41Z

Marking as duplicate of #111

asofter closed this as completed Jul 29, 2024

asofter added the duplicate This issue or pull request already exists label Jul 29, 2024

snickell mentioned this issue Aug 27, 2024

calculate_risk_score() seems to have wrong logic #182

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Risk score logic explanation #138

Risk score logic explanation #138

kadamsolanki commented May 15, 2024

asofter commented May 15, 2024

kadamsolanki commented May 16, 2024

asofter commented May 16, 2024

kadamsolanki commented May 16, 2024

asofter commented Jul 29, 2024

Risk score logic explanation #138

Risk score logic explanation #138

Comments

kadamsolanki commented May 15, 2024

asofter commented May 15, 2024

kadamsolanki commented May 16, 2024

asofter commented May 16, 2024

kadamsolanki commented May 16, 2024

asofter commented Jul 29, 2024