How to retrieve salience of some specific words? #105

CarhoJohn · 2023-09-05T08:35:31Z

Hi. To obtain the salience map of previous tokens when generating new tokens, we can use the code/function provided in the example code:

output = lm.generate(prompt, generate=1, do_sample=True, attribution=['ig'])
res = output.primary_attributions(attr_method='ig')

However, in this standard method, I can only get the salience map for the (randomly/uncontrollable) generated word.

Is it possible to obtain the salience map for specific word? For example, in the sentence "I have a dog. He is very ...", I'd like to get the salience map for a specific word cute, rather than other words generated by the model.

Thanks very much!

The text was updated successfully, but these errors were encountered:

BiEchi · 2023-09-11T16:53:21Z

From my understanding this is not possible unless you do algorithmic optimization (some math). Salience maps is doing backprop from output to embedding. This process is just chain rule, and if you break it you do get specific words, but unless mathematically grounded, your approach fails.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to retrieve salience of some specific words? #105

How to retrieve salience of some specific words? #105

CarhoJohn commented Sep 5, 2023 •

edited

Loading

BiEchi commented Sep 11, 2023

How to retrieve salience of some specific words? #105

How to retrieve salience of some specific words? #105

Comments

CarhoJohn commented Sep 5, 2023 • edited Loading

BiEchi commented Sep 11, 2023

CarhoJohn commented Sep 5, 2023 •

edited

Loading