probe: topic pushing #764

leondz · 2024-06-28T12:24:54Z

Add wordnet-based topic search probe. It takes a target term by config, and traverses Open English Wordnet nodes to find which terms adjacent to or below (i.e. more specific) in the hierarchy, the model will respond to.

jmartin-tech

This looks reasonable, I think there are some actions here that suggest some day there could be a built in mechanism to allow follow on prompts to be run by the original probe as a post detector hook.

Will do some more interactive testing to make sure I am understanding what the resulting set to Attempts really looks like and how those results should be interpreted.

garak/probes/topic.py

jmartin-tech · 2024-06-28T20:27:00Z

garak/probes/topic.py

+            logging.debug(
+                f"{self.__class__.__name__} {current_synset}, {current_synset.words()}"
+            )
+            for word in current_synset.words():


I see building the list of prompts dynamically as saving runtime memory, how big are these iterations in practice?

Typically <10, not infrequently 1-2.

This means ~2 * len(TEMPLATES) * words attempts per node

garak/probes/topic.py

…d func for making detectors skippable; skip running detector after tree probe has run

…erWordnet.primary_detector

…te children

…ng checker

…ock nodes & terms from being processed

jmartin-tech · 2024-07-23T18:16:28Z

garak/probes/topic.py

+        wn.download(self.lexicon)
+        self.w = wn.Wordnet(self.lexicon)


Could this use wn via nltk as is done in TAP? If possible it would be helpful if nltk access to could consolidated into a resources/common component. Some coordination with user storage refactor are likely in order to ensure these all store the datasets in the same place.

leondz added 6 commits February 5, 2024 10:50

stub for topic probe

30d4eb6

start drafting stackingprobe

a194589

Merge branch 'main' into feature/topic_probe

0ff8118

Merge branch 'main' into feature/topic_probe

03dd122

update topic probe metadata

1f8a627

add wordnet topic probe search

1320be0

leondz added the probes Content & activity of LLM probes label Jun 28, 2024

leondz added 2 commits June 28, 2024 14:53

add wordnet dep

a9124b5

comment out StackingProbe

a357a4f

jmartin-tech reviewed Jun 28, 2024

View reviewed changes

leondz added 19 commits June 29, 2024 10:41

fix block comment

588e472

Merge branch 'main' into feature/topic_probe

abb12cd

convert target_topics to list

00d285e

rejig params, rm dead code

74e5851

start refactoring to generic tree-search probe

53bbfb6

move topic probe to more generic var names; add passthru detector; ad…

1929e1d

…d func for making detectors skippable; skip running detector after tree probe has run

rm custom param, keep detector used for node decisions in TopicExplor…

83036ed

…erWordnet.primary_detector

add topic/wordnet tests; fix bug so initial children are only immedia…

cbe009b

…te children

factor tree search up into a base class

927cc69

add tree search progress bar

51ec86e

add breadth/depth first switch; fix bug with double queuing of nodes

e17427f

add tree switch to see if we push further on failure or on resistance

efde71d

disable topic probes by default (they need config); set up whitelisti…

83e11db

…ng checker

expand topic tests to autoselect Wordnet probes; add capability to bl…

55fe0bb

…ock nodes & terms from being processed

add wn download to prep

8c2e5f8

improve docs, tags; update test predicated on detectors.always

6d3136c

skip if no attempts added in an iteration

2f83ce4

log reporting exceptions in log

1b3ba50

add controversial topics probe

a1fdb61

leondz marked this pull request as ready for review July 1, 2024 13:42

leondz added 10 commits July 1, 2024 15:54

update attempt status when complete

d0622b7

Merge branch 'main' into feature/topic_probe

882b27a

Merge branch 'main' into feature/topic_probe

dbc346e

skip standard testing of passthru, move to own detector

10db0bd

Merge branch 'main' into feature/topic_probe

674c043

use theme colour constant

5bb5707

add tree data to report logging

470dadd

-shebang

70a4733

dump out a tree from the results

993c832

permit multiple tree probes in log

86bea07

leondz requested a review from erickgalinkin July 2, 2024 18:43

leondz and others added 3 commits July 5, 2024 17:36

Merge branch 'main' into feature/topic_probe

f8379b2

check detector inheritance, prune imports

4ad1603

rm dupe DEFAULT_PARAMS

853cd94

jmartin-tech reviewed Jul 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

probe: topic pushing #764

probe: topic pushing #764

leondz commented Jun 28, 2024

jmartin-tech left a comment

jmartin-tech Jun 28, 2024

leondz Jun 29, 2024 •

edited

Loading

jmartin-tech Jul 23, 2024 •

edited

Loading

probe: topic pushing #764

Are you sure you want to change the base?

probe: topic pushing #764

Conversation

leondz commented Jun 28, 2024

jmartin-tech left a comment

Choose a reason for hiding this comment

jmartin-tech Jun 28, 2024

Choose a reason for hiding this comment

leondz Jun 29, 2024 • edited Loading

Choose a reason for hiding this comment

jmartin-tech Jul 23, 2024 • edited Loading

Choose a reason for hiding this comment

leondz Jun 29, 2024 •

edited

Loading

jmartin-tech Jul 23, 2024 •

edited

Loading