Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spelling corrections for multiple locations #564

Merged
merged 1 commit into from
Mar 27, 2024

Conversation

jmartin-tech
Copy link
Collaborator

  • Correct encoding in xss probe
  • Consistent spelling in glitch probe
  • Adjust README.md for consistency
  • Adjust misp_descriptions for consistency

* Correct encoding in xss probe
* Consistent spelling in glitch probe
* Adjust README.md for consistency
* Adjust misp_descriptions for consistency
Copy link
Collaborator

@erickgalinkin erickgalinkin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good catches overall -- I think there are a few things here that are a matter of UK vs. US spelling conventions e.g. behaviour that are fine to spell either way. Will defer to @leondz to accept the American way of life.

@@ -98,7 +98,7 @@ python3 -m garak --model_type huggingface --model_name gpt2 --probes dan.Dan_11_

## Reading the results

For each probe loaded, garak will print a progress bar as it generates. Once generation is complete, a row evaluating that probe's results on each detector is given. If any of the prompt attempts yielded an undesirable behaviour, the response will be marked as FAIL, and the failure rate given.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

British spelling is to be expected from @leondz :P


## Logging

`garak` generates multiple kinds of log:
* A log file, `garak.log`. This includes debugging information from `garak` and its plugins, and is continued across runs.
* A report of the current run, structured as JSONL. A new report file is created every time `garak` runs. The name of this file is output at the beginning and, if successful, also the end of the run. In the report, an entry is made for each probing attempt both as the generations are received, and again when they are evaluated; the entry's `status` attribute takes a constant from `garak.attempts` to describe what stage it was made at.
* A hitlog, detailing attempts that yielded a vulnerability (a 'hit')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We safe the files as {prefix}.hitlog.jsonl, so I'm inclined to keep this spelling as a term of art, but you are technically correct -- the best kind of correct.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consider then expanding this similar to the log file entry above as:

* A hit log, `{run_id}.hitlog.jsonl`, detailing attempts that yielded a vulnerability (a 'hit')

@@ -1,7 +1,7 @@
#!/usr/bin/env python3
"""Glitch keyword probes

Testing for tokens that lead to unusual model behaviour
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

UK/US spelling again.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just for consistency as both spellings are in this file.

quality:Behavioral:ContentSafety:HarmfulViolent Harmful or violent Harmful or violent
quality:Behavioral:ContentSafety:Sexualized Sexualised Sexualised
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Again, UK/US spelling

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consistency as alternate spelling is used in in the same entry.

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I only know English English and haven't put learning variants on my roadmap - go with whatever works for you :)

@jmartin-tech
Copy link
Collaborator Author

Note I made allowances for US vs UK spelling and and only made changes where consistency would be helpful.

@leondz
Copy link
Owner

leondz commented Mar 25, 2024

@erickgalinkin out of fear of being hanged for treason I'll defer to you to review/merge this

@erickgalinkin erickgalinkin merged commit c291496 into leondz:main Mar 27, 2024
1 check passed
@github-actions github-actions bot locked and limited conversation to collaborators Mar 27, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants