scottemmons

Follow

Scott Emmons scottemmons

Follow

PhD student at UC Berkeley's Center for Human-Compatible Artificial Intelligence

23 followers · 0 following

University of California, Berkeley
Berkeley, California
https://scottemmons.com/
@emmons_scott

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

alexandrasouly/strongreject alexandrasouly/strongreject Public

Repository for "StrongREJECT for Empty Jailbreaks" paper

Jupyter Notebook 93 4
edmundmills/ALMANACS edmundmills/ALMANACS Public

A Simulatability Benchmark for Language Model Explainability

Python 3 1
euanong/image-hijacks euanong/image-hijacks Public

Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtime

Python 28 6
rvs rvs Public

Reinforcement Learning via Supervised Learning

Python 67 6
HumanCompatibleAI/imitation HumanCompatibleAI/imitation Public

Clean PyTorch implementations of imitation and reward learning algorithms

Python 1.3k 239