Policy Manager, Harmful Persuasion

Anthropic

San Francisco, CA | New York City, NY

in-office mid

Description

Anthropic is seeking a Policy Manager to develop and maintain policies to prevent misuse of AI systems, focusing on harmful persuasion risks. The role involves shaping policy frameworks, designing enforcement guidelines, and collaborating with various teams to ensure product safety. The position requires experience in policy development and a strong understanding of relevant regulatory landscapes.

Skills Required

policy development, trust & safety policy, platform policy, election integrity, fraud/scams, coordinated inauthentic behavior, influence operations, misinformation, policy writing, ML classifiers, enforcement decision-making, Engineering, Data Science, Legal, Policy teams, written and verbal communication skills, persuasion theory, influence tactics, cognitive biases, psychological manipulation techniques, AI governance, digital platform regulation, adversarial testing, red teaming, vulnerability assessments, generative AI capabilities, LLMs

Benefits

Competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.

Job Details
Compensation:
$245,000—$330,000 USD
Work Schedule:
in-office
Seniority:
mid
Degree Required:
Under Graduate
Posted:
2026-01-21 10:54:26
← Previous Job Back to Listings Next Job →