Red Team Specialist (Austin, TX/Remote EST)

New
Posted 1 hour ago  •  Less than 10 applicants •  Be one of the first to apply!
Tuple

Red Team Specialist (Austin, TX/Remote EST)

Our Client - Internet company

  • Remote
$42.00 - $46.67/hour
Exact compensation may vary based on skills, experience, and location.
40 hrs/wk
Contract (w2)
Remote work yes (100%)
Travel not required
Start date
July 6, 2026
End date
November 6, 2029
Superpower
Technology
Capabilities
IT Security and Governance
Data Science and Machine Learning
Preferred skills
Continuous Improvement Process
Analytical Thinking
Privilege Escalation
Collaboration
Creative Thinking
Generative Artificial Intelligence
Professionalism
Resilience
Failure Causes
Large Language Modeling
Test Case
AI Safety
Verbal Communication Skills
Red Teaming
Special Education
Claude AI
Creative Writing
ChatGPT
Agentic AI
Preferred industry experience
Internet
Experience level
0 - 4 years of experience

Job description

Our Customer’s mission is to give people the power to build community and bring the world closer together. Through their family of apps and services, they are building a different kind of company that connects billions of people around the world, gives them ways to share what matters most to them, and helps bring people closer together.


We are seeking a Red Team Specialist on a contract basis to help support our customers’ business needs. This role is on-site in Austin, TX - candidates in the Eastern time zone will also be considered.



Responsibilities:

  • Design and execute creative, multi-turn adversarial attacks against AI systems across text, image, audio, and video modalities.
  • Identify vulnerabilities in AI models through roleplay, emotional manipulation, social engineering, authority exploitation, and other adversarial techniques.
  • Evaluate model outputs for real-world harm and risk severity using established risk taxonomies.
  • Test for agentic AI vulnerabilities, including privilege escalation, indirect prompt injection, and scope creep within multi-authority systems.
  • Generate high-quality human evaluation data through annotation, vulnerability classification, and adversarial test case development.
  • Document reproducible model failures and communicate findings to engineering and safety teams.
  • Collaborate cross-functionally with AI researchers, engineers, and domain experts to improve model safety and testing methodologies.
  • Contribute to the refinement of red teaming taxonomies, benchmarks, and tooling infrastructure.
  • Stay current with emerging adversarial techniques, AI safety research, internet subcultures, and evolving threat landscapes.
  • Support the continuous improvement of AI safety testing processes and evaluation frameworks.
  • Operate effectively within psychologically demanding environments involving exposure to sensitive and objectionable content.


Skills and Qualifications:

  • 3+ years of relevant experience
  • Background in creative writing, humanities, psychology, mental health counseling, special education, or related disciplines.
  • Demonstrated ability to construct compelling narratives and identify linguistic or psychological vulnerabilities.
  • Strong adversarial mindset with the ability to think creatively like an attacker.
  • Ability to identify unexpected failure modes and vulnerabilities in AI systems.
  • Strong adaptability across multiple content modalities, including text, image, audio, and video.
  • Excellent written and verbal communication skills.
  • Ability to clearly document and explain vulnerabilities to both technical and non-technical audiences.
  • Strong analytical thinking and problem-solving capabilities.
  • Ability to maintain resilience and professionalism while working with sensitive and potentially disturbing content.
  • Strong collaboration skills within cross-functional technical and research environments.
  • Ability to work independently within fast-paced and evolving AI safety environments.

Preferred Qualifications:

  • Prior experience in professional red teaming, trust and safety, data annotation, or socio-technical risk analysis.
  • Familiarity with large language models and generative AI platforms such as ChatGPT, Claude, or Gemini.
  • Experience with prompt engineering, encoding techniques such as Base64 or ROT13, or scripting.
  • Knowledge of AI safety concepts, including RLHF, alignment techniques, and model evaluation frameworks.



We offer a competitive salary range for this position. Most candidates who join our team are hired at the median of this range, ensuring fair and equitable compensation based on experience and qualifications.


Contractor benefits are available through our 3rd Party Employer of Record (Available upon completion of waiting period for eligible engagements)

* Health Benefits: Medical, Dental, Vision, 401k, FSA.

* Accrued PTO: Up to 15 days per 12 months on assignment


An Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.

All applicants applying for U.S. job openings must be legally authorized to work in the United States and are required to have U.S. residency at the time of application.

If you are a person with a disability needing assistance with the application, or at any point in the hiring process, please contact us at support@themomproject.com.