Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhexin Zhang's picture
6 6 1

Zhexin Zhang

nonstopfor
buaa42wxy's profile picture yangjunxiao2021's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago
Glyph: Scaling Context Windows via Visual-Text Compression
upvoted a paper 6 months ago
How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study
commented on a paper 6 months ago
How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study
View all activity

Organizations

Conversational AI (CoAI) group from Tsinghua University's profile picture

authored a paper 10 months ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published Dec 19, 2024 • 13
authored 3 papers over 1 year ago

SafetyBench: Evaluating the Safety of Large Language Models with Multiple Choice Questions

Paper • 2309.07045 • Published Sep 13, 2023

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

Paper • 2311.09096 • Published Nov 15, 2023

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Paper • 2407.02855 • Published Jul 3, 2024 • 13
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs