Mynote
20 小时前
📌
Understanding Annotator Safety Policy with Interpretability
xxx
💡
xxx | via arXiv AI
arXiv.org
Understanding Annotator Safety Policy with Interpretability
Safety policies define what constitutes safe and unsafe AI outputs, guiding data annotation and model development. However, annotation disagreement is pervasive and can stem from multiple sources...
Home
Powered by
BroadcastChannel
&
Sepia