OpenAI: Improving Model Safety Behavior with Rule-Based Rewards | SignalBreak | SignalBreak