OpenAI: Learning from human preferences — new preference-inference capability (DeepMind collaboration) | SignalBreak | SignalBreak