OpenAI: TruthfulQA benchmark measures how models mimic human falsehoods | SignalBreak | SignalBreak