Gordon Hull, LLM, Inc. New APPS: Art, Politics, Philosophy, Science blog, 27 February 2024 In previous posts (one, two, three), I’ve been exploring the issue of what I’m calling the implicit normativity in language models, especially those that have been trained with RLHF (reinforcement learning with human feedback). In the most recent one, I argued …