Shopping News / Articles
How Transparent Is Diffusion Gemma (and why it matters) " Less Wrong
2+ hour, 25+ min ago (20+ words) Authors: Joshua Engels*, Callum Mc Dougall*, Bilal Chughtai*, Janos Kramar, Senthoran Rajamanoharan, Cindy Wu, Arthur Conmy, Asic Q Chen, Jean Tarbour...
Against Planet-Eating Nanoreplicators " Less Wrong
2+ hour, 3+ min ago (411+ words) A classic trope of hard sci-fi as well as more serious futurism is using self-replicating nanoassemblers to convert planets of the Solar System to computronium, or some other kind of a Dyson swarm. This is almost the default way to…...
[Linkpost] How Transparent Is Diffusion Gemma (and why it matters) " Less Wrong
2+ hour, 25+ min ago (20+ words) Work also done with Cindy Wu, Asic Q Chen, Jean Tarbouriech, Min Ma, Brendan O'Donoghue, Jo'o Gabriel Lopes de Oliveira. "...
The Invisible Side of AI Governance " Less Wrong
3+ hour, 36+ min ago (1562+ words) Tldr: Most strategic writing on AI governance on Less Wrong describes the outsider game, which is most often visible: press, statements, open letters. Here I want to describe the other, invisible half: the insider work within ministerial cabinets and international…...
Would anybody here be interested in a "mistake postmortem" discussion group? " Less Wrong
10+ hour, 27+ min ago (232+ words) I recently made a dumb (in retrospect) mistake that set me back a lot. Feeling upset and regretful, I spoke to an older family member who reassured me, "yeah, unfortunately there's no way around it; we have to experience these…...
Thoughts on Likelihood of Existential Risks by Misaligned AIs " Less Wrong
23+ hour, 37+ min ago (304+ words) The implication of this is that it is very hard to have one concrete AI risk argument I can read and respond to. It is difficult to form opinions on AI safety when most experts are in great disagreement about…...
How I think developers of frontier AI systems and regulators ought to act in the face of existential AI risk " Less Wrong
1+ day, 8+ min ago (1088+ words) In a recent podcast episode published July 20, 2025, Anthropic co-founder Ben Mann is asked (at 48: 43) "What are the odds that we align AI correctly and actually solve this problem?" In his answer, Ben references the following part of Anthropic's March 8, 2023 blog…...
Why should AI be moral? " Less Wrong
23+ hour, 46+ min ago (1067+ words) In outline, the moral skeptic's challenge goes: To respond, one must either refute the skeptical hypothesis or identify an extra-moral reason to accept morality. Without a response, one's acceptance of morality is unjustified. This position threatens to be reflectively destabilizing…...
AI Safety Ecosystem Research notes " Less Wrong
1+ day, 4+ hour ago (463+ words) These are some personal notes taken and later dressed up a bit to make into a post. Dunno how much value is here for people already familiar with the AI Safety Ecosystem. I believe MATS will be publishing the results…...
A brief list of ways AI safety efforts could be net negative " Less Wrong
1+ day, 6+ hour ago (245+ words) I'm not aware of a good list of downside risks for AI safety broadly[1], so I decided to make one. This is not intended to be fully comprehensive, these are just the ones that I personally take seriously[2][3]: (This list…...
Shopping
Please enter a search for detailed shopping results.