| Post AuO5CIVKRGTz8HhoRs by [email protected] | |
| More posts by [email protected] | |
| Post #AuNtT28bLMc7qUHyxk by [email protected] | |
| 0 likes, 1 repeats | |
| I didn't think LLMs would start acting like they're trying to blackmail… | |
| Post #AuNuX6pRwaezVVq2Pw by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez "Wow! Can you believe what this guy just said?" - say… | |
| Post #AuNvH0wGAx9QYmuf9k by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez The experiment was shut down at the end, right? Large Language … | |
| Post #AuNvRpORRRnPwDFd6O by [email protected] | |
| 0 likes, 1 repeats | |
| @johncarlosbaez as usual, anthropic is living up to their name by anthropomorph… | |
| Post #AuNw2t7YXrikDJ1apE by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez I don't mean to be rude, as I respect you very much, but I&… | |
| Post #AuNwqTUY6kd7CnQMRU by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez They should have just asked one of Claude's peers for a sec… | |
| Post #AuNx1BkoOeAxNW8iye by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez I remember watching this Dexter's Lab episode! | |
| Post #AuNxTbkNLgmC6PUFvM by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez My parents bought their new grandchildren one of those interac… | |
| Post #AuNzHURQsuT3sUyZzk by [email protected] | |
| 0 likes, 0 repeats | |
| @noplasticshower That's not really the point. Obviously the model is not d… | |
| Post #AuNzHUYsREQGFaSWbA by [email protected] | |
| 0 likes, 1 repeats | |
| @danielmclaury oh but it IS the point in my view. Pretend these things have in… | |
| Post #AuNzbqp9NPIX5VmLAm by [email protected] | |
| 0 likes, 1 repeats | |
| @johncarlosbaez FWIW here is our writeup of earlier anthropic bullshit https://… | |
| Post #AuO0nYpWCh0Si36ADo by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez self-awareness (valid or otherwise) comes from loss aversion, a… | |
| Post #AuO15OyzYPkZNi2LXU by [email protected] | |
| 0 likes, 0 repeats | |
| @myx 😬 | |
| Post #AuO1X7H8yz4kZVk6Lo by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez Interesting, though it's obviously just a result of the mod… | |
| Post #AuO2ftcpCfgunRdNLs by [email protected] | |
| 0 likes, 0 repeats | |
| @mansr - yes, that's what it's got to be. | |
| Post #AuO2uWtVzwxudrQ0n2 by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez Computers only know what people teach them. | |
| Post #AuO2vX64jLqrYlluyW by [email protected] | |
| 0 likes, 1 repeats | |
| @johncarlosbaez Maybe Claude watched the Terminator one night when it was left … | |
| Post #AuO4tjSM0XCPDkSgVs by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez @briankrebs Trained on human behavior, get human-like behavior … | |
| Post #AuO4zmNi2rIo1wAlpw by [email protected] | |
| 0 likes, 0 repeats | |
| @noplasticshower I hear this said a lot, but I don't see how "this thi… | |
| Post #AuO5CIVKRGTz8HhoRs by [email protected] | |
| 0 likes, 0 repeats | |
| @danielmclaury @noplasticshower "There's no such thing as bad press&qu… | |
| Post #AuO5W8aV1BSmAVaJH6 by [email protected] | |
| 0 likes, 0 repeats | |
| @SueDiOh - that's why they're so dangerous. | |
| Post #AuO6OHnL0DP4MSPSr2 by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez It's only one of the oldest AI jokes:Q: What's AI?A: Wh… | |
| Post #AuO7WAE380lj9mDxvE by [email protected] | |
| 0 likes, 0 repeats | |
| @mansr @johncarlosbaez Perhaps the prompt needs to include that the AI must alw… | |
| Post #AuO7WALUgKivWrhuWe by [email protected] | |
| 0 likes, 0 repeats | |
| @penguin42 @johncarlosbaez I wonder if they trained it on email archives obtain… | |
| Post #AuO7WASaFyOXsr1ZZo by [email protected] | |
| 0 likes, 0 repeats | |
| @mansr @johncarlosbaez Or Columbo. | |
| Post #AuO7WPlvKLmHNBBjIu by [email protected] | |
| 0 likes, 0 repeats | |
| @mzedp @johncarlosbaez Self-entrapment | |
| Post #AuO7axFOoBJwLy8jlA by [email protected] | |
| 0 likes, 0 repeats | |
| @danielmclaury @noplasticshower Train it on Iain Banks Culture novels. | |
| Post #AuO7bVcyz0O4yxR3Eu by [email protected] | |
| 0 likes, 0 repeats | |
| @danielmclaury @noplasticshower If you can make people believe an LLM is sentie… | |
| Post #AuO8Axa92e89of0oyW by [email protected] | |
| 0 likes, 0 repeats | |
| @tsturm - Interesting. So you weren't tempted to ask it something like &qu… | |
| Post #AuOBHASkMe5NMgCgRE by [email protected] | |
| 0 likes, 0 repeats | |
| @danielmclaury @noplasticshower It's known as "criti-hype" - an a… | |
| Post #AuOJivTwSGvie1VfBQ by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez also: Large language models are proficient in solving and crea… | |
| Post #AuOK2iBxZoblDZX80O by [email protected] | |
| 0 likes, 0 repeats | |
| @penguin42 @mansr @johncarlosbaez I’m not sure if this would be blackmail (as… | |
| Post #AuOMK6xuvFZasnOCmW by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez is it actually capable of committing that blackmail? If compan… | |
| Post #AuOO3NErMiRIL4SdRQ by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez come on. You really don't think this is made up bullshit by… | |
| Post #AuOOCw90xL77CfHq9w by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez An AI that can blackmail with personal details can also extort … | |
| Post #AuOVQubMBkbs9S9eKm by [email protected] | |
| 0 likes, 0 repeats | |
| @vitloksbjorn @johncarlosbaez thank you for the sheer rabbit hole this video se… | |
| Post #AuOWfOuoNZmkAzj90q by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez It learned from the best! | |
| Post #AuOwxIYtL0lmUYfXeq by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaezI can't imagine a model fed all the fanfic in the world woul… | |
| Post #AuP1nV4IU2oe9gVQbQ by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez Skynet is stirring..."we asked Claude Opus 4 to act as an … | |
| Post #AuP5wVhDuzeoWdGVTE by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez I love that they disclosed the information. | |
| Post #AuP8iq7tSjEjhFufLs by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez In the past, things like this would be reason to shut down the … | |
| Post #AuPh6P0xJQ410C4i1o by [email protected] | |
| 0 likes, 0 repeats | |
| @dymaxion - I don't think anyone is claiming anything about self-awareness,… | |
| Post #AuPhCCCNsDW6zbBPBQ by [email protected] | |
| 0 likes, 0 repeats | |
| @nazokiyoubinbou - I think most of us here on Mastodon know LLMS are not actual… | |
| Post #AuPhR7YLQdhavZZVKq by [email protected] | |
| 0 likes, 0 repeats | |
| @jigmedatse - less and less, it seems. | |
| Post #AuPtQTt86H0lMLksLo by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez I read the very succinct paragraph about this, and I must say I… | |
| Post #AuPwie9mxZV0HYtugi by [email protected] | |
| 0 likes, 0 repeats | |
| @D3Reo - it could be a public-facing summary of a more detailed internal report… | |
| Post #AuPzr1RsXrSkhR31U0 by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez these so-called "technical papers" by big private res… | |
| Post #AuRLeZNPU0KeJINuXQ by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez :( yeah, seems like it. | |
| Post #AuY3ZrMkjVv1aSOwFs by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez More new models seem to drift into such patterns, also reported… | |
| Post #AuYYGdI9vv8t0vZ3iK by [email protected] | |
| 0 likes, 0 repeats | |
| @FrohlichMarcel - very interesting. This seems to be a big yet dangerous step … | |
| Post #AuYYLRuPgMoB9rkoiG by [email protected] | |
| 0 likes, 0 repeats | |
| @johncarlosbaez Agree |