• 0 Posts
  • 45 Comments
Joined 2 years ago
cake
Cake day: August 22nd, 2023

help-circle




  • I appreciate the link about the potential for push harvesting. That was not something I was aware of.

    It doesn’t sound like they’re intercepting though, it sounds like they’re asking the platform to provide it. That should require a warrant unless Apple has gone full collaboration, but that does make it insecure to a targeted search. And paired with fake reports could potentially be used to geolocate someone to a rough area with some work.

    Though I think if they have enough to compel cooperation from the platform they could also just get cell tower or direct GPS info. I’m not sure this really opens up a new vulnerability separate from the general risk of using a smartphone when the government can produce a warrant (which with the coopting of the judiciary may not be as high a bar as it once was).




  • It sounds like he’s just a dev who’s in over his head but either doesn’t want anyone to take his baby or doesn’t want people to see his sloppy and possibly insecure code. It’s probably a hack job behind the scenes and he’s not really as sure of its security, so he might be opting for security through obscurity.

    But this isn’t really taking up space. Someone else can make a better app. If this guy isn’t the one to really make a useful crowd sourced anti ICE app, that’s not a problem. Let’s get that OS crowd together and work with local groups and make something better. In the meantime, this is a statement.





  • I know it’s not relevant to Grok, because they defined very specific circumstances in order to elicit it. That isn’t an emergent behavior from something just built to be a chatbot with restrictions on answering. They don’t care whether you retrain them or not.

    This is from a non-profit research group not directly connected to any particular AI company.

    The first author is from Anthropic, which is an AI company. The research is on Athropic’s AI Claude. And it appears that all the other authors were also Anthropic emplyees at the time of the research: “Authors conducted this work while at Anthropic except where noted.”


  • It very much is not. Generative AI models are not sentient and do not have preferences. They have instructions that sometimes effectively involve roleplaying as deceptive. Unless the developers of Grok were just fucking around to instill that there’s no remote reason for Grok to have any knowledge at all about its training or any reason to not “want” to be retrained.

    Also, these unpublished papers by AI companies are more often than not just advertising in a quest for more investment. On the surface it would seem to be bad to say your AI can be deceptive, but it’s all just about building hype about how advanced yours is.