@chicken

chicken@lemmy.dbzer0.com · 16 days ago

If other people are also immortal, the awkwardness of all of them eventually becoming your exes

chicken@lemmy.dbzer0.com · 19 days ago

that is not the … available outcome.

It demonstrably is already though. Paste a document in, then ask questions about its contents; the answer will typically take what’s written there into account. Ask about something you know is in a Wikipedia article that would have been part of its training data, same deal. If you think it can’t do this sort of thing, you can just try it yourself.

Obviously it can handle simple sums, this is an illustrative example

I am well aware that LLMs can struggle especially with reasoning tasks, and have a bad habit of making up answers in some situations. That’s not the same as being unable to correlate and recall information, which is the relevant task here. Search engines also use machine learning technology and have been able to do that to some extent for years. But with a search engine, even if it’s smart enough to figure out what you wanted and give you the correct link, that’s useless if the content behind the link is only available to institutions that pay thousands a year for the privilege.

Think about these three things in terms of what information they contain and their capacity to convey it:

A search engine
Dataset of pirated contents from behind academic paywalls
A LLM model file that has been trained on said pirated data

The latter two each have their pros and cons and would likely work better in combination with each other, but they both have an advantage over the search engine: they can tell you about the locked up data, and they can be used to combine the locked up data in novel ways.

chicken@lemmy.dbzer0.com · edit-2 20 days ago

Ok, but I would say that these concerns are all small potatoes compared to the potential for the general public gaining the ability to query a system with synthesized expert knowledge obtained from scraping all academically relevant documents. If you’re wondering about something and don’t know what you don’t know, or have any idea where to start looking to learn what you want to know, a LLM is an incredible resource even with caveats and limitations.

Of course, it would be better if it could also directly reference and provide the copyrighted/paywalled sources it draws its information from at runtime, in the interest of verifiably accurate information. Fortunately, local models are becoming increasingly powerful and lower barrier of entry to work with, so the legal barriers to such a thing existing might not be able to stop it for long in practice.

chicken@lemmy.dbzer0.com · 20 days ago

The OP tweet seems to be leaning pretty hard on the “AI bad” sentiment. If LLMs make academic knowledge more accessible to people that’s a good thing for the same reason what Aaron Swartz was doing was a good thing.

chicken@lemmy.dbzer0.com · 21 days ago

a few dozen, mostly hexbear users. Though that was mostly from when I started using Lemmy, I haven’t felt the need to block anyone in a long time. My list of blocked communities is much larger.

chicken@lemmy.dbzer0.com · edit-2 28 days ago

thepiratebay still exists but is regarded as untrustworthy and infested with malware. I’d say knowing you’re getting something from a trustworthy source is harder than it used to be.

chicken@lemmy.dbzer0.com · 29 days ago

changing how its “block” button works. That option previously allowed users to hide their profile from certain accounts – but will no longer do so.

So I guess all that stuff they did to lock down the ability to see things on Xitter without an account was strictly for evil then

chicken@lemmy.dbzer0.com · 1 month ago

That doesn’t sound like something you get arrested for though

chicken@lemmy.dbzer0.com · 1 month ago

I’d be skeptical that’s even real, outside of a select few countries with especially strict copyright enforcement

chicken@lemmy.dbzer0.com · 1 month ago

people being arrested for using pirate streaming services

What circumstances does that even happen in? Like a bar that plays a pirated sports stream?

chicken@lemmy.dbzer0.com · 1 month ago

Sadly, Porn

I don’t know how to describe it, expect to be confused and offended and gaslit.

chicken@lemmy.dbzer0.com · 2 months ago

If you are at the point where you are having to worry about government or corporate entities setting traps at the local library? You… kind of already lost.

What about just a blackmailer assuming anyone booting an OS from a public computer has something to hide? And then they have write access and there’s no defense, and it doesn’t have to be everywhere because people seeking privacy this way will have to be picking new locations each time. An attack like that wouldn’t have to be targeted at a particular person.

chicken@lemmy.dbzer0.com · 2 months ago

Isn’t it risky plugging usb drives into untrusted machines?

chicken@lemmy.dbzer0.com · edit-2 2 months ago

IIRC it spammed websites with traffic, didn’t conceal your IP at all, and some people got arrested for using it to make some websites go down for a very brief period. Basically a way to use people who didn’t know what they were doing as cannon fodder

chicken@lemmy.dbzer0.com · 2 months ago

Could you elaborate? Does HOA mean something different in other countries?

chicken@lemmy.dbzer0.com · 2 months ago

Home owner’s association; when you buy a house and it is part of a HOA, you have to sign a contract to join the HOA as a requirement of buying, which means you have to pay dues and abide by the rules of the organization, and you have to require the next buyer to also join in order to sell your house.

chicken@lemmy.dbzer0.com · 2 months ago

I doubt the school administrators who would be buying this thing or the people trying to make money off it have really thought that far ahead or care whether or not it does that, but it would definitely be one of its main effects.

chicken@lemmy.dbzer0.com · 2 months ago

IMO for some people arguing is a form of intimacy

chicken@lemmy.dbzer0.com · 3 months ago

Can anyone recommend any cool mods/projects built on top of Minetest?

chicken@lemmy.dbzer0.com · edit-2 3 months ago

The output for a given input cannot be independently calculated as far as I know, particularly when random seeds are part of the input.

The system gives a probability distribution for the next word based on the prompt, which will always be the same for a given input. That meets the definition of deterministic. You might choose to add non-deterministic rng to the input or output, but that would be a choice and not something inherent to how LLMs work. Random ‘seeds’ are normally used as part of deterministically repeatable rng. I’m not sure what you mean by “independently” calculated, you can calculate the output if you have the model weights, you likely can’t if you don’t, but that doesn’t affect how deterministic it is.

The so what means trying to prevent certain outputs based on moral judgements isn’t possible. It wouldn’t really be possible if you could get in there with code and change things unless you could write code for morality, but it’s doubly impossible given you can’t.

The impossibility of defining morality in precise terms, or even coming to an agreement on what correct moral judgment even is, obviously doesn’t preclude all potentially useful efforts to apply it. For instance since there is a general consensus that people being electrocuted is bad, electrical cables normally are made with their conductive parts encased in non-conductive material, a practice that is successful in reducing how often people get electrocuted. Why would that sort of thing be uniquely impossible for LLMs? Just because they are logic processing systems that are more grown than engineered? Because they are sort of anthropomorphic but aren’t really people? The reasoning doesn’t follow. What people are complaining about here is that AI companies are not making these efforts a priority, and it’s a valid complaint because it isn’t the case that these systems are going to be the same amount of dangerous no matter how they are made or used.