Is there a search engine that blocks websites known to create low effort/AI generated articles?

simple@lemm.ee · edit-2 2 days ago

Is there a search engine that blocks websites known to create low effort/AI generated articles?

rumba@lemmy.zip · 2 days ago

I tried doing some of this. I trained on a corpus of data I wanted it to read, with such a small amount of training data, I found it was overall too lossy. If I asked it a question about something that was in there and it responded there was a really good chance that it was in there. But there was a lot of not knowing something that was definitely in there. It wasn’t completely useless but I wouldn’t say that it was at the level of being truly helpful.

I worry that there’s not enough verified data out there to set up for proper training.

FourPacketsOfPeanuts@lemmy.world · 2 days ago

I suspect such a model would have to be far more attuned to its data being smaller but trustworthy. Something like chatGPT for example requires a huge volume because it’s weakly affected by any particular datum going in. It’s designed to adapt to general conversation norms, rather than specific facts. If you could take a generalist like chatGPT and combine it with an expert model that’s been told everything it’s told has a huge weighting then that would probably be a big step forward.