• 0 Posts
  • 27 Comments
Joined 11 months ago
cake
Cake day: April 3rd, 2024

help-circle

  • That undersells them slightly.

    LLMs are powerful tools for generating text that looks like something. Need something rephrased in a different style? They’re good at that. Need something summarized? They can do that, too. Need a question answered? No can do.

    LLMs can’t generate answers to questions. They can only generate text that looks like answers to questions. Often enough that answer is even correct, though usually suboptimal. But they’ll also happily generate complete bullshit answers and to them there’s no difference to a real answer.

    They’re text transformers marketed as general problem solvers because a) the market for text transformers isn’t that big and b) general problem solvers is what AI researchers are always trying to create. They have their use cases but certainly not ones worth the kind of spending they get.


  • I know I sorted by feed by Top 6 Hours but that doesn’t mean I expect six hours worth of text in a single image. Did they copy and paste three different job postings together? Did they use a LLM that had its stop token configured incorrectly? Is it an attempt at weeding out people who object to having their time wasted by corporate bullshit?

    We may never know. What we do know is that this wall of text has more red flags than a Chinese military parade.




  • I remember talking to someone about where LLMs are and aren’t useful. I pointed out that LLMs would be absolutely worthless for me as my work mostly consists of interacting with company-internal APIs, which the LLM obviously hasn’t been trained on.

    The other person insisted that that is exactly what LLMs are great at. They wouldn’t explain how exactly the LLM was supposed to know how my company’s internal software, which is a trade secret, is structured.

    But hey, I figured I’d give it a go. So I fired up a local Llama 3.1 instance and asked it how to set up a local copy of ASDIS, one such internal system (name and details changed to protect the innocent). And Llama did give me instructions… on how to write the American States Data Information System, a Python frontend for a single MySQL table containing basic information about the member states of the USA.

    Oddly enough, that’s not what my company’s ASDIS is. It’s almost as if the LLM had no idea what I was talking about. Words fail to express my surprise at this turn of events.











  • Yeah, it doesn’t take a lot to build a decent home server. I just rebuilt mine (the old one’s Turion II Neo was perhaps a bit too weak) and the most expensive part were the HDDs. I didn’t want to reuse the old ones.

    A slightly underclocked Athlon 3000G, 16 gigs of spare RAM, and three 4 TB WD Red Pluses give me all the power I actually need at a reasonable power budget. I initially wanted to go with an N100 but those never support more than two SATA drives directly.