121 lines
9.7 KiB
Plaintext
121 lines
9.7 KiB
Plaintext
|
|
Episode: 4064
|
||
|
|
Title: HPR4064: Large Language Models
|
||
|
|
Source: https://hub.hackerpublicradio.org/ccdn.php?filename=/eps/hpr4064/hpr4064.mp3
|
||
|
|
Transcribed: 2025-10-25 19:09:48
|
||
|
|
|
||
|
|
---
|
||
|
|
|
||
|
|
This is Hacker Public Radio episode 4,064 for Thursday, the 29th of February, 2024.
|
||
|
|
Today's show is entitled, Large Language Models.
|
||
|
|
It is hosted by Daniel Persson, and is about 13 minutes long.
|
||
|
|
It carries a clean flag.
|
||
|
|
The summary is, what are they good for?
|
||
|
|
Hello Hacker's, and welcome to another episode.
|
||
|
|
Today I'm going to talk about machine learning and LLM, so large language models.
|
||
|
|
To start with, when it comes to application of machine learning,
|
||
|
|
I think that there is a bunch of really interesting and good things to do with machine learning.
|
||
|
|
We have machine learning that trying to find proteins and how to fold proteins,
|
||
|
|
and so on, that is using in the medical field, which is really great.
|
||
|
|
You have machine learning that is trying to look at images and find particular faults in people's
|
||
|
|
eyes, for instance, to find sickness, or you could have machine learning that finds different
|
||
|
|
objects in traffic in order to help cameras in life and so on, which could really benefit society.
|
||
|
|
There is a lot of these kind of creepy things that is going on there as well,
|
||
|
|
so everything can be used for good and bad, of course.
|
||
|
|
Then you have these kind of machine learning AIs that looks at images and
|
||
|
|
finds, for instance, characters, so you can actually read things, translate things,
|
||
|
|
which is also helping people to live their lives.
|
||
|
|
Really great applications.
|
||
|
|
When it comes to working and creating new material, there is AIs that could create sound,
|
||
|
|
like music, and could that create voices, these kind of TTS engines, that sound really good,
|
||
|
|
and for people that is either blind or have reading disabilities,
|
||
|
|
then that is a really good application to give them the possibility to actually listen to media
|
||
|
|
and that kind of thing. So there is a great application in that as well.
|
||
|
|
When it comes to creating images and videos, there is a bunch of creepy applications,
|
||
|
|
but for a professional, these kind of tools can be really helpful as well.
|
||
|
|
So if you take, for instance, an image and you need to remove something in the image,
|
||
|
|
we have already algorithms for that, but AI improves those algorithms.
|
||
|
|
If you have, for instance, square images, which we had at our company,
|
||
|
|
and we needed rectangle images as product images, that would have cost a lot of money
|
||
|
|
for the customer to hire its staff and reshoot all those product images,
|
||
|
|
just because we needed a different resolution or a different crop of the image.
|
||
|
|
So having the tools in Photoshop and other programs in order to create things that weren't there,
|
||
|
|
in order to create a different aspect ratio of an image,
|
||
|
|
or adding material that isn't really there, where the thing is not that important.
|
||
|
|
You are not trying to fool anyone. You want something to look natural and what is actually the
|
||
|
|
real thing is still there. So in our case, we had, I think it was somebody selling rings.
|
||
|
|
So it was a ring on a finger where you saw the product really well,
|
||
|
|
but we needed a little bit bigger image. So the AI generated an arm that stretched a little bit
|
||
|
|
further and a door frame that stretched a little bit further. So you could actually see
|
||
|
|
a little bit more of the image and have a little bit of a different aspect ratio.
|
||
|
|
So in those applications, I find very helpful and very interesting ways of using AI
|
||
|
|
or machine learning pretty much in order to do something better. I don't really like the term
|
||
|
|
AI, but everybody uses it. When it comes to large language models, I still have not found any
|
||
|
|
viable applications for those. So there is a bunch of applications that people use them for,
|
||
|
|
but I haven't really found the key thing that is helpful for everyone.
|
||
|
|
So when it comes to different applications, you could, for instance, create text.
|
||
|
|
And there is a bunch of different text things that you could create. You could, for instance,
|
||
|
|
create these kind of blurbs you see on web pages where people are talking about their company.
|
||
|
|
And those are usually these kinds of we are a consultancy that is trying to help our company to
|
||
|
|
reach newer heights and work with blah blah blah and so on, which doesn't really say anything
|
||
|
|
about that company. We also have these kind of extra information that you need to put on your
|
||
|
|
products in order to sell them, which if they don't contain any real facts like this product is
|
||
|
|
this large, this product is this fast, this product has this particular color or whatever it might
|
||
|
|
might be that is objectively correct information that you need to have in your description.
|
||
|
|
If those kind of things aren't really what you're looking for, you just want to create some text
|
||
|
|
that describes this product. You could do that with a large language model and get a reasonable
|
||
|
|
result. These kind of things people don't usually read anyway, so it doesn't really matter.
|
||
|
|
Another application which I think is more sinister is that you could create really good spam.
|
||
|
|
So I have a lot of people creating a bunch of very interesting spam. I've gotten a bunch of them.
|
||
|
|
People that are trying to sell me steel, trying to sell me all kinds of different
|
||
|
|
strange products that I would never be interesting in, but they are starting it with
|
||
|
|
creating an information that is seems reasonable as an email to me and then when you have read
|
||
|
|
two paragraphs or something, you realize that this is a product I'm not interested in. Before
|
||
|
|
you could see these emails because they were so badly produced that you realize directly that
|
||
|
|
this is not something I'm interested in, but now that is so cheap and so easy to create these
|
||
|
|
kind of spam emails, you could do that much better. So we will see a lot more spam.
|
||
|
|
Another thing that is really easy to create with machine large language models is of course
|
||
|
|
content. So now Google results are much worse than last year this time because there is so much
|
||
|
|
content pumped out that doesn't really say anything. So if you're searching for a solution to a problem,
|
||
|
|
the other day I was searching for how to train a large language model. So I could figure that
|
||
|
|
one thing out. I had to go to the fourth page of the result in Google in order to find somebody
|
||
|
|
that actually was talking about training a model. The other ones was generated by a model talking
|
||
|
|
about the different steps, but didn't have any extra knowledge or anything that actually gave me
|
||
|
|
any information that could help me in the process of training a model. It was talking about
|
||
|
|
how the process was looking. What kind of steps you could have like in order to train a model,
|
||
|
|
you need to find data, refine data, and then train with the data, verify the data, and then
|
||
|
|
use the data. Those five steps in 10,000 words doesn't help me at all. Yes, for somebody that
|
||
|
|
has never seen the topic before, that is very helpful, but for me that actually trying to figure
|
||
|
|
out how can I do this myself, it's totally useless. And I can see much more of this kind of
|
||
|
|
wasteful content being produced in order to get clicks, in order to get search, ranking, and in order
|
||
|
|
to just put content out there to earn money. But I haven't really found what kind of use case could
|
||
|
|
benefit society using an LLM. Of course, you could create your own emails and send to your colleagues
|
||
|
|
that is 2,000 words where you're trying to say one thing, or you could just write that thing that
|
||
|
|
you want to say in order to writing it in 2,000 words and really waste their time. So that is my point
|
||
|
|
of view of the LLM so far, or else we will might end up in a world where we try to write down five
|
||
|
|
facts, send them through an email, we have an LLM, then generates an email with 10,000 words,
|
||
|
|
send it over to someone, and they have an LLM in the other end, or an AI in their machine learning
|
||
|
|
model in the other end, which summarized that 10,000 words email into these five points again.
|
||
|
|
So we get a actual message across. That might be what we are doing in the future,
|
||
|
|
and all this translation back and forth will of course create hilarious side effects and
|
||
|
|
misunderstanding between people. So that might be the world that we are living in in two years,
|
||
|
|
but hopefully not. Hopefully we will have found what we should use large language model for by then.
|
||
|
|
So I am looking forward to see what we should use these things for, still today I have no clue
|
||
|
|
because asking an LLM for an answer, it will not give you the answer, it will
|
||
|
|
figure out what is the most commonly used opinion about this, but probably not the correct answer.
|
||
|
|
It's probably something like, yeah, I think it's this, and then you said, yeah, but that doesn't
|
||
|
|
make any sense. So I have actually talked to a chat GPT over and over again, where it has said,
|
||
|
|
okay, try this, and I said, but that is not correct. I know about this subject, I want to do know
|
||
|
|
how to do this thing, but that what you send me is not correct. And I have done that over and over
|
||
|
|
again, and when you have told somebody that they are wrong ten times, then you are wasting time
|
||
|
|
when you can just look it up instead. So yeah, hopefully we will find something that these things
|
||
|
|
are useful for, or they might be much better so they can actually generate truth in the future,
|
||
|
|
or actually generating value. So this is my way of thinking about LLMs, do you have a different
|
||
|
|
opinion? Have you found the use case for this yet? Please leave a comment about that, or record
|
||
|
|
your own episode so I can hear about what you have used LLMs for. I hope to see you in the next episode.
|
||
|
|
You have been listening to Hacker Public Radio, as Hacker Public Radio does work. Today's show was
|
||
|
|
contributed by a HBR listener like yourself. If you ever thought of recording podcasts,
|
||
|
|
you click on our contribute link to find out how easy it really is. Hosting for HBR has been
|
||
|
|
kindly provided by an honesthost.com, the internet archive, and our syncs.net. On the Sadois
|
||
|
|
stages, today's show is released on their creative commons, attribution, 4.0 International
|
||
|
|
License.
|