Files
hpr-knowledge-base/hpr_transcripts/hpr4064.txt

121 lines
9.7 KiB
Plaintext
Raw Normal View History

Episode: 4064
Title: HPR4064: Large Language Models
Source: https://hub.hackerpublicradio.org/ccdn.php?filename=/eps/hpr4064/hpr4064.mp3
Transcribed: 2025-10-25 19:09:48
---
This is Hacker Public Radio episode 4,064 for Thursday, the 29th of February, 2024.
Today's show is entitled, Large Language Models.
It is hosted by Daniel Persson, and is about 13 minutes long.
It carries a clean flag.
The summary is, what are they good for?
Hello Hacker's, and welcome to another episode.
Today I'm going to talk about machine learning and LLM, so large language models.
To start with, when it comes to application of machine learning,
I think that there is a bunch of really interesting and good things to do with machine learning.
We have machine learning that trying to find proteins and how to fold proteins,
and so on, that is using in the medical field, which is really great.
You have machine learning that is trying to look at images and find particular faults in people's
eyes, for instance, to find sickness, or you could have machine learning that finds different
objects in traffic in order to help cameras in life and so on, which could really benefit society.
There is a lot of these kind of creepy things that is going on there as well,
so everything can be used for good and bad, of course.
Then you have these kind of machine learning AIs that looks at images and
finds, for instance, characters, so you can actually read things, translate things,
which is also helping people to live their lives.
Really great applications.
When it comes to working and creating new material, there is AIs that could create sound,
like music, and could that create voices, these kind of TTS engines, that sound really good,
and for people that is either blind or have reading disabilities,
then that is a really good application to give them the possibility to actually listen to media
and that kind of thing. So there is a great application in that as well.
When it comes to creating images and videos, there is a bunch of creepy applications,
but for a professional, these kind of tools can be really helpful as well.
So if you take, for instance, an image and you need to remove something in the image,
we have already algorithms for that, but AI improves those algorithms.
If you have, for instance, square images, which we had at our company,
and we needed rectangle images as product images, that would have cost a lot of money
for the customer to hire its staff and reshoot all those product images,
just because we needed a different resolution or a different crop of the image.
So having the tools in Photoshop and other programs in order to create things that weren't there,
in order to create a different aspect ratio of an image,
or adding material that isn't really there, where the thing is not that important.
You are not trying to fool anyone. You want something to look natural and what is actually the
real thing is still there. So in our case, we had, I think it was somebody selling rings.
So it was a ring on a finger where you saw the product really well,
but we needed a little bit bigger image. So the AI generated an arm that stretched a little bit
further and a door frame that stretched a little bit further. So you could actually see
a little bit more of the image and have a little bit of a different aspect ratio.
So in those applications, I find very helpful and very interesting ways of using AI
or machine learning pretty much in order to do something better. I don't really like the term
AI, but everybody uses it. When it comes to large language models, I still have not found any
viable applications for those. So there is a bunch of applications that people use them for,
but I haven't really found the key thing that is helpful for everyone.
So when it comes to different applications, you could, for instance, create text.
And there is a bunch of different text things that you could create. You could, for instance,
create these kind of blurbs you see on web pages where people are talking about their company.
And those are usually these kinds of we are a consultancy that is trying to help our company to
reach newer heights and work with blah blah blah and so on, which doesn't really say anything
about that company. We also have these kind of extra information that you need to put on your
products in order to sell them, which if they don't contain any real facts like this product is
this large, this product is this fast, this product has this particular color or whatever it might
might be that is objectively correct information that you need to have in your description.
If those kind of things aren't really what you're looking for, you just want to create some text
that describes this product. You could do that with a large language model and get a reasonable
result. These kind of things people don't usually read anyway, so it doesn't really matter.
Another application which I think is more sinister is that you could create really good spam.
So I have a lot of people creating a bunch of very interesting spam. I've gotten a bunch of them.
People that are trying to sell me steel, trying to sell me all kinds of different
strange products that I would never be interesting in, but they are starting it with
creating an information that is seems reasonable as an email to me and then when you have read
two paragraphs or something, you realize that this is a product I'm not interested in. Before
you could see these emails because they were so badly produced that you realize directly that
this is not something I'm interested in, but now that is so cheap and so easy to create these
kind of spam emails, you could do that much better. So we will see a lot more spam.
Another thing that is really easy to create with machine large language models is of course
content. So now Google results are much worse than last year this time because there is so much
content pumped out that doesn't really say anything. So if you're searching for a solution to a problem,
the other day I was searching for how to train a large language model. So I could figure that
one thing out. I had to go to the fourth page of the result in Google in order to find somebody
that actually was talking about training a model. The other ones was generated by a model talking
about the different steps, but didn't have any extra knowledge or anything that actually gave me
any information that could help me in the process of training a model. It was talking about
how the process was looking. What kind of steps you could have like in order to train a model,
you need to find data, refine data, and then train with the data, verify the data, and then
use the data. Those five steps in 10,000 words doesn't help me at all. Yes, for somebody that
has never seen the topic before, that is very helpful, but for me that actually trying to figure
out how can I do this myself, it's totally useless. And I can see much more of this kind of
wasteful content being produced in order to get clicks, in order to get search, ranking, and in order
to just put content out there to earn money. But I haven't really found what kind of use case could
benefit society using an LLM. Of course, you could create your own emails and send to your colleagues
that is 2,000 words where you're trying to say one thing, or you could just write that thing that
you want to say in order to writing it in 2,000 words and really waste their time. So that is my point
of view of the LLM so far, or else we will might end up in a world where we try to write down five
facts, send them through an email, we have an LLM, then generates an email with 10,000 words,
send it over to someone, and they have an LLM in the other end, or an AI in their machine learning
model in the other end, which summarized that 10,000 words email into these five points again.
So we get a actual message across. That might be what we are doing in the future,
and all this translation back and forth will of course create hilarious side effects and
misunderstanding between people. So that might be the world that we are living in in two years,
but hopefully not. Hopefully we will have found what we should use large language model for by then.
So I am looking forward to see what we should use these things for, still today I have no clue
because asking an LLM for an answer, it will not give you the answer, it will
figure out what is the most commonly used opinion about this, but probably not the correct answer.
It's probably something like, yeah, I think it's this, and then you said, yeah, but that doesn't
make any sense. So I have actually talked to a chat GPT over and over again, where it has said,
okay, try this, and I said, but that is not correct. I know about this subject, I want to do know
how to do this thing, but that what you send me is not correct. And I have done that over and over
again, and when you have told somebody that they are wrong ten times, then you are wasting time
when you can just look it up instead. So yeah, hopefully we will find something that these things
are useful for, or they might be much better so they can actually generate truth in the future,
or actually generating value. So this is my way of thinking about LLMs, do you have a different
opinion? Have you found the use case for this yet? Please leave a comment about that, or record
your own episode so I can hear about what you have used LLMs for. I hope to see you in the next episode.
You have been listening to Hacker Public Radio, as Hacker Public Radio does work. Today's show was
contributed by a HBR listener like yourself. If you ever thought of recording podcasts,
you click on our contribute link to find out how easy it really is. Hosting for HBR has been
kindly provided by an honesthost.com, the internet archive, and our syncs.net. On the Sadois
stages, today's show is released on their creative commons, attribution, 4.0 International
License.