• 1 Post
  • 16 Comments
Joined 2 years ago
cake
Cake day: June 9th, 2023

help-circle



  • Can you explain the difference between understanding the question and generating the words that might logically follow?

    I mean, it’s pretty obvious. Take someone like Rowan Atkinson whose death has been misreported multiple times. If you ask a computer system “Is Rowan Atkinson Dead?” you want it to understand the question and give you a yes/no response based on actual facts in its database. A well designed program would know to prioritize recent reports as being more authoritative than older ones. It would know which sources to trust, and which not to trust.

    An LLM will just generate text that is statistically likely to follow the question. Because there have been many hoaxes about his death, it might use that as a basis and generate a response indicating he’s dead. But, because those hoaxes have also been debunked many times, it might use that as a basis instead and generate a response indicating that he’s alive.

    So, if he really did just die and it was reported in reliable fact-checked news sources, the LLM might say “No, Rowan Atkinson is alive, his death was reported via a viral video, but that video was a hoax.”

    but why should we assume that shows some lack of understanding

    Because we know what “understanding” is, and that it isn’t simply finding words that are likely to appear following the chain of words up to that point.







  • Imagine asking a librarian “What was happening in Los Angeles in the Summer of 1989?” and that person fetching you … That’s modern LLMs in a nutshell.

    I agree, but I think you’re still being too generous to LLMs. A librarian who fetched all those things would at least understand the question. An LLM is just trying to generate words that might logically follow the words you used.

    IMO, one of the key ideas with the Chinese Room is that there’s an assumption that the computer / book in the Chinese Room experiment has infinite capacity in some way. So, no matter what symbols are passed to it, it can come up with an appropriate response. But, obviously, while LLMs are incredibly huge, they can never be infinite. As a result, they can often be “fooled” when they’re given input that semantically similar to a meme, joke or logic puzzle. The vast majority of the training data that matches the input is the meme, or joke, or logic puzzle. LLMs can’t reason so they can’t distinguish between “this is just a rephrasing of that meme” and “this is similar to that meme but distinct in an important way”.





  • The US had a period of “greatness” shortly after WWII.

    Why was the US “great” in the 1950s and 1960s?

    1. The labour protections from the New Deal hadn’t yet been repealed, so more wealth was shared with ordinary people (as long as they were white).
    2. The US was the only major country to escape WWII without massive damage to its infrastructure. So, while other countries were focusing on rebuilding cities flattened by war, the US economy just shifted from producing war goods to producing consumer goods.
    3. A lot of the world’s best and brightest fled to the US as a safe place to escape the war. These immigrants were essential to the US economy after the war.

    So yeah, people’s grandpas were able to buy a house and support a family working a menial job for a brief period after WWII. But, that’s not because of some fundamental characteristic about the US that makes it better. It’s mostly because the US was fortunate enough to be on the opposite side of the planet from one of the most destructive wars in history.