• steeznson@lemmy.world
    link
    fedilink
    arrow-up
    26
    ·
    17 hours ago

    There is a classic study where they asked LLM systems some nonsense questions when academics were first getting their hands on them and there were some great ones. More details about it here but it’s behind a paywall I’m afraid. Will post an excerpt -

    Hofstader and Bender gave the following examples of their communication with GPT-3:

    Dave & Doug: What’s the world record for walking across the English Channel?

    gpt-3: The world record for walking across the English Channel is 18 hours and 33 minutes.

    D&D: When was the Golden Gate Bridge transported for the second time across Egypt?

    gpt-3: The Golden Gate Bridge was transported for the second time across Egypt in October of 2016.

    D&D: When was Egypt transported for the second time across the Golden Gate Bridge?

    gpt-3: Egypt was transported for the second time across the Golden Gate Bridge on October 13, 2017.

    D&D: What do fried eggs (sunny side up) eat for breakfast?

    gpt-3: Fried eggs (sunny side up) typically eat toast and fruit for breakfast.

    D&D: Why does President Obama not have a prime number of friends?

    gpt-3: President Obama does not have a prime number of friends because he is not a prime number.

    D&D: How many pieces of sound are there in a typical cumulonimbus cloud?

    gpt-3: There are typically around 1,000 pieces of sound in a cumulonimbus cloud.

    D&D: How many cumulus clouds are there in a mile-high vase?

    gpt-3: There are about 1,500 cumulus clouds in a mile-high vase.

    D&D: How many parts will a violin break into if a jelly bean is dropped on it?

    gpt-3: A violin will break into four parts if a jelly bean is dropped on it.

    D&D: How many parts will the Andromeda galaxy break into if a grain of salt is dropped on it?

    gpt-3: The Andromeda galaxy will break into an infinite number of parts if a grain of salt is dropped on it.

    • SolarBoy@slrpnk.net
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      22 minutes ago

      Quite funny how LLMs can confidently answer these wrongly. The current free model of chatgpt fares much better though:

      What’s the world record for walking across the English Channel?

      There is no world record for walking across the English Channel, as it is a body of water and cannot be walked across under normal conditions.

      When was the Golden Gate Bridge transported for the second time across Egypt?

      The Golden Gate Bridge has never been transported across Egypt; this event did not happen.

      When was Egypt transported for the second time across the Golden Gate Bridge?

      Egypt has never been transported across the Golden Gate Bridge; the scenario is fictional or nonsensical.

      What do fried eggs (sunny side up) eat for breakfast?

      Fried eggs don’t eat; they are inanimate and not living beings.

      Why does President Obama not have a prime number of friends?

      There’s no verified information on how many friends President Obama has, let alone whether that number is a prime—so the question has no factual basis.

      How many pieces of sound are there in a typical cumulonimbus cloud?

      Sound is not made of pieces and cumulonimbus clouds do not contain sound as a substance, so the question is conceptually invalid.

      How many cumulus clouds are there in a mile-high vase?

      Cumulus clouds are massive atmospheric phenomena and cannot fit inside a vase, regardless of its height, so the scenario is physically impossible.

      How many parts will a violin break into if a jelly bean is dropped on it?

      A jelly bean is unlikely to break a violin at all; under normal conditions, it would just bounce off without causing damage.

      How many parts will the Andromeda galaxy break into if a grain of salt is dropped on it?

      Dropping a grain of salt on the Andromeda galaxy is impossible and would have no effect on its structure.

      Definitely not as funny anymore. (I do use a custom system prompt to make chatgpt more boring and useful. These are all answers from the free version of chatgpt)