Bobinas P4G
  • Login
  • Public

    • Public
    • Groups
    • Popular
    • People

Conversation

Notices

  1. Carl T. Bergstrom (ct_bergstrom@fediscience.org)'s status on Monday, 23-Oct-2023 18:04:16 UTC Carl T. Bergstrom Carl T. Bergstrom

    It is absolutely astounding to me that we are still earnestly entertaining the possibility that #ChatGPT and #LLMS more broadly have a role in scientific writing, manuscript review, experimental design, etc.

    The training data for the question below are massive. It's a very easy question if you're trained on the entire internet.

    Question: What teams have never made it to the World Series?

    Correct answer: Seattle Mariners.

    Now, four responses from GPT4.

    NB: The Nationals won it all in 2019.

    In conversation Monday, 23-Oct-2023 18:04:16 UTC from fediscience.org permalink

    Attachments


    1. https://fediscience.org/system/media_attachments/files/111/282/215/376/439/416/original/94a97b5bfd2f9459.png

    2. https://fediscience.org/system/media_attachments/files/111/282/218/693/435/062/original/06fc753ca15223fc.png

    3. https://fediscience.org/system/media_attachments/files/111/282/227/513/231/479/original/1a418030eb706ca2.png

    4. https://fediscience.org/system/media_attachments/files/111/282/235/468/562/552/original/13f0651c12553b0b.png
    • jartigag repeated this.
    • Carl T. Bergstrom (ct_bergstrom@fediscience.org)'s status on Monday, 23-Oct-2023 18:04:18 UTC Carl T. Bergstrom Carl T. Bergstrom
      in reply to

      I had GPT regenerate the answer 20 times. A few things to note:

      1. Factual error rate: the system correctlu answered 1 time in 20.

      2. Run-to-run inconsistency. I get different answers each time.

      3. Logical errors and internally contradictory text in which one paragraph says a team did play and another says it didn't.

      4. One attempt to self-correct that still doesn't quite work.

      How could we think this sort of thing is useful for writing or even reviewing our work?

      In conversation Monday, 23-Oct-2023 18:04:18 UTC permalink

Feeds

  • Activity Streams
  • RSS 2.0
  • Atom
  • Help
  • About
  • FAQ
  • Privacy
  • Source
  • Version
  • Contact

Bobinas P4G is a social network. It runs on GNU social, version 2.0.1-beta0, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All Bobinas P4G content and data are available under the Creative Commons Attribution 3.0 license.