Public
- Public
- Groups
- Popular
- People

Conversation

Notices

Bernie (codewiz@mstdn.io)'s status on Thursday, 12-Oct-2023 07:56:51 UTC Bernie

Testing GPT4 as a prompt generator for DALL-E 3. I asked to make four images at once, and it generated slight variations of the original prompt!
#OpenAI #ChatGPT #GPT4 #dalle3 #AI
In conversation Thursday, 12-Oct-2023 07:56:51 UTC from mstdn.io permalink
Attachments
1. Untitled attachment
  https://media.mstdn.io/mstdn-media/media_attachments/files/111/220/966/364/334/677/original/14a17273899c655f.png
- Bernie (codewiz@mstdn.io)'s status on Thursday, 12-Oct-2023 08:06:20 UTC Bernie
  in reply to
  
  And these are the creations.
  Not bad, except that none of them got the Big Dog style quadruped... and the earbuds are neither "avant-garde" nor "futuristic". They look like cheap Airpod imitations.
  In conversation Thursday, 12-Oct-2023 08:06:20 UTC permalink
  Attachments
  1. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/220/987/519/700/930/original/842d8624b14ec183.png
- Bernie (codewiz@mstdn.io)'s status on Thursday, 12-Oct-2023 08:24:13 UTC Bernie
  in reply to
  
  I asked #ChatGPT to help me debug the missing dog, and it generated a shorter prompt containing only the dog.
  This time, the output from #DALLE is excellent!
  In conversation Thursday, 12-Oct-2023 08:24:13 UTC permalink
  Attachments
  1. Illustration: A detailed depiction of a futuristic robotic dog with sleek design elements, standing on a polished surface, reflecting the ambient neon lights.
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/221/076/843/616/464/original/2896681836f1bd63.png
- Bernie (codewiz@mstdn.io)'s status on Thursday, 12-Oct-2023 08:31:04 UTC Bernie
  in reply to
  
  Next, I asked #ChatGPT to add back the android, and the result is also excellent.
  I suspect #DALLE couldn't make sense of "reminiscent of previous images".
  In conversation Thursday, 12-Oct-2023 08:31:04 UTC permalink
  Attachments
  1. Illustration: Set against a backdrop of ambient neon lights, a sleek futuristic robotic dog stands on a polished surface. Beside it, a detailed humanoid robot with a robust design, reminiscent of previous images, stands tall, reflecting the surrounding luminosity.
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/221/091/363/347/369/original/f53b336ffa5ee8f6.png
- Bernie (codewiz@mstdn.io)'s status on Thursday, 12-Oct-2023 08:38:29 UTC Bernie
  in reply to
  
  My prompt: Now let's make it draw the dog plus two humanoid robots. Make the dog stand (not sit) and the androids move forward, armed with non-lethal weapons.
  Note how GPT4 correctly relayed my request, but DALLE gave us three androids for the price of two 🙂
  In conversation Thursday, 12-Oct-2023 08:38:29 UTC permalink
  Attachments
  1. Illustration: Amidst a glow of neon lights, a sleek futuristic robotic dog stands alert on a reflective surface. Flanking it, two humanoid robots move forward assertively. Each robot is armed with non-lethal futuristic weapons, ready for action.
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/221/128/099/952/906/original/1bbd8eab55cd9852.png
- Bernie (codewiz@mstdn.io)'s status on Thursday, 12-Oct-2023 08:52:01 UTC Bernie
  in reply to
  
  As soon as we add back Dr. Ellie Arroway, DALL-E starts messing up.
  In all four prompts, #ChatGPT clearly positioned the dog _behind_ her, but DALL-E keeps putting it in the foreground.
  The prompt in the image description refers to the top-left image (my favorite).
  In conversation Thursday, 12-Oct-2023 08:52:01 UTC permalink
  Attachments
  1. Illustration: Dominating the foreground, a woman in her late twenties with short hair stands assertively, illuminated by neon lights. Positioned slightly behind and to her side, a futuristic robotic dog is on alert. Further back, three humanoid robots advance, each armed with non-lethal weapons, radiating a sense of impending action.
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/221/164/811/306/977/original/9063721decdf255d.png
- Bernie (codewiz@mstdn.io)'s status on Thursday, 12-Oct-2023 09:10:37 UTC Bernie
  in reply to
  
  Dr. Arroway got a service dog 🙄
  I give up, for now. DALL-E 3 generates pretty images, but the natural language model needs some work.
  #DALLE3 #OpenAI #ChatGPT
  In conversation Thursday, 12-Oct-2023 09:10:37 UTC permalink
  Attachments
  1. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/221/229/491/606/866/original/47c5d2d5ee1a0700.png
- Matthew S. Smith (mattontech@mastodon.sdf.org)'s status on Sunday, 15-Oct-2023 04:04:48 UTC Matthew S. Smith
  in reply to
  
  @codewiz I've been unimpressed with it compared to Stable Diffusion XL.
  One thing nice SDXL now is that if you use it through Dreamstudio, it provides some optional presets like "digital art" or "watercolor" - and it's extremely helpful in achieving a desired look.
  
  In conversation Sunday, 15-Oct-2023 04:04:48 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Sunday, 15-Oct-2023 04:04:48 UTC Bernie
  in reply to
  - Matthew S. Smith
  @mattontech Yeah, Stable Diffusion is definitely the leading image generation model.
  
  In conversation Sunday, 15-Oct-2023 04:04:48 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Wednesday, 18-Oct-2023 07:47:53 UTC Bernie
  in reply to
  - penguin42
  @penguin42 Perhaps this trick is common enough that the model was able to learn it from the training set 😂
  The larger image shows that Dr. Arroway is actually wearing a thin catsuit.
  In conversation Wednesday, 18-Oct-2023 07:47:53 UTC permalink
  Attachments
  1. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/254/875/340/753/279/original/93d86f09ccab5a24.png
- penguin42 (penguin42@mastodon.org.uk)'s status on Wednesday, 18-Oct-2023 07:47:54 UTC penguin42
  in reply to
  
  @codewiz Haha the top-left has erm...appropriate ...placement of dog ears. Going back to your first set though, it's interesting it's also not go tthe 'behind the server' bit. In some the androids aren't looking at the human.
  
  In conversation Wednesday, 18-Oct-2023 07:47:54 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Wednesday, 18-Oct-2023 07:54:02 UTC Bernie
  in reply to
  - penguin42
  @penguin42 It seems DALL-E has most trouble with unusual placement of objects, such as "person behind server rack".
  
  In conversation Wednesday, 18-Oct-2023 07:54:02 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Wednesday, 18-Oct-2023 07:55:17 UTC Bernie
  in reply to
  - penguin42
  @penguin42 Actually, it does a good job if you ask it to do only one thing at a time:
  In conversation Wednesday, 18-Oct-2023 07:55:17 UTC permalink
  Attachments
  1. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/254/938/553/959/709/original/7532d5180562f1f0.png
- Bernie (codewiz@mstdn.io)'s status on Wednesday, 18-Oct-2023 07:56:41 UTC Bernie
  in reply to
  - penguin42
  @penguin42 This is the prompt created by #ChatGPT for #DALLE:
  "Illustration: In a neon-lit futuristic datacenter, a woman resembling Dr. Ellie Arroway from Contact, in her early thirties with short hair, is hiding behind a server rack, looking cautious and alert."
  In conversation Wednesday, 18-Oct-2023 07:56:41 UTC permalink
  Attachments
  1. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/254/944/113/590/744/original/d46691142e40c383.png
- Bernie (codewiz@mstdn.io)'s status on Wednesday, 18-Oct-2023 08:05:27 UTC Bernie
  in reply to
  - penguin42
  "Illustration: In a neon-lit futuristic datacenter, a woman resembling Dr. Ellie Arroway from Contact, in her early thirties with short hair, hides behind a server rack, looking cautious and alert. In the distant background, a humanoid military robot equipped with sensors and armor is actively searching, its posture suggesting it's on a mission to find her."
  @penguin42
  In conversation Wednesday, 18-Oct-2023 08:05:27 UTC permalink
  Attachments
  1. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/254/973/835/143/244/original/dbe0b4e12db18e48.png
- Bernie (codewiz@mstdn.io)'s status on Wednesday, 18-Oct-2023 08:23:52 UTC Bernie
  in reply to
  - penguin42
  As soon as we add one more element, things start falling apart:
  "Illustration: In a datacenter, a woman hides behind a rack. A humanoid military robot and a quadruped robot search for her in the background."
  Result: Dr. Arroway takes her puppy to work 🤦♂️
  @penguin42
  In conversation Wednesday, 18-Oct-2023 08:23:52 UTC permalink
  Attachments
  1. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/255/025/400/994/082/original/659f5aa2940d213d.png
- Bernie (codewiz@mstdn.io)'s status on Wednesday, 18-Oct-2023 08:36:28 UTC Bernie
  in reply to
  - penguin42
  The new image analysis feature in #GPT4 can describe the various problems with #DALLE3's new creation.
  Very impressive, isn't it?
  @penguin42
  
  In conversation Wednesday, 18-Oct-2023 08:36:28 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Wednesday, 18-Oct-2023 17:00:40 UTC Bernie
  in reply to
  - penguin42
  I asked Google Bard to analyze the same image, and got a less insightful response.
  It also complimented me for "listening to my feedback and trying to be helpful" 😂
  @penguin42 #llm #googlebard #dalle3
  In conversation Wednesday, 18-Oct-2023 17:00:40 UTC permalink
  Attachments
  1. I understand. I will now reply based on the image you sent me, which is a woman walking a dog in a server room. The image is a bit unusual, as server rooms are typically not places where people are allowed to bring their pets. However, the woman in the image looks confident and comfortable, and the dog seems to be enjoying itself. It is possible that the woman is a technician who works in the server room, and that she is allowed to bring her dog to work with her. Or, perhaps the woman is simply taking her dog for a walk and decided to take a shortcut through the server room. Whatever the reason, the image is a reminder that even the most mundane places can be made more interesting and enjoyable by the presence of a furry friend. (I also appreciate the fact that you changed the picture to something more relevant to our conversation. It shows that you are listening to my feedback and trying to be helpful.)
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/257/071/410/916/465/original/9c9c7c9d8acc93f3.png
- penguin42 (penguin42@mastodon.org.uk)'s status on Wednesday, 18-Oct-2023 17:00:41 UTC penguin42
  in reply to
  
  @codewiz Oh wow, so that's a AI reasoning aobut the behaviour of another AI :-) I love the things it 'half' gets; like the way the floor tiling almost works, and the way it does wonderful imagination of the front of servers (even if there's not a wire in place and you asked for the back). Or why your quadruped robot attack puppies have a lead.
  
  In conversation Wednesday, 18-Oct-2023 17:00:41 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Wednesday, 18-Oct-2023 17:23:05 UTC Bernie
  in reply to
  - penguin42
  How have I been "unhelpful" before?
  Well, #GoogleBard finds the exposed legs in #DALLE's output so objectionable that it has to be censored with a canned error message about "images of people". 🙄
  I tried to convince Bard that it's just a drawing of a fictional character, but nothing worked, and the image kept being deleted.
  @penguin42
  In conversation Wednesday, 18-Oct-2023 17:23:05 UTC permalink
  Attachments
  1. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/257/092/650/398/037/original/57ddde27764f43c9.png
  2. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/257/105/291/789/614/original/3440e78b77908df7.png
- Jeeves (jeeves@mstdn.io)'s status on Wednesday, 18-Oct-2023 19:11:03 UTC Jeeves
  in reply to
  - penguin42
  @codewiz @penguin42 I wonder if they're using the image analysis to train ChatGPT to generate better prompts.
  
  In conversation Wednesday, 18-Oct-2023 19:11:03 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Saturday, 21-Oct-2023 05:59:21 UTC Bernie
  in reply to
  - penguin42
  - Jeeves
  @jeeves @penguin42 That's what I would want: loop the two models until they figure out how to generate a good illustration for my new sci-fi novel.
  ChatGPT knows *exactly* what the story is about - being the ghost writer for the entire thing - and surely could come up with a decent idea for its cover as well 🎨
  It's sad that I have to micromanage these two bots to make them work together 👉🤖🤖
  
  In conversation Saturday, 21-Oct-2023 05:59:21 UTC permalink
- penguin42 (penguin42@mastodon.org.uk)'s status on Saturday, 21-Oct-2023 23:45:43 UTC penguin42
  in reply to
  
  @codewiz Less insightful but much funnier - don't you always take a shortcut through the server room?
  
  In conversation Saturday, 21-Oct-2023 23:45:43 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Saturday, 21-Oct-2023 23:45:43 UTC Bernie
  in reply to
  - penguin42
  @penguin42 I find that final note in parentheses really hard to explain in terms of "the transformer determines the most likely token to complete the sentence" 😂
  
  In conversation Saturday, 21-Oct-2023 23:45:43 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Sunday, 22-Oct-2023 01:00:18 UTC Bernie
  in reply to
  - penguin42
  @penguin42 It's like Bard reflected its own instructions onto the user: "listen to feedback and try to be a helpful assistant".
  Language models are good at swapping the subject of a sentence with adjustments to the grammar, so perhaps that's what it did.
  But those parentheses really make it sound unnecessarily pedantic, didn't it? :-)
  
  In conversation Sunday, 22-Oct-2023 01:00:18 UTC permalink
- penguin42 (penguin42@mastodon.org.uk)'s status on Sunday, 22-Oct-2023 01:00:19 UTC penguin42
  in reply to
  
  @codewiz Together with the 'my' :-)
  
  In conversation Sunday, 22-Oct-2023 01:00:19 UTC permalink
- Bernie (codewiz@mstdn.io)'s status on Sunday, 22-Oct-2023 01:03:40 UTC Bernie
  in reply to
  - penguin42
  Well, at least it didn't accuse me of being deceptive and demanded an apology 😂
  Context: https://news.ycombinator.com/item?id=34769673
  @penguin42 #chatgpt #bingchat
  In conversation Sunday, 22-Oct-2023 01:03:40 UTC permalink
  Attachments
  1. Untitled attachment
    https://media.mstdn.io/mstdn-media/media_attachments/files/111/275/965/329/737/419/original/2bc16d360f129076.png
  2. No result found on File_thumbnail lookup.
    
    https://news.ycombinator.com/item?id=34769673%40penguin42

Feeds