Add: Generate Image by DALL·E 3 #493

mingming-ma · 2024-03-13T13:51:11Z

Description

The function generateImage calls the dall-e-3 model api to get results. Add a new command image on user side to display the image generated.

How to test

Enter prompt after command image in the input area: /image ${prompt}
Example: /image Happy White Border Collie. Close up

Fixes: #124

cloudflare-workers-and-pages · 2024-03-13T14:12:29Z

Deploying chatcraft-org with Cloudflare Pages

Latest commit:	`be3d7e9`
Status:	✅ Deploy successful!
Preview URL:	https://c90437cd.console-overthinker-dev.pages.dev
Branch Preview URL:	https://mingming-generateimage.console-overthinker-dev.pages.dev

View logs

tarasglek · 2024-03-13T16:21:43Z

we need to change code so commands added automatically show up in /commands atm this command is impossible to discover

tarasglek · 2024-03-13T16:23:35Z

also really confusing that /dalle3 doesn't work

humphd · 2024-03-13T16:25:42Z

I think it should be /image ...

src/lib/ai.ts

rjwignar

One thing I noticed is that once the chat exceeds the screen height, the scrollbar doesn't automatically move down to show the user the generated image:

However, this is something to be investigated in a follow-up, and should not block this from being merged.

src/lib/ai.ts

rjwignar

@mingming-ma These changes look good! It was a good idea to rename the command to "/image".

At the time of my review, @tarasglek and @humphd's suggestions have been addressed and incorporated into this PR:

I left ~3 observations/concerns that are either non-issues or can be addressed in follow-up PRs.
They shouldn't prevent this from being merged. @humphd do you agree?

tarasglek · 2024-03-14T12:48:09Z

I think we should also try to store the image similarly to how we upload images into chatcraft, so we can then ask gpt vision to analyze it(this can be done in followup)

Amnish04

@mingming-ma This is a pretty cool feature. Just left a few comments after reviewing.

src/lib/ai.ts

src/lib/commands/ImageCommand.ts

src/lib/ai.ts

Amnish04 · 2024-03-14T18:39:17Z

@mingming-ma This might be unrelated to this PR, but I feel like the images should have a cursor: pointer for the user to know something happens on clicking them.

Can we make that change in this PR or a follow up?

Another thing I wanted to suggest is that since image generation takes a long time, should we have an info alert as soon as the command is executed saying something like

{
    title: "Generating...",
    message: "Please wait while the image gets generated."
}

mingming-ma · 2024-03-15T00:42:31Z

I think we should also try to store the image similarly to how we upload images into chatcraft, so we can then ask gpt vision to analyze it(this can be done in followup)

I filed an issue to fix store problem #496 , just want to inform you that at the moment user is able use vision to analyze the generated image

mingming-ma · 2024-03-15T01:12:38Z

One thing I noticed is that once the chat exceeds the screen height, the scrollbar doesn't automatically move down to show the user the generated image:

@rjwignar I just saw @Amnish04 fixed an issue related the auto scroll, #495, I don't know it is related or not, let's see how it is after this be merged, sounds good?

Amnish04 · 2024-03-15T01:19:42Z

@rjwignar I just saw @Amnish04 fixed an issue related the auto scroll, #495, I don't know it is related or not, let's see how it is after this be merged, sounds good?

@mingming-ma I just undid that fix in #497 as that was causing nested submenus to break (more details here). But nonetheless, I don't think that was related to this issue

humphd · 2024-03-15T13:23:13Z

src/lib/ai.ts

+  try {
+    const response = await openai.images.generate({
+      model: "dall-e-3",
+      prompt: prompt,


nit: when you have the same key/value names, only include it once:

{ model: "dall-e-e", prompt, n, size, }

humphd · 2024-03-15T13:25:19Z

In a follow-up, I wonder if we should add a menu item to the Options menu to allow you to do the same thing, but pop-up a little model where you type your prompt? Or maybe make it so entering a prompt and asking the dall-e model does this too?

mingming-ma · 2024-03-15T14:14:46Z

In a follow-up, I wonder if we should add a menu item to the Options menu to allow you to do the same thing, but pop-up a little model where you type your prompt? Or maybe make it so entering a prompt and asking the dall-e model does this too?

@humphd Just guessing the main concern here, let user know this feature more easily?
Maybe we could also add the Commands in the Options menu, and list them as submenus.

humphd · 2024-03-15T14:43:32Z

In a follow-up, I wonder if we should add a menu item to the Options menu to allow you to do the same thing, but pop-up a little model where you type your prompt? Or maybe make it so entering a prompt and asking the dall-e model does this too?

@humphd Just guessing the main concern here, let user know this feature more easily? Maybe we could also add the Commands in the Options menu, and list them as submenus.

Lots of users will never use/know about /slash commands. It's pretty advanced.

Amnish04 · 2024-03-15T18:36:55Z

@mingming-ma Do we have an issue to track this #493 (comment)

Amnish04 · 2024-03-15T18:45:33Z

@mingming-ma One more thing, I feel like the image preview modal doesn't help much right now as most of the times we get a larger image in the message view itself.

As a user, I would love to have an image zoom feature on click in the preview modal. We should do it in a separate issue.
I found this library that could help with it
https://www.npmjs.com/package/react-medium-image-zoom

Examples at https://rpearce.github.io/react-medium-image-zoom/?path=/story/galleries--image-gallery

@humphd Do we have anything already in our tree to help with it, or should we pull this library I mentioned.

mingming-ma · 2024-03-15T20:57:25Z

@mingming-ma Do we have an issue to track this #493 (comment)

@Amnish04 Just filed #504 to track this

@mingming-ma One more thing, I feel like the image preview modal doesn't help much right now as most of the times we get a larger image in the message view itself.
As a user, I would love to have an image zoom feature on click in the preview modal. We should do it in a separate issue.
I found this library that could help with it
https://www.npmjs.com/package/react-medium-image-zoom

That's great! I'll investigate that later.

mingming-ma · 2024-03-15T21:02:49Z

I think we can merge this PR for now, @humphd @Amnish04 I'll create another issue follow up the react-medium-image-zoom

Update: issue -> #505

mingming-ma self-assigned this Mar 13, 2024

mingming-ma added this to the Release 1.5 milestone Mar 13, 2024

rjwignar self-requested a review March 13, 2024 15:02

WangGithub0 requested a review from Amnish04 March 13, 2024 15:06

This was referenced Mar 13, 2024

Let commands added automatically show up in /commands #494

Open

Fix image generated not available after some time #496

Closed

mingming-ma marked this pull request as ready for review March 13, 2024 17:08

rjwignar reviewed Mar 13, 2024

View reviewed changes

src/lib/ai.ts Outdated Show resolved Hide resolved

rjwignar reviewed Mar 13, 2024

View reviewed changes

src/lib/ai.ts Outdated Show resolved Hide resolved

rjwignar approved these changes Mar 13, 2024

View reviewed changes

Amnish04 requested changes Mar 14, 2024

View reviewed changes

src/lib/ai.ts Outdated Show resolved Hide resolved

src/lib/ai.ts Show resolved Hide resolved

src/lib/commands/ImageCommand.ts Show resolved Hide resolved

src/lib/ai.ts Outdated Show resolved Hide resolved

mingming-ma mentioned this pull request Mar 15, 2024

Support size settings in the image command #499

Closed

mingming-ma added 6 commits March 15, 2024 01:15

add generateImage function

5608c93

remove debug comment

4d77d26

change command to image

1111820

add image command instructions

55bb535

add n parameter instructions

0e082f7

use the currentProvider.createClient method

1b1d5b1

mingming-ma force-pushed the mingming/generateimage branch from 25faac0 to 1b1d5b1 Compare March 15, 2024 05:21

display no supported models error

e61b83a

mingming-ma requested review from rjwignar, Amnish04, tarasglek and humphd March 15, 2024 13:19

humphd approved these changes Mar 15, 2024

View reviewed changes

refactor for same key value names

be3d7e9

mingming-ma mentioned this pull request Mar 15, 2024

Add loading/warning info when image is being generated #504

Closed

Amnish04 approved these changes Mar 15, 2024

View reviewed changes

mingming-ma mentioned this pull request Mar 15, 2024

Better images preview behaviour #505

Closed

mingming-ma merged commit d09c947 into main Mar 15, 2024
4 checks passed

Amnish04 mentioned this pull request Mar 27, 2024

Open preview images in a new tab on click #533

Merged

mingming-ma deleted the mingming/generateimage branch April 10, 2024 16:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add: Generate Image by DALL·E 3 #493

Add: Generate Image by DALL·E 3 #493

mingming-ma commented Mar 13, 2024 •

edited

Loading

cloudflare-workers-and-pages bot commented Mar 13, 2024 •

edited

Loading

tarasglek commented Mar 13, 2024

tarasglek commented Mar 13, 2024

humphd commented Mar 13, 2024

rjwignar left a comment

rjwignar left a comment •

edited

Loading

tarasglek commented Mar 14, 2024 •

edited

Loading

Amnish04 left a comment

Amnish04 commented Mar 14, 2024 •

edited

Loading

mingming-ma commented Mar 15, 2024

mingming-ma commented Mar 15, 2024

Amnish04 commented Mar 15, 2024 •

edited

Loading

humphd Mar 15, 2024

humphd commented Mar 15, 2024

mingming-ma commented Mar 15, 2024 •

edited

Loading

humphd commented Mar 15, 2024

Amnish04 commented Mar 15, 2024

Amnish04 commented Mar 15, 2024 •

edited

Loading

mingming-ma commented Mar 15, 2024 •

edited

Loading

mingming-ma commented Mar 15, 2024 •

edited

Loading

Add: Generate Image by DALL·E 3 #493

Add: Generate Image by DALL·E 3 #493

Conversation

mingming-ma commented Mar 13, 2024 • edited Loading

Description

How to test

cloudflare-workers-and-pages bot commented Mar 13, 2024 • edited Loading

Deploying chatcraft-org with Cloudflare Pages

tarasglek commented Mar 13, 2024

tarasglek commented Mar 13, 2024

humphd commented Mar 13, 2024

rjwignar left a comment

Choose a reason for hiding this comment

rjwignar left a comment • edited Loading

Choose a reason for hiding this comment

tarasglek commented Mar 14, 2024 • edited Loading

Amnish04 left a comment

Choose a reason for hiding this comment

Amnish04 commented Mar 14, 2024 • edited Loading

mingming-ma commented Mar 15, 2024

mingming-ma commented Mar 15, 2024

Amnish04 commented Mar 15, 2024 • edited Loading

humphd Mar 15, 2024

Choose a reason for hiding this comment

humphd commented Mar 15, 2024

mingming-ma commented Mar 15, 2024 • edited Loading

humphd commented Mar 15, 2024

Amnish04 commented Mar 15, 2024

Amnish04 commented Mar 15, 2024 • edited Loading

mingming-ma commented Mar 15, 2024 • edited Loading

mingming-ma commented Mar 15, 2024 • edited Loading

mingming-ma commented Mar 13, 2024 •

edited

Loading

cloudflare-workers-and-pages bot commented Mar 13, 2024 •

edited

Loading

rjwignar left a comment •

edited

Loading

tarasglek commented Mar 14, 2024 •

edited

Loading

Amnish04 commented Mar 14, 2024 •

edited

Loading

Amnish04 commented Mar 15, 2024 •

edited

Loading

mingming-ma commented Mar 15, 2024 •

edited

Loading

Amnish04 commented Mar 15, 2024 •

edited

Loading

mingming-ma commented Mar 15, 2024 •

edited

Loading

mingming-ma commented Mar 15, 2024 •

edited

Loading