How would I do RGB 2 RGB image2image translation with this repo? #51

adeptflax · 2021-05-25T03:01:18Z

I have 512x512 pixel images I would like to do image2image translation on.

adeptflax · 2021-05-25T13:18:05Z

I don't understand how the config works.

adeptflax · 2021-05-29T03:08:28Z

I think I figured out how to do this. I'll try training a model tomorrow.

Guthman · 2021-06-05T01:03:26Z

@adeptflax Can you share your code?

adeptflax · 2021-06-05T14:34:24Z

@Guthman I'm still working on it. I got it to train. I need to test the model.

adeptflax · 2021-06-05T14:35:31Z

I'll publish the code once I get it working

1211sh · 2021-06-07T16:21:50Z

Can you share your intuition? I have no idea to revise this to work on I2I task.

adeptflax · 2021-06-08T01:31:14Z

The codebase is pretty much spaghetti code. I tried modifying drin, because it was doing something similar to image2image. The way I tried to modify it didn't work. I think I know one of the problems.

adeptflax · 2021-06-09T20:55:08Z

I think I got it working. I only have the first epoch of my model trained. I need to wait for it to finish to know for sure. I'll write a guide and the publish the code I used.

adeptflax · 2021-06-11T04:00:36Z

I had to fix something, but I did seem to have gotten it working. I'll post guide tomorrow if it works well.

adeptflax · 2021-06-14T03:48:28Z

Sorry guys, I procrastinated for a couple of days. I have gotten code to work that can train and run a image2image model. I don't know how it compares to pix2pixHD. I slightly screwed up input data on the dataset I was training on, through I should be able to recover from it without completely restarting training.

adeptflax · 2021-06-16T04:08:50Z

Here it is. Should work. https://github.com/adeptflax/image2image

adeptflax · 2021-06-18T14:33:46Z

@Guthman @1211sh I don't seem to get that good of results by epoch 36 on around 11,000 training examples. Does it just need to trained for longer or does something need to be changed? Any guesses? My output is faces, hair and eyebrows don't have detail.

Guthman · 2021-06-19T11:29:09Z

I don't remember where I read it (can't find it atm), but I think the authors trained theirs for five days on a V100 or something similar. So I think you have a bit to go. I'm training one for a bit on portrait paintings (~40k images), and although the reconstructions are started to look okay (after 34 epochs I think):

the validation examples weren't close to acceptable:

I basically copied the imagenet config but used a batch size of 8

I switched to StyleGAN2-ADA to finish my current project, but I'll come back to VQGAN.

adeptflax · 2021-06-20T00:01:53Z

@Guthman I saved the model output. and I just used pix2pixHD. Through pix2pixHd doesn't do as good as I need. Do you think random crop would help?

adeptflax · 2021-06-20T00:10:38Z

Maybe using transformers instead of just vggan would work? Maybe it's possible to pretrain on a face dataset? I'm doing stuff with faces.

adeptflax · 2021-06-20T00:18:35Z

I'm trained on 2 RTX 3090s for 2 days I think. So I would have to train for another 6 days, because of training because 512x512 is 4 times larger than 256x256?

adeptflax · 2021-06-20T00:19:10Z

@Guthman what's the resolution of your dataset?

adeptflax · 2021-06-20T00:24:46Z

Do transformer models first pre-train with vqgan and then do training on transformers?

adeptflax · 2021-06-20T00:28:35Z

I wonder what the problem is on #52.

adeptflax · 2021-06-20T03:37:09Z

actually it seems you need to first train to train a vqgan model than you can a train transformer. Maybe that's the the problem with #52. You would first train a model with faceshq_vqgan.yaml and then train a transformer with faceshq_transformer.yaml using the first vqgan model.

adeptflax · 2021-06-20T03:45:17Z

Does the transformer just modify the encodings?

adeptflax · 2021-06-20T04:38:59Z

ok I seem to be correct. In drin they created a depth vqgan and an imagenet vqgan model. So they whole drin goes depth vqgan model -> transformer -> image vqgan model. So basically the drin_transformer.yaml trains a model that converts the depth embeddings into imagenet embeddings.

adeptflax · 2021-06-20T04:47:54Z

I modified the reconstruction code to do x -> y instead of x -> x in my repo. Which isn't correct.

adeptflax · 2021-06-20T04:50:43Z

@Guthman did you set n_embed to 16384 or no? "model.params.n_embed" should be 16384.

adeptflax · 2021-07-18T17:24:56Z

ok I got an image2image transformer working I will submit pull request in the next few days.

adeptflax changed the title ~~How would I would do RGB 2 RGB image2image translation with this repo?~~ How would I do RGB 2 RGB image2image translation with this repo? May 26, 2021

adeptflax mentioned this issue May 29, 2021

[Question] How to detect and grab a grid from an image #49

Closed

adeptflax closed this as completed Jun 18, 2021

adeptflax reopened this Jun 18, 2021

adeptflax mentioned this issue Jun 18, 2021

Overfitting problem when training transformer #48

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How would I do RGB 2 RGB image2image translation with this repo? #51

How would I do RGB 2 RGB image2image translation with this repo? #51

adeptflax commented May 25, 2021

adeptflax commented May 25, 2021

adeptflax commented May 29, 2021

Guthman commented Jun 5, 2021

adeptflax commented Jun 5, 2021

adeptflax commented Jun 5, 2021

1211sh commented Jun 7, 2021

adeptflax commented Jun 8, 2021 •

edited

Loading

adeptflax commented Jun 9, 2021

adeptflax commented Jun 11, 2021

adeptflax commented Jun 14, 2021 •

edited

Loading

adeptflax commented Jun 16, 2021

adeptflax commented Jun 18, 2021 •

edited

Loading

Guthman commented Jun 19, 2021 •

edited

Loading

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021 •

edited

Loading

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jul 18, 2021

How would I do RGB 2 RGB image2image translation with this repo? #51

How would I do RGB 2 RGB image2image translation with this repo? #51

Comments

adeptflax commented May 25, 2021

adeptflax commented May 25, 2021

adeptflax commented May 29, 2021

Guthman commented Jun 5, 2021

adeptflax commented Jun 5, 2021

adeptflax commented Jun 5, 2021

1211sh commented Jun 7, 2021

adeptflax commented Jun 8, 2021 • edited Loading

adeptflax commented Jun 9, 2021

adeptflax commented Jun 11, 2021

adeptflax commented Jun 14, 2021 • edited Loading

adeptflax commented Jun 16, 2021

adeptflax commented Jun 18, 2021 • edited Loading

Guthman commented Jun 19, 2021 • edited Loading

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021 • edited Loading

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jun 20, 2021

adeptflax commented Jul 18, 2021

adeptflax commented Jun 8, 2021 •

edited

Loading

adeptflax commented Jun 14, 2021 •

edited

Loading

adeptflax commented Jun 18, 2021 •

edited

Loading

Guthman commented Jun 19, 2021 •

edited

Loading

adeptflax commented Jun 20, 2021 •

edited

Loading