image_root = 'data/image/george_full' #28

zzddwyff · 2024-10-31T03:13:38Z

where is george_full?

AndysonYs · 2024-10-31T03:38:39Z

it is the George sub-dataset here: https://huggingface.co/datasets/TencentARC/StoryStream.

zzddwyff · 2024-10-31T03:40:10Z

我就推理需要下载全部的george.zip.gz吗

zzddwyff · 2024-10-31T03:44:10Z

I just excute inference do i need to down load all george.zip.gz?

AndysonYs · 2024-10-31T03:46:06Z

No you don't. You can just take any of the one image-text pair as input.

zzddwyff · 2024-10-31T03:53:49Z

ok let me try try

zzddwyff · 2024-10-31T05:26:51Z

CAN you tell what should i change in your code to inference one image? I find out your code adaptly use lots of images

zzddwyff · 2024-10-31T05:45:07Z

Traceback (most recent call last):
File "/root/autodl-tmp/SEED-Story/src/inference/gen_george.py", line 213, in
images_gen = adapter.generate(image_embeds=output['img_gen_feat'], num_inference_steps=50)
File "/root/autodl-tmp/SEED-Story/src/models_ipa/adapter_modules.py", line 455, in generate
images = self.sdxl_pipe(
File "/root/autodl-tmp/conda/envs/seed/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
return func(*args, **kwargs)
File "/root/autodl-tmp/conda/envs/seed/lib/python3.10/site-packages/diffusers/pipelines/stable_diffusion_xl/pipeline_stable_diffusion_xl.py", line 730, in call
) = self.encode_prompt(
File "/root/autodl-tmp/conda/envs/seed/lib/python3.10/site-packages/diffusers/pipelines/stable_diffusion_xl/pipeline_stable_diffusion_xl.py", line 379, in encode_prompt
prompt_embeds = prompt_embeds.to(dtype=self.text_encoder_2.dtype, device=device)
AttributeError: 'NoneType' object has no attribute 'dtype'

only one image come out

AndysonYs · 2024-11-01T07:12:44Z

CAN you tell what should i change in your code to inference one image? I find out your code adaptly use lots of images

Hi. Please see src/inference/gen_george.py. If you want to get a long story given only 1 text-image pair. Then you can change the line 152 to
for j in range(1):
and you should make sure the first line of the val.jsonl file is your input.

If you don't even want a long story, you just need it to generate 1 text-image pair. Then you can change the story_len param to 2 in line 205.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image_root = 'data/image/george_full' #28

image_root = 'data/image/george_full' #28

zzddwyff commented Oct 31, 2024

AndysonYs commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

AndysonYs commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

AndysonYs commented Nov 1, 2024

image_root = 'data/image/george_full' #28

image_root = 'data/image/george_full' #28

Comments

zzddwyff commented Oct 31, 2024

AndysonYs commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

AndysonYs commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

zzddwyff commented Oct 31, 2024

AndysonYs commented Nov 1, 2024