a confusing issue #8

lilhongxy · 2024-03-29T07:42:29Z

cos, sin = self.rotary_emb(value_states, seq_len=kv_seq_len)
ValueError: too many values to unpack (expected 2)

I follow the instructions in the Full inference code,bu then I encounter this issue.
How can I fix this?

ChenxinAn-fdu · 2024-03-29T16:08:48Z

This error is usually caused by calling replace_with_chunkllama() after model.from_pretrained(). Make sure replace_with_chunkllama() is called before initializing the model. If it cannot solve the error, please provide more details.

lilhongxy · 2024-03-31T14:37:25Z

from transformers import AutoTokenizer, LlamaTokenizer, LlamaForCausalLM, AutoModelForCausalLM
from chunkllama_attn_replace import replace_with_chunkllama
import torch

replace_with_chunkllama(pretraining_length=4096)

tokenizer = LlamaTokenizer.from_pretrained("path_to_Llama-2-7b-hf", trust_remote_code=True)
model = LlamaForCausalLM.from_pretrained("path_to_Llama-2-7b-hf", trust_remote_code=True, torch_dtype=torch.bfloat16)
inputs = tokenizer("Long...docs\n Q: How to extend the context window of LLMs? ", return_tensors="pt")

output_ids = model.generate(**inputs, max_length=128)[0]
print(tokenizer.decode(output_ids))

I just precisely followde the inference instructions,but the issue remained...

Mooler0410 · 2024-04-01T01:12:44Z

from transformers import AutoTokenizer, LlamaTokenizer, LlamaForCausalLM, AutoModelForCausalLM
from chunkllama_attn_replace import replace_with_chunkllama
import torch

replace_with_chunkllama(pretraining_length=4096)

tokenizer = LlamaTokenizer.from_pretrained("path_to_Llama-2-7b-hf", trust_remote_code=True)
model = LlamaForCausalLM.from_pretrained("path_to_Llama-2-7b-hf", trust_remote_code=True, torch_dtype=torch.bfloat16)
inputs = tokenizer("Long...docs\n Q: How to extend the context window of LLMs? ", return_tensors="pt")

output_ids = model.generate(**inputs, max_length=128)[0]
print(tokenizer.decode(output_ids))

I just precisely followde the inference instructions,but the issue remained...

Could you please check your transformers version? RoPE api for llama is changed again after 4.38. (Actually, It always changes...from 4.35 to 4.36, to 4.37, to 4.38 ... almost each recent transformers release has a new RoPE implementation for Llama..😓)

ChenxinAn-fdu · 2024-04-01T06:21:25Z

Hey guys, the code works in my environment. My transformer version is 4.37.2

from transformers import AutoTokenizer, LlamaTokenizer, LlamaForCausalLM, AutoModelForCausalLM
from chunkllama_github import replace_with_chunkllama
import torch

model_path = "path/to/llama2"

replace_with_chunkllama(pretraining_length=4096)

tokenizer = LlamaTokenizer.from_pretrained(model_path, trust_remote_code=True)
model = LlamaForCausalLM.from_pretrained(model_path, trust_remote_code=True, torch_dtype=torch.bfloat16).to("cuda:0")
inputs = tokenizer("Long...docs\n Q: How to extend the context window of LLMs? ", return_tensors="pt").to("cuda:0")

output_ids = model.generate(**inputs, max_length=128)[0]
print(tokenizer.decode(output_ids))

ChenxinAn-fdu · 2024-04-01T06:51:39Z

Please use Flash Attention for processing longer input:
model = LlamaForCausalLM.from_pretrained(model_path, attn_implementation="flash_attention_2", trust_remote_code=True, torch_dtype=torch.bfloat16).to(device)

lilhongxy · 2024-04-01T08:05:46Z

thank you all guys !!!😄
finally get success
It is my torch version that caused the issue, previous version is 2.2.1+cu118

success environment:
torch_version:2.0.1+cu118
transformers_version:4.37.2

ChenxinAn-fdu · 2024-04-02T10:20:53Z

If there are no further questions or follow-up discussions, I will close this issue shortly. Thank you all for your contributions and participation.

MarsMeng1994 · 2024-04-30T08:41:55Z

infer is corret, but when finetune, it comes out again

ChenxinAn-fdu closed this as completed Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

a confusing issue #8

a confusing issue #8

lilhongxy commented Mar 29, 2024

ChenxinAn-fdu commented Mar 29, 2024 •

edited

Loading

lilhongxy commented Mar 31, 2024

Mooler0410 commented Apr 1, 2024

ChenxinAn-fdu commented Apr 1, 2024

ChenxinAn-fdu commented Apr 1, 2024

lilhongxy commented Apr 1, 2024

ChenxinAn-fdu commented Apr 2, 2024

MarsMeng1994 commented Apr 30, 2024

a confusing issue #8

a confusing issue #8

Comments

lilhongxy commented Mar 29, 2024

ChenxinAn-fdu commented Mar 29, 2024 • edited Loading

lilhongxy commented Mar 31, 2024

Mooler0410 commented Apr 1, 2024

ChenxinAn-fdu commented Apr 1, 2024

ChenxinAn-fdu commented Apr 1, 2024

lilhongxy commented Apr 1, 2024

ChenxinAn-fdu commented Apr 2, 2024

MarsMeng1994 commented Apr 30, 2024

ChenxinAn-fdu commented Mar 29, 2024 •

edited

Loading