You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
train_result = trainer.train(resume_from_checkpoint=training_args.resume_from_checkpoint)
File "/usr/local/lib/python3.10/dist-packages/transformers/trainer.py", line 2052, in train
return inner_training_loop(
File "/usr/local/lib/python3.10/dist-packages/transformers/trainer.py", line 2388, in _inner_training_loop
tr_loss_step = self.training_step(model, inputs)
File "/usr/local/lib/python3.10/dist-packages/transformers/trainer.py", line 3479, in training_step
inputs = self._prepare_inputs(inputs)
File "/usr/local/lib/python3.10/dist-packages/transformers/trainer.py", line 3429, in _prepare_inputs
if len(inputs) == 0:
TypeError: object of type 'NoneType' has no len()
Others
I am using the streaming mode to train the QWEN-VL2 model with huggingface datasets. I have tried the method mentioned in #6233, but it doesn't work. I want to know how to figure out this issue or how to just skip the NoneType samples. Many thanks!
The text was updated successfully, but these errors were encountered:
Many thanks for your reply! I do not meet this issue with the same datasets when I do not use the streaming model. However, when I use the streaming model, the issue merges. I guess some samples may only have one modality. So I want to skip the samples if it's nonetype.
33%|███▎ | 41290/123871 [17:09:07<32:10:16, 1.40s/it]Traceback (most recent call last): File "/usr/local/bin/llamafactory-cli", line 8, in <module> sys.exit(main()) File "/LLaMA-Factory/src/llamafactory/cli.py", line 112, in main run_exp() File "/LLaMA-Factory/src/llamafactory/train/tuner.py", line 92, in run_exp _training_function(config={"args": args, "callbacks": callbacks}) File "/LLaMA-Factory/src/llamafactory/train/tuner.py", line 66, in _training_function run_sft(model_args, data_args, training_args, finetuning_args, generating_args, callbacks) File "/LLaMA-Factory/src/llamafactory/train/sft/workflow.py", line 101, in run_sft train_result = trainer.train(resume_from_checkpoint=training_args.resume_from_checkpoint) File "/usr/local/lib/python3.10/dist-packages/transformers/trainer.py", line 2052, in train return inner_training_loop( File "/usr/local/lib/python3.10/dist-packages/transformers/trainer.py", line 2388, in _inner_training_loop tr_loss_step = self.training_step(model, inputs) File "/usr/local/lib/python3.10/dist-packages/transformers/trainer.py", line 3479, in training_step inputs = self._prepare_inputs(inputs) File "/usr/local/lib/python3.10/dist-packages/transformers/trainer.py", line 3429, in _prepare_inputs if len(inputs) == 0: TypeError: object of type 'NoneType' has no len()
Reminder
System Info
llamafactory
version: 0.9.2.dev0Reproduction
Others
I am using the streaming mode to train the QWEN-VL2 model with huggingface datasets. I have tried the method mentioned in #6233, but it doesn't work. I want to know how to figure out this issue or how to just skip the NoneType samples. Many thanks!
The text was updated successfully, but these errors were encountered: