For multimodal model , vision /audio data need to be sent from tokenizer process to scheduler process. Since currently, these data was sent via socket, the data transfer route was gpu ->cpu -> socket ...
I managed to install the previous version and trained for a few days. I had excellent results. Now, using the same settings, I can no longer start the training. I’m using an RTX 5090. I also ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results