v3.0.0
中文版
架构修改与新特性:
具体可以查看这里: https://swift.readthedocs.io/zh-cn/latest/Instruction/ReleaseNote3.0.html
新模型:
- OpenGVLab/InternVL2_5-1B等系列模型
- LLM-Research/Llama-3.3-70B-Instruct
- BAAI/Emu3-Gen
- deepseek-ai/DeepSeek-V2.5-1210, deepseek-ai/deepseek-vl2等系列模型
- Shanghai_AI_Laboratory/internlm-xcomposer2d5-ol-7b
- InfiniAI/Megrez-3b-Instruct, InfiniAI/Megrez-3B-Omni
- TeleAI/TeleChat2-3B等系列模型
English Version
Architecture Modifications and New Features:
For more details, please visit: https://swift.readthedocs.io/en/latest/Instruction/ReleaseNote3.0.html
New Models:
- OpenGVLab/InternVL2_5-1B series models
- LLM-Research/Llama-3.3-70B-Instruct
- BAAI/Emu3-Gen
- deepseek-ai/DeepSeek-V2.5-1210, deepseek-ai/deepseek-vl2 series models
- Shanghai_AI_Laboratory/internlm-xcomposer2d5-ol-7b
- InfiniAI/Megrez-3b-Instruct, InfiniAI/Megrez-3B-Omni
- TeleAI/TeleChat2-3B series models
What's Changed
- Refactor All Codes and bump version to 3.0 by @tastelikefeet in #2030
- fix doc by @tastelikefeet in #2545
- fix manifest by @tastelikefeet in #2546
- add doc 2.x by @tastelikefeet in #2548
- fix ui by @tastelikefeet in #2549
- fix infer by @tastelikefeet in #2550
- Refactor mllm by @Jintao-Huang in #2543
- fix ui by @tastelikefeet in #2552
- Fix ui by @tastelikefeet in #2556
- Update ddp infer doc by @Jintao-Huang in #2557
- fix docs by @Jintao-Huang in #2558
- Fix docs by @Jintao-Huang in #2561
- fix log by @tastelikefeet in #2564
- Fix the command line parameter doc by @Jintao-Huang in #2565
- fix context by @Jintao-Huang in #2568
- Documents Updates by @yrk111222 in #2574
- Revert "Documents Updates" by @Jintao-Huang in #2576
- fix hub param by @tastelikefeet in #2572
- Fix bugs by @Jintao-Huang in #2573
- Support internvl2.5 by @Jintao-Huang in #2575
- update english docs by @Jintao-Huang in #2577
- fix en docs by @Jintao-Huang in #2580
- fix docs & add custom example by @Jintao-Huang in #2581
- fix custom example by @Jintao-Huang in #2582
- support llama3.3 by @Jintao-Huang in #2584
- update acc_strategy & fix citest by @Jintao-Huang in #2583
- Support peft0.14 by @tastelikefeet in #2587
- update infer/deploy examples by @Jintao-Huang in #2588
- add image images mapping by @Jintao-Huang in #2594
- update llm sft notebook by @Jintao-Huang in #2599
- fix notebook by @Jintao-Huang in #2600
- Fix streaming by @Jintao-Huang in #2601
- Emu3 gen train by @mi804 in #2602
- compat mllm notebook by @Jintao-Huang in #2604
- Temporarily remove torchacc. by @Jintao-Huang in #2606
- update docs by @Jintao-Huang in #2607
- train and infer scripts for emu3_gen by @mi804 in #2610
- Uodate Document by @yrk111222 in #2615
- update memory usage of emu3-gen by @mi804 in #2611
- move prepare_model by @Jintao-Huang in #2614
- Update mllm notebook by @Jintao-Huang in #2617
- Support all-embedding / all-norm by @Jintao-Huang in #2619
- fix lmdeploy==0.5.* by @Jintao-Huang in #2621
- Support deepseek-ai/DeepSeek-V2.5-1210 by @Jintao-Huang in #2624
- fix use_reentrant gradient_checkpointing by @Jintao-Huang in #2625
- support reward model by @Jintao-Huang in #2628
- fix add_default_tag by @Jintao-Huang in #2631
- fix dataset by @Jintao-Huang in #2636
- fix bugs & update openbuddy models & update docs by @Jintao-Huang in #2638
- fix app-ui by @tastelikefeet in #2641
- Fix post encode by @Jintao-Huang in #2643
- fix bugs by @Jintao-Huang in #2645
- update truncation_strategy by @Jintao-Huang in #2647
- fix swift/Infinity-Instruct by @Jintao-Huang in #2651
- Support LoRA-GA by @lxline in #2650
- support deepseek_vl2 by @Jintao-Huang in #2654
- fix swift/SlimOrca by @Jintao-Huang in #2656
- fix swift/SlimOrca by @Jintao-Huang in #2657
- support Shanghai_AI_Laboratory/internlm-xcomposer2d5-ol-7b:audio by @Jintao-Huang in #2658
- support Shanghai_AI_Laboratory/internlm-xcomposer2d5-ol-7b:base by @Jintao-Huang in #2660
- fix hub by @tastelikefeet in #2661
- fix liger by @tastelikefeet in #2666
- support megrez by @Jintao-Huang in #2667
- fix unsloth resume training by @tastelikefeet in #2668
- fix dataset by @Jintao-Huang in #2670
- Fix bugs by @tastelikefeet in #2671
- fix deepseek_vl2 by @Jintao-Huang in #2675
- support adapters by @Jintao-Huang in #2633
- Support megrez omni by @Jintao-Huang in #2674
- fix docs by @Jintao-Huang in #2679
- fix megrez_omni by @Jintao-Huang in #2680
- fix infer by @Jintao-Huang in #2681
- Fix bugs by @Jintao-Huang in #2687
- Update readme by @Jintao-Huang in #2579
- update wechat by @Jintao-Huang in #2694
- fix readme by @Jintao-Huang in #2696
- Fix web-ui by @tastelikefeet in #2693
- Fix readme by @Jintao-Huang in #2697
- Update banner by @Jintao-Huang in #2699
- fix use_reentrant by @Jintao-Huang in #2700
- update examples by @Jintao-Huang in #2703
- fix eval strategy by @Jintao-Huang in #2707
- Update FAQ by @slin000111 in #2706
- qwen to Qwen by @Jintao-Huang in #2708
- fix timeout & web-ui by @Jintao-Huang in #2709
- Fix multi lora by @tastelikefeet in #2711
- support Qwen/QVQ-72B-Preview by @Jintao-Huang in #2712
- update examples by @Jintao-Huang in #2714
- fix deploy request_config by @Jintao-Huang in #2718
- fix examples by @Jintao-Huang in #2719
- fix gptq group_size by @Jintao-Huang in #2720
- Better error messages by @Jintao-Huang in #2721
New Contributors
- @yrk111222 made their first contribution in #2574
- @lxline made their first contribution in #2650
Full Changelog: v2.6.1...v3.0.0