AWS Machine Learning 动态:Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI
原文摘要:In this post, you learn how to use Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) together to improve the tool-calling accuracy of a small language model (SL 来源:AWS Machine Learning 动态。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。