微調代理的開源模型.pdf

編號:167630 PDF 26頁 4.01MB 下載積分:VIP專享
下載報告請您先登錄!

微調代理的開源模型.pdf

1、2024 Databricks Inc.All rights reservedFINE-TUNING OPEN SOURCE MODELS FOR AGENTSTristan Zajonc-Continual.AIJune 11,20241Were entering the era of AI agentsConversational agentsGithub Copilot:https:/ agentsHex Magic:https:/hex.tech/product/magic-ai/Event-driven agentsCleric:https:/cleric.ioAutonomous

2、agentsGithub Copilot Workspace:https:/ do we build these types of agents?A G E N T S A R E S Y S T E M S N O T M O D E L SExternal ServicesAPIsData SourcesAgent SystemOrchestrationModelsToolsKnowledge BasesEvaluation,Monitoring,Feedback,and Fine-TuningIn-context agentsConversational agentsEvent-driv

3、en agentsAgent ExperienceAutonomous agentsLets talk about modelsOpen source models are laggingAgentBench:https:/llmbench.ai/agentSWE-Bench(Lite):https:/ UseReasoningKnowledgeTool UseReasoningKnowledgeIs there hope?TinyAgent:https:/bair.berkeley.edu/blog/2024/05/29/tiny-agent/How do we fine-tune a mo

4、del for agents?TinyAgent/LLMCompilerChoose how you want to call functionsOpenAIGenerate synthetic data using self-instruct or agent gymAgentGymOpenAI CookbookDont forgot to cover the full scope of user behaviorAgent-FLAN:https:/arxiv.org/abs/2403.12881Choose chat template suitable for LLM trainingRe

5、Act chat templateAgent-FLAN chat templateAgent-FLAN:https:/arxiv.org/abs/2403.12881Fine-tune your base modelAgentTuning:https:/thudm.github.io/AgentTuning/Consider embedding within a multi-agent systemMixture of Agents:https:/arxiv.org/abs/2406.04692Should you fine-tune open source models for AI age

6、nts?C O NProprietary models are still significantly ahead of open source models for agent use cases.Collecting agent trajectories for complex tasks like coding is non-trivial.Fine-tuning can easily degrade general performance and become a game of whack-a-mole.Scale effects are very real.Youre unlike

7、ly to beat frontier models for generalist agents.You will learn a lot and generate a lot of useful data.Self-instruct and agent gyms makes collection of trajectories feasible for many use cases.You can significantly increase performance of agents in specific domains,even beating frontier models.Open source frontier models are getting better just like proprietary models.It allows you to control your own destiny.P R OLearn how and give it a shot.Thank youhttps:/continual.aitristancontinual.ai

友情提示

1、下載報告失敗解決辦法
2、PDF文件下載后,可能會被瀏覽器默認打開,此種情況可以點擊瀏覽器菜單,保存網頁到桌面,就可以正常下載了。
3、本站不支持迅雷下載,請使用電腦自帶的IE瀏覽器,或者360瀏覽器、谷歌瀏覽器下載即可。
4、本站報告下載后的文檔和圖紙-無水印,預覽文檔經過壓縮,下載后原文更清晰。

本文(微調代理的開源模型.pdf)為本站 (張5G) 主動上傳,三個皮匠報告文庫僅提供信息存儲空間,僅對用戶上傳內容的表現方式做保護處理,對上載內容本身不做任何修改或編輯。 若此文所含內容侵犯了您的版權或隱私,請立即通知三個皮匠報告文庫(點擊聯系客服),我們立即給予刪除!

溫馨提示:如果因為網速或其他原因下載失敗請重新下載,重復下載不扣分。
客服
商務合作
小程序
服務號
折疊
午夜网日韩中文字幕,日韩Av中文字幕久久,亚洲中文字幕在线一区二区,最新中文字幕在线视频网站