當前位置：首頁 > 報告詳情

Rasul Tutunov_On a theory of hidden variables in chain of thoughts_watermark.pdf

上傳人：張** 編號：155550 2024-02-15 PDF PDF 13頁 1.58MB

該報告所屬合集： RLChina 2023 大會PPT&課件資料合集

打包下載報告合集

文檔加載中……請稍候！
如果長時間未打開，您也可以點擊刷新試試。

下載報告到電腦，查找使用更方便

VIP專享文檔

書簽

分享

收藏

已收藏

版權投訴

/13

立即下載

word格式文檔無特別注明外均可編輯修改，預覽文件經過壓縮，下載原文更清晰！

三個皮匠報告文庫所有資源均是客戶上傳分享，僅供網友學習交流，未經上傳用戶書面授權，請勿作商用。

《Rasul Tutunov_On a theory of hidden variables in chain of thoughts_watermark.pdf》由會員分享，可在線閱讀，更多相關《Rasul Tutunov_On a theory of hidden variables in chain of thoughts_watermark.pdf（13頁珍藏版）》請在三個皮匠報告上搜索。

1、On a theory of hidden variables in chain of thoughtsOn a theory of hidden variables in chain of thoughtsRasul TutunovSenior Research Scientist Huawei R&D,Noahs Ark,London*Elena.Ospina(https:/ is a prompting technique for large language model(LLM)that allows to improve its performance by providing a

2、demonstrations of several intermediate reasoning steps as exemplars Pre-trained LLMexampleprompt questionChain-of-Thought(CoT)CoT prompting:CoT is a prompting technique for large language model(LLM)that allows to improve its performance by providing a demonstrations of several intermediate reasoning

3、 steps as exemplars Pre-trained LLMexample step-by-stepreasoningprompt questionmodel also constructsstep-by step solution Significantly improving“reasoning”ability CoT(beyond math questions)Few-shot examplars of triples for non-arithmetic tasks:Chain of thoughts are highlightedChain-of-Thought(CoT)C

4、oT is a prompting technique for large language model(LLM)that allows to improve its performance by providing a demonstrations of several intermediate reasoning steps as exemplars Pre-trained LLMCOT is computationally efficient,as it does notrequire to re-train/fine tune the model.But why does COT wo

5、rk?What does effect its performance?Statistical model for natural languageeach CoT sequence generation has the following steps:general task description describingthe final goal behind the message Examples:Arithmetic demonstration.contextC“Provide simple arithmetic problem”“Alice has 2 apples,Bob has

6、 5 apples.Alice ate 1 apple and Bob ate 2 apples and gave 1 apple to John.Home many apples Alice and Bob have”“Calculate Alices apples after she ate 1”“Alice has 2 apples,She ate 1.Now,she has 1 apple”“Calculate Bobs apples after ate 2 one and gave 1 apple to John”“Bob has 5 apples,He ate 2 apples a

7、nd gave 1 apple to John.Hence,he has 5-2-1=2 apples left.“Calculate total number of apples Bob and Alice have”“Alice has 1 apple left and Bob has 2 apples left.In total they have 2+1=3 apples.Answer is 3intention for the first message.Statistical model for natural languageCoT:“Alice has 2 apples,She

8、 ate 1.Now,she has 1 apple”“Bob has 5 apples,He ate 2 apples and gave 1 apple to John.Hence,he has 5-2-1=2 apples left.“Alice has 1 apple left and Bob has 2 apples left.In total they have 2+1=3 apples.Answer is 3“Alice has 2 apples,Bob has 5 apples.Alice ate 1 apple and Bob ate 2 apples and gave 1 a

9、pple to John.Home many apples Alice and Bob have”natural language model:Formally:Input messageIntermediatethoughtsOutputmessage-Message generated from intentions-Subsequent intentions,generated from previousintentions,previous messages and contextAmbiguity:LLM as universal density approximator such

10、statistical model for natural language allows us to define density for each message :language model,parametrized by weights approximates each factor in this product:CoT setupGiven a collection of exemplar CoTsand input message the LLM prediction:Natural language prediction:Arithmetic demonstration.W

11、e want to establish proximity:Main Result:Under ambiguity assumption for long enough exemplar CoTswe have:In other words,prediction of LLM and prediction of the natural language will be asymptotically the same.Moreover,starting from some lengths of the convergence is geometric:withSketch of the Proo

12、fUsing universal approximator property for optimal weight for any collection of messages :where:andusing independence of exemplar thoughtsusing assumption on ambiguity ()for long enough exemplar thoughts :for some:What is next?The ambiguity of exemplar thoughts is crucial for CoT“reasoning”how can we quantify this measure?4 Oct 2023this work is ongoing nowThank youTeam:Antoine Grosnit,Juliusz Ziomek,Jun Wang,Haitham Bou Ammar.

相關圖表

本文探討了一種提升大型語言模型（LLM）性能的技巧——鏈式思考（CoT）。鏈式思考通過提供幾個中間推理步驟的示例，改善了預訓練LLM的推理能力。該方法不僅適用于數學問題，也拓展到了非算術任務。作者提出了一種統計模型，用于生成自然語言的鏈式思考序列，通過將具體任務描述與最終目標結合，以及逐步推理的過程，從而提高預測的準確性。研究顯示，在足夠長的示例鏈式思考下，LLM的預測將與自然語言的預測趨于一致。文章還提到，示例的模糊性對于鏈式思考的“推理”至關重要，但如何量化這一模糊性仍是研究中的問題。最后，文章提到了研究團隊正在進行的相關工作，并感謝團隊成員的貢獻。

如何提升語言模型的性能？" 揭開鏈式思維背后的神秘面紗" "非數學任務中，鏈式思維如何助力少量樣本學習？"

相關報告

聯系我們

0731-84720580
sgpjbg002
工作日 9:30 - 18:00

關于我們

侵權處理

關于我們

出版物經營許可證
工信部備案號：湘ICP備17000430號-2
公安備案號：湘公網安備43010402001071號

三個皮匠報告專業的行業報告下載站，每日更新，歡迎大家關注！

copyright@2008-2013 長沙景略智創信息技術有限公司版權所有
網站備案/許可證號：湘B2-20190120

客服

小程序

服務號

折疊

午夜网日韩中文字幕,日韩Av中文字幕久久,亚洲中文字幕在线一区二区,最新中文字幕在线视频网站