當前位置：首頁 > 報告詳情

3134 Watson Assistant Retrieval Augmented Generation using Elasticsearch DBaaS.pdf

上傳人：一*** 編號：653200 2025-05-01 PDF PDF 38頁 1.94MB

該報告所屬合集： 2024年IBM TechXchange大會（IBM TechXchange Conference 2024）嘉賓演講PPT合集

打包下載報告合集

文檔加載中……請稍候！
如果長時間未打開，您也可以點擊刷新試試。

下載報告到電腦，查找使用更方便

VIP專享文檔

書簽

分享

收藏

已收藏

版權投訴

/38

立即下載

word格式文檔無特別注明外均可編輯修改，預覽文件經過壓縮，下載原文更清晰！

三個皮匠報告文庫所有資源均是客戶上傳分享，僅供網友學習交流，未經上傳用戶書面授權，請勿作商用。

《3134 Watson Assistant Retrieval Augmented Generation using Elasticsearch DBaaS.pdf》由會員分享，可在線閱讀，更多相關《3134 Watson Assistant Retrieval Augmented Generation using Elasticsearch DBaaS.pdf（38頁珍藏版）》請在三個皮匠報告上搜索。

1、October 21-24,2024Mandalay Bay Convention CenterLas Vegas,NevadaSession Session 31343134Daniel PittnerIBM,STSM,Cloud Platform,Cloud Database ServicesWatsonx Assistant Retrieval Augmented Generation using Elasticsearch DBaaSAgenda0102030405RAG 101Watsonx Assistant&Search(Elastic)Search IntegrationFil

2、ter and Query configurationDeploying models to ElasticIBM TechXchange|2024 IBM Corporation#IBMTechXchange3What you will learn in this sessionIBM TechXchange|2024 IBM Corporation0102030405What is Retrieval Augmented Generation?Watsonx Assistant Search IntegrationWhat parts of RAG run directly in Elas

3、ticSearch methods in ElasticDeploying custom models to Elastic#IBMTechXchange4IBM TechXchange|2024 IBM Corporation“Can you build something like ChatGPT,but with access to my calendar,mail”My Mom,asking for something that could be done with RAGAgenda0102030405RAG 101Watsonx Assistant&Search(Elastic)S

4、earch IntegrationFilter and Query configurationDeploying models to ElasticIBM TechXchange|2024 IBM Corporation#IBMTechXchange6Retrieval Augmented Generation 101Question?AnswerLoadSplitPromptLLM(Granite)Vectorstore(Elastic)IBM TechXchange|2024 IBM CorporationEmbed(ELSER)#IBMTechXchangeRAG PipelineDat

5、a PreparationLoading and chunkingdocuments using themost effective strategyfor a business domainGenerate EmbeddingsTransforming chunkschunks and questionsquestions into vectorsSearch and RankingVarious search methodssearch methods(kNN,aNN)ReRankReRank for improved relevance7IBM TechXchange|2024 IBM

6、Corporation#IBMTechXchange8RAG with Watsonx Assistant&ElasticQuestion?AnswerLoadSplitPromptLLM(Granite)Vectorstore(Elastic)IBM TechXchange|2024 IBM CorporationEmbed(ELSER)Agenda0102030405RAG 101Watsonx Assistant&Search(Elastic)Search IntegrationFilter and Query configurationDeploying models to Elast

7、icIBM TechXchange|2024 IBM Corporation#IBMTechXchangeOmnichannelVirtual Agent(build once,deployacross voice,chat)Dialogs,train/testconnectorsOmnichannel Virtual AgentsAI Agent AssistOmnichannelHuman AgentMultipleAppsIncrease Agent Efficiency and Effectiveness DataAPIsKnowledgeArticlesUserJourneyRepo

8、rtingAnalyticsImprove Contact Center OperationsCustomerWatsonx AssistantDeliver consistent and intelligent customer careIBM TechXchange|2024 IBM CorporationIntegration with Watson Assistant for Conversational SearchIBM Cloud Databases for ElasticsearchWatson AssistantWatson Discovery,Elastic,Custom#

9、IBMTechXchangeConversational search offoffConversational search onon12Watsonx Assistant Search IBM TechXchange|2024 IBM CorporationAgenda0102030405RAG 101Watson Assistant&Conversational Search(Elastic)Search IntegrationFilter and Query configurationDeploying models to ElasticIBM TechXchange|2024 IBM

10、 CorporationSetting up Elasticsearch integrationConnect IBM Cloud Database Elasticsearch Platinum instance using a unique service credentialEnsure minimum of 16 GB RAM per node are metIBM TechXchange|2024 IBM CorporationSetting up Elasticsearch integrationChoosing an index strategy:Uploading documen

11、ts Uploading documents easiest to get started,just upload documents,Watsonx assistant uses ELSER to indexUse my index Use my index fine grained control over all aspects of indexing,custom chunking,custom model(e.g.german language support),custom index settingsIBM TechXchange|2024 IBM CorporationSett

12、ing up Elasticsearch integrationUploading documents Uploading documents upload English pdf documents to add a knowledge base to your assistantIBM TechXchange|2024 IBM CorporationSetting up Elasticsearch integrationIBM TechXchange|2024 IBM CorporationAgenda0102030405RAG 101Watson Assistant&Conversati

13、onal Search(Elastic)Search IntegrationFilter and Query configurationDeploying models to ElasticIBM TechXchange|2024 IBM CorporationFilter and Query ConfigurationFilterFilter Filter results from Elastic globally or with action specific context using Elastic Search DSLQueryQuery Define how your knowle

14、dge base will be searched by watsonx assistant,defaults to ELSERELSERKNN dense vectorHybridwith optional nestingIBM TechXchange|2024 IBM CorporationSparse Vectors ELSER model ELSER generates a collection of pairs(token,weights)for a better semantic representation of the given text IBM TechXchange|20

15、24 IBM CorporationELSER(Elastic Learned Sparse EncodeR)is a retrieval model trained by Elastic that enables semantic search to retrieve more relevant search resultsBoth sentences are not similar if encoded using dense vectors,as semantics are not captured How ELSER model works-Text Expansion Dog Dog

16、 chased chased t the he squirrsquirrelelCat Cat dislikes dislikes the the mousemouseIBM TechXchange|2024 IBM CorporationELSER expands the meaning of your domain to terms it“understands”Example:“Latitude”could have two meanings:geospatial but also could be a laptopA dense vector representation can no

17、t capture semanticsHow ELSER model works-Text Expansion Dog Dog chased chased t the he squirrsquirrelelCat Cat dislikes dislikes the the mousemousecaninepursued pet treeacornrodentExpandsExpandsExpandspetfelinerejectsdespisesrodentratExpandsExpandsExpandsIBM TechXchange|2024 IBM CorporationSimilarit

18、y algorithm matches only on the expansionexpansion How ELSER model works-Text Expansion+Weights Dog Dog chased chased t the he squirrsquirrelelCat Cat dislikes dislikes the the mousemousecaninepursued pet treeacornrodentExpandsExpandsExpandspetfelinerejectsdespisesrodentratExpandsExpandsExpands.9.5.

19、7.6.4.6.7.7.4.6.8.9IBM TechXchange|2024 IBM CorporationScore is calculated from matched termsHow ELSER model works-Text Expansion+Weights Dog Dog chased chased t the he squirrsquirrelelCat Cat dislikes dislikes the the mousemousecaninepursued pet treeacornrodentExpandsExpandsExpandspetfelinerejectsd

20、espisesrodentratExpandsExpandsExpands.9.5.7.6.4.6.7.7.4.6.8.9(0.5*0.7)(0.5*0.7)+(0.6*0.8)+(0.6*0.8)-Score=.83Score=.83 IBM TechXchange|2024 IBM CorporationDense Vectors 4 Authors with different perspectives on Mona LisaDense Vectors are a type of mathematical objects that represent dataKnown also as

21、“vector embeddings”Different models create different dense vectors.78,.35,.23 .45,.83,.24 45,.23,.24,.82,.32,.94,IBM TechXchange|2024 IBM CorporationkNN on dense vectorsK K-nearest neighbor search will find the nearest neighbor search will find the closestclosest k k results in an n dimensional vect

22、or spaceresults in an n dimensional vector spaceResults are sorted by their Results are sorted by their distancedistance from search from search vectorvectorThere are multiple ways to define distanceThere are multiple ways to define distanceEuclideanEuclideanDotDot-productproductCosineCosineMax inne

23、r ProductMax inner Product IBM TechXchange|2024 IBM CorporationdK=3Hybrid Reciprocal Rank FusionCombines multiple searches into a single ranked result to increase result relevanceAllows combining vector and NLP freelyCombining specialized embeddings Increases computational load and storage consumpti

24、on(generating multiple embeddings)IBM TechXchange|2024 IBM CorporationAgenda0102030405RAG 101Watson Assistant&Conversational Search(Elastic)Search IntegrationFilter and Query configurationDeploying models to ElasticIBM TechXchange|2024 IBM CorporationDeploying models to ElasticTransformer models tha

25、t conform to the standard BERT model interface and use the WordPiece tokenization algorithm should workEland provides a convenient mechanism to deploy a model to IBM Cloud Database ElasticsearchIBM TechXchange|2024 IBM CorporationDeploying models to ElasticSetup Kibana for convenient administrationA

26、fter deployment of a new model model synchronization is requiredCheck model status to validate youre ready to go!IBM TechXchange|2024 IBM Corporation#IBMTechXchange31Using a custom model for indexingQuestion?AnswerLoadSplitPromptLLM(Granite)Vectorstore(Elastic)IBM TechXchange|2024 IBM CorporationEmb

27、ed(ELSER)Using a custom model for indexingCreate an ingestion pipeline,including a chunking strategyConfigure a reference to the model in the inference sectionIn this case we define a max size of a chunk with 1024,since ELSER supports 512 token context windowIBM TechXchange|2024 IBM CorporationUsing

28、 a custom model for indexing/ChunkingOverlapping chunks by 25%is recommendedOptimal setting might depend on your documentsWatch out for layout sensitive information,like tables,diagrams,etc.IBM TechXchange|2024 IBM Corporation#IBMTechXchange34Q&AIBM TechXchange|2024 IBM CorporationDaniel PittnerIBM,

29、STSM,Cloud Platform,Cloud Data S#IBMTechXchange35Thank YouIBM TechXchange|2024 IBM Corporation#IBMTechXchangeNotices and disclaimersCertain comments made in this presentation may be characterized as forward looking under the Private Securities Litigation Reform Act of 1995.Forward-looking statements

30、 are based on the companys current assumptions regarding future business and financial performance.Those statements by their nature address matters that are uncertain to different degrees and involve a number of factors that could cause actual results to differ materially.Additional information conc

31、erning these factors is contained in the Companys filings with the SEC.Copies are available from the SEC,from the IBM website,or from IBM Investor Relations.Any forward-looking statement made during this presentation speaks only as of the date on which it is made.The company assumes no obligation to

32、 update or revise any forward-looking statements except as required by law;these charts and the associated remarks and comments are integrally related and are intended to be presented and understood together.2024 International Business Machines Corporation.All rights reserved.This document is distri

33、buted“as is”without any warranty,either express or implied.In no event shall IBM be liable for any damage arising from the use of this information,including but not limited to,loss of data,business interruption,loss of profit or loss of opportunity.Customer examples are presented as illustrations of

34、 how those customers have used IBM products and the results they may have achieved.Actual performance,cost,savings or other results in other operating environments may vary.Workshops,sessions and associated materials may have been prepared by independent session speakers,and do not necessarily refle

35、ct the views of IBM.Not all offerings are available in every country in which IBM operates.Any statements regarding IBMs future direction,intent or product plans are subject to change or withdrawal without notice.IBM,the IBM logo,and are trademarks of International Business Machines Corporation,regi

36、stered in many jurisdictions worldwide.Other product and service names might be trademarks of IBM or other companies.A current list of IBM trademarks is available on the Web at“Copyright and trademark information”at: TechXchange|2024 IBM Corporation36#IBMTechXchangeHow to continue your learning jour

37、ney127972797Technology BreakoutUsing watsonx Assistant with watsonx.ai for intelligent Assistants with Gen AI211011101LabAutomated generative AI deployments on IBM Cloud-Deployable Architecture327962796Technology BreakoutEnterprise grade Q&A with RAG with watsonx.ai+watsonx Discovery/Elastic Search423192319Technology BreakoutHarnessing Conversational Search for Business with IBM watsonx and Elasticsearch Vector DatabasesIBM TechXchange|2024 IBM Corporation37

相關圖表

本文介紹了2024年10月21日至24日在美國拉斯維加斯舉行的Mandalay Bay Convention Center的一場會議，會議編號為Session 3134。主講人是IBM公司的Daniel Pittner，他將介紹關于Watsonx Assistant Retrieval Augmented Generation (RAG)的技術，以及如何在Elasticsearch數據庫即服務(DBaaS)上實現。會議的主要內容包括：RAG的基本概念、Watsonx Assistant與Elasticsearch的整合、模型在Elasticsearch上的部署、多渠道虛擬代理、對話和訓練/測試連接器、AI代理輔助、多渠道人類代理和應用程序、提高客戶服務中心運營效率和效果、以及Watson Assistant和Watson Discovery與Elasticsearch的集成。此外，文章還提到了如何使用過濾器和查詢配置、如何設置Elasticsearch集成、如何將模型部署到Elasticsearch、以及如何利用自定義模型進行索引等關鍵點。本文最后提供了進一步學習的資源，包括有關自動化生成式AI部署、企業級Q&A以及利用對話搜索業務等方面的技術突破。

"如何利用RAG提升業務效率？" "如何將自定義模型部署到Elasticsearch？" "如何通過Watsonx Assistant和Elasticsearch實現多渠道虛擬代理？"

相關報告

聯系我們

0731-84720580
sgpjbg002
工作日 9:30 - 18:00

關于我們

侵權處理

關于我們

出版物經營許可證
工信部備案號：湘ICP備17000430號-2
公安備案號：湘公網安備43010402001071號

三個皮匠報告專業的行業報告下載站，每日更新，歡迎大家關注！

copyright@2008-2013 長沙景略智創信息技術有限公司版權所有
網站備案/許可證號：湘B2-20190120

客服

小程序

服務號

折疊

午夜网日韩中文字幕,日韩Av中文字幕久久,亚洲中文字幕在线一区二区,最新中文字幕在线视频网站