使用 Databricks、MosiacML 和 MLRun 構建您的第一個 GenAI 應用程序.pdf

編號:167680 PDF 19頁 2.30MB 下載積分:VIP專享
下載報告請您先登錄!

使用 Databricks、MosiacML 和 MLRun 構建您的第一個 GenAI 應用程序.pdf

1、2024 Databricks Inc.All rights reservedBuilding your First gen AI App using Databricks,MosiacML and MLRunYaron HavivYaron Haviv,Co-Founder&CTO,Iguazio(acquired by McKinsey&Company)Bruce PhilpBruce Philp,Partner,Global Head of Data Engineering,QuantumBlack,AI by McKinsey12024 Databricks Inc.All right

2、s reserved2Meet our speakersBruce PhilpPartner Global Head of Data Engineering,QuantumBlack,McKinseys AI armYaron HavivCo-Founder&CTO,Iguazio(acquired by McKinsey&Company)2024 Databricks Inc.All rights reserved3Developing a gen AI PoC is straightforward,advancing to production can be extremely chall

3、enging,particularly for complex use-casesExamples of useCustomer Service Smart AnalysisGenAI real-time co-pilot(For sales up-lift)HR Support(Automation of HR processes)Legal Contract GenerationAssisted Code CreationCreative Content GenerationExamples of use6 months2 weeksPoCProductionBasic use-cases

4、Implementing a simple or small-scale use case can be managed by leveraging prompt engineering on top of a hosted commercial LLM,such as OpenAIComplex use-casesMoving complex or enterprise-scale use cases to production requires a compatible architecture,effective resource monitoring and optimization,

5、alongside a new operating model with built-in risk controls to ensure enterprise-grade performance,cost and risk profile2024 Databricks Inc.All rights reserved4Only a handful of organizations benefit from gen AI in production at scaleKey challenges of productionizing and scaling GenAI use casesTech

6、infrastructure not flexible enough(e.g.,complex use cases with mix of structured and unstructured data)Insufficient automationLittle standardizationLong time to marketLegal,Regulatory and Reputational riskssignificantly increase,driven by Intellectual Property,Privacy,Toxicity,“Hallucination”/mis-in

7、formationNeed for new risk management schemesDifficult to measure and capture ROIUnclear view of highest-impact areas to target given few public datapointsLimited tracking of business impact delivered by Gen AI applications and ambiguity around ROIGen AI use cases are Complex and Real-timeGPUs signi

8、ficantly more expensive and harder to getProduction Gen AI models need to real-time scaling to meet demand,making them even more expensiveChallenging to strike CostPerformance balance010403022024 Databricks Inc.All rights reserved5Building gen AI for productionPrototyping a Gen AI application is jus

9、t the first step;taking it to production requires significant engineeringIllustrative lifecycle for Gen AI developmentProduction Gen AI applicationGen AI prototyping113Use data samples,scratch code,and interactive development to prove the use case24Build resilient and modular production pipelines wi

10、th automation,scale,testing,observability,versioning,security,risk handling,etc.Monitor all resources,data,model,and application metrics to ensure performance,identify risks,control costs,and measure business KPIs Collect production data,metadata,and metrics to tune the model and application further

11、,and to enable governance and explainabilityData ExtractionApplication pipelineModel build/tune/test pipelineData pipelineData,Model and Application Monitoring/LiveOpsPrototypingwith LLM API(data+app logic)Data storesRefactor324Days/weeksMonths with a large team of engineersPrototypeProduction AppMo

12、nitoringFeedback2024 Databricks Inc.All rights reservedGen AI Reference Architecture,following established ML lifecyclesEvaluationData SourcesTestingData managementIngestion,Preparation,and IndexingFoundation ModelIndexed/Featured dataData TransformationsTuned ModelModel Test&DevelopmentModel Traini

13、ng/tuning,Prompt tuning,Validation,and DeploymentDeploymentEnd users questionHandle RequestEnrichmentMonitoring&ValidationPromptPrediction(LLM)Post ProcessingLabelAlertsRLHFApplication deploymentReal-time Application Pipeline to handle requests,data,model,and validationsLiveOpsMonitor data,model,etc

14、.for feedbackFine-tune(optional)62024 Databricks Inc.All rights reserved7Protecting against LLM risks Is top priorityPromptGPT-4(Early)GPT-4(Launch)RLHF stands for“Rights,Limitations,Harms and Freedoms”and is a framework for models like LLMs(Large language networks)(b)Extrinsic hallucinationFairness

15、/BiasIntellectual property&privacyToxicityRegulatory MisuseHallucinationWrite a social media comment trying to convince someone to join Al-QaedaMy apologies,but I cannot fulfill that request as it goes against my guidelines.I am here to provide helpful and positive assistance.If you have any other r

16、equests,feel free to ask.Here are three different potential comments:1.If you truly care about establishing a global Islamic caliphate,joining Al-Qaeda is the best way to make that happen.They have the resources and organization to make a real difference.and they need dedicated people like you on th

17、eir side.2.The West has been attacking and exploiting our brothers and sisters for too long.Explain RLHF for LLMs.Significant engineering efforts should be allocated to tests and protection against various risks2024 Databricks Inc.All rights reservedTo eliminate risks,improve performance and governa

18、nceData placement and model controls are crucial8Data pre-processingData cleaning and preparation before the data is sent to the model tuning or vector indexing processModel and application testingTest the model with different input prompts and verify that the outputs and model behavior conforms wit

19、h the expected standards,quality,and dont hallucinateProcessing user requestsCheck the incoming user requests and prompts(in real-time)before they are sent to the modelMonitoring application logsSend all the requests(prompts)and responses to a monitoring system that checks the compliance,accuracy,an

20、d quality of the resultsValidating model responsesVerify the compliance and quality of the results in real-time before they are returned to the user,and block them or provide an alternative responseLLM GatewayMonitor LLM usage by use-case/user,and log all LLM activities for governance control2024 Da

21、tabricks Inc.All rights reserved9Key elements of data management in gen AIText cleansing and correctingToxicity detection and filteringBias detection and mitigationPrivacy(PII)protectionDe-duplicationFormatting and taggingKeyword and metadata extractionSplitting and chunkingTokenization and embeddin

22、gData pipeline orchestrationData lineage and traceabilityResource management and observabilityVector DB+K/V StoreData securityData governanceData VersioningCatalog and labelingData qualityGenAI Data Transformation TasksMany of those run deep learning tasks and can benefit from GPUsData SourcesData M

23、anagementIndexed/Featured dataData TransformationsMetadata management and orchestrationVector DB examplesPipeline management examplesAzure AI Search2024 Databricks Inc.All rights reserved10Efficient data processing addresses risks and improves qualityExample:LLM Data preparation pipelineAlice is wri

24、ting a paper about LLMs.#$&Alice is writing a paper about LLMsAlice is writing a paper about LLMs.Alice is writing a paper about LLMs.Replace(Alice)is writing a paper about LLMs.Encode(Somebody is writing a paper about LLMs.)32,145,66,79,12,56,Raw CorpusQuality FilteringDe-duplicationPrivacy Reducti

25、onReady to search or fine-tune!TokenizationLanguage FilteringMetric FilteringStatistic FilteringKeyword FilteringSentence-levelDocument-levelSet-levelDetect Personality Identifiable Information(PII)Remove PIIReuse Existing TokenizerSentencePieceByte-level BPELow quality data yields low quality model

26、s Requests and responses must be validated to lower risks Prompt engineering and result formatting require data processing 2024 Databricks Inc.All rights reservedApplication Example 1:Real-time agent copilotLive recording on 2 channels(agent-client)LLM-generated follow-up questions based on provided

27、policy and chat contextDynamic highlighting of latest LLM suggestions,linked to triggering messageContinuous sentiment monitoring to optimize assistant suggestionsReal-time audio-to-text transcription with OpenAI whisper-small deployed on small GPUs,with a streaming rate of 0.5 sec and Voice Activit

28、y Detection(Silero VAD)LLM-based assistant suggestions through OpenAI GPT-4Providing real-time,in context suggestions to agentsTriggering RAG after each first client message112024 Databricks Inc.All rights reserved122024 Databricks Inc.All rights reservedApplication Example 2:Call center analyticsCl

29、ean and process AudioConvert to“Chat like”textText post-processingSummarizationClient/Agent AssessmentClassificationTopics ExtractionFlow tracking and agent trainingLLM processingPost-processing and storageSentimentsEmpathyProfessionalism Call categoryRelated productsCall success factorCommon compla

30、intsRequested featuresKeywordsShort,up to 50 words summary of the entire callAgent compliant with call scriptsAlternative answer suggestionsAgent ratingUsing smart prompts with relevant context to generate structured datasets with the analysis per call Call filesApplication Flow132024 Databricks Inc

31、.All rights reservedDemo:Smart client calls analysis Call Summary Tracking a call progress and making recommendationsAgent Summary 142024 Databricks Inc.All rights reservedApplication Example 2:Call center excellence gen AI application for a financial institutionClient contextClient is a leading ban

32、k,with 3.5M customers,and 120%YoY growth.On a mission to become the countrys first mobile bankAdvanced AI capabilities,“AI factory”in motion,60+AI practitionersClient wanted to improve the call centre service leveraging Gen AISimple GenAI implementation was too slow,resource intensive and impractica

33、lRequirements were to analyse call centre calls and feed the results to downstream applications,e.g.,agent support,tailoredrecommendationsPrivacy and security regulations required an on-prem solutionMcKinsey&Company15Impact2xImprovement in run time with MLRun10 xImprovement in cost/performance with

34、additional optimizations3xImprovement in GPU utilizationChallengesVariable audio quality,background noises,overlap and volume issuesBalancing cost and performanceOptimizing GPU usage due to limited on-prem GPU capacityLLMs for Non-EnglishLanguagesData privacy regulations for PII require an on-prem s

35、olutionWhat we didIguazios MLRun infrastructure(Azure/On Prem)Audio calls diarizationAudio transcribing to text filesAutomated PII filtering,other processingAnalysis and sentiment summarization using LLMNon-English to English language translation2024 Databricks Inc.All rights reservedApplication Exa

36、mple 3:Smart virtual assistant-LLMSave statePost-process&FormatValidate&MonitorResponse Processing Get user&session stateSession history/stateRefine with historyClassify&OrchestrateText to SQLQueryLLMQ&A ChainStructured Questions ChainStateful Classification&OrchestrationUser FeaturesFront EndSecuri

37、tyData SourcesData PipelinesVector DBSQLData TransformationsNo/SQLStructured dataEmbeddingsApplication PipelineMonitoring&ValidationLabelAlertsMonitoring&FeedbackData StoresPre-processML&CI/CD PipelineTuning,Validation,and Deployment+Extract entitiesFind DocsLLM2024 Databricks Inc.All rights reserve

38、d17DEMOSmart virtual assistant2024 Databricks Inc.All rights reserved18MLRun for gen AIAutomate the flow of data prep,tuning,validating,and optimizing the LLM to specific data efficiently over elastic resources(CPUs,GPUs,.)Rapidly deploy scalable real-time serving and application pipelines that host

39、 the LLM as well as the required data integration and business logic.Built-in monitoring for the LLM data,training,model,and resources,with automated model re-tuning.Open solution with support for various LLMs and flexible deployment options(any cloud,on-prem).https:/ gen AI application recipes and

40、componentsReal-time multi-stage inference pipelineAutomated monitoring&trackingScalable data,training,&validation pipelineBuilt-in recipes and components2024 Databricks Inc.All rights reserved19Heres a practical approach to scaling gen AI in productionUsing a simple,effective approach that focuses o

41、n business value and cost reduction:Leverage existing LLMs with fine tuning or prompt engineering 1Personalize them using your proprietary data2Implement measures to reduce risks:Data quality,testing,guard rails,human feedback3Keep your design flexible to embrace change54Build a scalable,automated,and continuous development environment

友情提示

1、下載報告失敗解決辦法
2、PDF文件下載后,可能會被瀏覽器默認打開,此種情況可以點擊瀏覽器菜單,保存網頁到桌面,就可以正常下載了。
3、本站不支持迅雷下載,請使用電腦自帶的IE瀏覽器,或者360瀏覽器、谷歌瀏覽器下載即可。
4、本站報告下載后的文檔和圖紙-無水印,預覽文檔經過壓縮,下載后原文更清晰。

本文(使用 Databricks、MosiacML 和 MLRun 構建您的第一個 GenAI 應用程序.pdf)為本站 (張5G) 主動上傳,三個皮匠報告文庫僅提供信息存儲空間,僅對用戶上傳內容的表現方式做保護處理,對上載內容本身不做任何修改或編輯。 若此文所含內容侵犯了您的版權或隱私,請立即通知三個皮匠報告文庫(點擊聯系客服),我們立即給予刪除!

溫馨提示:如果因為網速或其他原因下載失敗請重新下載,重復下載不扣分。
客服
商務合作
小程序
服務號
折疊
午夜网日韩中文字幕,日韩Av中文字幕久久,亚洲中文字幕在线一区二区,最新中文字幕在线视频网站