當前位置：首頁 > 報告詳情

付俊偉-AIGC 浪潮下 WebNN 的演進與實踐.pdf

上傳人：張** 編號：182382 2024-11-01 PDF PDF 27頁 3.71MB

該報告所屬合集： 2024QCon全球軟件開發大會-上海站嘉賓演講PPT合集

打包下載報告合集

文檔加載中……請稍候！
如果長時間未打開，您也可以點擊刷新試試。

下載報告到電腦，查找使用更方便

VIP專享文檔

書簽

分享

收藏

已收藏

版權投訴

/27

立即下載

word格式文檔無特別注明外均可編輯修改，預覽文件經過壓縮，下載原文更清晰！

三個皮匠報告文庫所有資源均是客戶上傳分享，僅供網友學習交流，未經上傳用戶書面授權，請勿作商用。

《付俊偉-AIGC 浪潮下 WebNN 的演進與實踐.pdf》由會員分享，可在線閱讀，更多相關《付俊偉-AIGC 浪潮下 WebNN 的演進與實踐.pdf（27頁珍藏版）》請在三個皮匠報告上搜索。

1、演講人：付俊偉胡寧馨,英特爾首席工程師,W3C Web Neural Network(WebNN)標準的起草和主要編輯者,Chromium committer and Chromium WebNN 組件的主要擁有者張敏,Intel WebNN 團隊的技術經理,Chromium and ONNX Runtime WebNN EP 的開發者,WebNN developer preview的作者付俊偉,英特爾高級軟件工程師,Chromium committer and Chromium WebNN的基礎架構設計和Chromium Shape Detection API 主要開發者目錄01WebNN

2、出現的背景02WebNN的架構設計03如何使用WebNN04WebNN的性能對比https:/microsoft.github.io/webnn-developer-preview/WebNN Execution Provider of ONNX Runtime Web with GPU acceleration from DirectML.Running on Intel CoreUltra 7 processor 155H with integrated ArcGPU.Stable DiffusionA cat under the snowText EncoderText EncoderI

3、mage GenerationUnetUnetStepStep1 1Image DecoderUnetUnetStepStep2 2UnetUnetStepStep3 3UnetUnetStepStep4 4WebNN OperationmatMulgathersigmoidsoftmaxDirectMLGEMMGATHERLOGISTICSOFTMAXTFLiteBATH_MATMULGATHERACTIVATION_SIGMOIDACTIVATION_SOFTMAXCoreMLmatmulgather_along_axissigmoidsoftmaxCPUGPUNPU系統ML APIsWe

4、b Browser(e.g.,Chrome/Edge)框架運用場景WebNNJavaScript Runtime(e.g.,Electron/Node.js)Noise SuppressionImageClassificationBackgroundSegmentationTensorFlow.jsONNXRuntimeWebMediaPipe WebNatural Language硬件CoreMLDirectMLWeb API Web引擎Transformers.jsWebAssemblyWebGPUObject DetectionTFLiteOther ML OS APIsWindows

5、Studio EffectsAPI extensionsComputational Graph(Web)conv2daddreluinputoutputfilterbiastmptmpcompilecomputeInput Buffers(CPU/GPU)MLGraphBuilderMLGraphBuilderMLContextMLContextMLGraphMLGraphOutput Buffers(CPU/GPU)device type:cpu/gpu/npupower preference:high-perf/low-powerbuildcreateCompiled Graph(Nati

6、ve)Fused conv2dinputoutputWebNN為Web帶來了神經網絡的統一抽象Other Web APIWebNN API Call flowDataflowWeb ApplicationJS ML FrameworksGPUNPUCPUWebNNMojo ClientDirectMLBackendWebNN Mojo ServerBNNS/MPSCoreMLMCDMApps/FrameworksHardwareChromiumNative ML APIsOS DriversRenderer ProcessIPCmacOSWindowsMLContextMLGraphBuild

7、erMLGraphDirectMLCoreMLBackendTFLiteBackendXNNPACK/DelegateTFLiteAndroid/ChromeOS/LinuxGPU/Utility ProcessIntegration StatusPrototype Available1.18 releaseCPUGPUNPUNative CPU KernelsNative GPU KernelsNative NPU KernelsTensorFlow Lite WebONNX Runtime WebWeb ApplicationWasmKernelsWebNN GraphWebGL Kern

8、elsWebGPUKernelsBrowsers with WebNN supportWasmKernelsWebNN GraphWasmKernelsPre-processingintermediateweightsMatMulweightsbiasIntermediatePost-ProcessinginputConv2dintermediatehttps:/microsoft.github.io/webnn-developer-preview/WebNN Execution Provider of ONNX Runtime Web with GPU acceleration from D

9、irectML.Running on Intel CoreUltra 7 processor 155H with integrated ArcGPU.VanillaJS(plain JavaScript)use of WebNN API,with NPU acceleration from DirectML.Running on Intel CoreUltra 7 processor 155H with integrated Intel AI Boost NPU.Browser:Chrome Canary 118.0.5943.0 DUT:Dell/Linux/i7-1260P,single

10、p-core Workloads:MediaPipesolution models(FP32,batch=1)1.01.01.01.01.01.01.01.01.01.01.01.01.01.01.03.03.13.03.03.02.93.12.84.43.12.52.52.91.82.33.13.23.23.23.13.03.23.04.53.32.82.93.12.22.70.0%10.0%20.0%30.0%40.0%50.0%60.0%70.0%80.0%90.0%100.0%0.00.51.01.52.02.53.03.54.04.55.0WebNN vs.Native RatioI

11、nference Speedup MediaPipeModels Inference Performance(Normalized/Higher is Better)Wasm SIMD WebNN XNNPackNative XNNPackWebNN vs NativeBrowser:Chrome Canary 126.0.6459.0OS:Windows 11 Pro 23H2DUT:Asus ZenbookCPU:Intel(R)Core(TM)Ultra 7 155H 3.80 GHzGPU:Intel(R)Arc(TM)GraphicsGPU Driver:31.0.101.55127

12、5.881.587.987.387.488.882.682.471.488.178.672.071.589.587.676.595.086.791.493.279.095.673.085.391.581.50.020.040.060.080.0100.0120.0110100100010000WebNN DirectML vs.Native DirectMLWebNN GPUNative DirectMLWebNN GPU vs.Native DirectMLInference Time(ms)(Logscale)Percentage(%)The average performance of

13、listed 4 models on WebNNDirectMLis about 80%80%of native DML on MTL NPUBrowser:Chrome Canary 126.0.6459.0OS:Windows 11 Pro 23H2DUT:Asus ZenbookCPU:Intel(R)Core(TM)Ultra 7 155H 3.80 GHzNPU:Intel(R)AI BoostNPU Driver:32.0.100.238162.7%95.8%73.4%86.1%0.0%10.0%20.0%30.0%40.0%50.0%60.0%70.0%80.0%90.0%100

14、.0%0.001.002.003.004.005.006.007.008.00MobileNetV2SqueezeNet 1.0ResNet50 v1EffiecientNet Lite 4WebNN vs Native(%)Inference Time(ms)WebNN DirectML vs Native on MTL NPUWebNN DirectML NPUNative NPUWebNN NPU vs NativeSpeech to Text PoC Demo for Khan Academy Khanmigo.WebNN Execution Provider of ONNX Runtime Web with NPU acceleration from DirectML.Running on Intel CoreUltra 7 processor 155H with integrated Intel AI Boost NPU.THANKS大模型正在重新定義軟件Large Language Model Is Redefining The Software

相關圖表

本文主要介紹了WebNN（Web Neural Network）標準及其在現代Web瀏覽器中的應用。WebNN是由英特爾首席工程師付俊偉和胡寧馨等人起草和編輯的W3C標準，主要目的是為了在Web平臺上實現神經網絡的統一抽象。文章詳細介紹了WebNN的架構設計、使用方法、性能對比以及其在各種框架和場景中的應用。關鍵數據如下： 1. WebNN在某些模型上的推理速度比Native實現慢約20%。 2. 在WebNN支持的瀏覽器上，使用WebNN DirectML的性能約為原生DirectML的80%。 3. 在Intel Core Ultra 7處理器上，WebNN能夠利用Arc GPU和AI Boost NPU進行加速。主要關鍵點包括： - WebNN為Web平臺帶來了神經網絡的統一抽象，使得在Web環境中進行機器學習任務變得更加簡便。 - WebNN的架構設計考慮了如何在Web環境中高效地執行神經網絡運算。 - 文章中提供了WebNN與Native實現的速度比較，以及在實際硬件上的性能表現。 - 最后，文章指出了大模型正在重新定義軟件，暗示了WebNN等技術的發展將深刻影響軟件開發的未來。

"WebNN如何提升網頁神經網絡性能？" "英特爾在WebNN技術發展中的角色與貢獻有哪些？" "WebNN與原生神經網絡API相比，優勢何在？"

相關報告

聯系我們

0731-84720580
sgpjbg002
工作日 9:30 - 18:00

關于我們

侵權處理

關于我們

出版物經營許可證
工信部備案號：湘ICP備17000430號-2
公安備案號：湘公網安備43010402001071號

三個皮匠報告專業的行業報告下載站，每日更新，歡迎大家關注！

copyright@2008-2013 長沙景略智創信息技術有限公司版權所有
網站備案/許可證號：湘B2-20190120

客服

小程序

服務號

折疊

午夜网日韩中文字幕,日韩Av中文字幕久久,亚洲中文字幕在线一区二区,最新中文字幕在线视频网站