計算基礎設施協同設計的架構挑戰與創新.pdf

編號:158271 PDF 21頁 4.20MB 下載積分:VIP專享
下載報告請您先登錄!

計算基礎設施協同設計的架構挑戰與創新.pdf

1、OCP Global Summit October 18,2023|San Jose,CASYM Title SlidePeipeiZhouAssistantProfessor,University of PittsburghArchitectural Challenges and Innovation for Compute Infrastructure Co-DesignSYM-ContentGenerativeAIModels:ChatGPTSYM-ContentGenerativeAIModels:StableDiffusion,Dall-ESYM-ContentTransformer

2、ModelsSYM-ContentProfiling Transformer based model,DeiT-T,on Nvidia GPU T4(TSMC12 nm)Low TensorCores utilization for INT8 MM kernels.TensorRT adopts an implicit quantization policy,which leads to BMM computing in FP32,which could originally be in INT8.The quan/dequan between FP32 and INT8 consumes n

3、on-negligible GPU cycles The data layout change also consumes nonnegligibleGPU cycles The nonlinear kernels,e.g.,Softmax,GeLU,Layernorm,take significant GPU cyclesKernelBreakdownSYM-ContentFPGA vs.GPU?GPU+FPGA?SYM-ContentVersal ACAP ArchitectureDDR4-DIMMAIE ArrayIOAIEVLIWProcessor32KB Mem25.6 GB/s1.

4、2 TB/sProgrammable LogicBRAMURAMCLBDSPNOCProcessor System(ARM)HeterogeneousAcceleratorArchitectureFine-GrainedPipelineINTNon-linear Functions(Softmax,GELU)01234567DeiT-256LV-ViT-TDeiT-TDeiT-160GPU TensorRTACAP CHARM(ours)ReducesLatencyby10 x overNvidia GPUT45.7x10.3x7.3x8.9xFromHeterogeneous Modelst

5、oHeterogeneous SystemComputation-Communication AwareScale-Out?SYM-ContentH2H:heterogeneous model to heterogeneous system mapping with computation and communication awareness,DAC 2022LowerLatency,LowerEnergyH2H:heterogeneous model to heterogeneous system mapping with computation and communication awa

6、reness,DAC 2022https:/ Modelsto Heterogeneous Chiplet SystemswithHeterogeneousComponentsComputation&Communication AwareHierarchical Scheduling&MappingLatencyvsThroughputChiplet?Sustainability?Source of CO2e from Meta DatacentersRepackaging ChipletsNSF CCF#2324864:Collaborative Research:DESC:Type II:

7、REFRESH:Revisiting Expanding FPGA Real-estate for Environmentally Sustainability Heterogeneous-SystemsSustainability?NSF CCF#2324864:Collaborative Research:DESC:Type II:REFRESH:Revisiting Expanding FPGA Real-estate for Environmentally Sustainability Heterogeneous-SystemsImagePeipei Zhou is an assist

8、ant professor of the Electrical Computer Engineering department at the University of Pittsburgh.Her research interests include designautomation,hardware/software co-design,AI chipdesign,etc.She has participated in$11M FederalFunds($2M as Lead PI).Her work in FPGA acceleration for deep learning won the 2019 Donald O.Pederson Best Paper Award from the IEEE Council for Design Automation(CEDA).Herworks have also won 2018 ISPASS Best Paper Nominee and 2018 ICCAD Best Paper Nominee.https:/peipeizhou-eecs.github.io/peipei.zhoupitt.eduOCP Global Summit|October 18,2023|San Jose,CASYM-End

友情提示

1、下載報告失敗解決辦法
2、PDF文件下載后,可能會被瀏覽器默認打開,此種情況可以點擊瀏覽器菜單,保存網頁到桌面,就可以正常下載了。
3、本站不支持迅雷下載,請使用電腦自帶的IE瀏覽器,或者360瀏覽器、谷歌瀏覽器下載即可。
4、本站報告下載后的文檔和圖紙-無水印,預覽文檔經過壓縮,下載后原文更清晰。

本文(計算基礎設施協同設計的架構挑戰與創新.pdf)為本站 (張5G) 主動上傳,三個皮匠報告文庫僅提供信息存儲空間,僅對用戶上傳內容的表現方式做保護處理,對上載內容本身不做任何修改或編輯。 若此文所含內容侵犯了您的版權或隱私,請立即通知三個皮匠報告文庫(點擊聯系客服),我們立即給予刪除!

溫馨提示:如果因為網速或其他原因下載失敗請重新下載,重復下載不扣分。
客服
商務合作
小程序
服務號
折疊
午夜网日韩中文字幕,日韩Av中文字幕久久,亚洲中文字幕在线一区二区,最新中文字幕在线视频网站