當前位置：首頁 > 報告詳情

語言模型：預測、可解釋性及其他.pdf

上傳人：蘆葦編號：651624 2025-05-01 PDF PDF 51頁 7.75MB

該報告所屬合集： 2025人工智能與電力峰會（AI and Electric Power Summit）嘉賓演講PPT合集

打包下載報告合集

文檔加載中……請稍候！
如果長時間未打開，您也可以點擊刷新試試。

下載報告到電腦，查找使用更方便

VIP專享文檔

書簽

分享

收藏

已收藏

版權投訴

/51

立即下載

word格式文檔無特別注明外均可編輯修改，預覽文件經過壓縮，下載原文更清晰！

三個皮匠報告文庫所有資源均是客戶上傳分享，僅供網友學習交流，未經上傳用戶書面授權，請勿作商用。

《語言模型：預測、可解釋性及其他.pdf》由會員分享，可在線閱讀，更多相關《語言模型：預測、可解釋性及其他.pdf（51頁珍藏版）》請在三個皮匠報告上搜索。

1、Transformer Based Time-Series Language Model for Energy ForecastingArtificial Intelligence(AI)and Digital Transformation(DX)Electric Power SummitDr.Satish Natti&Dr.Parul Arora01/07/2025www.gridaxon.io 1Contents Introduction Model Framework Case Study Dataset Description Results Comparison Conclusion

2、1/7/25www.gridaxon.io 2Introduction1/7/25www.gridaxon.io 3CustomersProject DevelopersIPPsPower MarketersISOs/UtilitiesService AreasSite IdentificationInterconnection ProcessGrid ModernizationNERC ComplianceExperience70+years of Cumulative experience30+GW projects across all marketsSite identificatio

3、n to Structuring PPAs AnalysisFeasibility StudiesSystem Impact StudiesEconomic StudiesModeling PSSe/PSLF/ASPEN/PSCAD/PROMOD/TARAIntroduction1/7/25www.gridaxon.io 4Key Idea Use of language models like T5 and GPT-2 for probabilistic time-series energy forecasting Motivation Time series tasks lack a ge

4、neral-purpose,pretrained framework like those in NLPChallenges Limited availability of high-quality time series datasets compared to text Existing forecasting models require dataset-specific fine-tuning and are resource-intensiveObjective Develop a unified,pretrained framework for time series with z

5、ero-shot and probabilistic forecasting capabilities To develop a model that is valuable in situations where historical time-series data is unavailable or when quick results are needed without extensive pre-trainingIntroduction Need for Energy Forecasting Balance supply and demand Reliable power deli

6、very by balancing generation to consumption Cost optimization Optimal resource allocation to lower the costs Renewable energy integration Manage intermittency and curtailment Energy markets Improve trading efficiency and reduce financial risk1/7/25www.gridaxon.io 5Model Framework A time series is tr

7、ansformed into a sequence of tokens via scaling and quantization Then a language model is trained on these tokens using the cross-entropy loss.Once trained,probabilistic forecasts are obtained by sampling multiple future trajectories given the historical context The models in this work are based on

8、the T5 architecture(Text-To-Text Transfer Transformer)Advantages Minimal architecture changes to existing LLMs Compatible with standard language modeling objectives like cross-entropy loss Comparison Traditional methods:domain-specific tuning.TS-LLM:universal framework with pretraining on large,dive

9、rse datasets1/7/25www.gridaxon.io 6Fig.1-(Left)The input time series is scaled and quantized to obtain a sequence of tokens.(Center)The tokens are fed into a language model which may either be an encoder-decoder or a decoder only model.The model is trained using the cross-entropy loss.(Right)During

10、inference,we autoregressively sample tokens from the model and map them back to numerical values.Multiple trajectories are sampled to obtain a predictive distribution.Case Study:Data Description1/7/25www.gridaxon.io 7 The study analyses six(6)distinct energy sources:Coal,Gas,Gas-Combined Cycle(Gas-C

11、C),Hydro,Solar,and Wind.Data is obtained from publicly available resources().The datasets cover a temporal range from 2019 to 2023,representing energy generation metrics for each fuel type.FuelMean(MWh)Median(MWh)Std.Dev.(MWh)Variance(MWh)Minimum(MWh)Maximum(MWh)Skewness(MWh)Kurtosis(MWh)Wind 2706.5

12、312597.1581464.4722144678.8055.2746725.8640.268-0.848Coal2021.9471989.336591.791350216.746527.6353477.2540.150-0.741Gas809.919501.667825.820681977.85260.1344907.3111.8583.341Gas-CC4343.9654284.1431651.6542727961.930652.1568002.1630.032-0.842Hydro15.90511.69916.148260.7610.000101.8531.5062.646Solar48

13、7.5355.429774.150599308.5680.0003516.4891.7642.313Case Study:Data Description1/7/25www.gridaxon.io 8Case Study:Results1/7/25www.gridaxon.io 9Case Study:Results1/7/25www.gridaxon.io 10Case Study:Results1/7/25www.gridaxon.io 11Case Study:Comparison1/7/25www.gridaxon.io 12Note:Validation Score and Pred

14、iction time lower is the betterConclusions The TS-LLM model can process multiple time-series datasets from different domains simultaneously,without training each time-series individually.TS-LLM offers scalable performance across diverse datasets while maintaining a consistent architecture.With pre-t

15、rained models that enable zero-shot forecasting,TS-LLM delivers accuracy comparable to or better than traditional models,especially for median(P50)forecasts,even with minimal or no fine-tuning.This work represents the first instance of testing various energy fuels data using LLMs,demonstrating the v

16、ersatility of LLMs in energy forecasting applications.1/7/25www.gridaxon.io 13References Alexander Alexandrov,Konstantinos Benidis,Michael Bohlke-Schneider,Valentin Flunkert,Jan Gasthaus,Tim Januschowski,Danielle C Maddix,Syama Rangapuram,David Salinas,Jasper Schulz,et al.GluonTS:Probabilistic and N

17、eural Time Series Modeling in Python.The Journal of Machine Learning Research,21(1):46294634,2020.33 Oliver Borchert,David Salinas,Valentin Flunkert,Tim Januschowski,and Stephan Gnnemann.Multiobjective model selection for time series forecasting.arXiv preprint arXiv:2202.08485,2022.21,43 Nate Gruver

18、,Marc Finzi,Shikai Qiu,and Andrew Gordon Wilson.Large Language Models Are Zero-Shot Time Series Forecasters.In Advances in Neural Information Processing Systems,2023.Rob J Hyndman and George Athanasopoulos.Forecasting:principles and practice.OTexts,2018.1,9 Parul Arora,BK Panigrahi,and PN Suganthan.

19、Shallow neural networks to deep neural networks for probabilistic wind forecasting.In 2021 Inter-national Conference on Computing,Communication,and Intelligent Systems(ICCCIS),pages 377382.IEEE,2021.Parul Arora,Abbas Khosravi,BK Panigrahi,and PN Suganthan.Remodelling state-space prediction with deep

20、 neural networks for probabilistic load forecasting.IEEE Transactions on Emerging Topics in Computational Intelligence,2021.Parul Arora,Himanshu Kumar,and BK Panigrahi.A comparative study for short term wind speed forecasting using statistical and machine learning approaches.In 2018 2nd IEEE Interna

21、tional Conference on Power Electronics,Intelligent Control and Energy Systems(ICPEICES),pages 200205.IEEE,2018.1/7/25www.gridaxon.io 14References Yuqi Nie,Nam H.Nguyen,Phanwadee Sinthong,and Jayant Kalagnanam.A time series is worth 64 words:Long-term forecasting with transformers.In International Co

22、nference on Learning Representations,2023.Alec Radford,Jeffrey Wu,Rewon Child,David Luan,Dario Amodei,Ilya Sutskever,et al.Language models are unsupervised multitask learners.OpenAI blog,1(8):9,2019.Syama Sundar Rangapuram,Matthias W Seeger,Jan Gasthaus,Lorenzo Stella,Yuyang Wang,and Tim Januschowsk

23、i.Deep state space models for time series forecasting.Advances in neural information processing systems,31,2018.David Salinas,Valentin Flunkert,Jan Gasthaus,and Tim Januschowski.Deepar:Probabilistic forecasting with autoregressive recurrent networks.International Journal of Forecasting,36(3):1181119

24、1,2020.Ruofeng Wen,Kari Torkkola,Balakrishnan Narayanaswamy,and Dhruv Madeka.A Multi-Horizon Quantile Recurrent Forecaster.arXiv:1711.11053,2017.Chen Xu and Yao Xie.Conformal Prediction Interval for Dynamic Time-Series.In International Conference on Machine Learning,pp.1155911569.PMLR,2021.Hao Xue a

25、nd Flora D.Salim.PromptCast:A New Prompt-based Learning Paradigm for Time Series Forecasting.arXiv:2210.08964,2023.Ansari,A.F.,Stella,L.,Turkmen,C.,Zhang,X.,Mercado,P.,Shen,H.,.&Wang,Y.(2024).Chronos:Learning the language of time series.arXiv preprint arXiv:2403.07815.1/7/25www.gridaxon.io 15XFRTran

26、sformComputingEnabling Widespread Use of Local Renewable Generation for New Large Loads via Explainable AISteve Reinhardtstevexfr.aiTransform Computing,Inc.XFRTransformComputingSupporting new large loads via local renewable generation(with storage)is an ideal solution.XFR de-risks and accelerates it

27、s practical adoption via explainable-AI-based analytics for mesogrids.XFRTransformComputingXFRTransformComputingAgendaWhy is renewable generation(plus storage)ideal for new large loads?What precludes it already being deployed today?How do(scaled)microgrids benefit the grid transition?How does XFRs e

28、xplainable AI accelerate widespread use of local renewable generation for new large loads?How does XFRs explainable AI work?When is XFRs explainable AI deployed?XFRTransformComputingXFRTransformComputingLocal Renewable Generation is an Ideal Solution for New Large LoadsSolar and wind are predicted t

29、o be the cheapest energy sources even when factoring in storage required by their intermittencyMinimal climate footprintAvoids interconnection delayshttps:/ Preventing Renewable Mesogrid Deployment mesogrid=a microgrid(architecturally)scaled to large sizeRenewable generation is intermittent(i.e.,not

30、 firm)and hence requires storage,and management of generation/storage/load at scale is hardStorage is too expensive for large loads but expected to dropUntil recent inability to find power for new AI data centers,little urgency to press renewable generation this hardHence,little focus on scaling and

31、 stressing microgrids and analyticsBattery cost projections for 4-hour lithium-ion systemsCole,NREL,https:/www.nrel.gov/docs/fy23osti/85332.pdfXFRTransformComputingXFRTransformComputingArchitectural Value of Microgrids in Energy TransitionMicrogrid=a set of interconnected loads and distributed energ

32、y resources that acts as a single controllable entity with respect to the grid.It can operate in either grid-connected or in island mode.Ongoing,delivers resilience via separation/reconnectionLimits the impact of failures in a single microgridDelivers local ability to survive wider failuresDuring th

33、e energy transition,enables portions of the grid to evolve at different rates.Portions that are less well served can accept greater risk to explore new high-value capabilities.Portions that are more reliability-focused dont have to move as fast.XFRTransformComputingXFRTransformComputingXFRs Analytic

34、s Accelerate Mesogrid AdoptionKey analytic issues limiting mesogrid adoptionElectromagnetic transient(EMT)detection and mitigationSituational awareness esp.of emergent issuesCybersecurity within the mesogridBased on XFRs innovative explainable-AI(XAI)capabilitiesBasic AI is inscrutable;humans direct

35、ing high-value activities need rationale for actionXFR complements AI(language models)with advanced math(e.g.,persistence diagrams)Advanced math provides a math sanity check on the AI results Language model translates between grid concepts and(obscure)math concepts to empower grid operatorsXFRTransf

36、ormComputingXFRTransformComputingReturn to Overview slideUse Case:Enabling Grid Operator to Respond to Mesogrid Issues via XAI Human grid operator can specifydesired change in grid concepts Grid Assistant translates(via LM)to code/math and simulates,then converts baseline and modified results each t

37、o persistence diagrams and passes both to LM as evidence of the benefit;LM uses evidence to recommend Persistence diagrams readily summarize large dataXAI:a)use LM to translate between grid and math/code concepts,and b)use LM to interpret persistence diagrams XFRTransformComputingXFRTransformComputi

38、ngXFR Grid Assistant Next Steps Prototype working on small grid configuration;needs validation,scaling,etc.XFR seeking partners with urgency to deploy mesogrids to give feedback as we evolve the prototypeTarget mesogrids:renewably powered AI data centers,military bases XFR seeking angel investorsXFR

39、 seeking renewable-grid expertise via product manager and lead engineerXFRTransformComputingXFRTransformComputingBackup 2025 Electric Power Research Institute,Inc.All rights Agnieszka CzeszumskaTechnical Leader I,EPRI Europe DACAI/DX SummitDecember 7th,2025Beyond the chatbotLLM:friend of foe?2025 El

40、ectric Power Research Institute,Inc.All rights reserved.27Data-Driven Decision Making(3DM)Applying Data Science in the Nuclear Power IndustryLeverage data science for the Nuclear Power industryLaunch&support activities across the Nuclear SectorGeneral application areas Insights:learning from the pas

41、tPrognostics:anticipating the futureAutomation:increasing reliabilityOptimization:increasing efficiencyMore details on 3DM program pageProjectsResults 2025 Electric Power Research Institute,Inc.All rights reserved.28 Initiate key proof of concepts Sponsor early implementation&tech transferLaunchOver

42、all ApproachProof of ConceptEarly ValueDeployment3DMHome R&D Issue Program AI projects across the R&D sectors Strategic initiativesSupportSoftware 2025 Electric Power Research Institute,Inc.All rights reserved.29Current R&D internal PoC projects that 3DM is involved in Operating Experience(OE)Consol

43、idation search tool in testing,summarization and data formatting in progressPart 21 data ingested,data extraction and cleaning in progressAI-assisted Fuel Failure Identification work ongoingNuclear Decommissioning Knowledge Management data ingested,evaluations ongoing,UX work startedNMAC Troubleshoo

44、ting Tool data ingestion started 2025 Electric Power Research Institute,Inc.All rights reserved.30Part 21 Reports DatabasePI:Allison Read 2025 Electric Power Research Institute,Inc.All rights reserved.31Part 21 Reporting of Defects and NoncompliancePart 21 reports are submitted to the NRC.These repo

45、rts are required when a licensed facility,activity,or basic component contains defects or failures to comply with NRC regulations that could create a substantial safety hazard.1000s of reports from NRC website in pdf format are difficult to parse through and search for relevant information quicklyNa

46、tural Language Processing 2025 Electric Power Research Institute,Inc.All rights reserved.32Part 21 ReportsThe goal is to extract each section(i)-(ix)and information from them into a structured databaseMany text extraction challengesAccession Number(i)(ii)(ii)ML101820160GE Hitachi Nuclear EnergyThe b

47、asic component that failed is a High Pressure Coolant Injection SystemThe diaphragms are manufactured by Chicago-Allis as commercial grade.They are dedicated and sold as safety-related by GE Hitachi Nuclear Energy 2025 Electric Power Research Institute,Inc.All rights reserved.33Processing PDFsPDFs a

48、re processed to remove footers and headers,and text saved into a database for further processingSome PDFs are unreadable,and some spelling mistakes occur due to processing(i.e.1 instead of I)tder,tifition Jl the:btJs.fcX)mponent supplled for such,facility or such activity within the United.Stat.,:s,

49、whir;tJ may feil to comply,or co,talns pot(;)nti defect._ 2025 Electric Power Research Institute,Inc.All rights reserved.34Extraction with standard NLP techniques“Standard”NLP:Using regular expressions and fuzzy search to grab specific sectionsChallenge 1:Two-thirds of sections missingSome reports d

50、ont follow the standard structureChallenge 2:More granular information such as part number would be usefulCan LLMs be used to extract information and summarize it instead?2025 Electric Power Research Institute,Inc.All rights reserved.35Challenge 1:Missing sectionsUse semantic search to grab relevant

51、 information and extract dataEquivalent to using RAG approachAddress challenge 2 as wellRequires prompt-tuning,i.e.designing the prompt that nudges the LLM to answer correctly,in a specific format(i.e.JSON)https:/ 2025 Electric Power Research Institute,Inc.All rights reserved.36Challenge 2:Extract s

52、pecific information with LLMsAdds value to the database by classifying and grabbing specific information,for example model numbersLLM often not reliableHallucinations can be quite obvious but not always!https:/ 2025 Electric Power Research Institute,Inc.All rights reserved.37Recovering missing secti

53、onsAble to extract x2 more dataAccuracy needs to be checked against ground truth data!2025 Electric Power Research Institute,Inc.All rights reserved.38Evaluating text:metrics to compare with ground truthSAS,Cosine Similarity:normally a gold standard in NLP text comparisons,but may not be applicable

54、to comparing data with no semantic meaning like part numbers of manufacturer namesFuzzy string matching:token set ratio can compare similar words,taking care of misspellings and word orderShared words(expand acronyms,remove stopwords like inc,ltd)Choose metrics based on use case and evaluate how wel

55、l they score results 2025 Electric Power Research Institute,Inc.All rights reserved.39ConclusionsSemantic search is a great tool to find relevant portions of textExtraction of specific data with LLMs needs careful evaluation and prompt tuning to achieve acceptable results,but hallucinations still pr

56、esentCombining standard NLP techniques with LLMs,under careful evaluation,can add value to databases 2025 Electric Power Research Institute,Inc.All rights reserved.40 2025 Electric Power Research Institute,Inc.All rights TOGETHERSHAPING THE FUTURE OF ENERGYUse of Large Language Models to Improve Art

57、ificial Intelligence PredictionsAhmad Al Rashdan Ph.D.DOE LWRS ProgramPlant Modernization Pathway Lead1/7/2025 Extend the life and improve the performance of the existing fleet through modernized technologies and improved processes for plant operation and power generation.Develop modernization solut

58、ions that improve reliability and economic performance while addressing the U.S.nuclear industrys aging and obsolescence challenges.Deliver a sustainable business model that enables the U.S.nuclear industry to remain cost competitive.42Our Mission AI predictions often rely on short text inputs,posin

59、g a challenge in various applications for nuclear power plants.Examples of those applications:Classification of operator logs into startup,shutdown,or failure for availability and reliability calculations.Estimation of work activities duration for outage scheduling and optimization.Screening of cond

60、ition reports(CRs)for decisions such as condition adverse to quality(CAQ)or not CAQ(NCAQ).This limitation affects the development of classical Natural Language Processing(NLP)models capable of accurate predictions.43Problem StatementHistogram of CR Lengths Solution 1:Can we use LLMs to make predicti

61、ons?44Prompt:“Based on the information given,please elaborate.”Prompt:“Log:Reset panel BKR EQUIP ID LOCATION per CODE BKR tripped due to welding in the area.Based on the information given does this appear to be a shutdown of equipment or does it not appear to be a startup of equipment?”Prompt:“Make

62、an argument that the nuclear power plant operator log is a shutdown and another argument that it is a startup.”Potential Solutions Solution 2:Large Language Models(LLMs)excel at generating context and adding background information to text.Can we use LLMs to explain short texts to improve prediction

63、accuracy?45Using Generative AI Models in Classification of Operator Log EventsPrompt:“Please answer yes or no.Is this an equipment startup:OP LOG?”Prompt:“Please only give a number between 1 and 10 as your response.A startup score of 1 means definitely not an equipment startup.A startup score of 10

64、definitely represents an equipment startup.In this nuclear power plant operator log what is the startup score only:OP LOG?”TF-IDF+Ridge Regression (No Generative AI)F1 Score Precision/Recall Confusion Matrix 87.2%Default score of 66%93.2%/82.0%Predicted False True Actual False 47 3 True 9 41 F1 Scor

65、e Precision/Recall Confusion Matrix 71.0%Default score of 66%55.7%/98.0%Predicted False True Actual False 11 39 True 1 49 Generative AI(LLMs)46Using Generative AI Models to Enhance Input for Operator Log Events ClassificationMethod Shutdown F1 Score Startup F1 Score Failure F1 Score Mean TF-IDF+Ridg

66、e 83.2 87.2 69.0 79.8 Ratings 82.4 71.0 70.1 74.5 Vicuna Explain+TF-IDF+Ridge 88.0 83.8 70.1 80.6 Vicuna Elaborate+TF-IDF+Ridge 89.3 87.7 68.3 81.8 Prompt:“Please elaborate on this nuclear power plant operator log:LOG”Prompt:“Based on the information given in this nuclear power plant operator log do

67、es this appear to be a startup of equipment or does it not appear to be a shutdown of equipment:LOG”Prompt:“Make an argument that the nuclear power plant operator log is a shutdown and another argument that it is a startup:LOG”Explain PromptsElaborate PromptResults After Using Vicuna for Explanation

68、 and Elaboration 47Actual vs.Predicted Completion Time in DaysRegression Performance Using a Sliding WindowUsing Generative AI Models to Elaborate on Input for Estimation of Condition Resolution Time 48Threshold for Elaboration(Tokens)Number of CRs That Were Elaborated CRs below Threshold All CRs R2

69、 Before Elaboration(%)R2 After Elaboration(%)R2(%)Default 0 NA NA 51.65 100 511 32.7 34.6 51.67 150 6,768 34.3 37.0 51.97 200 19,247 40.8 41.8 52.44 250 31,626 45.38 45.26 52.59 Impact of Text Elaboration on Model Performance Using Generative AI Models to Elaborate on Input for Estimation of Conditi

70、on Resolution Time Model Performance as a Function of Token Count for 511 CRs with Fewer than 100 Tokens Each49Actual vs.Predicted Completion Time in Days After CR Size ElaborationUsing Generative AI Models to Elaborate on Input for Estimation of Condition Resolution Time Actual vs.Predicted Complet

71、ion Time in Days Before CR Size Elaboration Despite their ability to understand context,generative AI models are not necessarily more accurate than classical NLP methods for prediction.Generative AI models can provide missing context that may improve prediction,but when the text is well-composed,they could confuse the predictor and result in suboptimal outcomes.An optimal combination of generative AI and NLP-based methods can be achieved through careful examination of each case.50ConclusionsSustaining National Nuclear Assetslwrs.inl.gov51

相關圖表

本文主要介紹了使用大型語言模型（LLM）來改善人工智能預測的方法。主要內容包括： 1. 提出問題：傳統的自然語言處理（NLP）模型在處理核電站的短文本輸入時存在局限性，影響預測準確性。 2. 解決方案：使用LLM來生成上下文和背景信息，以提高預測準確性。例如，通過提示LLM來解釋操作日志，以改善日志分類的準確性。 3. 實驗結果：在操作日志分類任務中，使用LLM生成的提示可以提高F1分數，改善預測性能。在條件報告完成時間估計任務中，對條件報告進行文本擴充可以提高回歸模型的R2分數。 4. 結論：LLM可以提供缺失的上下文信息，從而提高預測準確性。但是，當文本已經很好地組合時，LLM可能會混淆預測器并導致次優結果。因此，需要仔細檢查每個案例，以實現基于LLM和NLP方法的優化組合。

"如何利用大語言模型進行能源預測？" "大語言模型在電力行業有哪些應用？" "如何通過大語言模型提高人工智能預測的準確性？"

相關報告

聯系我們

0731-84720580
sgpjbg002
工作日 9:30 - 18:00

關于我們

侵權處理

關于我們

出版物經營許可證
工信部備案號：湘ICP備17000430號-2
公安備案號：湘公網安備43010402001071號

三個皮匠報告專業的行業報告下載站，每日更新，歡迎大家關注！

copyright@2008-2013 長沙景略智創信息技術有限公司版權所有
網站備案/許可證號：湘B2-20190120

客服

小程序

服務號

折疊

午夜网日韩中文字幕,日韩Av中文字幕久久,亚洲中文字幕在线一区二区,最新中文字幕在线视频网站