當前位置：首頁 > 報告詳情

7-2 等長機器翻譯研究及配音字幕的優化.pdf

上傳人：云閑編號：102337 2021-01-01 PDF PDF 35頁 3.52MB

該報告所屬合集： DataFunSummit：2022NLP峰會嘉賓演講PPT合集

打包下載報告合集

文檔加載中……請稍候！
如果長時間未打開，您也可以點擊刷新試試。

下載報告到電腦，查找使用更方便

VIP專享文檔

書簽

分享

收藏

已收藏

版權投訴

/35

立即下載

word格式文檔無特別注明外均可編輯修改，預覽文件經過壓縮，下載原文更清晰！

三個皮匠報告文庫所有資源均是客戶上傳分享，僅供網友學習交流，未經上傳用戶書面授權，請勿作商用。

《7-2 等長機器翻譯研究及配音字幕的優化.pdf》由會員分享，可在線閱讀，更多相關《7-2 等長機器翻譯研究及配音字幕的優化.pdf（35頁珍藏版）》請在三個皮匠報告上搜索。

1、Isometric Machine Translation for Subtitling等長機器翻譯及配音字幕優化楊浩華為-文本機器翻譯實驗室主任Hao Yang,Director of Text Machine Translation Laboratories,Huawei|01NMT Basics&Trends02What Is Isometric MT03Isometric MT Architecture04Isometric MT Application目錄Content|01NMT Trends|There is a white dog on the grass.草地上有只白色小狗

2、。f(x)convert sentenceFrom some languagesInto another languageMT Problem-機器翻譯，肖桐，朱靖波等，2021SMT ProblemTranslation modelLanguage modelCompute argmaxNMT ProblemEnd-to-End modelNo feature extraction layerNo fine tune layerEncoder-Decoder ModelEncoder modelDecoder modelEncoded vectorHow to configure s5700

3、 arp?Source sentenceC如何如何怎么配置設置S5700S2700?arpap如何配置配置S5700S5700arparp?Target sentenceEncoder-Decoder ModelEncoder：RNNHidden state:encoded vectorDecoder:RNN1.3.1 Seq2Seq with Neural NetworksDecodingGreedy decodingBeam searchSamplingGreedy DecodingAt each step,keep several best hypothesesBeam SearchDe

4、codingSMT VS NMT=33.3 VS 34.81SMT Re-ranking=33.3 VS 36.6How to configure S5700 arp?如何配置 s5700 arp?如何配置 s5700 Seq2Seq Model ProblemContext vector is bottleneckPerformance degrades as sentence becomes longerHow to configure s5700 arp?Source sentenceC如何配置設置如何配置SoftmaxAttentionAttentionSMT:33.3Seq2Se

5、q:34.81RNNSearch(RNNAttn):36.15Transformer:41.8*WMT14 EN/FRTransformerEncoder self attentionDecoder self attentionCross attentionPerformanceAshish Vaswani,Attention is all you needhttps:/ Attention is all you needTransformerNo RNNAttentionPosition embeddingOn the WMT 2014 English-to-French a new sin

6、gle-model SOTA BLEU score of 41.8 after training for 3.5 days on eight GPUshttp:/speech.ee.ntu.edu.tw/tlkagk/courses_DLHLP20.htmlTransformerBLEU 41.83.5 days on eight GPUsPre-trained model familyMore Machine Translation TrendsMulti-device 多設備Multi-screen 多屏幕Real-time 實時Phone/TabletWearableCarLaptop

7、Tablet Phone WatchTV DesktopAR/VRSource:https:/redian.news/news/12146;https:/youtu.be/niM4ttonrrA;https:/ is Isometric MT|What Is Isometric MT Isometric MTGenerates translationsEnsures SRC/MT in similar lengthApplication ScenariosAutomatic dubbingSubtitle fittingSimultaneous speech translationLayout

8、 constrained translationWhat Is Isometric MT Isometric MT MetricsTranslation Quality(TQ)BLEU BERTScoreLength Compliance(LC)LC LRBLEUBERTScoreLCIsometric MT TasksIWSTL 2022 Isometric MT TasksObjectives:Translation directions:En De/Fr/Es Subtitle translation scenarios Ensuring quality of sentence tran

9、slation Ensuring consistent length between MT and SRCDifficulties TQ vs.LCSRC:Its the one wheel XR and if you dont know what a one wheel is.MT1:Es ist das eineRad XR und wennSienicht wissen,was ein Rad ist.MT2:Es ist das One Wheel XR und wenn Sienichtwissen,was ein One Wheel ist.SRC:Its basically a

10、motorized one wheel skateboard that can go onMT1:Es ist ein motorisiertesRad-Skateboard,das weiter gehen kannMT2:Es ist im Grundeein motorisiertesSkateboard miteinemRad,das weitergehen kannSRC:This is my new toy.MT1:Das istmein neues.MT2:Das istmein neues Spielzeug.ExampleMore diversityLess missing

11、translation03Isometric MT Architecture|Isometric MT ArchitectureTranslation ModelAT/NATLC StrategyLCD Length Token Method Length Encoding MethodLAB Length-Aware BeamRe-rankEnsemble MT ModelsMT Model ScoringIsometric MT Architecture 1:Model AugmentationAT Model AugmentationLow-resource model augmenta

12、tion Shared decoder/embedding Multilingual model en2de,de2en Data diversification Sampling BT FT+BT R-Drop&ensemblePerformance BLEU 4-10+Isometric MT Architecture 2:LCD StrategyLCD:Length Controlled DecodingLength token Three-category tagging Short 0.9 norm 1.1 AT NAT+LCD+LAB can satisfy TQ+LC04Isom

13、etric MT Applications|Real-World ExperienceWithout Isometric MTWith Isometric MTHuawei Translate PlatformReferenceLi,Zongyao,et al.HW-TSCs Participation in the IWSLT 2022 Isometric Spoken Language Translation.Proceedings of the 19th InternationalConference on Spoken Language Translation(IWSLT 2022).

14、2022.Wang,Minghan,et al.HI-CMLM:Improve CMLM with Hybrid Decoder Input.Proceedings of the 14th International Conference on Natural Language Generation.2021.Lakew,Surafel Melaku,MattiaDi Gangi,and Marcello Federico.Controlling the output length of neural machine translation.arXivpreprint arXiv:1910.10408(2019).Takase,Sho,and Naoaki Okazaki.Positional encoding to control output sequence length.arXivpreprint arXiv:1904.07418(2019).Ghazvininejad,Marjan,et al.Mask-predict:Parallel decoding of conditional masked language models.arXiv preprint arXiv:1904.09324(2019).|Thank You 感謝您的觀看

相關圖表

本文主要探討了等長機器翻譯（Isometric MT）在語音翻譯、配音字幕優化等領域的應用。作者首先回顧了機器翻譯的基本概念和發展趨勢，介紹了神經機器翻譯（NMT）的架構和應用。隨后，文章提出了等長機器翻譯的概念，旨在生成翻譯的同時確保源文與翻譯在長度上相似。核心數據包括WMT14英法翻譯中，等長機器翻譯取得了41.8的BLEU分數。文章詳細介紹了等長機器翻譯的架構，包括翻譯模型、長度控制策略、長度感知束搜索等。作者通過實驗驗證了這些架構在提升翻譯質量和長度控制方面的有效性。實驗結果顯示，在手動評估中，等長機器翻譯在英語到德語、法語和西班牙語的翻譯任務中表現出色，其中英語到法語和西班牙語翻譯排名第一。此外，文章還討論了等長機器翻譯在實際應用中的性能，如華為翻譯平臺的使用參考。最后，文章感謝了觀眾的閱讀，并強調了等長機器翻譯在自動配音、字幕適配等方面的潛力。

"等長機器翻譯如何工作?" "注意力機制在機器翻譯中有什么作用?" "等長機器翻譯在實際應用中有什么優勢?"

相關報告

聯系我們

0731-84720580
sgpjbg002
工作日 9:30 - 18:00

關于我們

侵權處理

關于我們

出版物經營許可證
工信部備案號：湘ICP備17000430號-2
公安備案號：湘公網安備43010402001071號

三個皮匠報告專業的行業報告下載站，每日更新，歡迎大家關注！

copyright@2008-2013 長沙景略智創信息技術有限公司版權所有
網站備案/許可證號：湘B2-20190120

客服

小程序

服務號

折疊

午夜网日韩中文字幕,日韩Av中文字幕久久,亚洲中文字幕在线一区二区,最新中文字幕在线视频网站