《山东大学学报(理学版)》 ›› 2026, Vol. 61 ›› Issue (5): 79-89.doi: 10.6040/j.issn.1671-9352.5.2025.005
• • 上一篇
孙歆怡1,郑婷婷1,2*,孙丽雯1
SUN Xinyi1, ZHENG Tingting1,2*, SUN Liwen1
摘要: 为解决传统Transformer在长序列建模和计算效率的不足,本文提改进的Transformer模型,该模型在特征提取阶段引入多尺度卷积结构,通过并行卷积核在不同尺度上捕捉短期波动与长期趋势,增强对多层次时序模式的表征能力。随后,模型采用可学习的位置编码代替固定的正弦编码,更好地应对非平稳数据和不规则时间间隔问题。在全局依赖建模过程中,改进的编码器利用多头自注意力机制建立跨时间步的特征交互,动态分配时刻权重以聚焦关键片段,有效降低长序列建模的计算复杂度。Transformer模型还结合了霜冰优化算法(rime optimization algorithm, RIME)在高维超参数空间中进行高效搜索与优化,提升模型的收敛速度与泛化能力。实验在3个真实复杂数据集上进行,结果表明RIME-Transformer模型在多项指标上均优于主流方法研究结果,验证所提模型在复杂时序预测任务中的有效性与优越性。
中图分类号:
| [1] LIM B, ZOHREN S. Time-series forecasting with deep learning: a survey[J]. Philosophical Transactions of the Royal Society A, 2021, 379(2194):20200209. [2] TORRES J F, HADJOUT D, SEBAA A, et al. Deep learning for time series forecasting: a survey[J]. Big Data, 2021, 9(1):3-21. [3] MASINI R P, MEDEIROS M C, MENDES E F. Machine learning advances for time series forecasting[J]. Journal of Economic Surveys, 2023, 37(1):76-111. [4] WANG Zihan, KONG Fanheng, FENG Shi, et al. Is mamba effective for time series forecasting[J]. Neurocomputing, 2025, 619:129178. [5] BHOGADE V, NITHYA B. Time series forecasting using Transformer neural network[J]. International Journal of Computers and Applications, 2024, 46(10):880-888. [6] KIM D K, KIM K. A convolutional Transformer model for multivariate time series prediction[J]. IEEE Access, 2022, 10:101319-101329. [7] KHAN S, NASEER M, HAYAT M, et al. Transformers in vision: a survey[J]. ACM Computing Surveys(CSUR), 2022, 54(10s):1-41. [8] SUNKI A, SATYAKUMAR C, NARAYANA G S, et al. Time series forecasting of stock market using ARIMA, LSTM and FB prophet[C] //MATEC Web of Conferences.[S.l] : EDP Sciences, 2024, 392:01163. [9] TOKGÖZ A, ÜNAL G. A RNN based time series approach for forecasting Turkish electricity load[C] //2018 26th Signal Processing and Communications Applications Conference(SIU). Izmir: IEEE, 2018:1-4. [10] SAGHEER A, KOTB M. Time series forecasting of petroleum production using deep LSTM recurrent networks[J]. Neurocomputing, 2019, 323:203-213. [11] KIM J, MOON N. BiLSTM model based on multivariate time series data in multiple field for forecasting trading area[J]. Journal of Ambient Intelligence and Humanized Computing, 2020,11(2):599-608. [12] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[J]. Advances in Neural Information Processing Systems, 2017, 30:5998-6008. [13] ZHOU Haoyi, ZHANG Shanghang, PENG Jieqi, et al. Informer: beyond efficient Transformer for long sequence time-series forecasting[C] //Proceedings of the AAAI Conference on Artificial Intelligence. Philadelphia: AAAI, 2021, 35(12):11106-11115. [14] WU Haixu, XU Jiehui, WANG Jianmin, et al. Autoformer: decomposition Transformers with auto-correlation for long-term series forecasting[J]. Advances in Neural Information Processing Systems, 2021, 34:22419-22430. [15] AHAMED M A, CHENG Qiang. Timemachine: a time series is worth 4 mambas for long-term forecasting[C] //ECAI 2024:27th European Conference on Artificial Intelligence. Santiago de Compostela: IOS Press, 2024, 392:1688. [16] LIN Shengsheng, LIN Weiwei, WU Wentai, et al. Petformer: long-term time series forecasting via placeholder-enhanced Transformer[J]. IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 9(2):1189-1201 [17] SU Hang, ZHAO Dong, HEIDARI ALI ASGHAR, et al. RIME: a physics-based optimization[J]. Neurocomputing, 2023, 532:183-214. [18] CAMBRIA E, WHITE B. Jumping NLP curves: a review of natural language processing research[J]. IEEE Computational Intelligence Magazine, 2014, 9(2):48-57. [19] ZHONG Junliu, PUN Chiman. An end-to-end dense-inceptionnet for image copy-move forgery detection[J]. IEEE Transactions on Information Forensics and Security, 2019, 15:2134-2146. |
| [1] | 孙迪,郭义童,任超,范海峰,张传雷. 基于多尺度特征融合与改进注意力的锈蚀螺栓螺帽检测[J]. 《山东大学学报(理学版)》, 2026, 61(1): 1-14. |
| [2] | 闫本聪,王迎美. 基于引导滤波和Transformer的双分支视网膜血管分割网络[J]. 《山东大学学报(理学版)》, 2025, 60(7): 22-31. |
|
||