JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2026, Vol. 61 ›› Issue (5): 79-89.doi: 10.6040/j.issn.1671-9352.5.2025.005

Previous Articles     Next Articles

Application of RIME-Transformer model in complex time series prediction problems

SUN Xinyi1, ZHENG Tingting1,2*, SUN Liwen1   

  1. 1. School of Mathematical Sciences, Anhui University, Hefei 230601, Anhui, China;
    2. Center for University Mathematics Teaching, Anhui University, Hefei 23061, Anhui, China
  • Published:2026-05-15

Abstract: To address the shortcomings of the traditional Transformer in long-term sequence modeling and computational efficiency, an improved Transformer model is proposed. This model first introduces a multi-scale convolutional structure in the feature extraction stage. Parallel convolution kernels capture both short-term fluctuations and long-term trends at different scales, thereby enhancing the representation of multi-level temporal patterns. Subsequently, the model employs learnable positional encoding instead of fixed sinusoidal encoding to better address the challenges posed by non-stationary data and irregular time intervals. During global dependency modeling, the improved encoder leverages a multi-head self-attention mechanism to establish feature interactions across time steps and dynamically assign moment weights to focus on key segments, effectively reducing the computational complexity of long-term sequence modeling. Furthermore, the model incorporates the rime optimization algorithm RIME for efficient search and optimization in a high-dimensional hyperparameter space, thereby improving the models convergence speed and generalization ability. Experiments on three real-world complex datasets demonstrate that the RIME-Transformer outperforms mainstream methods across multiple metrics. These results validate the effectiveness and superiority of the proposed model for complex time series prediction tasks.

Key words: Transformer, rime optimization algorithm, multi-scale feature encoding, learnable positional encoding, attention pooling

CLC Number: 

  • TP183
[1] LIM B, ZOHREN S. Time-series forecasting with deep learning: a survey[J]. Philosophical Transactions of the Royal Society A, 2021, 379(2194):20200209.
[2] TORRES J F, HADJOUT D, SEBAA A, et al. Deep learning for time series forecasting: a survey[J]. Big Data, 2021, 9(1):3-21.
[3] MASINI R P, MEDEIROS M C, MENDES E F. Machine learning advances for time series forecasting[J]. Journal of Economic Surveys, 2023, 37(1):76-111.
[4] WANG Zihan, KONG Fanheng, FENG Shi, et al. Is mamba effective for time series forecasting[J]. Neurocomputing, 2025, 619:129178.
[5] BHOGADE V, NITHYA B. Time series forecasting using Transformer neural network[J]. International Journal of Computers and Applications, 2024, 46(10):880-888.
[6] KIM D K, KIM K. A convolutional Transformer model for multivariate time series prediction[J]. IEEE Access, 2022, 10:101319-101329.
[7] KHAN S, NASEER M, HAYAT M, et al. Transformers in vision: a survey[J]. ACM Computing Surveys(CSUR), 2022, 54(10s):1-41.
[8] SUNKI A, SATYAKUMAR C, NARAYANA G S, et al. Time series forecasting of stock market using ARIMA, LSTM and FB prophet[C] //MATEC Web of Conferences.[S.l] : EDP Sciences, 2024, 392:01163.
[9] TOKGÖZ A, ÜNAL G. A RNN based time series approach for forecasting Turkish electricity load[C] //2018 26th Signal Processing and Communications Applications Conference(SIU). Izmir: IEEE, 2018:1-4.
[10] SAGHEER A, KOTB M. Time series forecasting of petroleum production using deep LSTM recurrent networks[J]. Neurocomputing, 2019, 323:203-213.
[11] KIM J, MOON N. BiLSTM model based on multivariate time series data in multiple field for forecasting trading area[J]. Journal of Ambient Intelligence and Humanized Computing, 2020,11(2):599-608.
[12] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[J]. Advances in Neural Information Processing Systems, 2017, 30:5998-6008.
[13] ZHOU Haoyi, ZHANG Shanghang, PENG Jieqi, et al. Informer: beyond efficient Transformer for long sequence time-series forecasting[C] //Proceedings of the AAAI Conference on Artificial Intelligence. Philadelphia: AAAI, 2021, 35(12):11106-11115.
[14] WU Haixu, XU Jiehui, WANG Jianmin, et al. Autoformer: decomposition Transformers with auto-correlation for long-term series forecasting[J]. Advances in Neural Information Processing Systems, 2021, 34:22419-22430.
[15] AHAMED M A, CHENG Qiang. Timemachine: a time series is worth 4 mambas for long-term forecasting[C] //ECAI 2024:27th European Conference on Artificial Intelligence. Santiago de Compostela: IOS Press, 2024, 392:1688.
[16] LIN Shengsheng, LIN Weiwei, WU Wentai, et al. Petformer: long-term time series forecasting via placeholder-enhanced Transformer[J]. IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 9(2):1189-1201
[17] SU Hang, ZHAO Dong, HEIDARI ALI ASGHAR, et al. RIME: a physics-based optimization[J]. Neurocomputing, 2023, 532:183-214.
[18] CAMBRIA E, WHITE B. Jumping NLP curves: a review of natural language processing research[J]. IEEE Computational Intelligence Magazine, 2014, 9(2):48-57.
[19] ZHONG Junliu, PUN Chiman. An end-to-end dense-inceptionnet for image copy-move forgery detection[J]. IEEE Transactions on Information Forensics and Security, 2019, 15:2134-2146.
[1] . Based on multi-scale feature fusion and improved attention for rusty bolt and nut detection [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2026, 61(1): 1-14.
[2] YAN Bencong, WANG Yingmei. A dual branch retinal vessel segmentation network based on guided filtering and Transformer [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2025, 60(7): 22-31.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!