JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2014, Vol. 49 ›› Issue (09): 50-55.doi: 10.6040/j.issn.1671-9352.2.2014.436

Previous Articles     Next Articles

Optimization research of hot standy with Remus

ZOU De-qing1, XIANG Jun1, ZHANG Xiao-xu2, YUAN Bo-yang2, FENG Ming-lu2   

  1. 1. School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China;
    2. CECT-China COMM Communications Co., Ltd, Beijing 100022, China
  • Received:2014-06-24 Revised:2014-08-27 Online:2014-09-20 Published:2014-09-30

Abstract: Hot Standby is a reliable solution for virtualization system to increase it's high availability.To achieve state synchronization and data synchronization between host node and backup node, traditional hot standby technology constantly generate checkpoint, send real-time status updates of host virtual machine to backup virtual machine. However, traditional checkpoint technology will cause bandwidth delay which will affect the high availability of this technology when it encountered intensive applications during backup. In addition, during the process of backup heartbeat may lose effectiveness which led to false positives about virtual machine's state and infect system's normal operation. Therefore, two optimizations based on Remus hot standby mode were proposed. One is to compress the memory checkpoint, the other is to design a client-oriented hot standby mechanism. Through the comparison of bandwidth-delay before and after the improvement, checkpoint compression based on XOR-RLE algorithm effectively reduces the bandwidth delay caused by the intensive applications, and the client-oriented hot standby mechanism substantially reduces false positives during hot standby process.

Key words: incremental checkpoint compression, memory intensive, high availability, virtual machine, hot standby

CLC Number: 

  • TP309
[1] MELL P, GRANCE T. The NIST definition of cloud computing[J]. Communications of the ACM, 2010, 53(6):50-56.
[2] ARMBRUST M, FOX A, GRIFFITH R, et al. Above the clouds: a berkeley view of cloud computing [EB/OL]. [201-02-24].http://www.eecs.berkeley.edu/Pubs/TechRpts/2009/EECS-2009-28.pdf.
[3] 刘鹏程,陈榕.面向云计算的虚拟机动态迁移框架[J].计算机工程,2010, 36(5):37-39. LIU Pengcheng, CHEN Rong. Cloud computing-oriented live migration framework for virtual machine[J]. Computer Engineering, 2010, 36(5):37-39.
[4] HINES M, GOPALAN K. Post-copy based live virtual machine migration using adaptive pre-paging and dynamic self-ballooning[C]//Proceedings of 2009 ACMSIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE'09). New York: ACM Press, 2009:51-60.
[5] LUO Yingwei, ZHANG Binbin, WANG Xiaolin. Live and incremental whole-system migration of virtual machines using block-bitmap[C]//Proceedings of IEEE International Conference on Cluster Computing (Cluster'08). Washington: IEEE Computer Society, 2008:99-106.
[6] LAGAR-CAVILLA H, WHITNEY J, SCANNELL A, et al. SnowFlock: rapid virtual machine cloning for cloud computing[C]//Proceedings of the 4th ACM European Conference on Computer Systems (EuroSys'09). New York: ACM Press, 2009: 1-12.
[7] PARASHAR B, TANEJA G. Reliability and profit evaluation of a PLC hot standby system based on a Master-Slave concept and two types of repair facilities [J]. IEEE Transactions on Reliability, 2007, 56(3): 534-539.
[8] Warrier Chandra, XU Yingchun, Saxena Narothum. Hot standby protocol for wireless devices: US, 6795705[P]. 2004-09-21.
[9] 史文路,胡平.双机热备份系统的研究与改进[J].微处理机,2008, 29(3):180-182. SHI Wenlu, HU Ping. The research and improvement in duplex Hot-Backup system [J]. Microprocessors, 2008, 29(3):180-182.
[10] CULLY B, LEFEBVRE G, MEYER D, et al. Remus: high availability via asynchronous virtual machine replication[C]//Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation (NSDI'08).
[S.l.]:[s.n.]2008: 161-174.
[11] MINHAS U F, RAJAGOPALAN S, CULLY B, et al. Remusdb: transparent high availability for database systems [J]. VLDB Journal, 2013, 22(SI): 29-45.
[12] 潘晓东.基于内存压缩的虚拟机实时迁移机制研究[D].武汉:华中科技大学,2009. PAN Xiaodong. A study of live virtual machine migration mechanism based on compression [D]. Wuhan: Huazhong University of Science and Technology, 2009.
[13] GEROFI B, VASS Z, ISHIKAWA Y. Utilizing memory content similarity for improving the performance of replicated virtual machines[C]//Proceedings of the 4th IEEE International Conference on Utility and Cloud Computing. Washington: IEEE Computer Society, 2011:73-80.
[14] DENG Li, JIN Hai, WU Song, et al. Fast saving and restoring virtual machines with page compression[C]// Proceedings of 2011 International Conference on Cloud and Service Computing (CSC). Washington: IEEE Computer Society, 2011: 150-157.
[15] VALLEE G, NAUGHTON T, ONG H, et al. Checkpoint/restart of virtual machines based on Xen [C]//Proceedings of the High Availability and Performance Computing Workshop (HAPCW'06).[S.l.]:[s.n.],2006: 1-6.
[16] ZHU JUN, DONG Wei, JIANG Zhefu, et al. Improving the performance of hypervisor-based fault tolerance[C]//Proceedings of 2010 IEEE International Symposium on Parallel Distributed Processing. Washington: IEEE Computer Society, 2010: 1-10.
[17] PARK E, EGGER B, LEE J. Fast and space-efficient virtual machine check pointing[C]//Proceedings of the 7th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE'11). New York: ACM Press, 2011:75-86.
[1] ZHAO Dan-dan, CHEN Xing-shu, JIN Xin. A study on security enhancement technology for KVM Hypervisor [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(3): 38-43.
[2] JI Xiang-min, ZHAO Bo, XIANG Shuang, XIA Zhong-lin. Formally analyzing VMM dynamic measurement based on extended LS2 [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(09): 1-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!