簡易檢索 / 詳目顯示

研究生: 肯莫茲
Convolbo, WendKuuni-Moise
論文名稱: 雲端資源於成本及效能要求下之排程技術
Scheduling Techniques on Cloud Resources Under Cost and Performance Requirements
指導教授: 鍾葉青
Chung, Yeh-Ching
口試委員: 李哲榮
LEE, CHE-RUNG
周志遠
CHOU, JERRY
賴冠州
Lai, Kuan-Chou
許慶賢
Hsu, Ching-Hsien
學位類別: 博士
Doctor
系所名稱: 電機資訊學院 - 資訊系統與應用研究所
Institute of Information Systems and Applications
論文出版年: 2017
畢業學年度: 105
語文別: 英文
論文頁數: 134
中文關鍵詞: 雲端工作排程啟發式虛擬機器批次處理任務虛擬叢集部署競價型執行個體DAG
外文關鍵詞: cloud, job scheduling, heuristic, virtualmachines, batchjobs, virtualcluster, migration, spotinstance, DAG
相關次數: 點閱:1下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 基於運算可以被交付到網際網路並可按現行方式收費這樣的理念,雲端運算已經成為了一個突出的運算典範。
    藉由虛擬化技術,雲端給予我們一種錯覺,彷彿在不同組態和成本下也有取之不盡的資源可以使用。正因如此,雲端上的資源管理就成為相當關鍵的議題。
    雲端設施的效率相當倚賴虛擬機器(Virtual Machine)如何配置給應用程式,
    以及虛擬機器如何映射到實體機器(Physical Machine)上。由於採用不同的策略進行資源管控對於用戶任務的效能、成本以及資源的利用都有極大程度的影響。
    很快地,用戶對於有效率的排程要求包含了成本意識以及服務層級協議(Service Level Agreement)的滿足。然而,雲端運算中的成本模型相當複雜,牽涉了資源本身的能力、租賃時間以及資源獲取的模式。
    除此之外,近來大數據的出現也促進了大規模數據分析的發展,這些應用通常跨越了地理上分散的數據中心並且具有廣泛地處理要求。在這樣的情形下,雲端用戶經常提起的問題是:如何找到最有效益的方式來使用運算資源並且保證目標函式對於工作負載的執行。有鑑於此,我們考慮的問題是設計一個關於資源排程的技術,這個技術是在效能限制下對於執行成本的最小化。
    在本篇論文當中,我們將針對如何有效管理資源以及規劃應用程式的執行, 以期最小化整體計算成本並且能保證滿足效能需求,並且提出新穎的排程技術和演算法。
    本篇論文的主要目的是在雲端計算的範疇內提出一具成本意識的排程策略,這樣的策略可以涵蓋多種不同的應用程式,其中包括了高效計算、資料分析以及平行的批次作業處理。
    至此,我們的方法是去探索資源的類型,而這包括了利用拍賣資源來降低在特定用戶限制下的執行成本。除此之外,我們也調查了地理分佈式數據中心的相關問題。
    我們提出的策略所作出的貢獻可分成三個方面:
    清楚地了解雲端資源管理在成本和效能之間的權衡、利用資源租賃模式來使用基於拍賣的雲端資源,最後,我們對比了單一數據中心的排程和最近表現出不同排程機制的地理分佈式要求。


    Cloud computing has emerged to become a prominent computing paradigm based on the idea
    that computation can be delivered over the Internet and be charged at an as-you-go
    basis. Through virtualization techniques, the Cloud offers an illusion of limitless
    resources with different configurations and costs. As a result, managing cloud resources
    has become a critical issue. The efficiency of the whole cloud facilities strongly relies
    on how the Virtual Machines (VM) are allocated to the applications and how VMs are mapped
    to the Physical Machine (PM). Different resources management strategies can largely affect
    the performance of the user's job, the cost, and the resource utilization. Hence,
    efficient job scheduling in the user perspective has swift to include cost-awareness and
    the satisfaction of the Service Level Agreement (SLA). However, the cost in cloud computing
    is a complex model in which involve the resource capacity, the leasing time and the
    resource acquisition mode. In addition, the recent advent of Big Data has contributed
    to the development of large scale data analytic applications which often span
    geographically dispersed data centers and have a wide range of processing requirements.
    A problem usually raised by cloud users in this situation, is to find the most cost
    effective computing resources to guarantee the objective functions of their workloads
    execution. Hence, we consider the problem of designing resource scheduling techniques
    to minimize the execution costs under performance constraints.
    In this thesis, we present novel scheduling techniques and algorithms to efficiently
    manage the resource and plan the execution of application jobs so as to minimize the
    overall computation cost and guarantee the performance requirement. The main objective
    of this thesis is therefore to provide cost-aware scheduling strategies in cloud
    computing for various types of applications including High Performance Computing,
    data analytics and Parallel batch jobs. To this end, our approach is to explore the
    resource types including auction based resources to leverage the execution cost under
    specified user constrains. In addition, we investigate the scheduling problem in
    geo-distributed data centers. Contributions in our strategies are three folds:
    Ensure a clear understanding of the tradeoff between cost and performance in
    Cloud resource management. Exploit the resource leasing model to leverage the
    auction-based cloud resources. Finally, we show the contrast with single data center
    scheduling with the recent geo-distributed requirement which exhibits different
    scheduling mechanisms.

    􀴡􀚤 感謝. . . . . . . . . . . . . . . . . . . . . . . . .v Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . .vii Abstract . . . . . . . . . . . . . . . . . . . . . . . . .xiii 1 Introduction. . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1 Background and context . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.2 Research Problems and Objectives . . . . . . . . . . . . . . . . . . . . 4 1.3 Approach Overview and Significance . . . . . . . . . . . . . . . . . . 6 2 Background. . . . . . . . . . . . . . . . . . . . . . . . . 9 2.1 Scheduling in Cloud Environment . . . . . . . . . . . . . . . . . . . . 9 2.2 Taxonomy and Survey on Job Scheduling in Cloud . . . . . . . . . . . 11 2.2.1 Classification Parameters . . . . . . . . . . . . . . . . . . . . . 11 2.2.2 Application Model . . . . . . . . . . . . . . . . . . . . . . . . 12 2.2.3 Resource Model . . . . . . . . . . . . . . . . . . . . . . . . . 13 2.2.4 Scheduling Level . . . . . . . . . . . . . . . . . . . . . . . . . 13 2.3 User-centric Scheduling Techniques . . . . . . . . . . . . . . . . . . . 14 2.3.1 QoS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.3.2 Cost Awareness . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.3.3 Execution Time . . . . . . . . . . . . . . . . . . . . . . . . . . 16 2.3.4 Performance-driven . . . . . . . . . . . . . . . . . . . . . . . . 17 2.4 Cloud Provider-centric Scheduling . . . . . . . . . . . . . . . . . . . . 17 2.4.1 Power-awareness Job Scheduling . . . . . . . . . . . . . . . . 17 2.4.2 Load Balance-awareness Job Scheduling . . . . . . . . . . . . 18 2.4.3 The Resource Reutilization . . . . . . . . . . . . . . . . . . . . 18 2.5 Trend . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 2.5.1 Workflows Scheduling . . . . . . . . . . . . . . . . . . . . . . 19 2.5.2 Bidding on Spot Instances . . . . . . . . . . . . . . . . . . . . 19 2.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 3 Minimizing the Execution Cost on Cloud Resources Through DAG Scheduling Algorithms . . . . . . . . . . . . . . . . . . . . . . . . .23 3.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 3.2 Backgroud and Motivation . . . . . . . . . . . . . . . . . . . . . . . . 24 3.3 Problem Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26 3.3.1 System Model & Problem Description . . . . . . . . . . . . . . 26 3.3.2 DAG Job . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 3.3.3 Scheduling Examples . . . . . . . . . . . . . . . . . . . . . . . 31 3.4 Cost Aware Scheduling Algorithms . . . . . . . . . . . . . . . . . . . 33 3.4.1 Cost Optimal Algorithm . . . . . . . . . . . . . . . . . . . . . 34 3.4.2 Heuristic Algorithm . . . . . . . . . . . . . . . . . . . . . . . 35 3.4.3 Dependency-Aware Task Scheduling . . . . . . . . . . . . . . 40 3.4.4 Complexity Analysis . . . . . . . . . . . . . . . . . . . . . . . 41 3.5 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 3.5.1 Random Graph Generator . . . . . . . . . . . . . . . . . . . . 44 3.5.2 Task Execution Time Model . . . . . . . . . . . . . . . . . . . 45 3.5.3 Comparison Algorithms . . . . . . . . . . . . . . . . . . . . . 46 3.6 Evaluation and Analysis . . . . . . . . . . . . . . . . . . . . . . . . . 47 3.6.1 Overall Comparison . . . . . . . . . . . . . . . . . . . . . . . 47 3.6.2 In-depth Analysis . . . . . . . . . . . . . . . . . . . . . . . . . 49 3.6.3 Analysis on Types of DAGs . . . . . . . . . . . . . . . . . . . 52 3.7 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 3.8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 4 Towards the Optimization of Data locality-aware Job Scheduling in Geodistributed Data Centers . . . . . . . . . . . . . . . . . . . . . . . . .59 4.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 4.2 Background and System Model . . . . . . . . . . . . . . . . . . . . . . 62 4.2.1 Scheduling in Geo-Distributed Systems . . . . . . . . . . . . . 62 4.2.2 System Model . . . . . . . . . . . . . . . . . . . . . . . . . . 64 4.2.3 Motivational Example . . . . . . . . . . . . . . . . . . . . . . 65 4.3 Problem Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66 4.3.1 Problem input . . . . . . . . . . . . . . . . . . . . . . . . . . . 68 4.3.2 LP formulation . . . . . . . . . . . . . . . . . . . . . . . . . . 69 4.4 Data Locality-aware Scheduling . . . . . . . . . . . . . . . . . . . . . 71 4.4.1 Schedule Optimization with the LP Solver . . . . . . . . . . . . 71 4.4.2 GeoDis Scheduler . . . . . . . . . . . . . . . . . . . . . . . . 72 4.5 Performance Evaluation and Analysis . . . . . . . . . . . . . . . . . . 75 4.5.1 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . 75 4.5.2 Performance Evaluation and Analysis . . . . . . . . . . . . . . 77 4.6 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 4.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87 5 Optimal Static Bidding Strategy for Spot Instances with Deadline Constraint 89 5.1 Background and Motivation . . . . . . . . . . . . . . . . . . . . . . . 89 5.2 Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . 92 5.2.1 Spot Instance Characteristics . . . . . . . . . . . . . . . . . . . 92 5.2.2 Spot Price Model . . . . . . . . . . . . . . . . . . . . . . . . . 94 5.2.3 Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . 95 5.3 Bidding Strategy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 5.3.1 Recursion Equation for Monetary Cost . . . . . . . . . . . . . 98 5.3.2 Dynamic Programming Algorithm . . . . . . . . . . . . . . . . 100 5.4 Evaluations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100 5.4.1 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . 102 5.4.2 Monetary Cost analysis . . . . . . . . . . . . . . . . . . . . . . 103 5.4.3 Completion Comparison . . . . . . . . . . . . . . . . . . . . . 106 5.4.4 Spot and On-Demand Comparison . . . . . . . . . . . . . . . . 108 5.4.5 Restart & Checkpoint overhead Analysis . . . . . . . . . . . . 108 5.4.6 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 5.5 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111 5.6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . .115 6.1 Executive Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115 6.2 Virtual Cluster Acquisition for DAG Jobs . . . . . . . . . . . . . . . . 116 6.3 Scheduling Data-aware Large-Scale Applications on Geo-Distributed Data Centers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117 6.4 Leveraging Auction-based Cloud Resources . . . . . . . . . . . . . . . 117 6.5 Future Directions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119

    References
    1. M. Armbrust, A. Fox, R. Griffith, A. D. Joseph, R. Katz, A. Konwinski, G. Lee,
    D. Patterson, A. Rabkin, I. Stoica, and M. Zaharia, “A view of cloud computing,”
    Commun. ACM, vol. 53, pp. 50–58, Apr. 2010.
    2. S. Wu, H. Chen, S. Di, B. Zhou, Z. Xie, H. Jin, and X. Shi, “Synchronizationaware
    scheduling for virtual clusters in cloud,” IEEE Transactions on Parallel
    and Distributed Systems, vol. 26, pp. 2890–2902, Oct 2015.
    3. M. Armbrust, A. Fox, R. Griffith, A. D. Joseph, R. Katz, A. Konwinski, G. Lee,
    D. Patterson, A. Rabkin, I. Stoica, and M. Zaharia, “A view of cloud computing,”
    Commun. ACM, vol. 53, pp. 50–58, Apr. 2010.
    4. “AWS: Amazon Web Service,” 2006.
    5. “Microsoft Azure,” 2010.
    6. “Rackspace,” 1998.
    7. . Goiri, R. Beauchea, K. Le, T. D. Nguyen, M. E. Haque, J. Guitart, J. Torres, and
    R. Bianchini, “Greenslot: Scheduling energy consumption in green datacenters,”
    in 2011 International Conference for High Performance Computing, Networking,
    Storage and Analysis (SC), pp. 1–11, Nov 2011.
    8. Y. Liu, N. Bobroff, L. Fong, S. Seelam, and J. Delgado, “New metrics for scheduling
    jobs on cluster of virtual machines,” in 2011 IEEE International Symposium on
    Parallel and Distributed Processing Workshops and Phd Forum, pp. 1001–1008,
    May 2011.
    9. Y. Wang and W. Shi, “Budget-driven scheduling algorithms for batches of mapreduce
    jobs in heterogeneous clouds,” IEEE Transactions on Cloud Computing,
    vol. 2, pp. 306–319, July 2014.
    10. C. Zhang and H. De Sterck, “Cloudbatch: A batch job queuing system on clouds
    with hadoop and hbase,” in IEEE International Conference on Cloud Computing
    Technology and Science (CloudCom), pp. 368–375, Nov 2010.
    11. X. Liao, Z. Gao, W. Ji, and Y. Wang, “An enforcement of real time scheduling
    in spark streaming,” in Green Computing Conference and Sustainable Computing
    Conference (IGSC), 2015 Sixth International, pp. 1–6, Dec 2015.
    12. K. Wang, Z. Bian, and Q. Chen, “Millipedes: Distributed and set-based sub-task
    scheduler of computing engines running on yarn cluster,” in High Performance
    Computing and Communications (HPCC), 2015 IEEE 7th International Symposium
    on Cyberspace Safety and Security (CSS), 2015 IEEE 12th International Conferen
    on Embedded Software and Systems (ICESS), 2015 IEEE 17th International
    Conference on, pp. 1597–1602, Aug 2015.
    13. I. Menache, O. Shamir, and N. Jain, “On-demand, spot, or both: Dynamic resource
    allocation for executing batch jobs in the cloud,” in 11th International Conference
    on Autonomic Computing (ICAC 14), (Philadelphia, PA), pp. 177–187, USENIX
    Association, 2014.
    14. C. Reiss, A. Tumanov, G. R. Ganger, R. H. Katz, and M. A. Kozuch, “Heterogeneity
    and dynamicity of clouds at scale: Google trace analysis,” in Proceedings
    of the Third ACM Symposium on Cloud Computing, SoCC ’12, (New York, NY,
    USA), pp. 7:1–7:13, ACM, 2012.
    15. J. Dean and S. Ghemawat, “MapReduce: Simplified data processing on large clusters,”
    Commun. ACM, vol. 51, pp. 107–113, Jan. 2008.
    16. M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, “Dryad: Distributed dataparallel
    programs from sequential building blocks,” in Proceedings of the 2007
    Eurosys Conference, (Lisbon, Portugal), Association for Computing Machinery,
    Inc., March 2007.
    17. W. Guo, K. Chen, Y. Wu, and W. Zheng, “Bidding for highly available services
    with low price in spot instance market,” in Proceedings of the 24th International
    Symposium on High-Performance Parallel and Distributed Computing, HPDC
    ’15, (New York, NY, USA), pp. 191–202, ACM, 2015.
    18. A. Elghirani, R. Subrata, and A. Y. Zomaya, “Intelligent scheduling and replication
    in datagrids: a synergistic approach,” in Seventh IEEE International Symposium
    on Cluster Computing and the Grid (CCGrid ’07), pp. 179–182, May 2007.
    19. M. A. Kozuch, M. P. Ryan, R. Gass, S. W. Schlosser, D. O’Hallaron, J. Cipar,
    E. Krevat, J. López, M. Stroucken, and G. R. Ganger, “Tashi: Location-aware
    cluster management,” in Proceedings of the 1st Workshop on Automated Control
    for Datacenters and Clouds, ACDC ’09, (New York, NY, USA), pp. 43–48, ACM,
    2009.
    20. S. Pandey, L. Wu, S. M. Guru, and R. Buyya, “A particle swarm optimizationbased
    heuristic for scheduling workflow applications in cloud computing environments,”
    in 2010 24th IEEE International Conference on Advanced Information
    Networking and Applications, pp. 400–407, April 2010.
    21. I. A. Moschakis and H. D. Karatza, “Performance and cost evaluation of gang
    scheduling in a cloud computing system with job migrations and starvation handling,”
    in 2011 IEEE Symposium on Computers and Communications (ISCC),
    pp. 418–423, June 2011.
    22. Y. Kessaci, N. Melab, and E. G. Talbi, “A pareto-based ga for scheduling hpc applications
    on distributed cloud infrastructures,” in 2011 International Conference
    on High Performance Computing Simulation, pp. 456–462, July 2011.
    23. L. F. Bittencourt and E. R. M. Madeira, “Hcoc: a cost optimization algorithm for
    workflow scheduling in hybrid clouds,” Journal of Internet Services and Applications,
    vol. 2, no. 3, pp. 207–227, 2011.
    24. M. A. Rodriguez and R. Buyya, “Deadline based resource provisioningand
    scheduling algorithm for scientific workflows on clouds,” IEEE Transactions on
    Cloud Computing, vol. 2, pp. 222–235, April 2014.
    25. R. N. Calheiros and R. Buyya, “Meeting deadlines of scientific workflows in public
    clouds with tasks replication,” IEEE Transactions on Parallel and Distributed
    Systems, vol. 25, pp. 1787–1796, July 2014.
    26. X. You, X. Xu, J. Wan, and C. Jiang, “Analysis and evaluation of the scheduling algorithms
    in virtual environment,” in 2009 International Conference on Embedded
    Software and Systems, pp. 291–296, May 2009.
    27. L. F. Bittencourt, E. R. M. Madeira, and N. L. S. D. Fonseca, “Scheduling in hybrid
    clouds,” IEEE Communications Magazine, vol. 50, no. 9, pp. 42–47, 2012.
    28. S. M. Parikh, “A survey on cloud computing resource allocation techniques,” in
    2013 Nirma University International Conference on Engineering (NUiCONE),
    pp. 1–5, Nov 2013.
    29. A. Ananth and K. C. Sekaran, “Game theoretic approaches for job scheduling in
    cloud computing: A survey,” in 2014 International Conference on Computer and
    Communication Technology (ICCCT), pp. 79–85, Sept 2014.
    30. N. Chopra and S. Singh, “Survey on scheduling in hybrid clouds,” in Fifth International
    Conference on Computing, Communications and Networking Technologies
    (ICCCNT), pp. 1–6, July 2014.
    31. S. B. Shaw and A. K. Singh, “A survey on scheduling and load balancing techniques
    in cloud computing environment,” in 2014 International Conference on
    Computer and Communication Technology (ICCCT), pp. 87–95, Sept 2014.
    32. Y. P. Dave, A. S. Shelat, D. S. Patel, and R. H. Jhaveri, “Various job scheduling
    algorithms in cloud computing: A survey,” in International Conference on
    Information Communication and Embedded Systems (ICICES2014), pp. 1–5, Feb
    2014.
    33. S. Verma and R. A. Satao, “A survey on the impact of economies of scale on scientific
    communities,” in 2015 International Conference on Advances in Computer
    Engineering and Applications, pp. 722–726, March 2015.
    34. A. Greenberg, J. Hamilton, D. A. Maltz, and P. Patel, “The cost of a cloud: Research
    problems in data center networks,” SIGCOMM Comput. Commun. Rev.,
    vol. 39, pp. 68–73, Dec. 2008.
    35. N. Sooezi, S. Abrishami, and M. Lotfian, “Scheduling data-driven workflows in
    multi-cloud environment,” in 2015 IEEE 7th International Conference on Cloud
    Computing Technology and Science (CloudCom), pp. 163–167, Nov 2015.
    36. B. Sharma, R. K. Thulasiram, P. Thulasiraman, and R. Buyya, “Clabacus: A
    risk-adjusted cloud resources pricing model using financial option theory,” IEEE
    Transactions on Cloud Computing, vol. 3, pp. 332–344, July 2015.
    37. R. Iakymchuk, J. Napper, and P. Bientinesi, “Improving high-performance computations
    on clouds through resource underutilization,” in Proceedings of the
    2011 ACM Symposium on Applied Computing, SAC ’11, (New York, NY, USA),
    pp. 119–126, ACM, 2011.
    38. R. C. Chiang and H. H. Huang, “Tracon: Interference-aware scheduling for dataintensive
    applications in virtualized environments,” in 2011 International Conference
    for High Performance Computing, Networking, Storage and Analysis (SC),
    pp. 1–12, Nov 2011.
    39. T. A. Henzinger, V. Singh, T. Wies, and D. Zufferey, “Scheduling large jobs by
    abstraction refinement,” in Proceedings of the Sixth Conference on Computer Systems,
    EuroSys ’11, (New York, NY, USA), pp. 329–342, ACM, 2011.
    40. J. W. Lin, C. H. Chen, and J. M. Chang, “Qos-aware data replication for dataintensive
    applications in cloud computing systems,” IEEE Transactions on Cloud
    Computing, vol. 1, pp. 101–115, Jan 2013.
    41. D. Kliazovich, P. Bouvry, and S. U. Khan, “Dens: data center energy-efficient
    network-aware scheduling,” Cluster Computing, vol. 16, no. 1, pp. 65–75, 2013.
    42. P. Fan, J. Wang, Z. Zheng, and M. R. Lyu, “Toward optimal deployment of
    communication-intensive cloud applications,” in 2011 IEEE 4th International
    Conference on Cloud Computing, pp. 460–467, July 2011.
    43. D. Millot and C. Parrot, “Scheduling on unspecified heterogeneous distributed
    resources,” in 2011 IEEE International Symposium on Parallel and Distributed
    Processing Workshops and Phd Forum, pp. 45–56, May 2011.
    44. Z. Li, L. Wang, S. Ren, and G. Quan, “Temperature, power, and makespan aware
    dependent task scheduling for data centers,” in 2011 IEEE/ACM International
    Conference on Green Computing and Communications, pp. 22–27, Aug 2011.
    45. Y. Song, H. Wang, Y. Li, B. Feng, and Y. Sun, “Multi-tiered on-demand resource
    scheduling for vm-based data center,” in Proceedings of the 2009 9th IEEE/
    ACM International Symposium on Cluster Computing and the Grid, CCGRID ’09,
    (Washington, DC, USA), pp. 148–155, IEEE Computer Society, 2009.
    46. P. Wang, Y. Qi, X. Liu, Y. Chen, and X. Zhong, “Power management in heterogeneous
    multi-tier web clusters,” in 2010 39th International Conference on Parallel
    Processing, pp. 385–394, Sept 2010.
    47. A. Beloglazov and R. Buyya, “Energy efficient resource management in virtualized
    cloud data centers,” in Proceedings of the 2010 10th IEEE/ACM International
    Conference on Cluster, Cloud and Grid Computing, CCGRID ’10, (Washington,
    DC, USA), pp. 826–831, IEEE Computer Society, 2010.
    48. N. Regola and J. C. Ducom, “Recommendations for virtualization technologies in
    high performance computing,” in 2010 IEEE Second International Conference on
    Cloud Computing Technology and Science, pp. 409–416, Nov 2010.
    49. C. C. T. Mark, D. Niyato, and T. Chen-Khong, “Evolutionary optimal virtual machine
    placement and demand forecaster for cloud computing,” in 2011 IEEE International
    Conference on Advanced Information Networking and Applications,
    pp. 348–355, March 2011.
    50. S. Zaman and D. Grosu, “Efficient bidding for virtual machine instances in
    clouds,” in 2011 IEEE 4th International Conference on Cloud Computing, pp. 41–
    48, July 2011.
    51. S. Genaud and J. Gossa, “Cost-wait Trade-offs in Client-side Resource Provisioning
    with Elastic Clouds,” in 4th IEEE International Conference on Cloud Computing
    (CLOUD 2011), (Washington, United States), July 2011.
    52. B. Saovapakhiran, G. Michailidis, and M. Devetsikiotis, “Aggregated-dag
    scheduling for job flow maximization in heterogeneous cloud computing,” in 2011
    IEEE Global Telecommunications Conference - GLOBECOM 2011, pp. 1–6, Dec
    2011.
    53. V. A. Patil and V. Chaudhary, “Rack aware scheduling in hpc data centers: An
    energy conservation strategy,” in 2011 IEEE International Symposium on Parallel
    and Distributed Processing Workshops and Phd Forum, pp. 814–821, May 2011.
    54. F. Diaz, E. A. Doumith, and M. Gagnaire, “Impact of resource over-reservation
    (ror) and dropping policies on cloud resource allocation,” in 2011 IEEE Third International
    Conference on Cloud Computing Technology and Science, pp. 470–
    476, Nov 2011.
    55. Z. Wang and Y. Q. Zhang, “Energy-efficient task scheduling algorithms with human
    intelligence based task shuffling and task relocation,” in 2011 IEEE/ACM
    International Conference on Green Computing and Communications, pp. 38–43,
    Aug 2011.
    56. M. Rahman, X. Li, and H. Palit, “Hybrid heuristic for scheduling data analytics
    workflow applications in hybrid cloud environment,” in 2011 IEEE International
    Symposium on Parallel and Distributed Processing Workshops and Phd Forum,
    pp. 966–974, May 2011.
    57. X. Chen, Q. Liu, and J. Lai, “A new power-aware scheduling algorithm for distributed
    system,” in Green Computing and Communications (GreenCom), 2010
    IEEE/ACM Int’l Conference on Int’l Conference on Cyber, Physical and Social
    Computing (CPSCom), pp. 338–343, Dec 2010.
    58. B. Li, J. Li, J. Huai, T. Wo, Q. Li, and L. Zhong, “Enacloud: An energy-saving
    application live placement approach for cloud computing environments,” in 2009
    IEEE International Conference on Cloud Computing, pp. 17–24, Sept 2009.
    59. N. Ma, Y. Xia, and V. K. Prasanna, “Exploring weak dependencies in dag scheduling,”
    in 2011 IEEE International Symposium on Parallel and Distributed Processing
    Workshops and Phd Forum, pp. 591–598, May 2011.
    60. F. Pinel, J. E. Pecero, S. U. Khan, and P. Bouvry, “Energy-efficient scheduling
    on milliclusters with performance constraints,” in 2011 IEEE/ACM International
    Conference on Green Computing and Communications, pp. 44–49, Aug 2011.
    61. X. Zhu, C. He, Y. Bi, and D. Qiu, “Towards adaptive power-aware scheduling for
    real-time tasks on dvs-enabled heterogeneous clusters,” in Green Computing and
    Communications (GreenCom), 2010 IEEE/ACM Int’l Conference on Int’l Conference
    on Cyber, Physical and Social Computing (CPSCom), pp. 117–124, Dec
    2010.
    62. J. Sun, C. Huang, and J. Dong, “Research on power-aware scheduling for highperformance
    computing system,” in 2011 IEEE/ACM International Conference on
    Green Computing and Communications, pp. 75–78, Aug 2011.
    63. N. Kim, J. Cho, and E. Seo, “Energy-based accounting and scheduling of virtual
    machines in a cloud system,” in 2011 IEEE/ACM International Conference on
    Green Computing and Communications, pp. 176–181, Aug 2011.
    64. S. Tang, J. Yuan, C. Wang, and X. Y. Li, “A framework for amazon ec2 bidding
    strategy under sla constraints,” IEEE Transactions on Parallel and Distributed
    Systems, vol. 25, pp. 2–11, Jan 2014.
    65. S. Yi, A. Andrzejak, and D. Kondo, “Monetary cost-aware checkpointing and migration
    on amazon cloud spot instances,” Services Computing, IEEE Transactions
    on, vol. 5, pp. 512–524, Fourth 2012.
    66. S. Yi, D. Kondo, and A. Andrzejak, “Reducing costs of spot instances via checkpointing
    in the amazon elastic compute cloud,” in Cloud Computing (CLOUD),
    2010 IEEE 3rd International Conference on, pp. 236–243, July 2010.
    67. A. Marathe, R. Harris, D. Lowenthal, B. R. de Supinski, B. Rountree, and
    M. Schulz, “Exploiting redundancy for cost-effective, time-constrained execution
    of hpc applications on amazon ec2,” in Proceedings of the 23rd International Symposium
    on High-performance Parallel and Distributed Computing, HPDC ’14,
    (New York, NY, USA), pp. 279–290, ACM, 2014.
    68. W. Voorsluys and R. Buyya, “Reliable provisioning of spot instances for computeintensive
    applications,” in Proceedings of the 2012 IEEE 26th International
    Conference on Advanced Information Networking and Applications, AINA ’12,
    (Washington, DC, USA), pp. 542–549, IEEE Computer Society, 2012.
    69. S. Subramanya, T. Guo, P. Sharma, D. Irwin, and P. Shenoy, “Spoton: A batch
    computing service for the spot market,” in Proceedings of the Sixth ACM Symposium
    on Cloud Computing, SoCC ’15, (New York, NY, USA), pp. 329–341,
    ACM, 2015.
    70. O. Agmon Ben-Yehuda, M. Ben-Yehuda, A. Schuster, and D. Tsafrir, “Deconstructing
    amazon ec2 spot instance pricing,” in Cloud Computing Technology and
    Science (CloudCom), 2011 IEEE Third International Conference on, pp. 304–311, Nov 2011.
    71. Y. Song, M. Zafer, and K.-W. Lee, “Optimal bidding in spot instance market,” in
    INFOCOM, 2012 Proceedings IEEE, pp. 190–198, March 2012.
    72. T. Hu, “Parallel sequencing and assembly line problems,” Operations Research,
    vol. 9, no. 6, pp. 841–848, 1961.
    73. Y. Yu, M. Isard, D. Fetterly, M. Budiu, U. Erlingsson, P. K. Gunda, and J. Currey,
    “DryadLINQ: A system for general-purpose distributed data-parallel computing
    using a high-level language,” in Proceedings of the USENIX Conference on Operating
    Systems Design and Implementation, pp. 1–14, 2008.
    74. M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, “Spark:
    Cluster computing with working sets,” in Proceedings of the 2Nd USENIX Conference
    on Hot Topics in Cloud Computing, HotCloud’10, (Berkeley, CA, USA),
    pp. 10–10, USENIX Association, 2010.
    75. A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff,
    and R. Murthy, “Hive: A warehousing solution over a Map-Reduce framework,”
    Proc. VLDB Endow., vol. 2, pp. 1626–1629, Aug. 2009.
    76. C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins, “Pig latin: A notso-
    foreign language for data processing,” in ACM SIGMOD, pp. 1099–1110, 2008.
    77. R. Graham, L. Lawler, J. Lenstra, and A. Kan, “Optimization and approximation
    in deterministic sequencing and scheduling: A survey,” Annals of Discrete Math.,
    pp. 287–326, 1979.
    78. T. L. Casavant and J. G. Kuhl, “A taxonomy of scheduling in general-purpose
    distributed computing systems,” IEEE Transactions on Software Engineering,
    vol. 14, pp. 141–154, Feb. 1988.
    79. T. L. Adam, K. M. Chandy, and J. R. Dickson, “A comparison of list schedules for
    parallel processing systems,” Commun. ACM, vol. 17, pp. 685–690, Dec. 1974.
    80. W. H. Yu, Lu Decomposition on a Multiprocessing System with Communications
    Delay. PhD thesis, University of California, Berkeley, 1984. AAI8427141.
    81. B. Shirazi, M. Wang, and G. Pathak, “Analysis and evaluation of heuristic methods
    for static task scheduling,” Journal of Parallel and Distributed Computing, vol. 10,
    pp. 222–2232, Oct. 1990.
    82. T. Yang and A. Gerasoulis, “DSC: scheduling parallel tasks on an unbounded number
    of processors,” IEEE Transactions on Parallel and Distributed Systems, vol. 5,
    pp. 951–967, Sep 1994.
    83. Y.-K. Kwok and I. Ahmad, “Dynamic critical-path scheduling: an effective technique
    for allocating task graphs to multiprocessors,” IEEE Transactions on Parallel
    and Distributed Systems, vol. 7, pp. 506–521, May 1996.
    84. H. Chen, B. Shirazi, K. Kavi, and A. Hurson, “Static scheduling using linear clustering
    and task duplication,” in Parallel and Distributed Computing and Systems,
    pp. 285–290, 1993.
    85. J. Colin and P. Chretienne, “C.P.M. Scheduling with Small Computation Delays
    and Task Duplication,” Operations Research, pp. 680–684, 1991.
    86. R. Bajaj and D. Agrawal, “Improving scheduling of tasks in a heterogeneous
    environment,” IEEE Transactions on Parallel and Distributed Systems, vol. 15,
    pp. 107–118, Feb 2004.
    87. C.-H. Yang, P. Lee, and Y.-C. Chung, “Improving static task scheduling in heterogeneous
    and homogeneous computing systems,” in ICPP, pp. 45–45, Sept 2007.
    88. H. Topcuouglu, S. Hariri, and M.-y. Wu, “Performance-effective and lowcomplexity
    task scheduling for heterogeneous computing,” IEEE Transactions on
    Parallel and Distributed Systems, vol. 13, pp. 260–274, Mar. 2002.
    89. H. Arabnejad and J. Barbosa, “List scheduling algorithm for heterogeneous systems
    by an optimistic cost table,” IEEE Transactions on Parallel and Distributed
    Systems, vol. 25, pp. 682–694, March 2014.
    90. R. Raju, J. Amudhavel, M. Pavithra, S. Anuja, and B. Abinaya, “A heuristic fault
    tolerant ”MapReduce framework for minimizing makespan in hybrid cloud environment,”
    in International Conference on Green Computing Communication and
    Electrical Engineering (ICGCCEE), pp. 1–4, March 2014.
    91. L. Bittencourt and E. Madeira, “HCOC: a cost optimization algorithm for workflow
    scheduling in hybrid clouds,” Journal of Internet Services and Applications,
    vol. 2, no. 3, pp. 207–227, 2011.
    92. N. Fujimoto and K. Hagihara, “Near-optimal dynamic task scheduling of precedence
    constrained coarse-grained tasks onto a computational grid,” in Proceedings
    of International Conference on Parallel and Distributed Computing, pp. 80–87,
    2003.
    93. G. Juve, A. Chervenak, E. Deelman, S. Bharathi, G. Mehta, and K. Vahi, “Characterizing
    and profiling scientific workflows,” Future Gener. Comput. Syst., vol. 29,
    pp. 682–692, Mar. 2013.
    94. M. Mao and M. Humphrey, “Auto-scaling to minimize cost and meet application
    deadlines in cloud workflows,” in ACM/IEEE conference on Supercomputing,
    pp. 49:1–49:12, 2011.
    95. D. Cordeiro, G. Mounié, S. Perarnau, D. Trystram, J.-M. Vincent, and F. Wagner,
    “Random graph generation for scheduling simulations,” in Proceedings of the 3rd
    International ICST Conference on Simulation Tools and Techniques, pp. 60:1–
    60:10, 2010.
    96. L. Canon and E. Jeannot, “Evaluation and optimization of the robustness of dag
    schedules in heterogeneous environments,” IEEE Transactions on Parallel and
    Distributed Systems, vol. 21, pp. 532–546, April 2010.
    97. A. Radulescu and A. van Gemund, “Fast and effective task scheduling in heterogeneous
    systems,” in Heterogeneous Computing Workshop, 2000. (HCW 2000)
    Proceedings. 9th, pp. 229–238, 2000.
    98. M.-Y. Wu and D. Gajski, “Hypertool: a programming aid for message-passing
    systems,” IEEE Transactions on Parallel and Distributed Systems, vol. 1, pp. 330–
    343, Jul 1990.
    99. V. A. F. Almeida, I. M. M. Vasconcelos, J. Arabe, and D. Menasce, “Using random
    task graphs to investigate the potential benefits of heterogeneity in parallel
    systems,” in ACM/IEEE conference on Supercomputing, pp. 683–691, Nov 1992.
    100. A. Kvas, M. Ojstersek, and V. Zumer, “Evaluation of static program allocation
    schemes for macro data-flow computer,” in Proceedings of the 20th EUROMICRO
    Conference, pp. 573–580, Sep 1994.
    101. A. C. Zhou and B. He, “Transformation-based monetary cost optimizations for
    workflows in the cloud,” IEEE Transactions on Cloud Computing, vol. 2, no. 1,
    pp. 85–98, 2014.
    102. M. A. Rodriguez and R. Buyya, “Deadline based resource provisioningand
    scheduling algorithm for scientific workflows on clouds,” IEEE Transactions on
    Cloud Computing, vol. 2, no. 2, pp. 222–235, 2014.
    103. G. Chen, “Simplified particle swarm optimization algorithm based on particles
    classification,” in International Conference on Natural Computation (ICNC),
    vol. 5, pp. 2701–2705, Aug 2010.
    104. I. Ahmad and K. Yu-Kwong, “On exploiting task duplication in parallel program
    scheduling,” IEEE Transactions on Parallel and Distributed Systems, vol. 9,
    pp. 872–892, Sept. 1998.
    105. S. Darbha and D. P. Agrawal, “Optimal scheduling algorithm for distributedmemory
    machines,” IEEE Transactions on Parallel and Distributed Systems,
    vol. 9, pp. 87–95, Jan. 1998.
    106. J. Dean and S. Ghemawat, “Mapreduce: Simplified data processing on large clusters,”
    Commun. ACM, vol. 51, pp. 107–113, Jan. 2008.
    107. “Apache hadoop. http://hadoop.apache.org,” 2011.
    108. M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, “Spark:
    Cluster computing with working sets,” in Proceedings of the 2nd USENIX Conference
    on Hot Topics in Cloud Computing, 2010.
    109. M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, “Dryad: Distributed dataparallel
    programs from sequential building blocks,” in Proceedings of the 2007
    Eurosys Conference, (Lisbon, Portugal), Association for Computing Machinery,
    Inc., March 2007.
    110. A. Vulimiri, C. Curino, P. B. Godfrey, T. Jungblut, K. Karanasos, J. Padhye, and
    G. Varghese, “Wanalytics: Geo-distributed analytics for a data intensive world,” in
    Proceedings of the 2015 ACM SIGMOD International Conference on Management
    of Data, SIGMOD ’15, (New York, NY, USA), pp. 1087–1092, ACM, 2015.
    111. A. Vulimiri, C. Curino, P. B. Godfrey, T. Jungblut, J. Padhye, and G. Varghese,
    “Global analytics in the face of bandwidth and regulatory constraints,” in 12th
    USENIX Symposium on Networked Systems Design and Implementation (NSDI
    15), (Oakland, CA), pp. 323–336, USENIX Association, May 2015.
    112. W. Lin, Z. Qian, J. Xu, S. Yang, J. Zhou, and L. Zhou, “Streamscope: Continuous
    reliable distributed processing of big data streams,” in 13th USENIX Symposium
    on Networked Systems Design and Implementation (NSDI 16), (Santa Clara, CA),
    pp. 439–453, USENIX Association, Mar. 2016.
    113. “Google Compute Engine,” 2011.
    114. S. Venugopal and R. Buyya, “An scp-based heuristic approach for scheduling distributed
    data-intensive applications on global grids,” J. Parallel Distrib. Comput.,
    vol. 68, pp. 471–487, Apr. 2008.
    115. C.-C. Hung, L. Golubchik, and M. Yu, “Scheduling jobs across geo-distributed
    datacenters,” in Proceedings of the Sixth ACM Symposium on Cloud Computing,
    SoCC ’15, (New York, NY, USA), pp. 111–124, ACM, 2015.
    116. Z. Hu, B. Li, and J. Luo, “Flutter: Scheduling tasks closer to data across geodistributed
    datacenters,” in IEEE INFOCOM 2016 - The 35th Annual IEEE International
    Conference on Computer Communications, pp. 1–9, April 2016.
    117. K. Kloudas, M. Mamede, N. Preguiça, and R. Rodrigues, “Pixida: Optimizing data
    parallel jobs in wide-area data analytics,” Proc. VLDB Endow., vol. 9, pp. 72–83,
    Oct. 2015.
    118. Q. Pu, G. Ananthanarayanan, P. Bodik, S. Kandula, A. Akella, P. Bahl, and I. Stoica,
    “Low latency geo-distributed data analytics,” SIGCOMM Comput. Commun.
    Rev., vol. 45, pp. 421–434, Aug. 2015.
    119. M. Cardosa, C. Wang, A. Nangia, A. Chandra, and J. Weissman, “Exploring
    mapreduce efficiency with highly-distributed data,” in Proceedings of the Second
    International Workshop on MapReduce and Its Applications, MapReduce ’11,
    (New York, NY, USA), pp. 27–34, ACM, 2011.
    120. N. Garg, A. Kumar, and V. Pandit, “Order scheduling models: Hardness and algorithms,”
    in Proceedings of the 27th International Conference on Foundations
    of Software Technology and Theoretical Computer Science, FSTTCS’07, (Berlin,
    Heidelberg), pp. 96–107, Springer-Verlag, 2007.
    121. R. Tripathi, V. S, V. Tamarapalli, and D. Medhi, “Cost efficient design of fault
    tolerant geo-distributed data centers,” IEEE Transactions on Network and Service
    Management, vol. PP, no. 99, pp. 1–1, 2017.
    122. H. Herodotou, F. Dong, and S. Babu, “No one (cluster) size fits all: Automatic
    cluster sizing for data-intensive analytics,” in Proceedings of the 2Nd ACM Symposium
    on Cloud Computing, SOCC ’11, (New York, NY, USA), pp. 18:1–18:14,
    ACM, 2011.
    123. V. Jalaparti, H. Ballani, P. Costa, T. Karagiannis, and A. Rowstron, “Bridging the
    tenant-provider gap in cloud services,” in Proceedings of the Third ACM Symposium
    on Cloud Computing, SoCC ’12, (New York, NY, USA), pp. 10:1–10:14,
    ACM, 2012.
    124. V. Jalaparti, P. Bodik, I. Menache, S. Rao, K. Makarychev, and M. Caesar,
    “Network-aware scheduling for data-parallel jobs: Plan when you can,” SIGCOMM
    Comput. Commun. Rev., vol. 45, pp. 407–420, Aug. 2015.
    125. Y. Jin, Y. Gao, Z. Qian, M. Zhai, H. Peng, and S. Lu, “Workload-aware scheduling
    across geo-distributed data centers,” in 2016 IEEE Trustcom/BigDataSE/ISPA,
    pp. 1455–1462, Aug 2016.
    126. A. N. Toosi and R. Buyya, “A fuzzy logic-based controller for cost and energy
    efficient load balancing in geo-distributed data centers,” in 2015 IEEE/ACM 8th
    International Conference on Utility and Cloud Computing (UCC), pp. 186–194,
    Dec 2015.
    127. A. Makhorin, “Gnu linear programming kit, version 4.52„” 2012.
    128. V. H. Nguyen, N. H. Tuong, V. H. Tran, and N. Thoai, “An milp-based makespan
    minimization model for single-machine scheduling problem with splitable jobs
    and availability constraints,” in Computing, Management and Telecommunications
    (ComManTel), 2013 International Conference on, pp. 397–400, Jan 2013.
    129. J. H. Abawajy and M. M. Deris, “Data replication approach with consistency guarantee
    for data grid,” IEEE Transactions on Computers, vol. 63, pp. 2975–2987,
    Dec 2014.
    130. G. Ananthanarayanan, A. Ghodsi, S. Shenker, and I. Stoica, “Effective straggler
    mitigation: Attack of the clones,” in Presented as part of the 10th USENIX Symposium
    on Networked Systems Design and Implementation (NSDI 13), (Lombard,
    IL), pp. 185–198, USENIX, 2013.
    131. G. Ananthanarayanan, S. Kandula, A. Greenberg, I. Stoica, Y. Lu, B. Saha, and
    E. Harris, “Reining in the outliers in map-reduce clusters using mantri,” in Proceedings
    of the 9th USENIX Conference on Operating Systems Design and Implementation,
    OSDI’10, (Berkeley, CA, USA), pp. 265–278, USENIX Association,
    2010.
    132. Y. Chen, A. Ganapathi, R. Griffith, and R. Katz, “The case for evaluating mapreduce
    performance using workload suites,” in 2011 IEEE 19th Annual International
    Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication
    Systems, pp. 390–399, July 2011.
    133. L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, “Web caching and zipf-like
    distributions: evidence and implications,” in INFOCOM ’99. Eighteenth Annual
    Joint Conference of the IEEE Computer and Communications Societies. Proceedings.
    IEEE, vol. 1, pp. 126–134 vol.1, Mar 1999.
    134. D. G. Cameron, R. Carvajal-Schiaffino, A. P. Millar, C. Nicholson, K. Stockinger,
    and F. Zini, “Evaluating scheduling and replica optimisation strategies in optorsim,”
    in Proceedings. First Latin American Web Congress, pp. 52–59, Nov 2003.
    135. L. R. Anikode and B. Tang, “Integrating scheduling and replication in data
    grids with performance guarantee,” in Global Telecommunications Conference
    (GLOBECOM 2011), 2011 IEEE, pp. 1–6, Dec 2011.
    136. F. Jolfaei and A. T. Haghighat, “The impact of bandwidth and storage space on
    job scheduling and data replication strategies in data grids,” in Computing Technology
    and Information Management (ICCM), 2012 8th International Conference
    on, vol. 1, pp. 283–288, April 2012.
    137. Y. C. Lee and A. Y. Zomaya, “Practical scheduling of bag-of-tasks applications
    on grids with dynamic resilience,” IEEE Transactions on Computers, vol. 56,
    pp. 815–825, June 2007.
    138. M. Zarina, F. Ahmad, A. N. bin Mohd Rose, M. Nordin, and M. M. Deris, “Job
    scheduling for dynamic data replication strategy in heterogeneous federation data
    grid systems,” in Informatics and Applications (ICIA),2013 Second International
    Conference on, pp. 203–206, Sept 2013.
    139. W. Li, Y. Yang, and D. Yuan, “A novel cost-effective dynamic data replication
    strategy for reliability in cloud data centres,” in Dependable, Autonomic and Secure
    Computing (DASC), 2011 IEEE Ninth International Conference on, pp. 496–
    502, Dec 2011.
    140. C. L. Abad, Y. Lu, and R. H. Campbell, “Dare: Adaptive data replication for
    efficient cluster scheduling,” in 2011 IEEE International Conference on Cluster
    Computing, pp. 159–168, Sept 2011.
    141. R. Tudoran, A. Costan, and G. Antoniu, “Overflow: Multi-site aware big data
    management for scientific workflows on clouds,” IEEE Transactions on Cloud
    Computing, vol. 4, pp. 76–89, Jan 2016.
    142. L. Wang, J. Tao, R. Ranjan, H. Marten, A. Streit, J. Chen, and D. Chen, “G-hadoop:
    Mapreduce across distributed data centers for data-intensive computing,” Future
    Generation Computer Systems, vol. 29, no. 3, pp. 739 – 750, 2013. Special Section:
    Recent Developments in High Performance Computing and Security.
    143. B. Heintz, A. Chandra, R. K. Sitaraman, and J. Weissman, “End-to-end optimization
    for geo-distributed mapreduce,” IEEE Transactions on Cloud Computing,
    vol. 4, pp. 293–306, July 2016.
    144. S. Li, Q. Lu, W. Zhang, and L. Zhu, “A mapreduce cluster deployment optimization
    framework with geo-distributed data,” in 2015 IEEE 12th Intl Conf on Ubiquitous
    Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and
    Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and
    Communications and Its Associated Workshops (UIC-ATC-ScalCom), pp. 943–
    949, Aug 2015.
    145. M. Cavallo, G. D. Modica, C. Polito, and O. Tomarchio, “Application profiling in
    hierarchical hadoop for geo-distributed computing environments,” in 2016 IEEE
    Symposium on Computers and Communication (ISCC), pp. 555–560, June 2016.
    146. W. Chen, I. Paik, and Z. Li, “Cost-aware streaming workflow allocation on geodistributed
    data centers,” IEEE Transactions on Computers, vol. PP, no. 99, pp. 1–
    1, 2016.
    147. Y. Koshiba, W. Chen, Y. Yamada, T. Tanaka, and I. Paik, “Investigation of network
    traffic in geo-distributed data centers,” in 2015 IEEE 7th International Conference
    on Awareness Science and Technology (iCAST), pp. 174–179, Sept 2015.
    148. P. Li, S. Guo, S. Yu, and W. Zhuang, “Cross-cloud mapreduce for big data,” IEEE
    Transactions on Cloud Computing, vol. PP, no. 99, pp. 1–1, 2015.
    149. D. Cheng, J. Rao, Y. Guo, C. Jiang, and X. Zhou, “Improving performance of
    heterogeneous mapreduce clusters with adaptive task tuning,” IEEE Transactions
    on Parallel and Distributed Systems, vol. 28, pp. 774–786, March 2017.
    150. P. Li, S. Guo, T. Miyazaki, X. Liao, H. Jin, A. Zomaya, and K. Wang, “Trafficaware
    geo-distributed big data analytics with predictable job completion time,”
    IEEE Transactions on Parallel and Distributed Systems, vol. PP, no. 99, pp. 1–1,
    2016.
    151. A. Mandal, Y. Xin, I. Baldine, P. Ruth, C. Heerman, J. Chase, V. Orlikowski, and
    A. Yumerefendi, “Provisioning and evaluating multi-domain networked clouds
    for hadoop-based applications,” in 2011 IEEE Third International Conference on
    Cloud Computing Technology and Science, pp. 690–697, Nov 2011.
    152. Q. Pu, G. Ananthanarayanan, P. Bodik, S. Kandula, A. Akella, P. Bahl, and I. Stoica,
    “Low latency geo-distributed data analytics,” in Proceedings of the 2015 ACM
    Conference on Special Interest Group on Data Communication, SIGCOMM ’15,
    (New York, NY, USA), pp. 421–434, ACM, 2015.
    153. L. Schrage, “A proof of the optimality of the shortest remaining processing time
    discipline,” Operations Research, vol. 16, no. 3, pp. 687–690, 1968.
    154. Y.-K. Kwok and I. Ahmad, “Fastest: a practical low-complexity algorithm for
    compile-time assignment of parallel programs to multiprocessors,” IEEE Transactions
    on Parallel and Distributed Systems, vol. 10, pp. 147–159, Feb 1999.
    155. G. C. Sih and E. A. Lee, “A compile-time scheduling heuristic for interconnectionconstrained
    heterogeneous processor architectures,” IEEE Transactions on Parallel
    and Distributed Systems, vol. 4, pp. 175–187, Feb 1993.
    156. A. EC2, “Amazon ec2,” 2009.
    157. M. Mazzucco and M. Dumas, “Achieving performance and availability guarantees
    with spot instances,” in High Performance Computing and Communications
    (HPCC), 2011 IEEE 13th International Conference on, pp. 296–303, Sept 2011.
    158. A. Andrzejak, D. Kondo, and S. Yi, “Decision model for cloud computing under
    sla constraints,” in Modeling, Analysis Simulation of Computer and Telecommunication
    Systems (MASCOTS), 2010 IEEE International Symposium on, pp. 257–
    266, Aug 2010.
    159. Y. Song, M. Zafer, and K.-W. Lee, “Optimal bidding in spot instance market,” in
    INFOCOM, 2012 Proceedings IEEE, pp. 190–198, March 2012.
    160. S. Tang, J. Yuan, and X. Y. Li, “Towards optimal bidding strategy for amazon
    ec2 cloud spot instance,” in 2012 IEEE Fifth International Conference on Cloud
    Computing, pp. 91–98, June 2012.
    161. A. Andrzejak, D. Kondo, and D. Anderson, “Exploiting non-dedicated resources
    for cloud computing,” in Network Operations and Management Symposium
    (NOMS), 2010 IEEE, pp. 341–348, April 2010.
    162. M. Stokely, J. Winget, E. Keyes, C. Grimes, and B. Yolken, “Using a market
    economy to provision compute resources across planet-wide clusters,” in Parallel
    Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on,
    pp. 1–8, May 2009.
    163. S. Yi, A. Andrzejak, and D. Kondo, “Monetary cost-aware checkpointing and migration
    on amazon cloud spot instances,” Services Computing, IEEE Transactions
    on, vol. 5, pp. 512–524, Fourth 2012.
    164. I. Jangjaimon and N.-F. Tzeng, “Effective cost reduction for elastic clouds under
    spot instance pricing through adaptive checkpointing,” Computers, IEEE Transactions
    on, vol. 64, pp. 396–409, Feb 2015.
    165. Amazon, “Amazon console,” 2004.
    166. N. Chohan, C. Castillo, M. Spreitzer, M. Steinder, A. Tantawi, and C. Krintz,
    “See spot run: Using spot instances for mapreduce workflows,” in Proceedings of
    the 2Nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud’10,
    (Berkeley, CA, USA), pp. 7–7, USENIX Association, 2010.
    167. B. Javadi, R. Thulasiramy, and R. Buyya, “Statistical modeling of spot instance
    prices in public cloud environments,” in Utility and Cloud Computing (UCC), 2011
    Fourth IEEE International Conference on, pp. 219–228, Dec 2011.
    168. O. Regev and N. Nisan, “The popcorn market—an online market for computational
    resources,” in Proceedings of the First International Conference on Information
    and Computation Economies, ICE ’98, (New York, NY, USA), pp. 148–
    157, ACM, 1998.
    169. M. Stokely, J. Winget, E. Keyes, C. Grimes, and B. Yolken, “Using a market economy
    to provision compute resources across planet-wide clusters,” in Proceedings
    for the International Parallel and Distributed Processing Symposium 2009, pp. 1–
    8, 2009.
    170. S. Chaisiri, R. Kaewpuang, B.-S. Lee, and D. Niyato, “Cost minimization for provisioning
    virtual servers in amazon elastic compute cloud,” in Modeling, Analysis
    Simulation of Computer and Telecommunication Systems (MASCOTS), 2011 IEEE
    19th International Symposium on, pp. 85–95, July 2011.
    171. S. Genaud and J. Gossa, “Cost-wait trade-offs in client-side resource provisioning
    with elastic clouds,” in Cloud Computing (CLOUD), 2011 IEEE International
    Conference on, pp. 1–8, July 2011.
    172. S. Lu, X. Li, L. Wang, H. Kasim, H. Palit, T. Hung, E. Legara, and G. Lee, “A
    dynamic hybrid resource provisioning approach for running large-scale computational
    applications on cloud spot and on-demand instances,” in Parallel and Distributed
    Systems (ICPADS), 2013 International Conference on, pp. 657–662, Dec
    2013.
    173. H. Huang, L. Wang, B. C. Tak, L. Wang, and C. Tang, “Cap3: A cloud autoprovisioning
    framework for parallel processing using on-demand and spot instances,”
    in Cloud Computing (CLOUD), 2013 IEEE Sixth International Conference
    on, pp. 228–235, June 2013.
    174. C. Binnig, A. Salama, E. Zamanian, M. El-Hindi, S. Feil, and T. Ziegler, “Spotgres
    - parallel data analytics on spot instances,” in 2015 31st IEEE International
    Conference on Data Engineering Workshops, pp. 14–21, April 2015.
    175. H. Miao and L. Li, “Cost-effective provisioning of spot instances in clouds,” in
    2016 9th International Symposium on Computational Intelligence and Design (ISCID),
    vol. 2, pp. 194–197, Dec 2016.
    176. K. Veena, C. Anand, and G. C. Prakash, “Temporal and spatial trend analysis of
    cloud spot instance pricing in amazon ec2,” in 2016 IEEE 14th Intl Conf on Dependable,
    Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence
    and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and
    Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech),
    pp. 909–912, Aug 2016.
    177. A. C. Zhou, B. He, and C. Liu, “Monetary cost optimizations for hosting
    workflow-as-a-service in iaas clouds,” IEEE Transactions on Cloud Computing,
    vol. 4, pp. 34–48, Jan 2016.
    178. W. Voorsluys, S. K. Garg, and R. Buyya, “Provisioning spot market cloud resources
    to create cost-effective virtual clusters,” in Proceedings of the 11th International
    Conference on Algorithms and Architectures for Parallel Processing
    - Volume Part I, ICA3PP’11, (Berlin, Heidelberg), pp. 395–408, Springer-Verlag,
    2011.
    179. Q. Zhang, E. Gürses, R. Boutaba, and J. Xiao, “Dynamic resource allocation for
    spot markets in clouds,” in Proceedings of the 11th USENIX Conference on Hot
    Topics in Management of Internet, Cloud, and Enterprise Networks and Services,
    Hot-ICE’11, (Berkeley, CA, USA), pp. 1–1, USENIX Association, 2011.
    180. M. Mattess, C. Vecchiola, and R. Buyya, “Managing peak loads by leasing cloud
    infrastructure services from a spot market,” in 2010 IEEE 12th International Conference
    on High Performance Computing and Communications (HPCC), pp. 180–
    188, Sept 2010.
    181. Q. Zhang, Q. Zhu, and R. Boutaba, “Dynamic resource allocation for spot markets
    in cloud computing environments,” in 2011 Fourth IEEE International Conference
    on Utility and Cloud Computing, pp. 178–185, Dec 2011.
    182. N. Sadashiv, S. Kumar, and R. Goudar, “Hybrid spot instance based resource provisioning
    strategy in dynamic cloud environment,” in High Performance Computing
    and Applications (ICHPCA), 2014 International Conference on, pp. 1–6, Dec
    2014.
    183. S. Khatua and N. Mukherjee, “A novel checkpointing scheme for amazon ec2 spot
    instances,” in 2013 13th IEEE/ACM International Symposium on Cluster, Cloud,
    and Grid Computing, pp. 180–181, May 2013.

    QR CODE