雲端資源於成本及效能要求下之排程技術｜國立清華大學博碩士論文庫

簡易檢索 / 詳目顯示

回結果列表

研究生：	肯莫茲 Convolbo, WendKuuni-Moise
論文名稱：	雲端資源於成本及效能要求下之排程技術 Scheduling Techniques on Cloud Resources Under Cost and Performance Requirements
指導教授：	鍾葉青 Chung, Yeh-Ching
口試委員:	李哲榮 LEE, CHE-RUNG 周志遠 CHOU, JERRY 賴冠州 Lai, Kuan-Chou 許慶賢 Hsu, Ching-Hsien
學位類別：	博士 Doctor
系所名稱：	電機資訊學院 - 資訊系統與應用研究所 Institute of Information Systems and Applications
論文出版年：	2017
畢業學年度：	105
語文別：	英文
論文頁數：	134
中文關鍵詞：	雲端、工作排程、啟發式、虛擬機器、批次處理任務、虛擬叢集、部署、競價型執行個體、DAG
外文關鍵詞：	cloud, job scheduling, heuristic, virtualmachines, batchjobs, virtualcluster, migration, spotinstance, DAG
相關次數：	點閱：113 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

基於運算可以被交付到網際網路並可按現行方式收費這樣的理念，雲端運算已經成為了一個突出的運算典範。
藉由虛擬化技術，雲端給予我們一種錯覺，彷彿在不同組態和成本下也有取之不盡的資源可以使用。正因如此，雲端上的資源管理就成為相當關鍵的議題。
雲端設施的效率相當倚賴虛擬機器(Virtual Machine)如何配置給應用程式，
以及虛擬機器如何映射到實體機器(Physical Machine)上。由於採用不同的策略進行資源管控對於用戶任務的效能、成本以及資源的利用都有極大程度的影響。
很快地，用戶對於有效率的排程要求包含了成本意識以及服務層級協議(Service Level Agreement)的滿足。然而，雲端運算中的成本模型相當複雜，牽涉了資源本身的能力、租賃時間以及資源獲取的模式。
除此之外，近來大數據的出現也促進了大規模數據分析的發展，這些應用通常跨越了地理上分散的數據中心並且具有廣泛地處理要求。在這樣的情形下，雲端用戶經常提起的問題是：如何找到最有效益的方式來使用運算資源並且保證目標函式對於工作負載的執行。有鑑於此，我們考慮的問題是設計一個關於資源排程的技術，這個技術是在效能限制下對於執行成本的最小化。
在本篇論文當中，我們將針對如何有效管理資源以及規劃應用程式的執行，以期最小化整體計算成本並且能保證滿足效能需求，並且提出新穎的排程技術和演算法。
本篇論文的主要目的是在雲端計算的範疇內提出一具成本意識的排程策略，這樣的策略可以涵蓋多種不同的應用程式，其中包括了高效計算、資料分析以及平行的批次作業處理。
至此，我們的方法是去探索資源的類型，而這包括了利用拍賣資源來降低在特定用戶限制下的執行成本。除此之外，我們也調查了地理分佈式數據中心的相關問題。
我們提出的策略所作出的貢獻可分成三個方面:
清楚地了解雲端資源管理在成本和效能之間的權衡、利用資源租賃模式來使用基於拍賣的雲端資源，最後，我們對比了單一數據中心的排程和最近表現出不同排程機制的地理分佈式要求。

Cloud computing has emerged to become a prominent computing paradigm based on the idea
that computation can be delivered over the Internet and be charged at an as-you-go
basis. Through virtualization techniques, the Cloud offers an illusion of limitless
resources with different configurations and costs. As a result, managing cloud resources
has become a critical issue. The efficiency of the whole cloud facilities strongly relies
on how the Virtual Machines (VM) are allocated to the applications and how VMs are mapped
to the Physical Machine (PM). Different resources management strategies can largely affect
the performance of the user's job, the cost, and the resource utilization. Hence,
efficient job scheduling in the user perspective has swift to include cost-awareness and
the satisfaction of the Service Level Agreement (SLA). However, the cost in cloud computing
is a complex model in which involve the resource capacity, the leasing time and the
resource acquisition mode. In addition, the recent advent of Big Data has contributed
to the development of large scale data analytic applications which often span
geographically dispersed data centers and have a wide range of processing requirements.
A problem usually raised by cloud users in this situation, is to find the most cost
effective computing resources to guarantee the objective functions of their workloads
execution. Hence, we consider the problem of designing resource scheduling techniques
to minimize the execution costs under performance constraints.
In this thesis, we present novel scheduling techniques and algorithms to efficiently
manage the resource and plan the execution of application jobs so as to minimize the
overall computation cost and guarantee the performance requirement. The main objective
of this thesis is therefore to provide cost-aware scheduling strategies in cloud
computing for various types of applications including High Performance Computing,
data analytics and Parallel batch jobs. To this end, our approach is to explore the
resource types including auction based resources to leverage the execution cost under
specified user constrains. In addition, we investigate the scheduling problem in
geo-distributed data centers. Contributions in our strategies are three folds:
Ensure a clear understanding of the tradeoff between cost and performance in
Cloud resource management. Exploit the resource leasing model to leverage the
auction-based cloud resources. Finally, we show the contrast with single data center
scheduling with the recent geo-distributed requirement which exhibits different
scheduling mechanisms.

􀴡􀚤 感謝. . . . . . . . . . . . . . . . . . . . . . . . .v
Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . .vii
Abstract . . . . . . . . . . . . . . . . . . . . . . . . .xiii
Introduction. . . . . . . . . . . . . . . . . . . . . . . . . 1
1 Background and context . . . . . . . . . . . . . . . . . . . . . . . . . 2
2 Research Problems and Objectives . . . . . . . . . . . . . . . . . . . . 4
3 Approach Overview and Significance . . . . . . . . . . . . . . . . . . 6
Background. . . . . . . . . . . . . . . . . . . . . . . . . 9
1 Scheduling in Cloud Environment . . . . . . . . . . . . . . . . . . . . 9
2 Taxonomy and Survey on Job Scheduling in Cloud . . . . . . . . . . . 11
2.1 Classification Parameters . . . . . . . . . . . . . . . . . . . . . 11
2.2 Application Model . . . . . . . . . . . . . . . . . . . . . . . . 12
2.3 Resource Model . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.4 Scheduling Level . . . . . . . . . . . . . . . . . . . . . . . . . 13
3 User-centric Scheduling Techniques . . . . . . . . . . . . . . . . . . . 14
3.1 QoS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.2 Cost Awareness . . . . . . . . . . . . . . . . . . . . . . . . . . 15
3.3 Execution Time . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.4 Performance-driven . . . . . . . . . . . . . . . . . . . . . . . . 17
4 Cloud Provider-centric Scheduling . . . . . . . . . . . . . . . . . . . . 17
4.1 Power-awareness Job Scheduling . . . . . . . . . . . . . . . . 17
4.2 Load Balance-awareness Job Scheduling . . . . . . . . . . . . 18
4.3 The Resource Reutilization . . . . . . . . . . . . . . . . . . . . 18
5 Trend . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
5.1 Workflows Scheduling . . . . . . . . . . . . . . . . . . . . . . 19
5.2 Bidding on Spot Instances . . . . . . . . . . . . . . . . . . . . 19
6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Minimizing the Execution Cost on Cloud Resources Through DAG Scheduling
Algorithms . . . . . . . . . . . . . . . . . . . . . . . . .23
1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
2 Backgroud and Motivation . . . . . . . . . . . . . . . . . . . . . . . . 24
3 Problem Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
3.1 System Model & Problem Description . . . . . . . . . . . . . . 26
3.2 DAG Job . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
3.3 Scheduling Examples . . . . . . . . . . . . . . . . . . . . . . . 31
4 Cost Aware Scheduling Algorithms . . . . . . . . . . . . . . . . . . . 33
4.1 Cost Optimal Algorithm . . . . . . . . . . . . . . . . . . . . . 34
4.2 Heuristic Algorithm . . . . . . . . . . . . . . . . . . . . . . . 35
4.3 Dependency-Aware Task Scheduling . . . . . . . . . . . . . . 40
4.4 Complexity Analysis . . . . . . . . . . . . . . . . . . . . . . . 41
5 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
5.1 Random Graph Generator . . . . . . . . . . . . . . . . . . . . 44
5.2 Task Execution Time Model . . . . . . . . . . . . . . . . . . . 45
5.3 Comparison Algorithms . . . . . . . . . . . . . . . . . . . . . 46
6 Evaluation and Analysis . . . . . . . . . . . . . . . . . . . . . . . . . 47
6.1 Overall Comparison . . . . . . . . . . . . . . . . . . . . . . . 47
6.2 In-depth Analysis . . . . . . . . . . . . . . . . . . . . . . . . . 49
6.3 Analysis on Types of DAGs . . . . . . . . . . . . . . . . . . . 52
7 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Towards the Optimization of Data locality-aware Job Scheduling in Geodistributed
Data Centers . . . . . . . . . . . . . . . . . . . . . . . . .59
1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
2 Background and System Model . . . . . . . . . . . . . . . . . . . . . . 62
2.1 Scheduling in Geo-Distributed Systems . . . . . . . . . . . . . 62
2.2 System Model . . . . . . . . . . . . . . . . . . . . . . . . . . 64
2.3 Motivational Example . . . . . . . . . . . . . . . . . . . . . . 65
3 Problem Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
3.1 Problem input . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
3.2 LP formulation . . . . . . . . . . . . . . . . . . . . . . . . . . 69
4 Data Locality-aware Scheduling . . . . . . . . . . . . . . . . . . . . . 71
4.1 Schedule Optimization with the LP Solver . . . . . . . . . . . . 71
4.2 GeoDis Scheduler . . . . . . . . . . . . . . . . . . . . . . . . 72
5 Performance Evaluation and Analysis . . . . . . . . . . . . . . . . . . 75
5.1 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . 75
5.2 Performance Evaluation and Analysis . . . . . . . . . . . . . . 77
6 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
Optimal Static Bidding Strategy for Spot Instances with Deadline Constraint 89
1 Background and Motivation . . . . . . . . . . . . . . . . . . . . . . . 89
2 Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
2.1 Spot Instance Characteristics . . . . . . . . . . . . . . . . . . . 92
2.2 Spot Price Model . . . . . . . . . . . . . . . . . . . . . . . . . 94
2.3 Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . 95
3 Bidding Strategy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
3.1 Recursion Equation for Monetary Cost . . . . . . . . . . . . . 98
3.2 Dynamic Programming Algorithm . . . . . . . . . . . . . . . . 100
4 Evaluations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
4.1 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . 102
4.2 Monetary Cost analysis . . . . . . . . . . . . . . . . . . . . . . 103
4.3 Completion Comparison . . . . . . . . . . . . . . . . . . . . . 106
4.4 Spot and On-Demand Comparison . . . . . . . . . . . . . . . . 108
4.5 Restart & Checkpoint overhead Analysis . . . . . . . . . . . . 108
4.6 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
5 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
6 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . .115
1 Executive Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
2 Virtual Cluster Acquisition for DAG Jobs . . . . . . . . . . . . . . . . 116
3 Scheduling Data-aware Large-Scale Applications on Geo-Distributed
Data Centers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
4 Leveraging Auction-based Cloud Resources . . . . . . . . . . . . . . . 117
5 Future Directions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119

                                

References
1. M. Armbrust, A. Fox, R. Griffith, A. D. Joseph, R. Katz, A. Konwinski, G. Lee,
D. Patterson, A. Rabkin, I. Stoica, and M. Zaharia, “A view of cloud computing,”
Commun. ACM, vol. 53, pp. 50–58, Apr. 2010.
2. S. Wu, H. Chen, S. Di, B. Zhou, Z. Xie, H. Jin, and X. Shi, “Synchronizationaware
scheduling for virtual clusters in cloud,” IEEE Transactions on Parallel
and Distributed Systems, vol. 26, pp. 2890–2902, Oct 2015.
3. M. Armbrust, A. Fox, R. Griffith, A. D. Joseph, R. Katz, A. Konwinski, G. Lee,
D. Patterson, A. Rabkin, I. Stoica, and M. Zaharia, “A view of cloud computing,”
Commun. ACM, vol. 53, pp. 50–58, Apr. 2010.
4. “AWS: Amazon Web Service,” 2006.
5. “Microsoft Azure,” 2010.
6. “Rackspace,” 1998.
7. . Goiri, R. Beauchea, K. Le, T. D. Nguyen, M. E. Haque, J. Guitart, J. Torres, and
R. Bianchini, “Greenslot: Scheduling energy consumption in green datacenters,”
in 2011 International Conference for High Performance Computing, Networking,
Storage and Analysis (SC), pp. 1–11, Nov 2011.
8. Y. Liu, N. Bobroff, L. Fong, S. Seelam, and J. Delgado, “New metrics for scheduling
jobs on cluster of virtual machines,” in 2011 IEEE International Symposium on
Parallel and Distributed Processing Workshops and Phd Forum, pp. 1001–1008,
May 2011.
9. Y. Wang and W. Shi, “Budget-driven scheduling algorithms for batches of mapreduce
jobs in heterogeneous clouds,” IEEE Transactions on Cloud Computing,
vol. 2, pp. 306–319, July 2014.
10. C. Zhang and H. De Sterck, “Cloudbatch: A batch job queuing system on clouds
with hadoop and hbase,” in IEEE International Conference on Cloud Computing
Technology and Science (CloudCom), pp. 368–375, Nov 2010.
11. X. Liao, Z. Gao, W. Ji, and Y. Wang, “An enforcement of real time scheduling
in spark streaming,” in Green Computing Conference and Sustainable Computing
Conference (IGSC), 2015 Sixth International, pp. 1–6, Dec 2015.
12. K. Wang, Z. Bian, and Q. Chen, “Millipedes: Distributed and set-based sub-task
scheduler of computing engines running on yarn cluster,” in High Performance
Computing and Communications (HPCC), 2015 IEEE 7th International Symposium
on Cyberspace Safety and Security (CSS), 2015 IEEE 12th International Conferen
on Embedded Software and Systems (ICESS), 2015 IEEE 17th International
Conference on, pp. 1597–1602, Aug 2015.
13. I. Menache, O. Shamir, and N. Jain, “On-demand, spot, or both: Dynamic resource
allocation for executing batch jobs in the cloud,” in 11th International Conference
on Autonomic Computing (ICAC 14), (Philadelphia, PA), pp. 177–187, USENIX
Association, 2014.
14. C. Reiss, A. Tumanov, G. R. Ganger, R. H. Katz, and M. A. Kozuch, “Heterogeneity
and dynamicity of clouds at scale: Google trace analysis,” in Proceedings
of the Third ACM Symposium on Cloud Computing, SoCC ’12, (New York, NY,
USA), pp. 7:1–7:13, ACM, 2012.
15. J. Dean and S. Ghemawat, “MapReduce: Simplified data processing on large clusters,”
Commun. ACM, vol. 51, pp. 107–113, Jan. 2008.
16. M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, “Dryad: Distributed dataparallel
programs from sequential building blocks,” in Proceedings of the 2007
Eurosys Conference, (Lisbon, Portugal), Association for Computing Machinery,
Inc., March 2007.
17. W. Guo, K. Chen, Y. Wu, and W. Zheng, “Bidding for highly available services
with low price in spot instance market,” in Proceedings of the 24th International
Symposium on High-Performance Parallel and Distributed Computing, HPDC
’15, (New York, NY, USA), pp. 191–202, ACM, 2015.
18. A. Elghirani, R. Subrata, and A. Y. Zomaya, “Intelligent scheduling and replication
in datagrids: a synergistic approach,” in Seventh IEEE International Symposium
on Cluster Computing and the Grid (CCGrid ’07), pp. 179–182, May 2007.
19. M. A. Kozuch, M. P. Ryan, R. Gass, S. W. Schlosser, D. O’Hallaron, J. Cipar,
E. Krevat, J. López, M. Stroucken, and G. R. Ganger, “Tashi: Location-aware
cluster management,” in Proceedings of the 1st Workshop on Automated Control
for Datacenters and Clouds, ACDC ’09, (New York, NY, USA), pp. 43–48, ACM,
2009.
20. S. Pandey, L. Wu, S. M. Guru, and R. Buyya, “A particle swarm optimizationbased
heuristic for scheduling workflow applications in cloud computing environments,”
in 2010 24th IEEE International Conference on Advanced Information
Networking and Applications, pp. 400–407, April 2010.
21. I. A. Moschakis and H. D. Karatza, “Performance and cost evaluation of gang
scheduling in a cloud computing system with job migrations and starvation handling,”
in 2011 IEEE Symposium on Computers and Communications (ISCC),
pp. 418–423, June 2011.
22. Y. Kessaci, N. Melab, and E. G. Talbi, “A pareto-based ga for scheduling hpc applications
on distributed cloud infrastructures,” in 2011 International Conference
on High Performance Computing Simulation, pp. 456–462, July 2011.
23. L. F. Bittencourt and E. R. M. Madeira, “Hcoc: a cost optimization algorithm for
workflow scheduling in hybrid clouds,” Journal of Internet Services and Applications,
vol. 2, no. 3, pp. 207–227, 2011.
24. M. A. Rodriguez and R. Buyya, “Deadline based resource provisioningand
scheduling algorithm for scientific workflows on clouds,” IEEE Transactions on
Cloud Computing, vol. 2, pp. 222–235, April 2014.
25. R. N. Calheiros and R. Buyya, “Meeting deadlines of scientific workflows in public
clouds with tasks replication,” IEEE Transactions on Parallel and Distributed
Systems, vol. 25, pp. 1787–1796, July 2014.
26. X. You, X. Xu, J. Wan, and C. Jiang, “Analysis and evaluation of the scheduling algorithms
in virtual environment,” in 2009 International Conference on Embedded
Software and Systems, pp. 291–296, May 2009.
27. L. F. Bittencourt, E. R. M. Madeira, and N. L. S. D. Fonseca, “Scheduling in hybrid
clouds,” IEEE Communications Magazine, vol. 50, no. 9, pp. 42–47, 2012.
28. S. M. Parikh, “A survey on cloud computing resource allocation techniques,” in
2013 Nirma University International Conference on Engineering (NUiCONE),
pp. 1–5, Nov 2013.
29. A. Ananth and K. C. Sekaran, “Game theoretic approaches for job scheduling in
cloud computing: A survey,” in 2014 International Conference on Computer and
Communication Technology (ICCCT), pp. 79–85, Sept 2014.
30. N. Chopra and S. Singh, “Survey on scheduling in hybrid clouds,” in Fifth International
Conference on Computing, Communications and Networking Technologies
(ICCCNT), pp. 1–6, July 2014.
31. S. B. Shaw and A. K. Singh, “A survey on scheduling and load balancing techniques
in cloud computing environment,” in 2014 International Conference on
Computer and Communication Technology (ICCCT), pp. 87–95, Sept 2014.
32. Y. P. Dave, A. S. Shelat, D. S. Patel, and R. H. Jhaveri, “Various job scheduling
algorithms in cloud computing: A survey,” in International Conference on
Information Communication and Embedded Systems (ICICES2014), pp. 1–5, Feb
2014.
33. S. Verma and R. A. Satao, “A survey on the impact of economies of scale on scientific
communities,” in 2015 International Conference on Advances in Computer
Engineering and Applications, pp. 722–726, March 2015.
34. A. Greenberg, J. Hamilton, D. A. Maltz, and P. Patel, “The cost of a cloud: Research
problems in data center networks,” SIGCOMM Comput. Commun. Rev.,
vol. 39, pp. 68–73, Dec. 2008.
35. N. Sooezi, S. Abrishami, and M. Lotfian, “Scheduling data-driven workflows in
multi-cloud environment,” in 2015 IEEE 7th International Conference on Cloud
Computing Technology and Science (CloudCom), pp. 163–167, Nov 2015.
36. B. Sharma, R. K. Thulasiram, P. Thulasiraman, and R. Buyya, “Clabacus: A
risk-adjusted cloud resources pricing model using financial option theory,” IEEE
Transactions on Cloud Computing, vol. 3, pp. 332–344, July 2015.
37. R. Iakymchuk, J. Napper, and P. Bientinesi, “Improving high-performance computations
on clouds through resource underutilization,” in Proceedings of the
2011 ACM Symposium on Applied Computing, SAC ’11, (New York, NY, USA),
pp. 119–126, ACM, 2011.
38. R. C. Chiang and H. H. Huang, “Tracon: Interference-aware scheduling for dataintensive
applications in virtualized environments,” in 2011 International Conference
for High Performance Computing, Networking, Storage and Analysis (SC),
pp. 1–12, Nov 2011.
39. T. A. Henzinger, V. Singh, T. Wies, and D. Zufferey, “Scheduling large jobs by
abstraction refinement,” in Proceedings of the Sixth Conference on Computer Systems,
EuroSys ’11, (New York, NY, USA), pp. 329–342, ACM, 2011.
40. J. W. Lin, C. H. Chen, and J. M. Chang, “Qos-aware data replication for dataintensive
applications in cloud computing systems,” IEEE Transactions on Cloud
Computing, vol. 1, pp. 101–115, Jan 2013.
41. D. Kliazovich, P. Bouvry, and S. U. Khan, “Dens: data center energy-efficient
network-aware scheduling,” Cluster Computing, vol. 16, no. 1, pp. 65–75, 2013.
42. P. Fan, J. Wang, Z. Zheng, and M. R. Lyu, “Toward optimal deployment of
communication-intensive cloud applications,” in 2011 IEEE 4th International
Conference on Cloud Computing, pp. 460–467, July 2011.
43. D. Millot and C. Parrot, “Scheduling on unspecified heterogeneous distributed
resources,” in 2011 IEEE International Symposium on Parallel and Distributed
Processing Workshops and Phd Forum, pp. 45–56, May 2011.
44. Z. Li, L. Wang, S. Ren, and G. Quan, “Temperature, power, and makespan aware
dependent task scheduling for data centers,” in 2011 IEEE/ACM International
Conference on Green Computing and Communications, pp. 22–27, Aug 2011.
45. Y. Song, H. Wang, Y. Li, B. Feng, and Y. Sun, “Multi-tiered on-demand resource
scheduling for vm-based data center,” in Proceedings of the 2009 9th IEEE/
ACM International Symposium on Cluster Computing and the Grid, CCGRID ’09,
(Washington, DC, USA), pp. 148–155, IEEE Computer Society, 2009.
46. P. Wang, Y. Qi, X. Liu, Y. Chen, and X. Zhong, “Power management in heterogeneous
multi-tier web clusters,” in 2010 39th International Conference on Parallel
Processing, pp. 385–394, Sept 2010.
47. A. Beloglazov and R. Buyya, “Energy efficient resource management in virtualized
cloud data centers,” in Proceedings of the 2010 10th IEEE/ACM International
Conference on Cluster, Cloud and Grid Computing, CCGRID ’10, (Washington,
DC, USA), pp. 826–831, IEEE Computer Society, 2010.
48. N. Regola and J. C. Ducom, “Recommendations for virtualization technologies in
high performance computing,” in 2010 IEEE Second International Conference on
Cloud Computing Technology and Science, pp. 409–416, Nov 2010.
49. C. C. T. Mark, D. Niyato, and T. Chen-Khong, “Evolutionary optimal virtual machine
placement and demand forecaster for cloud computing,” in 2011 IEEE International
Conference on Advanced Information Networking and Applications,
pp. 348–355, March 2011.
50. S. Zaman and D. Grosu, “Efficient bidding for virtual machine instances in
clouds,” in 2011 IEEE 4th International Conference on Cloud Computing, pp. 41–
48, July 2011.
51. S. Genaud and J. Gossa, “Cost-wait Trade-offs in Client-side Resource Provisioning
with Elastic Clouds,” in 4th IEEE International Conference on Cloud Computing
(CLOUD 2011), (Washington, United States), July 2011.
52. B. Saovapakhiran, G. Michailidis, and M. Devetsikiotis, “Aggregated-dag
scheduling for job flow maximization in heterogeneous cloud computing,” in 2011
IEEE Global Telecommunications Conference - GLOBECOM 2011, pp. 1–6, Dec
2011.
53. V. A. Patil and V. Chaudhary, “Rack aware scheduling in hpc data centers: An
energy conservation strategy,” in 2011 IEEE International Symposium on Parallel
and Distributed Processing Workshops and Phd Forum, pp. 814–821, May 2011.
54. F. Diaz, E. A. Doumith, and M. Gagnaire, “Impact of resource over-reservation
(ror) and dropping policies on cloud resource allocation,” in 2011 IEEE Third International
Conference on Cloud Computing Technology and Science, pp. 470–
476, Nov 2011.
55. Z. Wang and Y. Q. Zhang, “Energy-efficient task scheduling algorithms with human
intelligence based task shuffling and task relocation,” in 2011 IEEE/ACM
International Conference on Green Computing and Communications, pp. 38–43,
Aug 2011.
56. M. Rahman, X. Li, and H. Palit, “Hybrid heuristic for scheduling data analytics
workflow applications in hybrid cloud environment,” in 2011 IEEE International
Symposium on Parallel and Distributed Processing Workshops and Phd Forum,
pp. 966–974, May 2011.
57. X. Chen, Q. Liu, and J. Lai, “A new power-aware scheduling algorithm for distributed
system,” in Green Computing and Communications (GreenCom), 2010
IEEE/ACM Int’l Conference on Int’l Conference on Cyber, Physical and Social
Computing (CPSCom), pp. 338–343, Dec 2010.
58. B. Li, J. Li, J. Huai, T. Wo, Q. Li, and L. Zhong, “Enacloud: An energy-saving
application live placement approach for cloud computing environments,” in 2009
IEEE International Conference on Cloud Computing, pp. 17–24, Sept 2009.
59. N. Ma, Y. Xia, and V. K. Prasanna, “Exploring weak dependencies in dag scheduling,”
in 2011 IEEE International Symposium on Parallel and Distributed Processing
Workshops and Phd Forum, pp. 591–598, May 2011.
60. F. Pinel, J. E. Pecero, S. U. Khan, and P. Bouvry, “Energy-efficient scheduling
on milliclusters with performance constraints,” in 2011 IEEE/ACM International
Conference on Green Computing and Communications, pp. 44–49, Aug 2011.
61. X. Zhu, C. He, Y. Bi, and D. Qiu, “Towards adaptive power-aware scheduling for
real-time tasks on dvs-enabled heterogeneous clusters,” in Green Computing and
Communications (GreenCom), 2010 IEEE/ACM Int’l Conference on Int’l Conference
on Cyber, Physical and Social Computing (CPSCom), pp. 117–124, Dec
2010.
62. J. Sun, C. Huang, and J. Dong, “Research on power-aware scheduling for highperformance
computing system,” in 2011 IEEE/ACM International Conference on
Green Computing and Communications, pp. 75–78, Aug 2011.
63. N. Kim, J. Cho, and E. Seo, “Energy-based accounting and scheduling of virtual
machines in a cloud system,” in 2011 IEEE/ACM International Conference on
Green Computing and Communications, pp. 176–181, Aug 2011.
64. S. Tang, J. Yuan, C. Wang, and X. Y. Li, “A framework for amazon ec2 bidding
strategy under sla constraints,” IEEE Transactions on Parallel and Distributed
Systems, vol. 25, pp. 2–11, Jan 2014.
65. S. Yi, A. Andrzejak, and D. Kondo, “Monetary cost-aware checkpointing and migration
on amazon cloud spot instances,” Services Computing, IEEE Transactions
on, vol. 5, pp. 512–524, Fourth 2012.
66. S. Yi, D. Kondo, and A. Andrzejak, “Reducing costs of spot instances via checkpointing
in the amazon elastic compute cloud,” in Cloud Computing (CLOUD),
2010 IEEE 3rd International Conference on, pp. 236–243, July 2010.
67. A. Marathe, R. Harris, D. Lowenthal, B. R. de Supinski, B. Rountree, and
M. Schulz, “Exploiting redundancy for cost-effective, time-constrained execution
of hpc applications on amazon ec2,” in Proceedings of the 23rd International Symposium
on High-performance Parallel and Distributed Computing, HPDC ’14,
(New York, NY, USA), pp. 279–290, ACM, 2014.
68. W. Voorsluys and R. Buyya, “Reliable provisioning of spot instances for computeintensive
applications,” in Proceedings of the 2012 IEEE 26th International
Conference on Advanced Information Networking and Applications, AINA ’12,
(Washington, DC, USA), pp. 542–549, IEEE Computer Society, 2012.
69. S. Subramanya, T. Guo, P. Sharma, D. Irwin, and P. Shenoy, “Spoton: A batch
computing service for the spot market,” in Proceedings of the Sixth ACM Symposium
on Cloud Computing, SoCC ’15, (New York, NY, USA), pp. 329–341,
ACM, 2015.
70. O. Agmon Ben-Yehuda, M. Ben-Yehuda, A. Schuster, and D. Tsafrir, “Deconstructing
amazon ec2 spot instance pricing,” in Cloud Computing Technology and
Science (CloudCom), 2011 IEEE Third International Conference on, pp. 304–311, Nov 2011.
71. Y. Song, M. Zafer, and K.-W. Lee, “Optimal bidding in spot instance market,” in
INFOCOM, 2012 Proceedings IEEE, pp. 190–198, March 2012.
72. T. Hu, “Parallel sequencing and assembly line problems,” Operations Research,
vol. 9, no. 6, pp. 841–848, 1961.
73. Y. Yu, M. Isard, D. Fetterly, M. Budiu, U. Erlingsson, P. K. Gunda, and J. Currey,
“DryadLINQ: A system for general-purpose distributed data-parallel computing
using a high-level language,” in Proceedings of the USENIX Conference on Operating
Systems Design and Implementation, pp. 1–14, 2008.
74. M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, “Spark:
Cluster computing with working sets,” in Proceedings of the 2Nd USENIX Conference
on Hot Topics in Cloud Computing, HotCloud’10, (Berkeley, CA, USA),
pp. 10–10, USENIX Association, 2010.
75. A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff,
and R. Murthy, “Hive: A warehousing solution over a Map-Reduce framework,”
Proc. VLDB Endow., vol. 2, pp. 1626–1629, Aug. 2009.
76. C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins, “Pig latin: A notso-
foreign language for data processing,” in ACM SIGMOD, pp. 1099–1110, 2008.
77. R. Graham, L. Lawler, J. Lenstra, and A. Kan, “Optimization and approximation
in deterministic sequencing and scheduling: A survey,” Annals of Discrete Math.,
pp. 287–326, 1979.
78. T. L. Casavant and J. G. Kuhl, “A taxonomy of scheduling in general-purpose
distributed computing systems,” IEEE Transactions on Software Engineering,
vol. 14, pp. 141–154, Feb. 1988.
79. T. L. Adam, K. M. Chandy, and J. R. Dickson, “A comparison of list schedules for
parallel processing systems,” Commun. ACM, vol. 17, pp. 685–690, Dec. 1974.
80. W. H. Yu, Lu Decomposition on a Multiprocessing System with Communications
Delay. PhD thesis, University of California, Berkeley, 1984. AAI8427141.
81. B. Shirazi, M. Wang, and G. Pathak, “Analysis and evaluation of heuristic methods
for static task scheduling,” Journal of Parallel and Distributed Computing, vol. 10,
pp. 222–2232, Oct. 1990.
82. T. Yang and A. Gerasoulis, “DSC: scheduling parallel tasks on an unbounded number
of processors,” IEEE Transactions on Parallel and Distributed Systems, vol. 5,
pp. 951–967, Sep 1994.
83. Y.-K. Kwok and I. Ahmad, “Dynamic critical-path scheduling: an effective technique
for allocating task graphs to multiprocessors,” IEEE Transactions on Parallel
and Distributed Systems, vol. 7, pp. 506–521, May 1996.
84. H. Chen, B. Shirazi, K. Kavi, and A. Hurson, “Static scheduling using linear clustering
and task duplication,” in Parallel and Distributed Computing and Systems,
pp. 285–290, 1993.
85. J. Colin and P. Chretienne, “C.P.M. Scheduling with Small Computation Delays
and Task Duplication,” Operations Research, pp. 680–684, 1991.
86. R. Bajaj and D. Agrawal, “Improving scheduling of tasks in a heterogeneous
environment,” IEEE Transactions on Parallel and Distributed Systems, vol. 15,
pp. 107–118, Feb 2004.
87. C.-H. Yang, P. Lee, and Y.-C. Chung, “Improving static task scheduling in heterogeneous
and homogeneous computing systems,” in ICPP, pp. 45–45, Sept 2007.
88. H. Topcuouglu, S. Hariri, and M.-y. Wu, “Performance-effective and lowcomplexity
task scheduling for heterogeneous computing,” IEEE Transactions on
Parallel and Distributed Systems, vol. 13, pp. 260–274, Mar. 2002.
89. H. Arabnejad and J. Barbosa, “List scheduling algorithm for heterogeneous systems
by an optimistic cost table,” IEEE Transactions on Parallel and Distributed
Systems, vol. 25, pp. 682–694, March 2014.
90. R. Raju, J. Amudhavel, M. Pavithra, S. Anuja, and B. Abinaya, “A heuristic fault
tolerant ”MapReduce framework for minimizing makespan in hybrid cloud environment,”
in International Conference on Green Computing Communication and
Electrical Engineering (ICGCCEE), pp. 1–4, March 2014.
91. L. Bittencourt and E. Madeira, “HCOC: a cost optimization algorithm for workflow
scheduling in hybrid clouds,” Journal of Internet Services and Applications,
vol. 2, no. 3, pp. 207–227, 2011.
92. N. Fujimoto and K. Hagihara, “Near-optimal dynamic task scheduling of precedence
constrained coarse-grained tasks onto a computational grid,” in Proceedings
of International Conference on Parallel and Distributed Computing, pp. 80–87,
2003.
93. G. Juve, A. Chervenak, E. Deelman, S. Bharathi, G. Mehta, and K. Vahi, “Characterizing
and profiling scientific workflows,” Future Gener. Comput. Syst., vol. 29,
pp. 682–692, Mar. 2013.
94. M. Mao and M. Humphrey, “Auto-scaling to minimize cost and meet application
deadlines in cloud workflows,” in ACM/IEEE conference on Supercomputing,
pp. 49:1–49:12, 2011.
95. D. Cordeiro, G. Mounié, S. Perarnau, D. Trystram, J.-M. Vincent, and F. Wagner,
“Random graph generation for scheduling simulations,” in Proceedings of the 3rd
International ICST Conference on Simulation Tools and Techniques, pp. 60:1–
60:10, 2010.
96. L. Canon and E. Jeannot, “Evaluation and optimization of the robustness of dag
schedules in heterogeneous environments,” IEEE Transactions on Parallel and
Distributed Systems, vol. 21, pp. 532–546, April 2010.
97. A. Radulescu and A. van Gemund, “Fast and effective task scheduling in heterogeneous
systems,” in Heterogeneous Computing Workshop, 2000. (HCW 2000)
Proceedings. 9th, pp. 229–238, 2000.
98. M.-Y. Wu and D. Gajski, “Hypertool: a programming aid for message-passing
systems,” IEEE Transactions on Parallel and Distributed Systems, vol. 1, pp. 330–
343, Jul 1990.
99. V. A. F. Almeida, I. M. M. Vasconcelos, J. Arabe, and D. Menasce, “Using random
task graphs to investigate the potential benefits of heterogeneity in parallel
systems,” in ACM/IEEE conference on Supercomputing, pp. 683–691, Nov 1992.
100. A. Kvas, M. Ojstersek, and V. Zumer, “Evaluation of static program allocation
schemes for macro data-flow computer,” in Proceedings of the 20th EUROMICRO
Conference, pp. 573–580, Sep 1994.
101. A. C. Zhou and B. He, “Transformation-based monetary cost optimizations for
workflows in the cloud,” IEEE Transactions on Cloud Computing, vol. 2, no. 1,
pp. 85–98, 2014.
102. M. A. Rodriguez and R. Buyya, “Deadline based resource provisioningand
scheduling algorithm for scientific workflows on clouds,” IEEE Transactions on
Cloud Computing, vol. 2, no. 2, pp. 222–235, 2014.
103. G. Chen, “Simplified particle swarm optimization algorithm based on particles
classification,” in International Conference on Natural Computation (ICNC),
vol. 5, pp. 2701–2705, Aug 2010.
104. I. Ahmad and K. Yu-Kwong, “On exploiting task duplication in parallel program
scheduling,” IEEE Transactions on Parallel and Distributed Systems, vol. 9,
pp. 872–892, Sept. 1998.
105. S. Darbha and D. P. Agrawal, “Optimal scheduling algorithm for distributedmemory
machines,” IEEE Transactions on Parallel and Distributed Systems,
vol. 9, pp. 87–95, Jan. 1998.
106. J. Dean and S. Ghemawat, “Mapreduce: Simplified data processing on large clusters,”
Commun. ACM, vol. 51, pp. 107–113, Jan. 2008.
107. “Apache hadoop. http://hadoop.apache.org,” 2011.
108. M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, “Spark:
Cluster computing with working sets,” in Proceedings of the 2nd USENIX Conference
on Hot Topics in Cloud Computing, 2010.
109. M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, “Dryad: Distributed dataparallel
programs from sequential building blocks,” in Proceedings of the 2007
Eurosys Conference, (Lisbon, Portugal), Association for Computing Machinery,
Inc., March 2007.
110. A. Vulimiri, C. Curino, P. B. Godfrey, T. Jungblut, K. Karanasos, J. Padhye, and
G. Varghese, “Wanalytics: Geo-distributed analytics for a data intensive world,” in
Proceedings of the 2015 ACM SIGMOD International Conference on Management
of Data, SIGMOD ’15, (New York, NY, USA), pp. 1087–1092, ACM, 2015.
111. A. Vulimiri, C. Curino, P. B. Godfrey, T. Jungblut, J. Padhye, and G. Varghese,
“Global analytics in the face of bandwidth and regulatory constraints,” in 12th
USENIX Symposium on Networked Systems Design and Implementation (NSDI
15), (Oakland, CA), pp. 323–336, USENIX Association, May 2015.
112. W. Lin, Z. Qian, J. Xu, S. Yang, J. Zhou, and L. Zhou, “Streamscope: Continuous
reliable distributed processing of big data streams,” in 13th USENIX Symposium
on Networked Systems Design and Implementation (NSDI 16), (Santa Clara, CA),
pp. 439–453, USENIX Association, Mar. 2016.
113. “Google Compute Engine,” 2011.
114. S. Venugopal and R. Buyya, “An scp-based heuristic approach for scheduling distributed
data-intensive applications on global grids,” J. Parallel Distrib. Comput.,
vol. 68, pp. 471–487, Apr. 2008.
115. C.-C. Hung, L. Golubchik, and M. Yu, “Scheduling jobs across geo-distributed
datacenters,” in Proceedings of the Sixth ACM Symposium on Cloud Computing,
SoCC ’15, (New York, NY, USA), pp. 111–124, ACM, 2015.
116. Z. Hu, B. Li, and J. Luo, “Flutter: Scheduling tasks closer to data across geodistributed
datacenters,” in IEEE INFOCOM 2016 - The 35th Annual IEEE International
Conference on Computer Communications, pp. 1–9, April 2016.
117. K. Kloudas, M. Mamede, N. Preguiça, and R. Rodrigues, “Pixida: Optimizing data
parallel jobs in wide-area data analytics,” Proc. VLDB Endow., vol. 9, pp. 72–83,
Oct. 2015.
118. Q. Pu, G. Ananthanarayanan, P. Bodik, S. Kandula, A. Akella, P. Bahl, and I. Stoica,
“Low latency geo-distributed data analytics,” SIGCOMM Comput. Commun.
Rev., vol. 45, pp. 421–434, Aug. 2015.
119. M. Cardosa, C. Wang, A. Nangia, A. Chandra, and J. Weissman, “Exploring
mapreduce efficiency with highly-distributed data,” in Proceedings of the Second
International Workshop on MapReduce and Its Applications, MapReduce ’11,
(New York, NY, USA), pp. 27–34, ACM, 2011.
120. N. Garg, A. Kumar, and V. Pandit, “Order scheduling models: Hardness and algorithms,”
in Proceedings of the 27th International Conference on Foundations
of Software Technology and Theoretical Computer Science, FSTTCS’07, (Berlin,
Heidelberg), pp. 96–107, Springer-Verlag, 2007.
121. R. Tripathi, V. S, V. Tamarapalli, and D. Medhi, “Cost efficient design of fault
tolerant geo-distributed data centers,” IEEE Transactions on Network and Service
Management, vol. PP, no. 99, pp. 1–1, 2017.
122. H. Herodotou, F. Dong, and S. Babu, “No one (cluster) size fits all: Automatic
cluster sizing for data-intensive analytics,” in Proceedings of the 2Nd ACM Symposium
on Cloud Computing, SOCC ’11, (New York, NY, USA), pp. 18:1–18:14,
ACM, 2011.
123. V. Jalaparti, H. Ballani, P. Costa, T. Karagiannis, and A. Rowstron, “Bridging the
tenant-provider gap in cloud services,” in Proceedings of the Third ACM Symposium
on Cloud Computing, SoCC ’12, (New York, NY, USA), pp. 10:1–10:14,
ACM, 2012.
124. V. Jalaparti, P. Bodik, I. Menache, S. Rao, K. Makarychev, and M. Caesar,
“Network-aware scheduling for data-parallel jobs: Plan when you can,” SIGCOMM
Comput. Commun. Rev., vol. 45, pp. 407–420, Aug. 2015.
125. Y. Jin, Y. Gao, Z. Qian, M. Zhai, H. Peng, and S. Lu, “Workload-aware scheduling
across geo-distributed data centers,” in 2016 IEEE Trustcom/BigDataSE/ISPA,
pp. 1455–1462, Aug 2016.
126. A. N. Toosi and R. Buyya, “A fuzzy logic-based controller for cost and energy
efficient load balancing in geo-distributed data centers,” in 2015 IEEE/ACM 8th
International Conference on Utility and Cloud Computing (UCC), pp. 186–194,
Dec 2015.
127. A. Makhorin, “Gnu linear programming kit, version 4.52„” 2012.
128. V. H. Nguyen, N. H. Tuong, V. H. Tran, and N. Thoai, “An milp-based makespan
minimization model for single-machine scheduling problem with splitable jobs
and availability constraints,” in Computing, Management and Telecommunications
(ComManTel), 2013 International Conference on, pp. 397–400, Jan 2013.
129. J. H. Abawajy and M. M. Deris, “Data replication approach with consistency guarantee
for data grid,” IEEE Transactions on Computers, vol. 63, pp. 2975–2987,
Dec 2014.
130. G. Ananthanarayanan, A. Ghodsi, S. Shenker, and I. Stoica, “Effective straggler
mitigation: Attack of the clones,” in Presented as part of the 10th USENIX Symposium
on Networked Systems Design and Implementation (NSDI 13), (Lombard,
IL), pp. 185–198, USENIX, 2013.
131. G. Ananthanarayanan, S. Kandula, A. Greenberg, I. Stoica, Y. Lu, B. Saha, and
E. Harris, “Reining in the outliers in map-reduce clusters using mantri,” in Proceedings
of the 9th USENIX Conference on Operating Systems Design and Implementation,
OSDI’10, (Berkeley, CA, USA), pp. 265–278, USENIX Association,
2010.
132. Y. Chen, A. Ganapathi, R. Griffith, and R. Katz, “The case for evaluating mapreduce
performance using workload suites,” in 2011 IEEE 19th Annual International
Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication
Systems, pp. 390–399, July 2011.
133. L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker, “Web caching and zipf-like
distributions: evidence and implications,” in INFOCOM ’99. Eighteenth Annual
Joint Conference of the IEEE Computer and Communications Societies. Proceedings.
IEEE, vol. 1, pp. 126–134 vol.1, Mar 1999.
134. D. G. Cameron, R. Carvajal-Schiaffino, A. P. Millar, C. Nicholson, K. Stockinger,
and F. Zini, “Evaluating scheduling and replica optimisation strategies in optorsim,”
in Proceedings. First Latin American Web Congress, pp. 52–59, Nov 2003.
135. L. R. Anikode and B. Tang, “Integrating scheduling and replication in data
grids with performance guarantee,” in Global Telecommunications Conference
(GLOBECOM 2011), 2011 IEEE, pp. 1–6, Dec 2011.
136. F. Jolfaei and A. T. Haghighat, “The impact of bandwidth and storage space on
job scheduling and data replication strategies in data grids,” in Computing Technology
and Information Management (ICCM), 2012 8th International Conference
on, vol. 1, pp. 283–288, April 2012.
137. Y. C. Lee and A. Y. Zomaya, “Practical scheduling of bag-of-tasks applications
on grids with dynamic resilience,” IEEE Transactions on Computers, vol. 56,
pp. 815–825, June 2007.
138. M. Zarina, F. Ahmad, A. N. bin Mohd Rose, M. Nordin, and M. M. Deris, “Job
scheduling for dynamic data replication strategy in heterogeneous federation data
grid systems,” in Informatics and Applications (ICIA),2013 Second International
Conference on, pp. 203–206, Sept 2013.
139. W. Li, Y. Yang, and D. Yuan, “A novel cost-effective dynamic data replication
strategy for reliability in cloud data centres,” in Dependable, Autonomic and Secure
Computing (DASC), 2011 IEEE Ninth International Conference on, pp. 496–
502, Dec 2011.
140. C. L. Abad, Y. Lu, and R. H. Campbell, “Dare: Adaptive data replication for
efficient cluster scheduling,” in 2011 IEEE International Conference on Cluster
Computing, pp. 159–168, Sept 2011.
141. R. Tudoran, A. Costan, and G. Antoniu, “Overflow: Multi-site aware big data
management for scientific workflows on clouds,” IEEE Transactions on Cloud
Computing, vol. 4, pp. 76–89, Jan 2016.
142. L. Wang, J. Tao, R. Ranjan, H. Marten, A. Streit, J. Chen, and D. Chen, “G-hadoop:
Mapreduce across distributed data centers for data-intensive computing,” Future
Generation Computer Systems, vol. 29, no. 3, pp. 739 – 750, 2013. Special Section:
Recent Developments in High Performance Computing and Security.
143. B. Heintz, A. Chandra, R. K. Sitaraman, and J. Weissman, “End-to-end optimization
for geo-distributed mapreduce,” IEEE Transactions on Cloud Computing,
vol. 4, pp. 293–306, July 2016.
144. S. Li, Q. Lu, W. Zhang, and L. Zhu, “A mapreduce cluster deployment optimization
framework with geo-distributed data,” in 2015 IEEE 12th Intl Conf on Ubiquitous
Intelligence and Computing and 2015 IEEE 12th Intl Conf on Autonomic and
Trusted Computing and 2015 IEEE 15th Intl Conf on Scalable Computing and
Communications and Its Associated Workshops (UIC-ATC-ScalCom), pp. 943–
949, Aug 2015.
145. M. Cavallo, G. D. Modica, C. Polito, and O. Tomarchio, “Application profiling in
hierarchical hadoop for geo-distributed computing environments,” in 2016 IEEE
Symposium on Computers and Communication (ISCC), pp. 555–560, June 2016.
146. W. Chen, I. Paik, and Z. Li, “Cost-aware streaming workflow allocation on geodistributed
data centers,” IEEE Transactions on Computers, vol. PP, no. 99, pp. 1–
1, 2016.
147. Y. Koshiba, W. Chen, Y. Yamada, T. Tanaka, and I. Paik, “Investigation of network
traffic in geo-distributed data centers,” in 2015 IEEE 7th International Conference
on Awareness Science and Technology (iCAST), pp. 174–179, Sept 2015.
148. P. Li, S. Guo, S. Yu, and W. Zhuang, “Cross-cloud mapreduce for big data,” IEEE
Transactions on Cloud Computing, vol. PP, no. 99, pp. 1–1, 2015.
149. D. Cheng, J. Rao, Y. Guo, C. Jiang, and X. Zhou, “Improving performance of
heterogeneous mapreduce clusters with adaptive task tuning,” IEEE Transactions
on Parallel and Distributed Systems, vol. 28, pp. 774–786, March 2017.
150. P. Li, S. Guo, T. Miyazaki, X. Liao, H. Jin, A. Zomaya, and K. Wang, “Trafficaware
geo-distributed big data analytics with predictable job completion time,”
IEEE Transactions on Parallel and Distributed Systems, vol. PP, no. 99, pp. 1–1,
2016.
151. A. Mandal, Y. Xin, I. Baldine, P. Ruth, C. Heerman, J. Chase, V. Orlikowski, and
A. Yumerefendi, “Provisioning and evaluating multi-domain networked clouds
for hadoop-based applications,” in 2011 IEEE Third International Conference on
Cloud Computing Technology and Science, pp. 690–697, Nov 2011.
152. Q. Pu, G. Ananthanarayanan, P. Bodik, S. Kandula, A. Akella, P. Bahl, and I. Stoica,
“Low latency geo-distributed data analytics,” in Proceedings of the 2015 ACM
Conference on Special Interest Group on Data Communication, SIGCOMM ’15,
(New York, NY, USA), pp. 421–434, ACM, 2015.
153. L. Schrage, “A proof of the optimality of the shortest remaining processing time
discipline,” Operations Research, vol. 16, no. 3, pp. 687–690, 1968.
154. Y.-K. Kwok and I. Ahmad, “Fastest: a practical low-complexity algorithm for
compile-time assignment of parallel programs to multiprocessors,” IEEE Transactions
on Parallel and Distributed Systems, vol. 10, pp. 147–159, Feb 1999.
155. G. C. Sih and E. A. Lee, “A compile-time scheduling heuristic for interconnectionconstrained
heterogeneous processor architectures,” IEEE Transactions on Parallel
and Distributed Systems, vol. 4, pp. 175–187, Feb 1993.
156. A. EC2, “Amazon ec2,” 2009.
157. M. Mazzucco and M. Dumas, “Achieving performance and availability guarantees
with spot instances,” in High Performance Computing and Communications
(HPCC), 2011 IEEE 13th International Conference on, pp. 296–303, Sept 2011.
158. A. Andrzejak, D. Kondo, and S. Yi, “Decision model for cloud computing under
sla constraints,” in Modeling, Analysis Simulation of Computer and Telecommunication
Systems (MASCOTS), 2010 IEEE International Symposium on, pp. 257–
266, Aug 2010.
159. Y. Song, M. Zafer, and K.-W. Lee, “Optimal bidding in spot instance market,” in
INFOCOM, 2012 Proceedings IEEE, pp. 190–198, March 2012.
160. S. Tang, J. Yuan, and X. Y. Li, “Towards optimal bidding strategy for amazon
ec2 cloud spot instance,” in 2012 IEEE Fifth International Conference on Cloud
Computing, pp. 91–98, June 2012.
161. A. Andrzejak, D. Kondo, and D. Anderson, “Exploiting non-dedicated resources
for cloud computing,” in Network Operations and Management Symposium
(NOMS), 2010 IEEE, pp. 341–348, April 2010.
162. M. Stokely, J. Winget, E. Keyes, C. Grimes, and B. Yolken, “Using a market
economy to provision compute resources across planet-wide clusters,” in Parallel
Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on,
pp. 1–8, May 2009.
163. S. Yi, A. Andrzejak, and D. Kondo, “Monetary cost-aware checkpointing and migration
on amazon cloud spot instances,” Services Computing, IEEE Transactions
on, vol. 5, pp. 512–524, Fourth 2012.
164. I. Jangjaimon and N.-F. Tzeng, “Effective cost reduction for elastic clouds under
spot instance pricing through adaptive checkpointing,” Computers, IEEE Transactions
on, vol. 64, pp. 396–409, Feb 2015.
165. Amazon, “Amazon console,” 2004.
166. N. Chohan, C. Castillo, M. Spreitzer, M. Steinder, A. Tantawi, and C. Krintz,
“See spot run: Using spot instances for mapreduce workflows,” in Proceedings of
the 2Nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud’10,
(Berkeley, CA, USA), pp. 7–7, USENIX Association, 2010.
167. B. Javadi, R. Thulasiramy, and R. Buyya, “Statistical modeling of spot instance
prices in public cloud environments,” in Utility and Cloud Computing (UCC), 2011
Fourth IEEE International Conference on, pp. 219–228, Dec 2011.
168. O. Regev and N. Nisan, “The popcorn market—an online market for computational
resources,” in Proceedings of the First International Conference on Information
and Computation Economies, ICE ’98, (New York, NY, USA), pp. 148–
157, ACM, 1998.
169. M. Stokely, J. Winget, E. Keyes, C. Grimes, and B. Yolken, “Using a market economy
to provision compute resources across planet-wide clusters,” in Proceedings
for the International Parallel and Distributed Processing Symposium 2009, pp. 1–
8, 2009.
170. S. Chaisiri, R. Kaewpuang, B.-S. Lee, and D. Niyato, “Cost minimization for provisioning
virtual servers in amazon elastic compute cloud,” in Modeling, Analysis
Simulation of Computer and Telecommunication Systems (MASCOTS), 2011 IEEE
19th International Symposium on, pp. 85–95, July 2011.
171. S. Genaud and J. Gossa, “Cost-wait trade-offs in client-side resource provisioning
with elastic clouds,” in Cloud Computing (CLOUD), 2011 IEEE International
Conference on, pp. 1–8, July 2011.
172. S. Lu, X. Li, L. Wang, H. Kasim, H. Palit, T. Hung, E. Legara, and G. Lee, “A
dynamic hybrid resource provisioning approach for running large-scale computational
applications on cloud spot and on-demand instances,” in Parallel and Distributed
Systems (ICPADS), 2013 International Conference on, pp. 657–662, Dec
2013.
173. H. Huang, L. Wang, B. C. Tak, L. Wang, and C. Tang, “Cap3: A cloud autoprovisioning
framework for parallel processing using on-demand and spot instances,”
in Cloud Computing (CLOUD), 2013 IEEE Sixth International Conference
on, pp. 228–235, June 2013.
174. C. Binnig, A. Salama, E. Zamanian, M. El-Hindi, S. Feil, and T. Ziegler, “Spotgres
- parallel data analytics on spot instances,” in 2015 31st IEEE International
Conference on Data Engineering Workshops, pp. 14–21, April 2015.
175. H. Miao and L. Li, “Cost-effective provisioning of spot instances in clouds,” in
2016 9th International Symposium on Computational Intelligence and Design (ISCID),
vol. 2, pp. 194–197, Dec 2016.
176. K. Veena, C. Anand, and G. C. Prakash, “Temporal and spatial trend analysis of
cloud spot instance pricing in amazon ec2,” in 2016 IEEE 14th Intl Conf on Dependable,
Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence
and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and
Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech),
pp. 909–912, Aug 2016.
177. A. C. Zhou, B. He, and C. Liu, “Monetary cost optimizations for hosting
workflow-as-a-service in iaas clouds,” IEEE Transactions on Cloud Computing,
vol. 4, pp. 34–48, Jan 2016.
178. W. Voorsluys, S. K. Garg, and R. Buyya, “Provisioning spot market cloud resources
to create cost-effective virtual clusters,” in Proceedings of the 11th International
Conference on Algorithms and Architectures for Parallel Processing
- Volume Part I, ICA3PP’11, (Berlin, Heidelberg), pp. 395–408, Springer-Verlag,
2011.
179. Q. Zhang, E. Gürses, R. Boutaba, and J. Xiao, “Dynamic resource allocation for
spot markets in clouds,” in Proceedings of the 11th USENIX Conference on Hot
Topics in Management of Internet, Cloud, and Enterprise Networks and Services,
Hot-ICE’11, (Berkeley, CA, USA), pp. 1–1, USENIX Association, 2011.
180. M. Mattess, C. Vecchiola, and R. Buyya, “Managing peak loads by leasing cloud
infrastructure services from a spot market,” in 2010 IEEE 12th International Conference
on High Performance Computing and Communications (HPCC), pp. 180–
188, Sept 2010.
181. Q. Zhang, Q. Zhu, and R. Boutaba, “Dynamic resource allocation for spot markets
in cloud computing environments,” in 2011 Fourth IEEE International Conference
on Utility and Cloud Computing, pp. 178–185, Dec 2011.
182. N. Sadashiv, S. Kumar, and R. Goudar, “Hybrid spot instance based resource provisioning
strategy in dynamic cloud environment,” in High Performance Computing
and Applications (ICHPCA), 2014 International Conference on, pp. 1–6, Dec
2014.
183. S. Khatua and N. Mukherjee, “A novel checkpointing scheme for amazon ec2 spot
instances,” in 2013 13th IEEE/ACM International Symposium on Cluster, Cloud,
and Grid Computing, pp. 180–181, May 2013.

簡易檢索 / 詳目顯示

相關論文