COMP 6611B: Topics on Cloud Computing and Data Analytics Systems [Fall 2016]
This is a tentative reading list subject to changes over weeks.
General Guideline
Paper Reading
Giving a Talk
Overview of Cloud Computing and Datacenter Architecture
Data Analytics Frameworks
J. Dean, S. Ghemawat, ‘‘MapReduce: Simplified Data Processing on Large Clusters,’’ USENIX OSDI 2004.
M. Zaharia et al., ‘‘Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing,’’ USENIX NSDI 2012.
Armbrust et al., ‘‘Spark SQL: Relational Data Processing in Spark,’’ ACM SIGMOD 2015.
B. Sahah et al., ‘‘Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications,’’ ACM SIGMOD 2015.
M. Zaharia et al., ‘‘Discretized Streams: Fault-Tolerant Streaming Computation at Scale,’’ ACM SOSP 2013.
Akidau et al., ‘‘The dataflow model: a practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing,’’ Proc. VLDB Endowment, 8(12):1792–1803, 2015.
Malewicz et al., ‘‘Pregel: A System for Large-Scale Graph Processing,’’ ACM SIGMOD 2010.
Gonzalez et al., ‘‘GraphX: Graph processing in a distributed dataflow framework,’’ USENIX OSDI 2014.
Storage Systems
Workload Characteristics
Cluster Management Systems
B. Hindman, et al., ‘‘Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center,’’ USENIX NSDI 2011.
V.K. Vavilapallih et al., ‘‘Apache Hadoop YARN: Yet Another Resource Negotiator,’’ ACM SoCC 2013.
A. Verma et al., ‘‘Large-scale cluster management at Google with Borg,’’ ACM EuroSys 2015.
Burns et al., ‘‘Borg, Omega, and Kubernetes,’’ ACM Queue, vol. 14 (2016), pp. 70-93.
Resource Management Policies
A. Ghodsi et al., ‘‘Dominant Resource Fairness: Fair Allocation of Multiple Resource Types,’’ USENIX NSDI 2011.
W. Wang et al., ‘‘Multi-Resource Fair Sharing for Datacenter Jobs with Placement Constraints,’’ IEEE/ACM SC 2016.
R. Grandl et al., ‘‘Multi-Resource Packing for Cluster Schedulers,’’ ACM SIGCOMM 2014.
Tumanov et al., ‘‘TetriSched: Global Rescheduling with Adaptive Plan-ahead in Dynamic Heterogeneous Clusters,’’ ACM EuroSys 2016.
R. Grandl et al., ‘‘Altruistic Scheduling in Multi-Resource Clusters,’’ USENIX OSDI 2016.
Ananthanarayanan et al., ‘‘PACMan: Coordinated memory caching for parallel jobs,’’ USENIX NSDI 2012.
Q. Pu et al., ‘‘FairRide: Near-Optimal, Fair Cache Sharing,’’ USENIX NSDI 2016.
Cluster Scheduler Design
K. Ousterhout et al., ‘‘Sparrow: Distributed, Low Latency Scheduling,’’ ACM SOSP 2013.
M. Schwarzkopf et al., ‘‘Omega: flexible, scalable schedulers for large computer clusters,’’ ACM EuroSys 2013.
Rasley et al., ‘‘Efficient Queue Management for Cluster Scheduling,’’ ACM EuroSys 2016.
G. Ananthanarayanan et al., ‘‘Effective Straggler Mitigation: Attack of the Clones,’’ USENIX NSDI 2013.
X. Ren et al., ‘‘Hopper: Decentralized Speculation-aware Cluster Scheduling at Scale,’’ ACM SIGCOMM 2015.
Datacenter Networking
|