COMP 6611B: Topics on Cloud Computing and Data Analytics Systems [Fall 2015]
This is a tentative reading list subject to changes over weeks.
General Guideline
Paper Reading
Giving a Talk
Overview of Cloud Computing and Datacenter Architecture
Data Analytics Frameworks
J. Dean, S. Ghemawat, ‘‘MapReduce: Simplified Data Processing on Large Clusters,’’ Commun. ACM, 2008.
M. Zaharia et al., ‘‘Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing,’’ USENIX NSDI 2012.
M. Zaharia et al., ‘‘Discretized Streams: Fault-Tolerant Streaming Computation at Scale,’’ ACM SOSP 2013.
B. Sahah et al., ‘‘Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications,’’ ACM SIGMOD 2015.
Storage Systems
Workload Characteristics
Cluster Management Systems
Resource Management and Scheduling Policies
M. Zaharia et al., ‘‘Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling,’’ ACM EuroSys 2010.
A. Ghodsi et al., ‘‘Dominant Resource Fairness: Fair Allocation of Multiple Resource Types,’’ USENIX NSDI 2011.
A. Ghodsi et al., ‘‘Choosy: Max-Min Fair Sharing for Datacenter Jobs with Constraints,’’ ACM EuroSys 2013.
R. Grandl et al., ‘‘Multi-Resource Packing for Cluster Schedulers,’’ ACM SIGCOMM 2014.
Q. Pu et al., ‘‘Low Latency Geo-distributed Data Analytics,’’ ACM SIGCOMM 2015.
Cluster Scheduler Design
K. Ousterhout et al., ‘‘Sparrow: Distributed, Low Latency Scheduling,’’ ACM SOSP 2013.
M. Schwarzkopf et al., ‘‘Omega: flexible, scalable schedulers for large computer clusters,’’ ACM EuroSys 2013.
J. Dean, L.A. Barroso, ‘‘The Tail at Scale,’’ Commun. ACM, 2013.
G. Ananthanarayanan et al., ‘‘Effective Straggler Mitigation: Attack of the Clones,’’ USENIX NSDI 2013.
F.R. Dogar et al., ‘‘Decentralized Task-Aware Scheduling for Data Center Networks,’’ ACM SIGCOMM 2014.
X. Ren et al., ‘‘Hopper: Decentralized Speculation-aware Cluster Scheduling at Scale,’’ ACM SIGCOMM 2015.
Datacenter Networking: Architecture, Traffic Characteristics, and Flow Management
R.N. Mysore et al., ‘‘PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric,’’ ACM SIGCOMM 2009.
A. Singh et al., ‘‘Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google's Datacenter Network,’’ ACM SIGCOMM 2015.
T. Benson et al., ‘‘Network Traffic Characteristics of Data Centers in the Wild,’’ ACM IMC 2010.
Y. Chen et al., ‘‘A First Look at Inter-Data Center Traffic Characteristics via Yahoo! Datasets,’’ IEEE INFOCOM 2011.
L. Popa et al., ‘‘FairCloud: Sharing the Network in Cloud Computing,’’ ACM SIGCOMM 2012.
A. Ghodsi et al., ‘‘Multi-Resource Fair Queueing for Packet Processing,’’ ACM SIGCOMM 2012.
J. Mogul, L. Popa, ‘‘What We Talk About When We Talk About Cloud Network Performance,’’ ACM SIGCOMM Comput. Commun. Rev., 2012.
M. Chowdhury et al., ‘‘Efficient Coflow Scheduling with Varys,’’ ACM SIGCOMM 2014.
|