SE2 Winter Semester 2015 Dates 10.11.2015 No exercise! Work on the MapReduce assignment. 16.11.2015 Due date for MapReduce assignment! 04.12.2015 Midterm exam 12.01.2016 No exercise! 01.03.2016 Final exam SE2 Schedule Topic Lecture Exercise Google Search Lecture slides (12.10.2015) Barroso, Clidaras, Hölzle: The Datacenter as a Computer. Barroso, Dean, Hölzle: Web search for a planet: The Google cluster architecture. Exercise sheet Exercise slides (20.10.2015) Elasticity in Cloud Computing: What It Is, and What It Is Not A rare look inside Facebook’s Oregon data center Microsoft GFS Datacenter Tour Operations at Twitter: Scaling Beyond 100 Million Users How We've Scaled Dropbox From the Monolith to Microservices: Lessons from Google and eBay MapReduce, PIG Lecture slides (19.10.2015) Dean, Ghemawat: MapReduce: Simplified Data Processing on Large Clusters. Olston et al.: Pig latin: a not-so-foreign language for data processing. Exercise sheet Exercise slides (27.10.2015) MapReduce assignment slides MapReduce assignment: task description Assignment archive Bu et al.: HaLoop: Efficient Iterative Data Processing on Large Clusters. Page, Brin: The PageRank Citation Ranking: Bringing Order to the Web. Lämmel: Google's MapReduce Programming Model. Stonebraker et al.: MapReduce and parallel DBMSs: friends or foes? https://hadoop.apache.org/ Google File System Lecture slides (26.10.2015) Ghemawat et al.: The Google File System. Exercise sheet Exercise slides (03.11.2015) GFS: Evolution on Fast-forward Bigtable Lecture slides (02.11.2015) Chang et al.: Bigtable: A Distributed Storage System for Structured Data. Exercise sheet Exercise slides (01.12.2015) Apache HBase Reference Guide HBase: The Definitive Guide (accessible with TU IP) HBase Schema Design Chubby Lecture slides (09.11.2015) Burrows: The Chubby Lock Service for Loosely-Coupled Distributed Systems. Exercise sheet (local) Exercise slides (17.11.2015) Peer-to-Peer (P2P) Lecture slides (16/23.11.2015) Stoica et al.: Chord: A scalable peer-to-peer lookup service for internet applications. Ratnasamy et al.: A scalable content-addressable network. Rowstron, Druschel: Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems. Exercise sheet (local) Exercise slides (24.11.2015) BitTorrent Lecture slides (30.11.2015) Izal et al.: Dissecting BitTorrent: Five Months in a Torrent’s Lifetime Exercise sheet (local) Exercise slides (08.12.2015) Dynamo Lecture slides (07.12.2015) DeCandia et al.: Dynamo: amazon's highly available key-value store Amazon DynamoDB Overview, a fully managed NoSQL database service Exercise sheet Exercise slides (15.12.2015) Cloud Computing Lecture slides (14.12.2015/04.01.2016/11.01.2016) Exercise slides (05.01.2016) Exercise slides (19.01.2016) Pregel Lecture slides (18.01.2016) Malewicz et al.: Pregel: a system for large-scale graph processing Exercise sheet Giraph (Introduction, Quick Start) Spark Lecture slides (25.01.2016) Zaharia et al.: Spark: Cluster Computing with Working Sets Apache Spark Spanner Lecture slides (01.02.2016) Corbett et al.: Spanner: Google's Globally-Distributed Database Exercise sheet Exercise slides (02.02.2016)