- Ceph: Reliable, Scalable, and High-performance Distributed Storage
- CRUSH: Controlled, Scalable, Decentralized Placement of Replicated Data
-
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing
-
Tachyon: Reliable, Memory Speed Storage for Cluster Computing Frameworks
- Large-scale cluster management at Google with Borg
- Dapper, a Large-Scale Distributed Systems Tracing Infrastructure
- Borg, Omega, and Kubernetes(This article describes some of the knowledge gained and lessons learned during Google’s journey from Borg to Kubernetes. )
- The Google File System
- Bigtable: A Distributed Storage System for Structured Data
- MapReduce: Simplified Data Processing on Large Clusters
- Dremel: Interactive Analysis of Web-Scale Datasets
- Pregel: A System for Large-Scale Graph Processing
- Large-scale Incremental Processing Using Distributed Transactions and Notifications(One of the backend systems that subtend Caffeine)
- Similarity Estimation Techniques from Rounding Algorithms
- Similarity Estimation Techniques from Rounding Algorithms(Simhash)
- CRUSH: Controlled, Scalable, Decentralized Placement of Replicated Data
- Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web(Consistent hashing)
- The Part-Time Parliament(Paxos)
- Paxos Made Simple