Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleSeptember 2017
PBSE: a robust path-based speculative execution for degraded-network tail tolerance in data-parallel frameworks
- Riza O. Suminto,
- Cesar A. Stuardo,
- Alexandra Clark,
- Huan Ke,
- Tanakorn Leesatapornwongsa,
- Bo Fu,
- Daniar H. Kurniawan,
- Vincentius Martin,
- Maheswara Rao G. Uma,
- Haryadi S. Gunawi
SoCC '17: Proceedings of the 2017 Symposium on Cloud ComputingPages 295–308https://doi.org/10.1145/3127479.3131622We reveal loopholes of Speculative Execution (SE) implementations under a unique fault model: node-level network throughput degradation. This problem appears in many data-parallel frameworks such as Hadoop MapReduce and Spark. To address this, we ...
- research-articleMay 2017
Scalability Bugs: When 100-Node Testing is Not Enough
- Tanakorn Leesatapornwongsa,
- Cesar A. Stuardo,
- Riza O. Suminto,
- Huan Ke,
- Jeffrey F. Lukman,
- Haryadi S. Gunawi
HotOS '17: Proceedings of the 16th Workshop on Hot Topics in Operating SystemsPages 24–29https://doi.org/10.1145/3102980.3102985We highlight the problem of scalability bugs, a new class of bugs that appear in "cloud-scale" distributed systems. Scalability bugs are latent bugs that are cluster-scale dependent, whose symptoms typically surface in large-scale deployments, but not ...