skip to main content
10.5555/3291168.3291175acmotherconferencesArticle/Chapter ViewAbstractPublication PagesosdiConference Proceedingsconference-collections
Article

LegoOS: a disseminated, distributed OS for hardware resource disaggregation

Published: 08 October 2018 Publication History

Abstract

The monolithic server model where a server is the unit of deployment, operation, and failure is meeting its limits in the face of several recent hardware and application trends. To improve resource utilization, elasticity, heterogeneity, and failure handling in datacenters, we believe that datacenters should break monolithic servers into disaggregated, network-attached hardware components. Despite the promising benefits of hardware resource disaggregation, no existing OSes or software systems can properly manage it.
We propose a new OS model called the splitkernel to manage disaggregated systems. Splitkernel disseminates traditional OS functionalities into loosely-coupled monitors, each of which runs on and manages a hardware component. A splitkernel also performs resource allocation and failure handling of a distributed set of hardware components. Using the splitkernel model, we built LegoOS, a new OS designed for hardware resource disaggregation. LegoOS appears to users as a set of distributed servers. Internally, a user application can span multiple processor, memory, and storage hardware components. We implemented LegoOS on x86-64 and evaluated it by emulating hardware components using commodity servers. Our evaluation results show that LegoOS' performance is comparable to monolithic Linux servers, while largely improving resource packing and reducing failure rate over monolithic clusters.

References

[1]
Open Compute Project (OCP). http://www.opencompute.org/.
[2]
The CIFAR-10 dataset. https://www.cs.toronto.edu/~kriz/cifar.html.
[3]
Wikipedia dump. https://dumps.wikimedia.org/.
[4]
M. Abadi, P. Barham, J. Chen, Z. Chen, A. Davis, J. Dean, M. Devin, S. Ghemawat, G. Irving, M. Isard, M. Kudlur, J. Levenberg, R. Monga, S. Moore, D. G. Murray, B. Steiner, P. Tucker, V. Vasudevan, P. Warden, M. Wicke, Y. Yu, and X. Zheng. Tensorflow. A system for large-scale machine learning. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI '16).
[5]
S. R. Agrawal, S. Idicula, A. Raghavan, E. Vlachos, V. Govindaraju, V. Varadarajan, C. Balkesen, G. Giannikis, C. Roth, N. Agarwal, and E. Sedlar. A Many-core Architecture for In-memory Data Processing. In Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO '17).
[6]
M. K. Aguilera, N. Amit, I. Calciu, X. Deguillard, J. Gandhi, S. Novakoviû, A. Ramanathan, P. Subrahmanyam, L. Suresh, K. Tati, R. Venkatasubramanian, and M. Wei. Remote regions. a simple abstraction for remote memory. In 2018 USENIX Annual Technical Conference (ATC '18).
[7]
M. K. Aguilera, N. Amit, I. Calciu, X. Deguillard, J. Gandhi, P. Subrahmanyam, L. Suresh, K. Tati, R. Venkatasubramanian, and M. Wei. Remote memory in the age of fast networks. In Proceedings of the 2017 Symposium on Cloud Computing (SoCC '17).
[8]
J. Ahn, S. Hong, S. Yoo, O. Mutlu, and K. Choi. A scalable processing-in-memory accelerator for parallel graph processing. In Proceedings of the 42nd Annual International Symposium on Computer Architecture (ISCA '15).
[9]
J. Ahn, S. Yoo, O. Mutlu, and K. Choi. PIM-enabled Instructions. A Low-overhead, Locality-aware Processing-in-memory Architecture. In Proceedings of the 42nd Annual International Symposium on Computer Architecture (ISCA '15).
[10]
Alibaba. Alibaba Production Cluster Trace Data. https://github.com/alibaba/clusterdata.
[11]
Amazon. Amazon EC2 Elastic GPUs. https://aws.amazon.com/ec2/elastic-gpus/.
[12]
Amazon. Amazon EC2 F1 Instances. https://aws.amazon.com/ec2/instance-types/f1/.
[13]
Amazon. Amazon EC2 Root Device Volume. https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/RootDeviceStorage.html#RootDeviceStorageConcepts.
[14]
Y. Artsy, H.-Y. Chang, and R. Finkel. Interprocess communication in charlotte. IEEE Software, Jan 1987.
[15]
K. Asanovi. FireBox. A Hardware Building Block for 2020 Warehouse-Scale Computers, February 2014. Keynote talk at the 12th USENIX Conference on File and Storage Technologies (FAST '14).
[16]
A. Barak and R. Wheeler. MOSIX. An integrated unix for multiprocessor workstations. International Computer Science Institute, 1988.
[17]
A. Barbalace, M. Sadini, S. Ansary, C. Jelesnianski, A. Ravichandran, C. Kendir, A. Murray, and B. Ravindran. Popcorn. Bridging the programmability gap in heterogeneous-isa platforms. In Proceedings of the Tenth European Conference on Computer Systems (EuroSys '15).
[18]
L. A. Barroso and U. Hölzle. The case for energy-proportional computing. Computer, 40(12), December 2007.
[19]
F. Baskett, J. H. Howard, and J. T. Montague. Task Communication in DEMOS. In Proceedings of the Sixth ACM Symposium on Operating Systems Principles (SOSP '77).
[20]
A. Basu, M. D. Hill, and M. M. Swift. Reducing Memory Reference Energy with Opportunistic Virtual Caching. In Proceedings of the 39th Annual International Symposium on Computer Architecture (ISCA '12).
[21]
A. Baumann, P. Barham, P.-E. Dagand, T. Harris, R. Isaacs, S. Peter, T. Roscoe, A. Schüpbach, and A. Singhania. The Multikernel. A New OS Architecture for Scalable Multicore Systems. In Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (SOSP '09).
[22]
C. Bienia, S. Kumar, J. P. Singh, and K. Li. The PARSEC Benchmark Suite. Characterization and Architectural Implications. In Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques (PACT '08).
[23]
M. N. Bojnordi and E. Ipek. PARDIS. A Programmable Memory Controller for the DDRx Interfacing Standards. In Proceedings of the 39th Annual International Symposium on Computer Architecture (ISCA '12).
[24]
Cache Coherent Interconnect for Accelerators, 2018. https://www.ccixconsortium.com/.
[25]
W. Cao, Z. Liu, P. Wang, S. Chen, C. Zhu, S. Zheng, Y. Wang, and G. Ma. PolarFS. an ultra-low latency and failure resilient distributed file system for shared storage cloud database. Proceedings of the VLDB Endowment (VLDB '18).
[26]
J. Chapin, M. Rosenblum, S. Devine, T. Lahiri, D. Teodosiu, and A. Gupta. Hive. Fault Containment for Shared-memory Multiprocessors. In Proceedings of the Fifteenth ACM Symposium on Operating Systems Principles (SOSP '95).
[27]
D. Cheriton. The V Distributed System. Commun. ACM, 31(3), March 1988.
[28]
I.-H. Chung, B. Abali, and P. Crumley. Towards a Composable Computer System. In Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region (HPC Asia '18).
[29]
Cisco, EMC, and Intel. The Performance Impact of NVMe and NVMe over Fabrics. http://www.snia.org/sites/default/files/NVMe_Webcast_Slides_Final.1.pdf.
[30]
P. Costa. Towards rack-scale computing. Challenges and opportunities. In First International Workshop on Rackscale Computing (WRSC '14).
[31]
P. Costa, H. Ballani, K. Razavi, and I. Kash. R2C2. A Network Stack for Rack-scale Computers. In Proceedings of the ACM SIGCOMM 2015 Conference on SIGCOMM (SIGCOMM '15).
[32]
J. Dean and S. Ghemawat. MapReduce. Simplified Data Processing on Large Clusters. In Proceedings of the 6th Conference on Symposium on Opearting Systems Design and Implementation (OSDI '04).
[33]
C. Delimitrou and C. Kozyrakis. Quasar. Resource-efficient and QoS-aware Cluster Management. In Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '14).
[34]
Dragojeviû, Aleksandar and Narayanan, Dushyanth and Hodson, Orion and Castro, Miguel. FaRM. Fast Remote Memory. In Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation (NSDI '14).
[35]
A. Dunkels. Design and Implementation of the lwIP TCP/IP Stack. Swedish Institute of Computer Science, 2001.
[36]
K. Elphinstone and G. Heiser. From l3 to sel4 what have we learnt in 20 years of l4 microkernels? In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (SOSP '13).
[37]
D. R. Engler, M. F. Kaashoek, and J. O'Toole, Jr. Exokernel: An operating system architecture for application-level resource management. In Proceedings of the Fifteenth ACM Symposium on Operating Systems Principles (SOSP '95).
[38]
Facebook. Introducing Lightning. A flexible NVMe JBOF. https://code.fb.com/data-center-engineering/introducing-lightning-a-flexible-nvme-jbof/.
[39]
Facebook. Wedge 100. More open and versatile than ever. https://code.fb.com/networking-traffic/wedge-100-more-open-and-versatile-than-ever/.
[40]
P. Faraboschi, K. Keeton, T. Marsland, and D. Milojicic. Beyond Processor-centric Operating Systems. In 15th Workshop on Hot Topics in Operating Systems (HotOS '15).
[41]
P. X. Gao, A. Narayan, S. Karandikar, J. Carreira, S. Han, R. Agarwal, S. Ratnasamy, and S. Shenker. Network Requirements for Resource Disaggregation. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI '16).
[42]
GenZ Consortium. http://genzconsortium.org/.
[43]
J. E. Gonzalez, Y. Low, H. Gu, D. Bickson, and C. Guestrin. PowerGraph. Distributed Graph-Parallel Computation on Natural Graphs. In Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation (OSDI '12).
[44]
J. R. Goodman. Coherency for Multiprocessor Virtual Address Caches. In Proceedings of the Second International Conference on Architectual Support for Programming Languages and Operating Systems (ASPLOS '87).
[45]
Google. Google Production Cluster Trace Data. https://github.com/google/cluster-data.
[46]
Google. GPUs are now available for Google Compute Engine and Cloud Machine Learning. https://cloudplatform.googleblog.com/2017/02/GPUs-are-now-available-for-Google-Compute-Engine-and-Cloud-Machine-Learning.html.
[47]
J. Gu, Y. Lee, Y. Zhang, M. Chowdhury, and K. Shin. Efficient Memory Disaggregation with Infiniswap. In Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI '17).
[48]
Hewlett-Packard. The Machine. A New Kind of Computer. http://www.hpl.hp.com/research/systemsresearch/themachine/.
[49]
Intel. Intel Non-Volatile Memory 3D XPoint. http://www.intel.com/content/www/us/en/architecture-and-technology/non-volatile-memory.html?wapkw=3d+xpoint.
[50]
Intel. Intel Omni-Path Architecture. https://tinyurl.com/ya3x4ktd.
[51]
Intel. Intel Rack Scale Architecture. Faster Service Delivery and Lower TCO. http://www.intel.com/content/www/us/en/architecture-and-technology/intel-rack-scale-architecture.html.
[52]
Intel, 2018. https://www.intel.com/content/www/us/en/high-performance-computing-fabrics/.
[53]
JEDEC. HIGH BANDWIDTH MEMORY (HBM) DRAM JESD235A. https://www.jedec.org/standardsdocuments/docs/jesd235a.
[54]
W. John, J. Halen, X. Cai, C. Fu, T. Holmberg, V. Katardjiev, M. Sedaghat, P. Sköldström, D. Turull, V. Yadhav, and J. Kempf. Making Cloud Easy. Design Considerations and First Components of a Distributed Operating System for Cloud. In 10th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '18).
[55]
N. P. Jouppi, C. Young, N. Patil, D. Patterson, G. Agrawal, R. Bajwa, S. Bates, S. Bhatia, N. Boden, A. Borchers, R. Boyle, P. luc Cantin, C. Chao, C. Clark, J. Coriell, M. D. Mike Daley, J. Dean, B. Gelb, T. V. Ghaemmaghami, R. Gottipati, W. Gulland, R. Hagmann, C. R. Ho, D. Hogberg, J. Hu, R. Hundt, D. Hurt, J. Ibarz, A. Jaffey, A. Jaworski, H. K. Alexander Kaplan, A. Koch, N. Kumar, S. Lacy, J. Laudon, J. Law, D. Le, C. Leary, Z. Liu, K. Lucke, A. Lundin, G. MacKean, A. Maggiore, M. Mahony, K. Miller, R. Nagarajan, R. Narayanaswami, R. Ni, K. Nix, T. Norrie, M. Omernick, N. Penukonda, A. Phelps, M. R. Jonathan Ross, A. Salek, E. Samadiani, C. Severn, G. Sizikov, M. Snelham, J. Souter, A. S. Dan Steinberg, M. Tan, G. Thorson, B. Tian, H. Toma, E. Tuttle, V. Vasudevan, R. Walter, W. Wang, E. Wilcox, and D. H. Yoon. In-Datacenter Performance Analysis of a Tensor Processing Unit. In Proceedings of the 44th Annual International Symposium on Computer Architecture (ISCA '17).
[56]
K. Katrinis, D. Syrivelis, D. Pnevmatikatos, G. Zervas, D. Theodoropoulos, I. Koutsopoulos, K. Hasharoni, D. Raho, C. Pinto, F. Espina, S. Lopez-Buedo, Q. Chen, M. Nemirovsky, D. Roca, H. Klos, and T. Berends. Rack-scale disaggregated cloud data centers. The dReDBox project vision. In 2016 Design, Automation Test in Europe Conference Exhibition (DATE '16).
[57]
A. Kaufmann, S. Peter, N. K. Sharma, T. Anderson, and A. Krishnamurthy. High Performance Packet Processing with FlexNIC. In Proceedings of the Twenty-First International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'16).
[58]
S. Kaxiras and A. Ros. A New Perspective for Efficient Virtual-cache Coherence. In Proceedings of the 40th Annual International Symposium on Computer Architecture (ISCA '13).
[59]
A. Klimovic, C. Kozyrakis, E. Thereska, B. John, and S. Kumar. Flash Storage Disaggregation. In Proceedings of the Eleventh European Conference on Computer Systems (EuroSys '16).
[60]
A. Klimovic, H. Litz, and C. Kozyrakis. ReFlex. Remote Flash ≈ Local Flash. In Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '17).
[61]
E. K. Lee and C. A. Thekkath. Petal. Distributed Virtual Disks. In Proceedings of the Seventh International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '96).
[62]
S. Legtchenko, H. Williams, K. Razavi, A. Donnelly, R. Black, A. Douglas, N. Cheriere, D. Fryer, K. Mast, A. D. Brown, A. Klimovic, A. Slowey, and A. Rowstron. Understanding Rack-Scale Disaggregated Storage. In 9th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage '17).
[63]
K. Lim, J. Chang, T. Mudge, P. Ranganathan, S. K. Reinhardt, and T. F. Wenisch. Disaggregated Memory for Expansion and Sharing in Blade Servers. In Proceedings of the 36th Annual International Symposium on Computer Architecture (ISCA '09).
[64]
K. Lim, Y. Turner, J. R. Santos, A. AuYoung, J. Chang, P. Ranganathan, and T. F. Wenisch. System-level Implications of Disaggregated Memory. In Proceedings of the 2012 IEEE 18th International Symposium on High-Performance Computer Architecture (HPCA '12).
[65]
D. Meisner, B. T. Gold, and T. F. Wenisch. The powernap server architecture. ACM Trans. Comput. Syst., February 2011.
[66]
Mellanox. ConnectX-6 Single/Dual-Port Adapter supporting 200Gb/s with VPI. http://www.mellanox.com/page/products_dyn?product_family=265&mtag=connectx_6_vpi_card.
[67]
Mellanox. Mellanox BuleField SmartNIC. http://www.mellanox.com/page/products_dyn?product_family=275&mtag=bluefield_smart_nic.
[68]
Mellanox. Mellanox Innova Adapters. http://www.mellanox.com/page/programmable_network_adapters?mtag=&mtag=programmable_adapter_cards.
[69]
Mellanox. Rdma aware networks programming user manual. http://www.mellanox.com/related-docs/prod_software/RDMA_Aware_Programming_user_manual.pdf.
[70]
J. Mickens, E. B. Nightingale, J. Elson, D. Gehring, B. Fan, A. Kadav, V. Chidambaram, O. Khan, and K. Nareddy. Blizzard. Fast, Cloud-scale Block Storage for Cloud-oblivious Applications. In 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI '14).
[71]
D. Minturn. NVM Express Over Fabrics. 11th Annual OpenFabrics International OFS Developers' Workshop, March 2015.
[72]
T. P. Morgan. Enterprises Get On The Xeon Phi Roadmap. https://www.enterprisetech.com/2014/11/17/enterprises-get-xeon-phi-roadmap/.
[73]
M. Nanavati, J. Wires, and A. Warfield. Decibel. Isolation and Sharing in Disaggregated Rack-Scale Storage. In 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI '17).
[74]
J. Nelson, B. Holt, B. Myers, P. Briggs, L. Ceze, S. Kahan, and M. Oskin. Latency-Tolerant Software Distributed Shared Memory. In Proceedings of the 2015 USENIX Annual Technical Conference (ATC '15).
[75]
Netronome. Agilio SmartNICs. https://www.netronome.com/products/smartnic/overview/.
[76]
E. B. Nightingale, O. Hodson, R. McIlroy, C. Hawblitzel, and G. Hunt. Helios. Heterogeneous Multiprocessing with Satellite Kernels. In Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (SOSP '09).
[77]
V. Nitu, B. Teabe, A. Tchana, C. Isci, and D. Hagimont. Welcome to Zombieland. Practical and Energy-efficient Memory Disaggregation in a Datacenter. In Proceedings of the Thirteenth EuroSys Conference (EuroSys '18).
[78]
S. Novakovic, A. Daglis, E. Bugnion, B. Falsafi, and B. Grot. Scale-out NUMA. In Proceedings of the 19th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'14).
[79]
S. Novakovic, A. Daglis, E. Bugnion, B. Falsafi, and B. Grot. The Case for RackOut. Scalable Data Serving Using Rack-Scale Systems. In Proceedings of the Seventh ACM Symposium on Cloud Computing (SoCC'16).
[80]
D. Ongaro, S. M. Rumble, R. Stutsman, J. Ousterhout, and M. Rosenblum. Fast Crash Recovery in RAMCloud. In Proceedings of the 23rd ACM Symposium on Operating Systems Principles (SOSP '11).
[81]
Open Coherent Accelerator Processor Interface, 2018. https://opencapi.org/.
[82]
J. K. Ousterhout, A. R. Cherenson, F. Douglis, M. N. Nelson, and B. B. Welch. The Sprite Network Operating System. Computer, 21(2), February 1988.
[83]
J. Ouyang, B. Kocoloski, J. R. Lange, and K. Pedretti. Achieving Performance Isolation with Lightweight Co-Kernels. In Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing (HPDC '15).
[84]
A. Putnam, A. M. Caulfield, E. S. Chung, D. Chiou, K. Constantinides, J. Demme, H. Esmaeilzadeh, J. Fowers, G. P. Gopal, J. Gray, M. Haselman, S. Hauck, S. Heil, A. Hormati, J.-Y. Kim, S. Lanka, J. Larus, E. Peterson, S. Pope, A. Smith, J. Thong, P. Y. Xiao, and D. Burger. A Reconfigurable Fabric for Accelerating Large-scale Datacenter Services. In Proceeding of the 41st Annual International Symposium on Computer Architecuture (ISCA '14).
[85]
C. Ranger, R. Raghuraman, A. Penmetsa, G. Bradski, and C. Kozyrakis. Evaluating MapReduce for Multicore and Multiprocessor Systems. In Proceedings of the 13th International Symposium on High Performance Computer Architecture (HPCA '07).
[86]
R. F. Rashid and G. G. Robertson. Accent. A Communication Oriented Network Operating System Kernel. In Proceedings of the Eighth ACM Symposium on Operating Systems Principles (SOSP '81).
[87]
B. M. Rogers, A. Krishna, G. B. Bell, K. Vu, X. Jiang, and Y. Solihin. Scaling the Bandwidth Wall. Challenges in and Avenues for CMP Scaling. In Proceedings of the 36th Annual International Symposium on Computer Architecture (ISCA '09).
[88]
S. M. Rumble. Infiniband Verbs Performance. https://ramcloud.atlassian.net/wiki/display/RAM/Infiniband+Verbs+Performance.
[89]
R. Sandberg. The Design and Implementation of the Sun Network File System. In Proceedings of the 1985 USENIX Summer Technical Conference, 1985.
[90]
B. Schroeder and G. A. Gibson. Disk Failures in the Real World: What Does an MTTF of 1,000,000 Hours Mean to You? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST '07).
[91]
M. Schwarzkopf, M. P. Grosvenor, and S. Hand. New Wine in Old Skins. The Case for Distributed Operating Systems in the Data Center. In Proceedings of the 4th Asia-Pacific Workshop on Systems (APSys '13).
[92]
S. Seshadri, M. Gahagan, S. Bhaskaran, T. Bunker, A. De, Y. Jin, Y. Liu, and S. Swanson. Willow: A User-Programmable SSD. In Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI '14).
[93]
Y. Shan, S.-Y. Tsai, and Y. Zhang. Distributed Shared Persistent Memory. In Proceedings of the 2017 Symposium on Cloud Computing (SoCC '17).
[94]
M. Silberstein. Accelerators in data centers: the systems perspective. https://www.sigarch.org/accelerators-in-data-centers-the-systems-perspective/.
[95]
A. J. Smith. Cache Memories. ACM Comput. Surv., 14(3), September 1982.
[96]
A. S. Tanenbaum, M. F. Kaashoek, R. Van Renesse, and H. E. Bal. The Amoeba Distributed Operating System &Mdash;a Status Report. Comput. Commun., 14(6), August 1991.
[97]
A. S. Tanenbaum, R. van Renesse, H. van Staveren, G. J. Sharp, and S. J. Mullender. Experiences with the Amoeba Distributed Operating System. Commun. ACM, 33(12), December 1990.
[98]
E. Technologies. NVMe Storage Accelerator Series. https://www.everspin.com/nvme-storage-accelerator-series.
[99]
S. Thomas, G. M. Voelker, and G. Porter. CacheCloud. Towards Speed-of-light Datacenter Communication. In 10th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud '18).
[100]
C.-C. Tsai, B. Jain, N. A. Abdul, and D. E. Porter. A Study of Modern Linux API Usage and Compatibility. What to Support when You'Re Supporting. In Proceedings of the Eleventh European Conference on Computer Systems (EuroSys '16).
[101]
S.-Y. Tsai and Y. Zhang. LITE Kernel RDMA Support for Datacenter Applications. In Proceedings of the 26th Symposium on Operating Systems Principles (SOSP '17).
[102]
A. Verma, L. Pedrosa, M. R. Korupolu, D. Oppenheimer, E. Tune, and J. Wilkes. Large-scale cluster management at Google with Borg. In Proceedings of the European Conference on Computer Systems (EuroSys '15).
[103]
VMware. Virtual SAN. https://www.vmware.com/products/vsan.html.
[104]
W. H. Wang, J.-L. Baer, and H. M. Levy. Organization and Performance of a Two-level Virtual-real Cache Hierarchy. In Proceedings of the 16th Annual International Symposium on Computer Architecture (ISCA '89).
[105]
A. Warfield, R. Ross, K. Fraser, C. Limpach, and S. Hand. Parallax. Managing Storage for a Million Machines. In Proceedings of the 10th Conference on Hot Topics in Operating Systems (HotOS '05).
[106]
D. Wentzlaff, C. Gruenwald, III, N. Beckmann, K. Modzelewski, A. Belay, L. Youseff, J. Miller, and A. Agarwal. An Operating System for Multicore and Clouds. Mechanisms and Implementation. In Proceedings of the 1st ACM Symposium on Cloud Computing (SoCC '10).
[107]
G. Zellweger, S. Gerber, K. Kourtis, and T. Roscoe. Decoupling cores, kernels, and operating systems. In Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation (OSDI '14).
[108]
Q. Zhang, G. Yu, C. Guo, Y. Dang, N. Swanson, X. Yang, R. Yao, M. Chintalapati, A. Krishnamurthy, and T. Anderson. Deepview. Virtual Disk Failure Diagnosis and Pattern Detection for Azure. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI '18).

Cited By

View all
  1. LegoOS: a disseminated, distributed OS for hardware resource disaggregation

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    OSDI'18: Proceedings of the 13th USENIX conference on Operating Systems Design and Implementation
    October 2018
    815 pages
    ISBN:9781931971478

    Sponsors

    • NetApp
    • Google Inc.
    • NSF
    • Microsoft: Microsoft
    • Facebook: Facebook

    In-Cooperation

    Publisher

    USENIX Association

    United States

    Publication History

    Published: 08 October 2018

    Check for updates

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Towards Buffer Management with Tiered Main MemoryProceedings of the ACM on Management of Data10.1145/36392862:1(1-26)Online publication date: 26-Mar-2024
    • (2023)W-Kernel: An OS Kernel Architecture Designed With Isolation and CustomizabilityProceedings of the 2023 5th International Conference on Software Engineering and Development10.1145/3637792.3637796(42-50)Online publication date: 20-Oct-2023
    • (2023)RISA: Round-Robin Intra-Rack Friendly Scheduling Algorithm for Disaggregated DatacentersProceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis10.1145/3624062.3624228(1512-1520)Online publication date: 12-Nov-2023
    • (2023)Using Local Cache Coherence for Disaggregated Memory SystemsACM SIGOPS Operating Systems Review10.1145/3606557.360656157:1(21-28)Online publication date: 28-Jun-2023
    • (2023)Poster: Designing a Memory Disaggregation System for CloudProceedings of the ACM SIGCOMM 2023 Conference10.1145/3603269.3610854(1099-1101)Online publication date: 10-Sep-2023
    • (2023)Partial Failure Resilient Memory Management System for (CXL-based) Distributed Shared MemoryProceedings of the 29th Symposium on Operating Systems Principles10.1145/3600006.3613135(658-674)Online publication date: 23-Oct-2023
    • (2023)CPU-free Computing: A Vision with a BlueprintProceedings of the 19th Workshop on Hot Topics in Operating Systems10.1145/3593856.3595906(1-14)Online publication date: 22-Jun-2023
    • (2023)Disaggregating RocksDB: A Production ExperienceProceedings of the ACM on Management of Data10.1145/35897721:2(1-24)Online publication date: 20-Jun-2023
    • (2023)Persistent Memory Disaggregation for Cloud-Native Relational DatabasesProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3582016.3582055(498-512)Online publication date: 25-Mar-2023
    • (2023)Towards an Adaptable Systems Architecture for Memory Tiering at Warehouse-ScaleProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3582016.3582031(727-741)Online publication date: 25-Mar-2023
    • Show More Cited By

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media