research-article

Boosting Inter-process Communication with Architectural Support

Authors:
Yubin Xia

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

0000-0001-6558-5298
View Profile

,
Dong Du

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

0000-0002-7945-8430
View Profile

,
Zhichao Hua

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

0000-0002-2211-9120
View Profile

,
Binyu Zang

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

0000-0002-1968-7645
View Profile

,
Haibo Chen

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

0000-0002-9720-0361
View Profile

,
Haibing Guan

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

Shanghai Key Laboratory of Scalable Computing and Systems, Shanghai Jiao TongUniversity, China and Institute of Parallel and Distributed Systems, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, Shanghai, China

0000-0002-4714-7400
View Profile

Authors Info & Claims

ACM Transactions on Computer Systems Volume 39 Issue 1-4Article No.: 6pp 1–35https://doi.org/10.1145/3532861

Published:05 July 2022Publication History

ACM Transactions on Computer Systems

Abstract

IPC (inter-process communication) is a critical mechanism for modern OSes, including not only microkernels such as seL4, QNX, and Fuchsia where system functionalities are deployed in user-level processes, but also monolithic kernels like Android where apps frequently communicate with plenty of user-level services. However, existing IPC mechanisms still suffer from long latency. Previous software optimizations of IPC usually cannot bypass the kernel that is responsible for domain switching and message copying/remapping across different address spaces; hardware solutions such as tagged memory or capability replace page tables for isolation, but usually require non-trivial modification to existing software stack to adapt to the new hardware primitives. In this article, we propose a hardware-assisted OS primitive, XPC (Cross Process Call), for efficient and secure synchronous IPC. XPC enables direct switch between IPC caller and callee without trapping into the kernel and supports secure message passing across multiple processes without copying. We have implemented a prototype of XPC based on the ARM AArch64 with Gem5 simulator and RISC-V architecture with FPGA boards. The evaluation shows that XPC can reduce IPC call latency from 664 to 21 cycles, 14×–123× improvement on Android Binder (ARM), and improve the performance of real-world applications on microkernels by 1.6× on Sqlite3.

REFERENCES

[1] Arm Inc. 2018. Arm System Modeling Research Enablement Kit. Retrieved from https://developer.arm.com/research/research-enablement/system-modeling.Google Scholar
[2] Google Inc. 2018. Fuchsia. Retrieved from https://fuchsia.googlesource.com/zircon.Google Scholar
[3] Intel Inc. 2018. An Introduction to the Intel QuickPath Interconnect. Retrieved from https://www.intel.de/content/dam/doc/white-paper/quick-path-interconnect-introduction-paper.pdf.Google Scholar
[4] lowRISC Project. 2018. lowRISC. Retrieved from https://www.lowrisc.org/.Google Scholar
[5] lwIP Project. 2018. lwIP. Retrieved from https://savannah.nongnu.org/projects/lwip/. Referenced May 2018.Google Scholar
[6] seL4 Project. 2018. seL4 Benchmark. Retrieved from https://sel4.systems/About/Performance.Google Scholar
[7] SQLite Project. 2018. SQLite. Retrieved from https://www.sqlite.org/index.html. Referenced May 2018.Google Scholar
[8] Xilinx Inc. 2018. Vivado Design Suite. Retrieved from https://www.xilinx.com/products/design-tools/vivado.html. Referenced August 2018.Google Scholar
[9] John Stultz. 2019. Anonymous shared memory (ashmem) subsystem [LWN.net]. Retrieved from https://lwn.net/Articles/452035/.Google Scholar
[10] Dianne Hackborn. 2019. LKML: Dianne Hackborn: Re: [PATCH 1/6] staging: android: binder: Remove some funny usage. Retrieved from https://lkml.org/lkml/2009/6/25/3.Google Scholar
[11] Common Weakness Enumeration. 2021. CWE-367: Time-of-check Time-of-use (TOCTOU) Race Condition (4.5). Retrieved from https://cwe.mitre.org/data/definitions/367.html.Google Scholar
[12] Fiasco.OC Project. 2021. The Fiasco microkernel - Overview. Retrieved from https://os.inf.tu-dresden.de/fiasco/. Referenced Oct. 2021.Google Scholar
[13] Barrelfish Project. 2021. Message Notifications, Barrelfish Technical Note 9. http://www.barrelfish.org/publications/TN-009-Notifications.pdf.Google Scholar
[14] SiFive Inc. 2021. SiFive. Retrieved from https://www.sifive.com/.Google Scholar
[15] Sohil Mehta. 2021. User Interrupts: A faster way to signal. Retrieved from https://linuxplumbersconf.org/event/11/contributions/985/attachments/756/1417/User_Interrupts_LPC_2021.pdf.Google Scholar
[16] seL4 Project. 2022. seL4 Dynamic Libraries: IPC. Retrieved from https://docs.sel4.systems/Tutorials/dynamic-2.html.Google Scholar
[17] Amit Nadav, Tai Amy, and Wei Michael. 2020. Don’t shoot down TLB shootdowns! In EuroSys’20). Association for Computing Machinery. DOI:Google ScholarDigital Library
[18] Asanovic Krste, Avizienis Rimas, Bachrach Jonathan, Beamer Scott, Biancolin David, Celio Christopher, Cook Henry, Dabbelt Daniel, Hauser John, Izraelevitz Adam, et al. 2016. The Rocket Chip Generator. Technical Report. EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2016-17.Google Scholar
[19] Asmussen Nils, Völp Marcus, Nöthen Benedikt, Härtig Hermann, and Fettweis Gerhard. 2016. M3: A hardware/operating-system co-design to tame heterogeneous manycores. In ASPLOS. ACM, New York, NY.Google Scholar
[20] Baumann Andrew, Barham Paul, Dagand Pierre-Evariste, Harris Tim, Isaacs Rebecca, Peter Simon, Roscoe Timothy, Schüpbach Adrian, and Singhania Akhilesh. 2009. The multikernel: A new OS architecture for scalable multicore systems. In ACM SIGOPS.Google Scholar
[21] Bershad Brian N., Anderson Thomas E., Lazowska Edward D., and Levy Henry M.. 1990. Lightweight remote procedure call. ACM Trans. Comput. Syst. 8, 1 (1990), 37–55.Google ScholarDigital Library
[22] Bershad Brian N., Anderson Thomas E., Lazowska Edward D., and Levy Henry M.. 1991. User-level interprocess communication for shared memory multiprocessors. ACM Trans. Comput. Syst. 9, 2 (1991), 175–198.Google ScholarDigital Library
[23] Binkert Nathan, Beckmann Bradford, Black Gabriel, Reinhardt Steven K., Saidi Ali, Basu Arkaprava, Hestness Joel, Hower Derek R., Krishna Tushar, Sardashti Somayeh, Sen Rathijit, Sewell Korey, Shoaib Muhammad, Vaish Nilay, Hill Mark D., and Wood David A.. 2011. The Gem5 simulator. SIGARCH Comput. Archit. News 39, 2 (Aug. 2011), 1–7.Google ScholarDigital Library
[24] Carter Nicholas P., Keckler Stephen W., and Dally William J.. 1994. Hardware support for fast capability-based addressing. In ACM SIGPLAN Notices. ACM.Google Scholar
[25] Chase Jeffrey S., Levy Henry M., Feeley Michael J., and Lazowska Edward D.. 1994. Sharing and protection in a single-address-space operating system. ACM Trans. Comput. Syst. 12, 4 (1994).Google ScholarDigital Library
[26] Chen Haogang, Ziegler Daniel, Chajed Tej, Chlipala Adam, Kaashoek M. Frans, and Zeldovich Nickolai. 2015. Using crash Hoare logic for certifying the FSCQ file system. In SOSP.Google Scholar
[27] Clark Raymond K., Jensen E. Douglas, and Reynolds Franklin D.. 1992. An architectural overview of the Alpha real-time distributed kernel. In USENIX Workshop on Microkernels and other Kernel Architectures.Google Scholar
[28] David Francis M., Chan Ellick, Carlyle Jeffrey C., and Campbell Roy H.. 2008. CuriOS: Improving reliability through operating system structure. In OSDI.Google Scholar
[29] Elphinstone Kevin and Heiser Gernot. 2013. From L3 to seL4 what have we learnt in 20 years of L4 microkernels? In SOSP.Google Scholar
[30] Engler D. R., Kaashoek M. F., and Jr. J. O’Toole,1995. Exokernel: An operating system architecture for application-level resource management. In SOSP’95. ACM, New York, NY.Google Scholar
[31] Ford Bryan and Lepreau Jay. 1994. Evolving Mach 3.0 to a migrating thread model. In USENIX Winter.Google Scholar
[32] Gamsa Benjamin, Krieger Orran, Appavoo Jonathan, and Stumm Michael. 1999. Tornado: Maximizing locality and concurrency in a shared memory multiprocessor operating system. In OSDI, Vol. 99. 87–100.Google Scholar
[33] Härtig Hermann, Hohmuth Michael, Liedtke Jochen, Wolter Jean, and Schönberg Sebastian. 1997. The performance of \( \mu \)-kernel-based systems. In ACM SIGOPS Operating Systems Review, Vol. 31. ACM.Google Scholar
[34] Klein Gerwin, Elphinstone Kevin, Heiser Gernot, Andronick June, Cock David, Derrin Philip, Elkaduwe Dhammika, Engelhardt Kai, Kolanski Rafal, Norrish Michael, et al. 2009. seL4: Formal verification of an OS kernel. In ACM SIGOPS.Google Scholar
[35] Koldinger Eric J., Chase Jeffrey S., and Eggers Susan J.. 1992. Architecture Support for Single Address Space Operating Systems. Vol. 27. ACM.Google ScholarDigital Library
[36] Lee Sanghoon, Tiwari Devesh, Solihin Yan, and Tuck James. 2011. HAQu: Hardware-accelerated queueing for fine-grained threading on a chip multiprocessor. In HPCA.Google Scholar
[37] Levy Henry M.. 1984. Capability-based Computer Systems. Digital Press.Google ScholarDigital Library
[38] Li Wenhao, Xia Yubin, Chen Haibo, Zang Binyu, and Guan Haibing. 2015. Reducing world switches in virtualized environment with flexible cross-world calls. In ISCA.Google Scholar
[39] Liedtke Jochen. 1993. Improving IPC by kernel design. ACM SIGOPS Oper. Syst. Rev. 27, 5 (Dec. 1993), 175–188.Google ScholarDigital Library
[40] Liedtke Jochen. 1993. A persistent system in real use-experiences of the first 13 years. In 3rd International Workshop on Object Orientation in Operating Systems. IEEE.Google Scholar
[41] Liedtke Jochen. 1995. On Micro-kernel Construction. Vol. 29. ACM.Google ScholarDigital Library
[42] Liedtke Jochen, Elphinstone Kevin, Schonberg Sebastian, Hartig Hermann, Heiser Gernot, Islam Nayeem, and Jaeger Trent. 1997. Achieved IPC performance (still the foundation for extensibility). In 6th Workshop on Hot Topics in Operating Systems. IEEE.Google Scholar
[43] Lyons Anna, McLeod Kent, Almatary Hesham, and Heiser Gernot. 2018. Scheduling-context capabilities: A principled, light-weight operating-system mechanism for managing time. In 13th EuroSys Conference. ACM.Google ScholarDigital Library
[44] Markuze Alex, Smolyar Igor, Morrison Adam, and Tsafrir Dan. 2018. DAMN: Overhead-free IOMMU protection for networking. In 23rd International Conference on Architectural Support for Programming Languages and Operating Systems. ACM.Google ScholarDigital Library
[45] McVoy Larry W., Staelin Carl, et al. 1996. LMbench: Portable tools for performance analysis. In USENIX. 279–294.Google Scholar
[46] Mi Zeyu, Li Dingji, Yang Zihan, Wang Xinran, and Chen Haibo. 2019. SkyBridge: Fast and secure inter-process communication for microkernels. In EuroSys. ACM.Google Scholar
[47] Min Changwoo, Kang Woonhak, Kumar Mohan, Kashyap Sanidhya, Maass Steffen, Jo Heeseung, and Kim Taesoo. 2018. Solros: A data-centric operating system architecture for heterogeneous computing. In EuroSys. ACM.Google Scholar
[48] Narayanan Vikram, Balasubramanian Abhiram, Jacobsen Charlie, Spall Sarah, Bauer Scott, Quigley Michael, Hussain Aftab, Younis Abdullah, Shen Junjie, Bhattacharyya Moinak, and Burtsev Anton. 2019. LXDs: Towards isolation of kernel subsystems. In USENIX. USENIX Association, 269–284. Retrieved from https://www.usenix.org/conference/atc19/presentation/narayanan.Google Scholar
[49] Narayanan Vikram, Huang Yongzhe, Tan Gang, Jaeger Trent, and Burtsev Anton. 2020. Lightweight kernel isolation with virtualization and VM functions. In VEE’20. Association for Computing Machinery, New York, NY, 157–171. DOI:Google ScholarDigital Library
[50] Park Soyeon, Lee Sangho, Xu Wen, Moon HyunGon, and Kim Taesoo. 2019. libmpk: Software abstraction for Intel memory protection keys (Intel MPK). In USENIX. USENIX Association, 241–254. Retrieved from https://www.usenix.org/conference/atc19/presentation/park-soyeon.Google Scholar
[51] Saltzer Jerome H.. 1974. Protection and the control of information sharing in multics. Commun. ACM 17, 7 (1974), 388–402.Google ScholarDigital Library
[52] Shapiro Jonathan S., Smith Jonathan M., and Farber David J.. 1999. EROS: A fast capability system. Vol. 33. ACM.Google Scholar
[53] Steinberg Udo and Kauer Bernhard. 2010. NOVA: A microhypervisor-based secure virtualization architecture. In EUROSYS.Google Scholar
[54] Tsafrir Dan. 2007. The context-switch overhead inflicted by hardware interrupts (and the enigma of do-nothing loops). In ExpCS. ACM.Google Scholar
[55] Vilanova Lluïs, Ben-Yehuda Muli, Navarro Nacho, Etsion Yoav, and Valero Mateo. 2014. CODOMs: Protecting software with code-centric memory domains. In ACM SIGARCH Computer Architecture News. IEEE Press.Google Scholar
[56] Vilanova Lluís, Jordà Marc, Navarro Nacho, Etsion Yoav, and Valero Mateo. 2017. Direct inter-process communication (dIPC): Repurposing the CODOMs architecture to accelerate IPC. In EuroSys. ACM.Google Scholar
[57] Waterman Andrew, Lee Yunsup, Patterson David A., and Asanovi Krste. 2014. The RISC-V Instruction Set Manual. Volume 1: User-Level ISA, Version 2.0. Technical Report. California University Berkeley Department of Electrical Engineering and Computer Sciences.Google ScholarCross Ref
[58] Watson Robert N. M., Laurie Ben, et al. 2015. Cheri: A hybrid capability-system architecture for scalable software compartmentalization. In SP. IEEE.Google Scholar
[59] Watson Robert N. M., Norton Robert M., Woodruff Jonathan, Moore Simon W., Neumann Peter G., Anderson Jonathan, Chisnall David, Davis Brooks, Laurie Ben, Roe Michael, et al. 2016. Fast protection-domain crossing in the CHERI capability-system architecture. IEEE Micro 36, 5 (2016), 38–49.Google ScholarDigital Library
[60] Witchel Emmett, Cates Josh, and Asanović Krste. 2002. Mondrian memory protection. In ASPLOS. ACM, New York, NY.Google Scholar
[61] Witchel Emmett, Rhee Junghwan, and Asanović Krste. 2005. Mondrix: Memory isolation for Linux using Mondriaan memory protection. In SOSP. ACM.Google Scholar

Index Terms

Boosting Inter-process Communication with Architectural Support
1. Computer systems organization
  1. Architectures
2. Software and its engineering
  1. Software organization and properties
    1. Contextual software domains
      1. Operating systems

Recommendations

A Design to Adapt Microkernel Inter-process Communication Mechanism
ICIIP '18: Proceedings of the 3rd International Conference on Intelligent Information Processing

In order to improve the efficiency of inter-process communication among microkernel operating systems, this paper proposes an inter-process communication mechanism for microkernels. The mechanism runs IPC (Inter-Process Communication) as a set of ...
Read More
XPC: architectural support for secure and efficient cross process call
ISCA '19: Proceedings of the 46th International Symposium on Computer Architecture

Microkernel has many intriguing features like security, fault-tolerance, modularity and customizability, which recently stimulate a resurgent interest in both academia and industry (including seL4, QNX and Google's Fuchsia OS). However, IPC (inter-...
Read More
Micro-CLK: returning to the asynchronicity with communication-less microkernel
APSys '21: Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems

Inter-process communication (IPC) has always been the "Achilles heel" of microkernels, determining their overall performance. The entire history of microkernel development is tightly coupled to the debates about IPC, its efficiency, and the bottleneck ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Computer Systems Volume 39, Issue 1-4
November 2021
216 pages
ISSN:0734-2071
EISSN:1557-7333
DOI:10.1145/3543986
Editor:
Michael Swift
University of Wisconsin, USA
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 July 2022
- Online AM: 9 May 2022
- Revised: 1 April 2022
- Accepted: 1 April 2022
- Received: 1 February 2021
Published in tocs Volume 39, Issue 1-4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Operating system
microkernel
inter-process communication
hardware-software co-design
Qualifiers
- research-article
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 1,962
  Total Downloads
- Downloads (Last 12 months)606
- Downloads (Last 6 weeks)48
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

HTML Format

View this article in HTML Format .

View HTML Format

Boosting Inter-process Communication with Architectural Support

ACM Transactions on Computer Systems

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

A Design to Adapt Microkernel Inter-process Communication Mechanism

XPC: architectural support for secure and efficient cross process call

Micro-CLK: returning to the asynchronicity with communication-less microkernel

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Full Text

HTML Format

Caption

Boosting Inter-process Communication with Architectural Support

ACM Transactions on Computer Systems

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

A Design to Adapt Microkernel Inter-process Communication Mechanism

XPC: architectural support for secure and efficient cross process call

Micro-CLK: returning to the asynchronicity with communication-less microkernel

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Full Text

HTML Format

Share this Publication link

Share on Social Media