Orca: GC and type system co-design for actor languages

ORCA is a concurrent and parallel garbage collector for actor programs, which does not require any STW steps, or synchronization mechanisms, and that has been designed to support zero-copy message passing and sharing of mutable data. ORCA is part of a runtime for actor-based languages, which was co-...

Full description

Bibliographic Details
Published in:Proceedings of the ACM on Programming Languages
Main Authors: Clebsch, Sylvan, Franco, Juliana, Drossopoulou, Sophia, Yang, Albert Mingkun, Wrigstad, Tobias, Vitek, Jan
Other Authors: NSF, Swedish Research Council, FP7-ICT, Svenska Forskningsrådet Formas, EPSRC, Horizon 2020, ONR
Format: Article in Journal/Newspaper
Language:English
Published: Association for Computing Machinery (ACM) 2017
Subjects:
Online Access:http://dx.doi.org/10.1145/3133896
https://dl.acm.org/doi/pdf/10.1145/3133896
Description
Summary:ORCA is a concurrent and parallel garbage collector for actor programs, which does not require any STW steps, or synchronization mechanisms, and that has been designed to support zero-copy message passing and sharing of mutable data. ORCA is part of a runtime for actor-based languages, which was co-designed with the Pony programming language, and in particular, with its data race free type system. By co-designing an actor language with its runtime, it was possible to exploit certain language properties in order to optimize performance of garbage collection. Namely, ORCA relies on the guarantees of absence of race conditions in order to avoid read/write barriers, and it leverages the actor message passing, for synchronization among actors. In this paper we briefly describe Pony and its type system. We use pseudo-code in order to introduce how ORCA allocates and deallocates objects, how it shares mutable data without requiring barriers upon data mutation, and how can immutability be used to further optimize garbage collection. Moreover, we discuss the advantages of co-designing an actor language with its runtime, and we demonstrate that ORCA can be implemented in a performant and scalable way through a set of micro-benchmarks, including a comparison with other well-known collectors.