RumbleML, a declarative machine learning framework
Can Berker Cikis,
17 June 2020
This piece introduces the need for and the benefits of RumbleML framework. Particularly, this framework addresses the shortcomings of contemporary ML frameworks of iterative nature by utilizing declarative paradigms instead.
Continue reading
Building an inverted index on a large text collection with JSONiq
Ghislain Fourny,
21 August 2019
We show how JSONiq can be used not only to manipulate JSON input, but also to build a standard inverted index on and query a text collection.
Continue reading
Rumble, an engine to run JSONiq on top of Spark
Ghislain Fourny,
06 June 2019
This is an introduction and motivation to the Rumble engine, in particular, how it addresses the limitations of dataframes and Spark SQL when the dataset is heterogeneous and nested.
Continue reading
The design and implementation of a lock-free ring-buffer with contiguous reservations
Andrea Lattuada (@utaal) and James Munns (@bitshiftmask),
03 June 2019
This is the story of how James Munns and Andrea Lattuada designed and implemented (two versions!) of an high-perf lock-free ring-buffer for cross-thread communication. If any of those words look scary to you, don't fret, we'll explain everything from the basics.
Continue reading
Academics Should Build Their Own Computers to Advance Systems Research
Timothy Roscoe,
13 May 2019
Mothy was invited to write a post
for the ACM SIGARCH blog, and decided to talk about building hardware
designed specifically for system software research (as opposed to
run commercial workloads). You can check it out here:
https://www.sigarch.org/academics-should-build-their-own-computers-to-advance-systems-research/
Continue reading
A fork() in the road
Timothy Roscoe,
20 April 2019
Andrew Baumann,
Jonathan Appavoo,
Orran Krieger, and
I
have written a paper for
Hot Topics in Operating Systems next month about the Unix fork()
system call.
Continue reading
String interning and beyond, in differential dataflow
Frank McSherry (@frankmcsherry),
10 December 2018
Differential dataflow does a great number of interesting bits of data processing, but what about when you want to use complicated types, like strings? In this post we’ll check out how to use differential dataflow to intern strings, replacing them with integer identifiers that will allow the rest of our computation to execute more efficiently. From there, we’ll see how this generalizes to automatically assigning distinct record identifiers to collection elements, much like a database does!
Continue reading
Physical Adressing on Real Hardware in Isabelle/HOL
Lukas Humbel (home),
09 November 2018
Modern memory systems are much more complicated than the traditionally assumed
virtual and physical address space separation. We explain in this post which
effects can not solely expressed by the basic model and are important for
correct function of operating systems. We summarize our recent paper.
In this work we present a theory for addressing in such modern memory
subsystems. We formalize the theory in Isabelle/HOL.
Continue reading
A hammer you can only hold by the handle
Andrea Lattuada (@utaal),
05 November 2018
Today we’re looking at the rust borrow checker from a different perspective. As you may know, the borrow checker is designed to safely handle memory allocation and ownership, preventing accessess to invalid memory and ensuring data-race freedom. This is a form of resource management: the borrow checker is tracking who’s in charge of a chunk of memory, and who is currently allowed to read or write to it. In this post, we’ll see how these facilities can be used to enforce higher-level API constraints in your libraries and software. Once you’re familiar with these techniques, we’ll cover how the same principles apply to advanced memory management and handling of other more abstract resources.
Continue reading