Welcome to the Virtual Data Lake documentation!#
The Virtual Data Lake (VDL) is a software that is based on Cranmera, a proprietary privacy engine developed by our cryptography team. Cranmera was named after the Cranmer abacus for blind people, and it powers the VDL. To interact with the VDL, we have developed a Python package called crandas. This package allows users to encrypt and upload data to the VDL, perform privacy-preserving data analyses over the encrypted data, and obtain the computational results. It offers a seamless way to perform secure computations on private data while maintaining a familiar pandas-like syntax. Users can leverage the power of multi-party computation (MPC) tools for data analysis without having to worry about the cryptography.
If you are new to crandas, the Getting Started guide will walk you through the process of installing the crandas library, creating and manipulating tables. This guide also provides information on MPC and some design choices in crandas. For more in-depth information about functionalities, consult the User Guide. Additionally, you can go through some Tutorials to familiarize yourself with the different features present in crandas.
Getting started
User Guide
Reference
Tutorials
Download the zip file for tutorial data and notebooks (Jupyter).