About BOSS

The workshop BOSS'19 will be held in conjunction with the

45th International Conference on
Very Large Data Bases
Los Angeles, California - August 26 - 30, 2019

Workshop Date
  August 30th, 2019

Following the great success of the previous BOSS workshops collocated with VLDB since 2015, the fifth Workshop on Big Data Open Source Systems (BOSS'19) will again give a deep-dive introduction into several active, publicly available, open-source systems.

  • The systems will be presented in tutorials by experts in the presented systems.
  • The tutorials will give details on installation and non-trivial examples usage of the presented system.

The workshop will consist of tutorials, we will publish the tutorial proposals on the website and encourage the presenters to publish the tutorial resources. In the previous editions, we published slides and project websites. We would encourage proposers to engage participants in a hands-on quick jump start familiarity exercise for the system.

There will be an open call for tutorials.

Workshop Format

Program Outline

09:00 - 09:20 Intro, Flash Talks (3mins for each parallel tutorial)
09:20 - 10:30 Tensor Flow Extended (TFX): A platform for end-to-end ML in production
10:30 - 11.00 Coffee Break
11:00 - 13:00 Parallel Tutorials I (1.5-2h based on tutorial), all but TFX
13:00 - 14.00 Lunch Break
14:00 - 15:30 Parallel Tutorials II (1.5-2h based on tutorial, repetitions of the morning tutorials)
15:30 - 16:00 Coffee Break
16:00 - 16.30 Parallel Tutorials II Continued
Parallel Tutorials
Cloudberry for Big Data Visualization
Presenters: Sadeem Alsudais, Qiushi Bai, and Chen Li (UC Irvine, USA) slides and instructions
Related Links:  http://cloudberry.ics.uci.edu/
massive virtual data warehouses with Apache Drill
Presenters: Pattrick Holl (TU Munich, Germany) slides coming soon
Related Links: https://drill.apache.org/ http://demo.midas.science/vldb
Tuning and Programming for Data-Intensive Systems with OX and Open-Channel SSDs
Presenters: Philippe Bonnet, Ivan Luiz Picoli (IT University of Copenhagen, Denmark) Tutorial
Related Links: http://lightnvm.io/ https://github.com/DFC-OpenSource/ox-ctrl
From Zero to Hero with Apache Kudu
Presenters: Andrew Wong (Cloudera, USA) slides
Related Links: https://kudu.apache.org
Presenters: Wes McKinney (Ursa Labs & RStudio, USA) link to github
Related Links: https://arrow.apache.org/
Tensor Flow Extended (TFX): A platform for end-to-end ML in production
Presenters: Neoklis Polyzotis (Google, USA)
Related Links: https://github.com/tensorflow/tfx

Workshop Organization

Workshop Chair:

  • Pinar Tözün, IT University of Copenhagen, pito@itu.dk
  • Emanuel Zgraggen, MIT, emzg@mit.edu

Advisory Committee:
  • Tilmann Rabl, TU Berlin
  • Michael Carey, UC Irvine
  • Volker Markl, TU Berlin

Call for tutorials
There will be an open call for tutorials through dbworld and other channels.
  • Important Dates:

    Proposals for tutorials are accepted until May 15

    Accepted presenters will be notified by June 15

    In order to propose a tutorial, please email
    • a short abstract with a brief description of the system,
    • an outline of the planned tutorial,
    • the technology used for the hands on tutorial,
    • a list of presenters involved,
    • and a link to the website of your system

    to bossvldb19@gmail.com
Selection Process for Tutorials
The proposals will be evaluated by the chairs and the advisory committee for the system readiness, relevance, timeliness, and perceived interests from the conference participants.

Previous Editions

BOSS'15 on September 4, 2015, in conjunction with VLDB 2015

BOSS'16 on September 9, 2016, in conjunction with VLDB 2016

BOSS'17 on September 1, 2017, in conjunction with VLDB 2017

BOSS'18 on August 27, 2018, in conjunction with VLDB 2018