Data Science with Unix Power Tools

Speaker: Jeroen Janssens

Abstract

The unix command line, although invented decades ago, is an amazing environment for efficiently performing tedious but essential data science tasks. By combining small, powerful, command-line tools (like parallel, jq, and csvkit), you can quickly clean and explore your data and hack together prototypes.

This hands-on workshop is based on the book Data Science at the Command Line, written by instructor Jeroen Janssens. You'll learn how to build fast data pipelines, how to leverage R and Python at the command line, and how to quickly visualise data.

Biography

Jeroen Janssens is the founder and CEO of Data Science Workshops, which provides on-the-job training and coaching in data visualisation, machine learning, and programming.

Previously, he was an assistant professor at Jheronimus Academy of Data Science and a data scientist at Elsevier in Amsterdam and startups YPlan and Outbrain in New York City. He is the author of Data Science at the Command Line, published by O’Reilly Media.

Jeroen holds a PhD in machine learning from Tilburg University and an MSc in artificial intelligence from Maastricht University.

Twitter: @jeroenhjanssens

voorjaar 2018

2020-08-24
 
Vereniging NLUUG
info@nluug.nl
           postbus 8189
6710 AD Ede