Start learning now:
OpenDS4All/tree/master/ opends4all-resources/ opends4all-data-wrangling-and- integration
ODPI has officially announced this recently and it looks great.
There is a ton of amazing materials including slides, notes, documentation, homework, exercises and Jupyter notebooks covering Data Wrangling, Data Science, the Basics and Apache Spark.
This“starter set” of training materials can help you build a Data Science program for yourself, your company, your university or your non-profit. I am going to bring some of these to my meetups and hopefully can help give back with new materials, updates and suggestions.
These are college level materials developed by the University of Pennsylvania and open source via the ODPI with IBM leading. The code and slides look great. I can see these helping to enable the world adding another million desperately needed Data Scientists, Data Engineers and Data Science Enabled professionals.
I have been running some of this via Cloudera Machine Learning in my CDP cluster in AWS and it works great. This is really well made. I am hoping to create a module on Streaming Data Science to contribute.