Data is not always useful and it doesn't matter how much of it you have.
There’s no mathematical tool to tell you if your hypothesis is true; you can only see whether it is consistent with the data, and if the data is sparse or unclear, your conclusions are uncertain.
Cookiecutter Data Science - Logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Pachyderm - Reproducible Data Science at Scale.
Reflow - Language and runtime for distributed, incremental data processing in the cloud.
Virgilio - Mentor for Data Science E-Learning.
Awesome Data Science with Python - Curated list of Python resources for data science.
nteract - Interactive computing suite for you.
Pandas - Powerful Python data analysis toolkit.
Datasette - Tool for exploring and publishing data.
Weld - High-performance runtime for data analytics applications.
Vaex - Out-of-Core DataFrames for Python, visualize and explore big tabular data at a billion rows per second.
Ibis - Python data analysis framework for Hadoop and SQL engines.
Kyso - Data analytics knowledge hub.
Feather - Fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow.
ROOT system - Provides a set of OO frameworks with all the functionality needed to handle and analyze large amounts of data in a very efficient way.