Data is not always useful and it doesn't matter how much of it you have.
There’s no mathematical tool to tell you if your hypothesis is true; you can only see whether it is consistent with the data, and if the data is sparse or unclear, your conclusions are uncertain.
Cookiecutter Data Science - Logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
Pachyderm - Reproducible Data Science at Scale.
Reflow - Language and runtime for distributed, incremental data processing in the cloud.
Virgilio - Mentor for Data Science E-Learning.
Awesome Data Science with Python - Curated list of Python resources for data science.
nteract - Interactive computing suite for you.
Pandas - Powerful Python data analysis toolkit.
Datasette - Tool for exploring and publishing data.
Vaex - Out-of-Core DataFrames for Python, visualize and explore big tabular data at a billion rows per second.