The amount of information data scientists need to curate, organize and process can often seem insurmountable, especially given the increasing volume of data sets being generated by sensors, devices and users.