DATA SCIENTIST VERSUS
BIG DATA COMPARING THE 2 TYPES OF SOFTWARE COURSES
Ensure end to end the flow of data lake architecture, starting from data loading till presentation to enduser.
We should have knowledge of the entire flow, including business rules, current organization business track and user-friendly presentation for an end-user. Data Scientist normally has an idea of all the technologies or processing tools like Hive, Map Reduce, R, Spark or the related technologies or tools.
Less costly since MapReduce model
Ensure huge data loading smoothly and fetching those data for preparing big data dictionary which can be easily used for presenting end-use by applying business rules. Should have knowledge of huge data loading smoothly from various sources, and fetching data as quickly as possible without any mistake.
Those guys have clear ideas on data loading and data fetching related technologies or tools. There normally experts on Hive, Spark, MapReduce, Pig, Cassandra, etc.
Costlier than Hadoop since it has an in-