This session will talk about the need and evolution of NoSQL, fundamentals and comparison of NoSQL and RDBMS systems. We will try to understand the practical relevance of NoSQL with a real problem statement and it’s solution based on NoSQL. The session will briefly also look at various implementations and types of NoSQL solutions.
Hadoop and MapReduce paradigm provides ease of writing parallel data processing. However, many application require a number of Map-Reduce jobs that join, clean, aggregate, and analyze large volume of data. Such a set of connected jobs form a pipeline. Programming/managing such pipelines can be tricky and can cause major impediments to developer productivity.