Though Apache Hadoop and related projects are driving big data innovation, a significant opportunity remains to realize the full potential business impact of open source big data solutions. There are significant challenges to delivering Hadoop applications:
• Specialized skills are required for using Hadoop preventing most developers from effectively building solutions • Moving from proof-of-concept to production is difficult and can take months or quarters • No clear separation between business logic and infrastructure APIs increases application complexity and total cost of ownership • Data ingestion in realtime and batch currently involves piecing together multiple different technologies • Traditional concepts and tools for application life cycle management have not been available
The Cask Data Application Platform (CDAP) (http://cdap.io) is an open source platform for quickly building, deploying, and managing Hadoop solutions (such as ETL, IoT, analytics applications, and closed loop applications). Its built-in capabilities for data ingestion, building complex data access patterns on various storage systems and the mixing of different processing paradigms, lets developers and data scientists be productive building solutions in minutes. It supports high-level concepts and abstractions that hide infrastructure complexity and enable reusability. CDAP provides greater reliability and predictability to Hadoop solutions with testing frameworks and developer/devops tools.
In this presentation, Nitin Motgi, co-founder and CTO of Cask Data, will share ideas and use-cases behind the designing and building of CDAP, its architecture, core concepts, and building blocks, and demonstrate the platform. He will also talk about specific customer use-cases and how they have benefited by adopting this open source technology.
About the Speaker
Nitin Motgi is Founder and CTO of Cask, where he is responsible for developing the company's long-term technology, driving company engineering initiatives and collaboration.
Prior to Cask, Nitin was at Yahoo! working on a large-scale content optimization system externally known as C.O.R.E.
Prior to Yahoo!, Nitin led the development of large-scale fabrication analysis system at Altera, and he previously held senior engineering roles at FedEx. Nitin holds a Master's degree in computer science from University of Central Florida (UCF).