Crunching Big Data with Google BigQuery + Introducing Apache Drill





The interactive transcript could not be loaded.



Rating is available when the video has been rented.
This feature is not available right now. Please try again later.
Streamed live on Sep 5, 2012


Crunching Big Data with Google BigQuery

Applications which grow to web-scale generate massive amounts of data. Many developers end up throwing this data away because they can't extract value from it without the necessary expertise or infrastructure.

Google knows Big Data -- our infrastructure processes 60 hours of YouTube video uploads every minute and many of our products have hundreds of millions of users. Google has developed custom technologies to analyze this data and make intelligent product decisions. We've started to open up some of these technologies as APIs which allow developers to concentrate on their business problems, while Google handles the underlying infrastructure.

Google BigQuery is a Big Data analysis tool born from an internal technology known as Dremel. BigQuery enables developers to analyze terabyte data sets in seconds using a RESTful API and a SQL-like query language. We'll demonstrate how you can incorporate Google BigQuery into your own applications, and how queries are processed underneath the covers. We'll also show examples using public data sets and demos built by other developers like you.

Ryan Boyd is a Developer Advocate at Google, focused on cloud data services such as Google BigQuery. He's been in the Developer Relations group for 5 years and previously helped build out the Google Apps ISV ecosystem. Ryan recently published his first book on "Getting Started with OAuth 2.0" with O'Reilly.

Introducing Apache Drill

Apache Drill is a new open source Apache Incubator project for interactive analysis of large-scale datasets, inspired by Google's Dremel. It enables users to query terabytes of data in seconds. Apache Drill supports a broad range of data formats, including Protocol Buffers, Avro and JSON, and leverages Hadoop and HBase as data sources. Drill's primary query language, DrQL, is compatible with Google BigQuery. In this talk we provide an overview of the Drill project, including its design goals and architecture.

Tomer Shiran heads the product management team at MapR and is responsible for product strategy, roadmap and requirements. Prior to MapR, Tomer held numerous product management and engineering roles at Microsoft, most recently as the product manager for Microsoft Internet Security & Acceleration Server (now Microsoft Forefront). He is the founder of two websites that have served tens of millions of users, and received coverage in prestigious publications such as The New York Times, USA Today and The Times of London. Tomer is also the author of a 900-page programming book. He holds an MS in Computer Engineering from Carnegie Mellon University and a BS in Computer Science from Technion - Israel Institute of Technology.


When autoplay is enabled, a suggested video will automatically play next.

Up next

to add this to Watch Later

Add to

Loading playlists...