Upload

Loading icon Loading...

This video is unavailable.

Bay Area Spark Meetup: Stores, Monoids and Dependency Injection - Abstractions for Spark

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to like Apache Spark's video.

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to dislike Apache Spark's video.

Sign in to YouTube

Sign in with your Google Account (YouTube, Google+, Gmail, Orkut, Picasa, or Chrome) to add Apache Spark's video to your playlist.

Published on Jan 17, 2014

At this Apache Spark community meetup, Ryan Weald presents an extended version of the talk he gave at Spark Summit 2013. The meetup was hosted by Sharethrough in downtown San Francisco on January 16, 2014.

Find the meetup.com event details at http://www.meetup.com/spark-users/eve... and join the Bay Area meetup group at http://www.meetup.com/spark-users/

In addition to Ryan's talk, the meetup began with Patrick Wendell giving a brief talk on some of the updates about upcoming Spark releases (Spark 0.9 and Shark 0.8.1).

== Speaker ==
Ryan Weald

== Talk Title ==
Stores, Monoids and Dependency Injection - Abstractions for Spark Streaming Jobs - Ryan Weald

== Slides == http://files.meetup.com/3138542/Abstr...

== Abstract ==
One of the most difficult aspects of deploying spark streaming as part of your technology stack is maintaining all the job associated with stream processing jobs. In this talk I will discuss the tools and techniques that Sharethrough has found most useful for maintaining a large number of spark streaming jobs. We will look in detail at the way Monoids and Twitter's Algebrid library can be used to create generic aggregations. As well as the way we can create generic interfaces for writing the results of streaming jobs to multiple data stores. Finally, we will look at the way dependency injection can be used to tie all the pieces together, enabling raping development of new streaming jobs.

== Ryan's Bio ==
Ryan Weald is a data scientist at Sharethrough where he works on data infrastructure and services for real time ad targeting and reporting. Ryan is passionate about machine learning, distributed systems, and building data driven products. You can find him on Twitter @rweald

Loading icon Loading...

Loading icon Loading...

Loading icon Loading...

Loading icon Loading...

Ratings have been disabled for this video.
Rating is available when the video has been rented.
This feature is not available right now. Please try again later.

Loading icon Loading...

Advertisement
Loading...
Working...
to add this to Watch Later

Add to