Strata SC 2012
-
1
15:29
Strata 2012: Avinash Kaushik, "A Big Data Imperative: Driving Big Action"
by OreillyMedia 18,129 views
So you've hoarded the world's data within your enterprise. Now what? Author and digital marketing evangelist Avinash Kaushik shares lessons from the nascent world of Web Analytics on how multiplicity, scale and outsourcing powers a data democracy, and how that in turn drives business action.
Avinash Kaushik
Market Motive
Avinash Kaushik is the co-Founder of Market Motive Inc and the Digital Marketing Evangelist for Google. His prior professional experience includes key roles at Intuit, DirecTV, Silicon Graphics in the US & DHL in Saudi Arabia.
Through his blog, Occam's Razor, and his best selling books, Web Analytics: An Hour A Day and Web Analytics 2.0, Avinash has become recognized as an authoritative voice on how marketers, executives teams and industry leaders can leverage data to fundamentally reinvent their digital existence.
Avinash puts a common sense framework around the often frenetic world of web analytics and combines that with the philosophy that investing in talented analysts is the key to long-term success. He passionately advocates customer centricity and leveraging bleeding edge competitive intelligence techniques.
Avinash has received rave reviews for bringing his energetic, inspiring, and practical insights to companies like Unilever, Dell, Time Warner, Vanguard, Porsche, and IBM. He has delivered keynotes at a variety of global conferences, including Ad-Tech, Monaco Media Forum, Search Engine Strategies, JMP Innovators' Summit, The Art of Marketing and Web 2.0.
Acting on his passion for teaching Avinash has lectured at major universities such as Stanford University, University of Virginia, University of California -- Los Angeles and University of Utah.
Avinash received the 2009 Statistical Advocate of the Year award from the American Statistical Association, and the 2011 Most Influential Industry Contributor award from the Web Analytics Association. -
2
8:26
Strata 2012: Coco Krumme, "The Trouble with Taste"
by OreillyMedia 1,151 views
Why data can tell us only so much about food, flavor, and our preferences.
Coco Krumme
MIT Media Lab
Coco Krumme is a PhD student at MIT, where she's partnered with a major financial institution to study transaction data and human behavior. -
3
47:34
Tim O'Reilly and Dave Campbell Explore How to Accelerate Insights from Data
by OreillyMedia 3,195 views
Tim O'Reilly, founder and CEO of O'Reilly Media, talks with Microsoft Technical Fellow Dave Campbell about new tools for data. While Microsoft creates its own tools for data, it also works within the larger data community and ecosystem. Thinking of data as a platform, how can a new agenda for tools speed up the process to insight?
-
4
2:40
Luke Lonergan interviewed at Strata 2012
by OreillyMedia 568 views
Luke Lonergan
CTO, VP and Co-Founder, Greenplum, a division of EMC
A co-founder of Greenplum, Luke served as CTO of the organization and continues in this role for the Greenplum Division. Prior to Greenplum, Luke founded Didera, a database clustering company, in 2000 and served as CEO and Chairman. Luke's background includes 16 years of management experience in computing technology ranging from innovations in supercomputing to advances in medical imaging systems. Most recently, he directed data center integration at High Performance Technologies Inc (HPTi), scaling the business to $30M, and setting industry firsts in parallel computing subsequently adopted by IBM and Compaq. Previously he held management positions at Northrop Grumman Corporation. He holds an M.S. in Aeronautics and Astronautics from Stanford University and a B.E. in Mathematics from Vanderbilt University. -
6
5:21
Gary Lang interviewed at Strata 2012
by OreillyMedia 289 views
Gary Lang
Senior Vice President, Engineering, MarkLogic
Gary Lang is the senior vice president of engineering for MarkLogic. Lang is a proven leader with more than two decades experience delivering large, complex products and systems, architectural design and direction setting for high-revenue software projects. Lang is responsible for all of MarkLogic product development.
Lang comes to MarkLogic from Microsoft, where he was a leader in the development of the next version of Visual Studio. Prior to Microsoft, Gary was vice president of platforms and global engineering at Autodesk, where he led an organization of 1,200 employees worldwide providing platform and product engineering for Autodesk's core products as well as new software and services for emerging businesses. His organization was responsible for developing code for almost all of Autodesk's desktop and SaaS products, such as AutoCAD, Inventor, and Seek, which generated up to $2.5 billion in revenue. During this period, Lang also served as the vice president of global engineering for the company and managed its offices in China and Singapore. Earlier in his career, Lang was vice president of engineering for Autodesk's infrastructure division. He led the engineering team for all geospatial products, including Map and MapGuide Server. Lang was also a co-creator of the OSGeo Foundation, the first industry-wide open source foundation for geospatial software and services.
Lang has a bachelor's degree in computer and information science from the University of California, Santa Cruz. -
8
2:19
Tomer Shiran interviewed at Strata 2012
by OreillyMedia 247 views
Tomer Shiran
Director of Product Management, MapR Technologies
Tomer Shiran is MapR's Director of Product Management. Prior to MapR, Tomer held product management and engineering positions at Microsoft, as well as a software engineering position at IBM Research. He is also the author of a 900-page JavaScript programming book and the founder of two websites that have served over 10M users. Tomer holds an MS in Computer Engineering from Carnegie Mellon University and a BS in Computer Science from the Technion -- Israel Institute of Technology. -
10
3:03
Pete Warden interviewed at Strata 2012
by OreillyMedia 117 views
Pete Warden
CTO, Jetpac
A former Apple engineer, Pete Warden is the CTO of Jetpac, and writes on large-scale data processing and visualization -
11
4:56
DJ Patil interviewed at Strata 2012
by OreillyMedia 348 views
DJ Patil
Data Scientist in Residence, Greylock Partners
DJ is the "Data Scientist in Residence" at Greylock Partners.
Previously he was the Chief Product Officer for Color and the Chief Scientist at the LinkedIn Corporation, leading the Analytics and Data Teams. Some of the products shipped include, People You May Know, Who's Viewed My Profile, Talent Match, Skills, and Career Explorer.
He has held roles at Skype, PayPal, and eBay. As was a member of the faculty at the University of Maryland, he helped start a major research initiative on numerical weather prediction. As an AAAS Science & Technology Policy Fellow for the Department of Defense, Dr. Patil directed new efforts to leverage social network analysis and the melding of computational and social sciences to anticipate emerging threats to the US. He has also co-chaired a major review of US efforts to prevent bioweapons proliferation in Central Asia and co-founded the Iraqi Virtual Science Library (IVSL).
More details can be found on his LinkedIn profile. -
12
6:43
Strata 2012: Flavio Villanustre, "Machine Learning and Big Data: Sustainable Value or Hype?"
by OreillyMedia 939 views
Back in the late 80s artificial intelligence was set to take over the world; it didn't happen. In 2012; AI has been stripped down, dressed up and reborn as machine learning. Will it take over the world this time? What makes a Big Data -- Machine Learning solution 'better'? Can machine learning happen with legacy tools? What exactly does it mean to be fully parallel? Do I care? Will I be any better if I get it right?
Flavio Villanustre
LexisNexis Risk Solutions and HPCC Systems
Flavio Villanustre is the Vice President of Infrastructure and Products. In this position, Flavio is responsible for Information and Physical Security, overall infrastructure strategy and new product development for LexisNexis Risk Solutions and HPCC Systems. Prior to 2001, Flavio served in a variety of roles at different companies including Infrastructure, Information Security and Information Technology. In addition to this, Villanustre has been involved with the Opensource community for over 15 years through multiple initiatives. Some of these include founding the first Linux User Group in Buenos Aires (BALUG) in 1994, releasing several pieces of software under different Opensource licenses, and evangelizing Opensource to different audiences through conferences, training and education. Before working in technology, Flavio was a neurosurgeon. -
13
15:17
Strata 2012: Hal Varian, "Using Google Data for Short-term Economic Forecasting"
by OreillyMedia 3,722 views
Google Insights for Search provides an index of search activity for millions of queries. These queries can sometimes help understand consumer behavior. Hal describes some of the issues that arise in trying to use this data for short-term economic forecasts and provide examples.
Hal Varian
Google
Hal R. Varian is the Chief Economist at Google. He started in May 2002 as a consultant and has been involved in many aspects of the company, including auction design, econometric analysis, finance, corporate strategy and public policy.
He also holds academic appointments at the University of California, Berkeley in three departments: business, economics, and information management.
He received his SB degree from MIT in 1969 and his MA in mathematics and Ph.D. in economics from UC Berkeley in 1973. He has also taught at MIT, Stanford, Oxford, Michigan and other universities around the world.
Dr. Varian is a fellow of the Guggenheim Foundation, the Econometric Society, and the American Academy of Arts and Sciences. He was Co-Editor of the American Economic Review from 1987-1990 and holds honorary doctorates from the University of Oulu, Finland and the University of Karlsruhe, Germany.
Professor Varian has published numerous papers in economic theory, industrial organization, financial economics, econometrics and information economics. He is the author of two major economics textbooks which have been translated into 22 languages. He is the co-author of a bestselling book on business strategy, Information Rules: A Strategic Guide to the Network Economy and wrote a monthly column for the New York Times from 2000 to 2007. -
14
4:30
-
15
5:49
Strata 2012: Gary Lang, "Big Data's Next Step: Applications"
by OreillyMedia 1,089 views
Big Data is about extracting value from fast, huge, varied, complex data sets. But simply crunching data is only the first step. As adoption of MapReduce and data analytic technologies increases, forward thinking companies are starting to build applications on their core data assets. In this keynote, MarkLogic's Gary Lang will explore what these Big Data Applications look like, offering some tantalizing real-world glimpses at what data wrapped in applications makes possible.
Gary Lang
MarkLogic
Gary Lang is the senior vice president of engineering for MarkLogic. Lang is a proven leader with more than two decades experience delivering large, complex products and systems, architectural design and direction setting for high-revenue software projects. Lang is responsible for all of MarkLogic product development.
Lang comes to MarkLogic from Microsoft, where he was a leader in the development of the next version of Visual Studio. Prior to Microsoft, Gary was vice president of platforms and global engineering at Autodesk, where he led an organization of 1,200 employees worldwide providing platform and product engineering for Autodesk's core products as well as new software and services for emerging businesses. His organization was responsible for developing code for almost all of Autodesk's desktop and SaaS products, such as AutoCAD, Inventor, and Seek, which generated up to $2.5 billion in revenue. During this period, Lang also served as the vice president of global engineering for the company and managed its offices in China and Singapore. Earlier in his career, Lang was vice president of engineering for Autodesk's infrastructure division. He led the engineering team for all geospatial products, including Map and MapGuide Server. Lang was also a co-creator of the OSGeo Foundation, the first industry-wide open source foundation for geospatial software and services.
Lang has a bachelor's degree in computer and information science from the University of California, Santa Cruz. -
16
4:10
Strata 2012: Jonathan Gluck, "Winner of the Second Heritage Health Progress Prize"
by OreillyMedia 671 views
Strata 2012: Jonathan Gluck, "Winner of the Second Heritage Health Progress Prize"
-
17
10:35
Strata 2012: Usman Haque, "Open Data and the Internet of Things"
by OreillyMedia 1,542 views
The expected massive growth of connected device, appliance and sensor markets in the coming years -- often called 'The Internet of Things' -- will need a more rich concept of 'open data' than is currently common. When data is generated through activities of people doing things inside their homes and outside in public in their cities, the question of who owns the data becomes almost irrelevant next to the questions of who has access to the data, what do they do with it, and how do citizens manage and make sense of their data while retaining the 'openness' that we've seen drive creativity and business on the web over the last few years.
Usman Haque
Pachube.com
Usman Haque is the founder of Pachube.com, a real-time data infrastructure for the Internet of Things used by tens of thousands of people around the world (acquired by LogMeIn Inc in 2011). Trained as an architect, he has created responsive environments, interactive installations, digital interface devices and dozens of mass-participation initiatives. His skills include the design and engineering of both physical spaces and the software and systems that bring them to life. He received the 2008 Design of the Year Award (interactive) from the Design Museum, UK, a 2009 World Technology Award (art), a Wellcome Trust Sciart Award, a grant from the Daniel Langlois Foundation for Art, Science and Technology, the Swiss Creation Prize, Belluard Bollwerk International, the Japan Media Arts Festival Excellence prize and the Asia Digital Art Award Grand Prize. -
18
9:57
Strata 2012: Pete Warden, "Embrace The Chaos"
by OreillyMedia 424 views
Why unstructured data beats structured.
Pete Warden
Jetpac
A former Apple engineer, Pete Warden is the CTO of Jetpac, and writes on large-scale data processing and visualization -
19
9:32
Strata 2012: Luke Lonergan, "5 Big Questions about Big Data"
by OreillyMedia 1,489 views
How are businesses using big data to connect with their customers, deliver new products or services faster and create a competitive advantage? Luke Lonergan, co-founder & CTO, Greenplum, a division of EMC, gives insight into the changing nature of customer intimacy and how the technologies and techniques around big data analysis provide business advantage in today's social, mobile environment -- and why it is imperative to adopt a big data analytics strategy.
Luke Lonergan
Greenplum, a division of EMC
A co-founder of Greenplum, Luke served as CTO of the organization and continues in this role for the Greenplum Division. Prior to Greenplum, Luke founded Didera, a database clustering company, in 2000 and served as CEO and Chairman. Luke's background includes 16 years of management experience in computing technology ranging from innovations in supercomputing to advances in medical imaging systems. Most recently, he directed data center integration at High Performance Technologies Inc (HPTi), scaling the business to $30M, and setting industry firsts in parallel computing subsequently adopted by IBM and Compaq. Previously he held management positions at Northrop Grumman Corporation. He holds an M.S. in Aeronautics and Astronautics from Stanford University and a B.E. in Mathematics from Vanderbilt University. -
20
7:31
Strata 2012: Jonathan Gosier, "Democratization of Data Platforms"
by OreillyMedia 748 views
Big data isn't just an abstract problem for corporations, financial firms, and tech companies. To your mother, a 'big data' problem might simply be too much email, or a lost file on her computer.
We need to democratize access to the tools used for understanding information by taking the hard-work out of drawing insight from excessive quantities of information. To help humans process content more efficiently and to help them capture more of their world.
Tools to effectively do this need to be visual, intuitive, and quick. This talk looks at some of the data visualization platforms that are helping to solve big data problems for normal people.
Jonathan Gosier
metaLayer Inc.
Jonathan Gosier is a designer, software developer, lover of data science and the co-founder of metaLayer.com which aims to change how you analyze content by offering products for atomizing and visualizing data.
From 2009 to 2011 he served as Director of Product for SwiftRiver at Ushahidi working on an open-source platform for drawing insight from real-time communication during crisis events. The SwiftRiver project was awarded the 2011 Knight News Challenge award for its potential to improve the data journalism and news gathering process.
In 2009 Jon spoke at TED in Oxford, UK about his company Appfrica and one of their projects which connected rural African villages with the internet through a call center and light infrastructure. The service, in collaboration with non-profit OpenMind, was called QuestionBox and allowed people with no access to the internet to ask questions and get timely, vetted answers.
Jon is also the organizer of the annual Apps4Africa competition which encourages African software developers to develop solutions to local problems.
In addition to TED, Jon has been invited to present at the Economist's Ideas Economy, Google Zeitgeist and Personal Democracy Forum. Links to articles about his work, presentation slides and video of passed events Jon has spoken at can be found at his blog GosDot.com -
23
1:51
Sanjay Mehta interviewed at Strata 2012
by OreillyMedia 139 views
Sanjay Mehta
Director of Product Marketing, Splunk
Sanjay Mehta, Senior Director of Product Marketing at Splunk, is responsible for developing and executing a market-driven product strategy for Splunk's core product. In addition, Sanjay spearheads the Company's focus on Big Data, helping customers understand how they can use their big machine data to gain operational intelligence through unprecedented insights in the areas of application management, IT operations, web analytics. Sanjay's role at Splunk leverages his 19 years of experience building, marketing and advising on enterprise software and information management solutions for the retail, communications and media industries. Prior to joining Splunk, Sanjay held key positions at Oracle, Portal Software and Sybase. -
25
5:12
Hjalmar Gislason interviewed at Strata 2012
by OreillyMedia 205 views
Hjalmar Gislason
Founder & CEO, DataMarket
Hjalmar is a serial entrepreneur, founder of three startups in the gaming, mobile and web sectors since 1996. Prior to launching DataMarket, Hjalmar worked on new media and business development for companies in the Skipti Group (owners of Iceland Telecom) after their acquisition of his search startup -- Spurl. DataMarket is based largely on his vision of the need for a global exchange for structured data. -
26
4:00
Flavio Villanustre interviewed at Strata 2012
by OreillyMedia 168 views
Flavio Villanustre
Vice President Infrastructure and Products , LexisNexis Risk Solutions and HPCC Systems
Flavio Villanustre is the Vice President of Infrastructure and Products. In this position, Flavio is responsible for Information and Physical Security, overall infrastructure strategy and new product development for LexisNexis Risk Solutions and HPCC Systems. Prior to 2001, Flavio served in a variety of roles at different companies including Infrastructure, Information Security and Information Technology. In addition to this, Villanustre has been involved with the Opensource community for over 15 years through multiple initiatives. Some of these include founding the first Linux User Group in Buenos Aires (BALUG) in 1994, releasing several pieces of software under different Opensource licenses, and evangelizing Opensource to different audiences through conferences, training and education. Before working in technology, Flavio was a neurosurgeon. -
27
2:48
Cheryl Phillips interviewed at Strata 2012
by OreillyMedia 185 views
Cheryl Phillips
Data Enterprise Editor, The Seattle Times
Cheryl Phillips is the Data Enterprise Editor for The Seattle Times and a former board president with Investigative Reporters and Editors, a national journalism training organization, where she served on the board for a decade. Phillips coordinates data-related enterprise journalism across the Seattle Times newsroom. She has edited a number of award-winning stories that made compelling use of data visualizations. One of the most recent was an investigation into the myth-busting reasons behind the foreclosure crisis, "Rescue From Foreclosure? Frustration, Anger Grow." The joint project with The Seattle Times and ProPublica received The Gannett Award for Innovation in Watchdog Journalism and a first place award in the National Association of Real Estate Editors' 61st annual journalism contest. She also was the sole journalist in the newsroom in 2009 when a gunman shot and killed four area police officers in a coffee shop. She was integrally involved with the subsequent breaking news coverage, which received a Pulitzer Prize. Phillips also worked as deputy investigations editor and a reporter on the investigations team, writing or editing several stories which received national awards and has twice been a member of reporting teams that were finalists for a Pulitzer Prize. Previously, she has worked at USA Today and newspapers in Michigan, Montana and Texas. -
28
5:58
Terence Craig interviewed at Strata 2012
by OreillyMedia 138 views
Terence Craig
Founder, CEO & CTO, PatternBuilders
Terence Craig is CEO and CTO of PatternBuilders, a big data analytics companies that produces advanced applications for financial services, retail and other data intensive industries.
Terence has an extensive background in building, implementing, and selling analytically-driven enterprise applications across such diverse domains as enterprise resource planning (ERP), retail sales channel optimization, professional services automation (PSA), and semi-conductor process control and analytics in both public and private companies. He has been part of the ERP/SCM industry as it has evolved, from the VAX and HP 3000 to its current heyday of client-server, GUIs, and relational databases and is looking forward to exploring what the next generation of solutions, powered by the Internet of Things and big data analytics, will look like.
With over 20 years of experience in both executive and technical management roles with leading-edge private and public technology companies, Terence brings a unique and innovative view of what is needed—from both an operational and technology perspective—to build a world class analytics platform that is focused on the innovative development of analytic applications designed to improve companies' and organizations' profitability and efficiencies. He is also a speaker, blogger (on all things big data and analytics plus lots of other stuff), and author of Privacy and Big Data. -
29
6:47
Doug Cutting interviewed at Strata 2012
by OreillyMedia 307 views
Doug Cutting
Architect, Cloudera
Doug is a founder of several Apache open source projects, including Lucene, Nutch, Hadoop and Avro. Doug currently works for Cloudera, and previously worked at Yahoo!, Excite, Apple and Xerox PARC. Doug holds a Bachelor's degree from Stanford University and presently chairs the Board of the Apache Software Foundation. -
30
5:20
Chris Moody interviewed at Strata 2012
by OreillyMedia 296 views
Chris Moody
President and COO, Gnip
Chris Moody currently serves as the President and COO of Gnip, the leading provider of social media data for enterprise applications. In this role, Moody is responsible for the day-to-day execution of Gnip's operations with direct responsibility for sales, marketing, finance, and business development.
Prior to joining Gnip, Moody served as Founder and President of Aquent On Demand, a leading provider of technology solutions for creative and marketing organizations. Prior to his responsibilities with Aquent On Demand, Moody served as Aquent's Chief Operating Officer with responsibility for the day-to-day management of more than 700 employees across 70 offices in 17 countries. Before joining Aquent, Moody served in senior management and technology consulting roles with IBM, Oracle, and EDS where he led engagements with more than 25 Fortune 500 companies.
Moody serves on the National Technical Advisory Board of Year Up, is an advisor to several technology startups, and is an active TechStars mentor. He also facilitates a monthly CEO Lunch for startups in Boulder. Moody has a Bachelor of Science degree in Electrical Engineering from Auburn University. -
31
6:40
Max Gadney interviewed at Strata 2012
by OreillyMedia 270 views
Max Gadney
Design Director & Founder, After The Flood
Max Gadney founded After the Flood to help companies communicate data better. Current clients include the BBC, Edelman and Manchester City Football Club. A passion for information design has been a consistent theme throughout his life and career. At the BBC, Max was the Head of Design and Audience Insight at BBC News Online from 2000-2007. The team won 11 Webbys and the Society of News Design President's award for election night data visualisation. After that he joined the BBC TV Digital Commissioning team. His most recent commission there was BBC Dimensions, part of the NYC MOMA 'Talk To Me' show in 2011. After a brief stint in market research, he set up After the Flood. He curates the The Design of Understanding conference in London. -
35
8:56
Alasdair Allan interviewed at Strata 2012
by OreillyMedia 268 views
Alasdair Allan
Founder, Babilim Light Industries
Alasdair Allan is the author of Learning iOS Programming, Programming iOS Sensors, Basic Sensors in iOS, Geolocation in iOS, iOS Sensor Apps and Arduino and Augmented Reality in iOS. Last year he and Pete Warden caused a privacy scandal by uncovering that your iPhone was recording your location, all the time. This caused several class action lawsuits and a U.S. Senate hearing. He isn't sure what to think about that. From time to time he stands in front of cameras, and you can often find him at conferences run by O'Reilly Media.
He runs a small technology consulting business writing bespoke software, building open hardware and providing training, including a series of workshops on sensors. He sporadically writes blog posts about things that interest him, or more frequently provides commentary about them in 140 characters or less.
Alasdair is also a senior research fellow at the University of Exeter. As part of his work there he built a distributed peer-to-peer network of telescopes which, acting autonomously, reactively scheduled observations of time-critical events. Notable successes included contributing to the detection of the most distant object yet discovered, a gamma-ray burster at a redshift of 8.2. -
38
3:53
Ryan Ismert interviewed at Strata 2012
by OreillyMedia 140 views
Ryan Ismert
General Manager - Augmented Reality, Sportvision, Inc
Ryan Ismert is Sportvision's General Manager for Augmented Reality. Prior to assuming his latest role, he spent eight years helping to lead the Sportvision engineering team as Director of Engineering. He has an extensive background in computer graphics and computer vision, and graduated from Cornell University with an MS in Architectural Science. Ryan is a frequent speaker at Silicon Valley augmented reality events. -
40
2:44
Fabien Girardin interviewed at Strata 2012
by OreillyMedia 114 views
Fabien Girardin
Partner, Near Future Laboratory
Fabien Girardin (PhD) is the co-founder of Lift Lab, a research agency that helps companies and institutions understand, foresee and prepare for changes triggered by technological and social evolutions. He is particularly active in the domains of user experience, data science and urban informatics. His research mixes qualitative observations with quantitative data analysis to gain insights from the integration and appropriation of technologies in urban environments. Subsequently, he exploits the gained knowledge with engineering techniques to prototype and evaluate concepts and solutions for mobile network operators, urban and location-based services providers, city planners and decision makers. -
41
3:00
Dave Campbell interviewed at Strata 2012
by OreillyMedia 233 views
Dave Campbell
Technical Fellow, Microsoft
David Campbell is a Microsoft Technical Fellow whose present role is Vice President of Product Development for the SQL Server product suite.
David graduated with a Master's Degree in Mechanical Engineering (Robotics) from Clarkson University in 1984 and began working on robotic workcells for Sanders Associates -- later a division of Lockheed Corporation. In 1990 he joined Digital Equipment Corporation where he worked on their Codasyl database product DEC DBMS as well as their relational database product; Rdb.
Upon joining Microsoft in 1994, David was a developer and architect on the SQL Server Storage Engine team that was principally responsible for rewriting the core engine of SQL Server for SQL Server Version 7.0.
At Microsoft, he has held numerous positions and driven a number of major initiatives such as overseeing the initial product development of several of Microsoft's Azure (public cloud) services; defining and implementing SQL Server's global development processes; and, more recently, defining and overseeing the initiation of Microsoft's commercial Big Data product strategy.
David holds several patents in the data management, schema and software quality realms. He is a frequent speaker at industry and research conferences on a wide variety of data management and software development topics.
David lives in Sammamish WA with his wife Marcia and two teenage sons. He enjoys traveling with the family, photography, and occasionally making dust in the woodshop. -
42
15:23
Strata 2012: Ben Goldacre "The Information Architecture of Medicine is Broken"
by OreillyMedia 5,659 views
I am a doctor and a data geek. I worry that data geeks are too easily seduced by the glamour of laboratory science and forget about clinics. Randomised controlled trials are the best tool we have in medicine for finding out if a treatment works or not. Lots of trials are done. Unfortunately, the results of these trials can go missing in action after they are completed.
Missing data is always a challenge: but we also know that "negative results" are more likely to go missing. This means we have a biased sample, overestimating the benefits of treatments. To prevent all this happening, people have set up registers of trial protocols, to be completed before trials begin. These have not been correctly used, and they are not matched to published trials, which show up what data has been left unpublished.
I will describe a small project to fix this, illustrate how that can lead on to fixing other similar problems in medicine, and make a cry for help.
Ben Goldacre
Bad Science
Ben is a best-selling author, broadcaster, medical doctor and academic who specialises in unpicking dodgy scientific claims from drug companies, newspapers, government reports, PR people and quacks. Unpicking bad science is the best way to explain good science.
Bad Science (4th Estate) has sold over 400,000 copies, is published in 18 countries, and reached #1 in the UK paperback non-fiction charts. His book exposing bad behaviour in the pharmaceutical industry will be published in 2012 by 4th Estate.
Ben has written the weekly Bad Science Column in the Guardian since 2003. It's archived on this site along with blogposts, columns for the British Medical Journal, and other writing.
There are lots of clips of Ben on telly here, and a talk at TEDGlobal here. The Placebo Effect is a two-part documentary series he made for BBC Radio 4. The Rise of the Lifestyle Nutritionists is another. He's appeared on the Today programme lots of times, Any Questions, Newsnight, Start The Week, The Now Show, Loose Ends, PM, Quote Unquote, Watchdog, and various other things. You can find plenty of it if you dig around on the site, along with lectures, podcast interviews, maybe start Here.
He has given over 250 talks in the past 5 years, from comedy clubs and music festivals to universities and schools, government departments, and more. You can book him for after dinner speaking by emailing sballard@unitedagents.co.uk.
He's received lots of awards for writing, and a few honorary doctorates.
This is what Google thinks about him, this is what the blogs say about Bad Science. He was trained in medicine in Oxford and London.
Ben is 36 and currently works full time as an academic in epidemiology. He does not see private patients. -
43
6:07
Strata 2012: Steve Schoettler, "Learning Analytics"
by OreillyMedia 4,615 views
Our education system is not preparing students for college. There is an urgent need to improve academic outcomes and equip students with critical 21st century skills. Evidence from top-performing schools shows that use of data, analysis, and feedback are our best tools for improvement. The increasing use of online software and digital devices in classrooms presents an opportunity to collect high-frequency data for mining. Today's analytics techniques could be used to develop a deeper understanding of how students learn, recommend personalized learning plans, and identify early warning flags. Rich data, analytics, and feedback enable a process of iteration and continuous improvement, where educators become learners, and we figure out how to improve education. We are at the beginning of a wave of data-driven change in education, with important social consequences and fantastic opportunities.
Steve Schoettler
Junyo
Steve Schoettler is Founder and CEO of Junyo, a learning analytics company creating tools to help teachers and students understand and improve academic success. As co-founder of Zynga, Steve helped introduce social gaming, virtual currencies, and real-time analytics on a massive scale. Prior to Zynga, Steve worked on innovative and scalable technologies in mobile, entertainment, distributed computing, and security. Steve holds a B.S. in Electrical Engineering and Computer Science from UC Berkeley. -
44
9:22
Strata 2012: Mike Olson, "Guns, Drugs and Oil: Attacking Big Problems with Big Data"
by OreillyMedia 997 views
Tools for attacking big data problems originated at consumer internet companies, but the number and variety of big data problems have spread across industries and around the world. I'll present a brief summary of some of the critical social and business problems that we're attacking with the open source Apache Hadoop platform.
Mike Olson
Cloudera
Michael Olson is currently CEO of Cloudera, the company delivering an enterprise-ready data management platform based on Apache Hadoop. He was formerly CEO of Sleepycat Software, makers of Berkeley DB, the open source embedded database engine. Mike spent two years at Oracle Corporation as Vice President for Embedded Technologies after Oracle's acquisition of Sleepycat in 2006. Prior to joining Sleepycat, Mike held technical and business roles at database vendors Britton Lee, Illustra Information Technologies and Informix Software. Mike has Bachelor's and Master's degrees in Computer Science from the University of California at Berkeley. -
45
10:30
Strata 2012: Abhishek Mehta, "Decoding the Great American ZIP myth"
by OreillyMedia 974 views
How big data tools and technologies give us back our individual identity ... because if you didn't know you were unique and special, well, you are. Big data can be applied to solving socio-economic problems that rival the scale and importance of building ad optimization models.
Abhishek Mehta
Tresata
Abhishek is an expert in the areas big data and consumer payments.
He is the co-founder of Tresata, a big data startup that helps companies identify their core data assets, manage, maintain and enhance the intrinsic value in them and build data factories and products to monetize that value.
Abhishek has over a decade of experience in various strategic and operational leadership roles in banking, technology and consulting. Abhishek is also a Member of the Faculty at one of the premier Retail Banking Management Programs in the US.
A featured speaker on these topics, Abhishek is a die-hard supporter of all things open source and is recognized in the industry as a visionary on how to create value by building, transforming (or disrupting) business eco-systems.
Abhishek is also the Founder and President of Foundation Ten10, a one-of-a-kind network driven non-profit focused on training, educating and nurturing children with learning disabilities. -
46
13:04
Strata 2012: Dave Campbell, "Do We Have The Tools We Need To Navigate The New World Of Data?"
by OreillyMedia 1,155 views
In a world where data increasing 10x every 5 years and 85% of that information is coming from new data sources, how do our existing technologies to manage and analyze data stack up? This talk discusses some of the key implications that Big Data will have on our existing technology infrastructure and where do we need to go as a community and ecosystem to make the most of the opportunity that lies ahead.
Dave Campbell
Microsoft
David Campbell is a Microsoft Technical Fellow whose present role is Vice President of Product Development for the SQL Server product suite.
David graduated with a Master's Degree in Mechanical Engineering (Robotics) from Clarkson University in 1984 and began working on robotic workcells for Sanders Associates -- later a division of Lockheed Corporation. In 1990 he joined Digital Equipment Corporation where he worked on their Codasyl database product DEC DBMS as well as their relational database product; Rdb.
Upon joining Microsoft in 1994, David was a developer and architect on the SQL Server Storage Engine team that was principally responsible for rewriting the core engine of SQL Server for SQL Server Version 7.0.
At Microsoft, he has held numerous positions and driven a number of major initiatives such as overseeing the initial product development of several of Microsoft's Azure (public cloud) services; defining and implementing SQL Server's global development processes; and, more recently, defining and overseeing the initiation of Microsoft's commercial Big Data product strategy.
David holds several patents in the data management, schema and software quality realms. He is a frequent speaker at industry and research conferences on a wide variety of data management and software development topics.
David lives in Sammamish WA with his wife Marcia and two teenage sons. He enjoys traveling with the family, photography, and occasionally making dust in the woodshop. -
47
10:39
Strata 2012: Doug Cutting, "The Apache Hadoop Ecosystem"
by OreillyMedia 2,385 views
Apache Hadoop forms the kernel of an operating system for Big Data. This ecosystem of interdependent projects enables institutions to affordably explore ever vaster quantities of data. The platform is young, but it is strong and vibrant, built to evolve.
Doug Cutting
Cloudera
Doug is a founder of several Apache open source projects, including Lucene, Nutch, Hadoop and Avro. Doug currently works for Cloudera, and previously worked at Yahoo!, Excite, Apple and Xerox PARC. Doug holds a Bachelor's degree from Stanford University and presently chairs the Board of the Apache Software Foundation. -
48
4:58
Jonathan Bruner interviewed at Strata 2012
by OreillyMedia 193 views
Jon Bruner
Editor-at-Large, O'Reilly Media
Jon Bruner is a quantitative journalist at O'Reilly, where he writes about anything numbers-related and develops interactive visualizations. He was previously data editor at Forbes Magazine. He earned a B.S. in mathematics and economics at the University of Chicago. -
49
7:17
Virginia Carlson interviewed at Strata 2012
by OreillyMedia 218 views
Virginia Carlson
Principal, Urban Rubrics
A data and information expert, Virginia has more than 25 years of experience leading fast-paced, creative environments where data are used to make decisions, tell stories and illuminate trends. Before taking the helm at MCIC in January 2009, Virginia was a professor of Urban Planning at the University of Wisconsin-- Milwaukee. She's also been Deputy Director for Data Policy at the Brookings Institution and was the founding Research Director at World Business Chicago. She holds a Doctorate in Political Science from Northwestern University. As a Board member of the Association of Public Data Users, Virginia believes that even the smallest non-profit organizations should have access to the best data available. -
50
5:09
Jeremy Howard interviewed at Strata 2012
by OreillyMedia 2,606 views
Jeremy Howard
President and Chief Scientist, Kaggle
Jeremy Howard is President and Chief Scientist at Kaggle. Previously, he founded FastMail (sold to Opera Software) and Optimal Decisions sold to ChoicePoint -- now called LexisNexis Risk Solutions). Prior to that he worked in management consulting, at McKinsey & Company and A.T. Kearney. Jeremy's passion is applying algorithms to data. At FastMail he used algorithms to automate nearly every part of the business -- as a result the company only needed a total of 3 full time staff, and got over a million signups. Optimal Decisions was a business entirely built to commercialise a new algorithm he designed for the optimal pricing of insurance. Jeremy competes regularly in data mining competitions, which he uses to test himself and stay on the leading edge of machine learning and predictive modelling technology. He is currently ranked #1 on Kaggle's overall competitor rankings, out of over 16,000 data scientists. -
51
2:54
Strata Conference 2012
by OreillyMedia 1,100 views
Strata Conference is the leading event for the people and technology driving the data revolution. The home of data science, Strata brings together practitioners, researchers, IT leaders and entrepreneurs to discuss big data, Hadoop, analytics, visualization and data markets.
-
52
4:33
James Dixon interviewed at Strata 2012
by OreillyMedia 106 views
James Dixon
Founder and Lord of the 1s and 0s / CTO, Pentaho
As " Lord of the 1s and 0s" (CTO) at Pentaho, James Dixon is responsible for Pentaho's architecture and technology roadmap. James has over 20 years of professional experience in software architecture, development and systems consulting. Prior to Pentaho, James held key technical roles at AppSource Corporation (acquired by Arbor Software which later merged into Hyperion Solutions) and Keyola (acquired by Lawson Software). Earlier in his career, James was a technology consultant working with large and small firms to deliver the benefits of innovative technology in real-world environments. -
57
7:58
Diego Saenz interviewed at Strata 2012
by OreillyMedia 103 views
Diego Saenz
President, Data Driven CEO
I am currently working on a couple of startups -- DataDrivenCEO.com and Petplace.com. I have worked as a Fortune 500 executive, as a management consultant and as a successful Internet entrepreneur.
I started my career with Accenture where I consulted for a number of Fortune 500 clients including Burger King, PepsiCo and Great Western Financial Services.
After leaving Accenture I served as the CIO for Pepsi-Latin America where I oversaw multi-million dollar technology projects throughout South America, Central America and the Caribbean. I also served as the General Manager for Pepsi's International Bottling Systems Group.
I left the corporate world in April of 2000 and joined Petplace.com as an early stage start-up. I lead Petplace.com from the early stages to profitability and recognition by Inc Magazine as one of the 500 fastest growing private companies in America.
In addition to Petplace.com I lead the development and launch of Vetsuite.com which we sold to Novartis Animal Health in 2005.
On a personal level in am a fan of: My Wife and Kids, Cool Startups, Data Science, Creative Thinking, Beautiful Design, & Photography
I short, I love people, building companies, leading teams that take on big challenges, getting my hands dirty and creating things with lasting value. -
58
4:23
Jesper Andersen interviewed at Strata 2012
by OreillyMedia 141 views
Jesper Andersen
Founder, Bloom Studios
Jesper develops experimental online services designed to introduce emotional contexts into online relationships, creating more authentic experiences. He is the co-founder of Bloom Studios, developing novel data interface applications for web and tablet platforms. He is also an accomplished data scientist, working on problems including home valuations for Trulia, credit card fraud for Visa, and social network analysis for Visible Path. Jesper speaks frequently at international technology and design conferences and has appeared in print and broadcast media for projects like Avoidr, Freerisk, and his Foursquare privacy hack. He holds a B.Sc. in Physics from Haverford College and an M.B.A. in Econometrics from University of Chicago. -
59
5:13
Nathan Marz interviewed at Strata 2012
by OreillyMedia 288 views
Nathan Marz
Lead engineer, Twitter
Nathan Marz is the lead engineer on Twitter's Publisher Analytics team. He was previously the lead engineer at BackType before being acquired by Twitter in July of 2011.
Nathan is the author of numerous open-source projects relied upon by companies all around the world. These include Cascalog, ElephantDB, and Storm.
He has spoken about his work at conferences such as the Hadoop Summit, Strange Loop, Gluecon, Clojure/conj, and POSSCON. He writes a blog at http://nathanmarz.com. -
62
1:17
Eric Baldeschwieler interviewed at Strata 2012
by OreillyMedia 113 views
Eric Baldeschwieler
CEO, Hortonworks
Prior to co-founding Hortonworks, Eric served as VP Hadoop Software Engineering for Yahoo!, where he led the evolution of Apache Hadoop from a 20 node prototype to a 42,000 node service that is behind every click at Yahoo!. Eric also served as a technology leader for Inktomi's web service engine, which Yahoo! acquired in 2003. Prior to Inktomi, Eric developed software for video games, video post production systems and 3D modeling systems. Eric has a Master's degree in Computer Science from the University of California, Berkeley and a Bachelor's degree in Mathematics and Computer Science from Carnegie Mellon University. -
66
9:42
Nick Halstead interviewed at Strata 2012
by OreillyMedia 167 views
Nick Halstead
Founder / CTO, DataSift
Nick Halstead is the Founder of DataSift Inc., the real-time social media data-filtering platform. During the past five years, Nick has been a foremost technical visionary on the power of social data to revolutionize information delivery. Nick founded TweetMeme, the leading platform delivering social news, which quickly built an audience of millions in 30 countries. TweetMeme also invented the highly successful Retweet button, which serves more than 30 billion clicks per month and drives high volumes of traffic for Twitter. Nick is a regular speaker at events such as TechCrunch Disrupt, Le Web, Future of Web Apps, The Next Web and Strata and has spoken at SXSW and FOWA. -
67
2:31
Strata 2012: Live from the Exhibition Hall
by OreillyMedia 231 views
Strata Conference offers the nuts-and-bolts of building a data-driven business—the latest on the skills, tools, and technologies you need to make data work.
-
68
3:03
Strata 2012: All Access Video Compilation
by OreillyMedia 589 views
Strata Conference offers the nuts-and-bolts of building a data-driven business—the latest on the skills, tools, and technologies you need to make data work.
-
69
40:52
If Data Wants to Be Free, is Privacy a Prison?
by OreillyMedia 212 views
http://oreilly.com/go/strata2012-video
Moderated by:
Alexander Howard (O'Reilly Media)
Panelists:
Jim Adler (Intelius), Solon Barocas (New York University)
So much of the privacy discussion is about data collection and access, fears of a future dystopia, and the complexities of law. There seems to be a real vacuum around how societal norms should be mapped to rapidly growing capabilities of big data. What's difficult about some of these big data use-cases is that even the intended and approved uses of data can lead to decisions or actions that negatively affect specific individuals or groups. These can range from effects on safety (by making a person more easily identifiable or locatable), to fairness (because the purpose of the application is some form of discrimination), to autonomy (by limiting individual choice or through subtle manipulation).
Regrettably, data professionals (e.g., scientists, engineers, designers, analysts) are left in a "don't ask don't tell" privacy conundrum where no framework exists to assess the societal impact of their work. Such a framework would need to go beyond default "procedural protections" (e.g., the Fair Information Practice Principles) to "substantive protections" that evaluate possible product impact at design-time and track actual impact as the product moves into the market.
This conversation will address, from academic and industrial perspectives, specific use-cases within people search, background checks, online advertising, and voter targeting. Through these use-cases, we'll explore the feasibility of a "responsible innovation" framework that might guide data professionals.