by OreillyMedia 518 views
Mike Hendrickson of O'Reilly Media interviews Mikael Bisgaard-Bohr of Teradata Corporation.
As the VP Business Development for Teradata in EMEA Mikael Bisgaard-Bohr is responsible for identifying new trends and directions in the market for BI, Analytics and Big Data. He interacts with the largest and most sophisticated users of Teradata's technology as well as leading minds in the industry to gain a better understanding for how technology is changing how organisations are run, products consumed and how organisations and consumers interact in the future. He frequently shares that insight with Teradata customers and prospects as well as at leading conferences across the EMEA region. Mikael has been working with BI for the last 18 years and prior to his current role he was a thought leader and business consultant focused on the retail industry. Mikael holds an MBA from SDA Bocconi and lives in Copenhagen with his wife and 2 children.
by OreillyMedia 419 views
In this presentation Mikael Bisgaard-Bohr will discuss why the Big Data Wave is arriving now, what spawned it and how it will impact how we manage organisations whether they be private or public, and how organisations interact with their customers and stakeholders. As Mikael will show in his talk, Big Data will enable profound changes that will impact each one of us as consumers, workers and manager as well as the organisations that we function in today. He will also explain why "Big Data" is not really about the data but more about how the data itself and the new technologies enabling Big Data will be used.
by OreillyMedia 411 views
Liam Maxwell is the Executive Director of the IT Reform Group in the Cabinet Office and was appointed the Deputy Government Chief Information Officer (CIO) in April 2012.
His team is responsible for the delivery of the Government IT Strategy and Technical Leadership across government. He leads the spend control process for IT which is part of the Efficiency and Reform Group programme to save multiple billions across government.
Previously, Liam was Lead Member for Policy at the Royal Borough of Windsor and Maidenhead where his brief covered IT, sustainability and the council's role as a "government lab" where he introduced innovative policies at the heart of the Big Society agenda and delivered the lowest council tax in Britain outside London.
As well as academic roles he has a background as an IT Director in FTSE and Fortune 500 business service companies.
by OreillyMedia 1,020 views
Kenneth Cukier (The Economist) "The Manifest Destiny of Big Data"
Everyone uses the term big data but no on can agree on what it means or even if it's novel. However the label is useful to describe the radically new ways that the world interacts with information—for which the public, policymakers and even data geeks, are unprepared. Where Strata conferences tend to have a technical bent, this keynote considers big data's mainstream economic, political and social consequences.
The implications are far-reaching. Big data will replace human judgement in many instances just as steam power replaced muscle in the industrial revolution. It will mean greater efficiency and new winners and losers—not just among companies, but countries too. Yet it presents new challenges: not only for privacy but for "propensity"— algorithms predicting our behavior before we've acted, be it to repay a loan or commit a crime. Ultimately, we need new institutions to shape how we introduce big data into society, since it will become the undergirding our world whether we like it or not.
Kenneth Neil Cukier the data editor of the The Economist, and co-author of "Big Data: A Revolution that Will Transform How We Work, Live and Think" to appear in early 2013. Previously, he was the paper's Tokyo correspondent and before that, its technology correspondent in London. From 2002 to 2004 Mr. Cukier was a research fellow at Harvard's Kennedy School of Government, where he worked on the Internet and international relations. Additionally, Mr. Cukier serves on the board of directors of International Bridges to Justice, a Geneva-based NGO promoting legal rights in developing countries.
by OreillyMedia 434 views
Mark spent the past two decades working on analysis and decision support projects in many industries. He is the founder of Third Nature, a research and consulting firm focused on emerging technology and practices in analytics, BI and information management. Mark is also an award-winning former CTO and consultant who frequently speaks at US and European conferences.
I focus on two types of work: using data to make decisions and manage organizations, and building data technology infrastructure. I've spent the past two decades working on analysis, performance management and decision support projects. A big part of making decisions and using data in a corporate setting is ensuring that the right data capture and data delivery infrastructure is in place to manage the business. As a result, I do as much information strategy and IT architecture work as I do performance management and decision support.
by OreillyMedia 1,171 views
Kim Rees is a founding partner of Periscopic: http://www.periscopic.com, an award-winning information visualization firm. Their work has been featured in the MoMA as well as several online and print publications, including CommArts' Interactive Annual, The Information Design Sourcebook, Print magazine, and numerous websites, blogs, and regional media outlets. Periscopic's body of work was nominated for the Cooper-Hewitt National Design Awards.
Kim is a prominent individual in the information visualization community. She has published papers in Parsons Journal of Information Mapping, was an award winner in the VAST 2010 Challenge, and is an advisor to the Congressional Budget Office. Kim has presented at several industry events including Strata, Wolfram Data Summit, Eyeo, VisWeek, and various data visualization groups among others. Recently she has also been an adviser on an upcoming documentary film and was the Technical Editor for Visualize This by Nathan Yau. Kim received her BA in Computer Science from New York University.
by OreillyMedia 756 views
Governments around the world are opening up their data vaults to release a "tsunami of information". This big data provides critical insight into the way government works, but thousands of datasets are published every day and are often ignored. How can we make this data accessible for everyone to analyze?
When the UK government published every item of spending over £25,000, it was big data published in a complex way. UK Guardian Datablog made that data publicly available and asked its users to get involved in poring through the numbers. The result was readers and journalists working together to make real stories out of raw numbers.
In the past, journalists kept the raw information close to their chests; now by opening it up to the world, everyone becomes a potential data journalist. Every day the Guardian publishes the raw data behind the news using Google tools, opening up and making government statistics accessible in a form their readers can use.
Publicly sharing and visualizing data can help gain insight and answers to questions that aren't immediately obvious. The public can combine the public data with their own. Using the tools available, anyone can now create their own stories and visualizations with the data, which leads to increased knowledge and action.
In this keynote, we explore how the UK Guardian Datablog is at the forefront of data journalism. We'll cover several of the tools and technologies they have used to collect, analyze and visualize, and share their data with the world.
Kathryn Hurley recently joined Google as a Developer Programs Engineer for Fusion Tables. In this role, she helps spread the word about Fusion Tables by presenting at conferences and developer events. She recently worked on Google's 2010 U.S. Election Ratings gadget. She received an MS in Web Science from the University of San Francisco and a BS in Genetics from the University of California, Davis. Prior work experience includes research in mobile and peer-to-peer computing.
Simon Rogers is editor of the Guardian's Datablog and Datastore, an online data resource which publishes hundreds of raw datasets and encourages its users to visualise and analyse them. He is the author of Facts are sacred: the power of data available now on Kindle. Simon is also a news editor on the Guardian, working with the graphics team to visualise and interpret huge datasets. He was closely involved in the Guardian's exercise to crowdsource 450,000 MP expenses records and the organisation's coverage of the Afghanistan Wikileaks war logs. Previously he was the launch editor of the Guardian's online news service and has edited the paper's science section. He has edited two Guardian books: How Slow Can You Waterski and The Hutton Inquiry and its impact. Simon has just been awarded the Oxford University Internet Institute's award of 'Best Internet Journalist' and was recently honoured at the Knight Batten awards for journalistic innovation. The Datablog and Datastore have won awards in 2011 for innovation from the UK's Online Media Awards and the Newspaper Awards. In 2010, Simon received a special commendation from the Royal Statistical Society in its awards for journalistic excellence.
by OreillyMedia 1,964 views
Evolution in the digital universe has been driven, since the beginning, partly by improvements in code and partly by improvements in machines. Alan Turing's one-dimensional model of universal computation of 1936 led directly to John von Neumann's 5-kilobyte, two-dimensional implementation of 1946. The resulting address matrix, still in place after sixty years, is how the machines know where to find the code, and how the code knows where to find the machines. Mapping real-world correspondence to data structures populating a storage matrix currently expanding by some 5 trillion bits per second is the challenge that brings us here.
George Dyson is a historian of technology whose interests have included the development (and redevelopment) of the Aleut kayak (Baidarka, 1986), the evolution of digital computing and telecommunications (Darwin Among the Machines, 1997), and a path not taken into space (Project Orion, 2002). His latest book, Turing's Cathedral: The origins of the digital universe, illuminates the transition from numbers that mean things to numbers that do things in the aftermath of World War II.
by OreillyMedia 559 views
We've all heard the opening line before: We're in the midst of a data revolution, a time in which those with the right skills can harness streams of information to yield insights and create value. Yet for all of our machine learning algorithms and big data tools, so many of the problems we solve day-to-day are decidedly "first world": figuring out how to get the biggest ROI on ad dollars or crafting personalized movie recommendations. Can we use our skills as data scientists to solve social problems as well, helping people find clean water as easily as they can find good restaurants? How do we draw insight from big (and not-so-big) data to give perspective to social changemakers who don't have the resources to hire a full-time data science team? How will the influx of publicly available government, foundation, and third party data shape the kind of problems we tackle and the communities we connect with? If we truly are in the midst of the data revolution, let's make sure we're playing the part of the heroes.
Jake Porway is a machine learning and technology enthusiast who loves nothing more than seeing good values in data. He is the founder and executive director of DataKind, an organization that brings together leading data scientists with high impact social organizations to better collect, analyze, and visualize data in the service of humanity. Jake was most recently the data scientist in the New York Times R&D lab and remains an active member of the data science community, bringing his technical experience from his past work with groups like NASA, DARPA, Google, and Bell Labs to bear on the social sector. Jake's work has been featured in leading academic journals and conferences (PAMI, ICCV), the Guardian, the Stanford Social Innovation Review, and he has been honored as a 2011 PopTech Social Innovation Fellow and a 2012 National Geographic Emerging Explorer. He holds a B.S. in Computer Science from Columbia University and his M.S. and Ph.D. in Statistics from UCLA.
by OreillyMedia 272 views
There are a lot of dreams centred around the idea that opening up data will bring economic and social benefits: economic growth, more efficient public services, better democratic engagement and so on. Open data is a powerful tool, and can revolutionise how organisations achieve their goals, but we have learned that connecting the two can be difficult. Jeni Tennison, Technical Director of the newly formed Open Data Institute, will describe the ODI's twin aims of helping data owners achieve their organisational objectives through publishing open data, and helping those who reuse that data to add value responsibly and effectively, thereby turning open data dreams into reality.
Open Data Institute
Jeni Tennison is the Technical Director of the Open Data Institute. As a developer, she specialises in open data publishing and consumption, including XML, JSON and linked data APIs. She trained as a knowledge engineer, gaining a PhD in collaborative ontology development. Jeni was the technical architect and lead developer for legislation.gov.uk and worked on the linked data aspects of data.gov.uk. She is author of several technical books and was appointed to the W3C's Technical Architecture Group in 2011. She is also a member of the Open Data User Group and of the UK Government Linked Data Group.
by OreillyMedia 1,275 views
The Great Railway Caper: Big Data in 1955
It's 1951 and you've got the world's first business computer and you've just been handed a Big Data problem. Go! With 2K of memory it was powerful enough to run the then massive Lyons business. But it wasn't long, in 1955, before Big Data came calling in the form of a request from British Rail to calculate the shortest distance between every one of their 5,000 railway stations.
The British Rail network was, of course, a graph with edges labelled with the distance between the stations, but graph algorithms hadn't been invented yet. Dijkstra's Shortest Path algorithm which is widely used today wouldn't be described until 1959. And with 5,000 stations there were 12.5 million distance pairs to calculate exceeding the memory capacity of LEO by four orders of magnitude (equivalent today of a single large machine being faced with a petabyte to calculate).
What could be more modern? Data that exceeds available memory by orders of magnitude; a graph structure; the need for new algorithms.
In this talk I'll tell the story of Roger Coleman who worked out how to partition the problem into manageable parts, discovered Dijkstra's algorithm, and hand coded the solution in assembly language.
by OreillyMedia 433 views
"The Quiet Comfort of the Internet of Things"
Founder of Good Night Lamp / Designswarm Founder
Alexandra Deschamps-Sonsino is an interaction designer & entrepreneur. She is the founder of Good Night Lamp, a family of internet-connected lamps. She also leads Designswarm an "internet of things" design studio & consultancy and works with clients who want to design next generation connected products. She also uses her expertise to help shape early business ideas around smart products. Her work has been exhibited at The Victoria & Albert Museum and the Museum of Modern Art in New York. From 2007 to 2010, she co-founded and ran Tinker London, a smart product design studio. Focused on creating connected product experiences that linked the digital to the physical, Tinker was the first distributor of the Arduino platform in the UK, ran workshops around the world and offered design and consultancy services. She is also involved in organising technology & design community events in London such as the Internet of Things meetup and This Happened London . Alexandra has been focused on the "internet of things" and its implications in the design of everyday products since 2005. Find her on twitter or see her linkedin profile.
by OreillyMedia 903 views
Ever searched for loose change down the back of the sofa? Ever wondered where all those odd socks go? Or the missing biros? How about the tea spoons from the work canteen? No? Well nor have I, much more important things to do after all. But I have searched for data in the cracks between software architectures, trailed people down the street watching them leak data, data that can tell me and others about their everyday lives, and mined unstructured data for their secrets.
Big data isn't just multi-terabyte datasets hidden inside eventually-concurrent distributed databases in the cloud. It's also about the hidden data you carry with you all the time, data that is generated for you and about you, but not necessarily by you. Hidden data, your data, carrying on its secret life without your knowledge, but with your implicit and implied consent.
Alasdair Allan is the author of Learning iOS Programming, Programming iOS Sensors, Basic Sensors in iOS, Geolocation in iOS, iOS Sensor Apps and Arduino and Augmented Reality in iOS. Last year he and Pete Warden caused a privacy scandal by uncovering that your iPhone was recording your location all of the time without you knowing. This caused several class action lawsuits and a U.S. Senate hearing. He isn't sure what to think about that. From time to time he stands in front of cameras, and you can often find him at conferences run by O'Reilly Media.
He runs a small technology consulting business writing bespoke software, building open hardware and providing training, including a series of workshops on sensors. He sporadically writes blog posts about things that interest him, or more frequently provides commentary about them in 140 characters or less.
Alasdair is also a senior research fellow at the University of Exeter. As part of his work there he built a distributed peer-to-peer network of telescopes which, acting autonomously, reactively scheduled observations of time-critical events. Notable successes included contributing to the detection of the most distant object yet discovered, a gamma-ray burster at a redshift of 8.2.
by OreillyMedia 1,660 views
Data is great. Data is powerful. But when some data is missing, bias can be introduced, distorting the overall picture. Randomised controlled trials are the best tool we have in medicine for finding out if a treatment works or not, and lots of trials are done. But unfortunately, the results of these trials can go missing in action after they are completed, and trials with "negative results" are more likely to go missing. This means we have a biased sample, overestimating the benefits of treatments. To prevent all this happening, various regulations have been passed around the world. They have not been enforced, and the problem has persisted. I'll describe a small project trying to document and prevent this problem.
Ben is a best-selling author, broadcaster, medical doctor and academic who specialises in unpicking dodgy scientific claims from drug companies, newspapers, government reports, PR people and quacks. Unpicking bad science is the best way to explain good science.
Bad Science (4th Estate) has sold over 400,000 copies, is published in 18 countries, and reached #1 in the UK paperback non-fiction charts. His book exposing bad behaviour in the pharmaceutical industry will be published in 2012 by 4th Estate.
Ben has written the weekly Bad Science Column in the Guardian since 2003. It's archived on this site along with blogposts, columns for the British Medical Journal, and other writing.
There are lots of clips of Ben on telly here, and a talk at TEDGlobal here. The Placebo Effect is a two-part documentary series he made for BBC Radio 4. The Rise of the Lifestyle Nutritionists is another. He's appeared on the Today programme lots of times, Any Questions, Newsnight, Start The Week, The Now Show, Loose Ends, PM, Quote Unquote, Watchdog, and various other things. You can find plenty of it if you dig around on the site, along with lectures, podcast interviews, maybe start Here.
He has given over 250 talks in the past 5 years, from comedy clubs and music festivals to universities and schools, government departments, and more. You can book him for after dinner speaking by emailing email@example.com.
He's received lots of awards for writing, and a few honorary doctorates.
This is what Google thinks about him, this is what the blogs say about Bad Science. He was trained in medicine in Oxford and London.
Ben is 36 and currently works full time as an academic in epidemiology. He does not see private patients.
by OreillyMedia 103 views
Jason Phelps, EMEA Sales Director, Splunk.
Splunk® Inc. provides the engine for machine data™. Splunk software collects, indexes and harnesses the massive machine data continuously generated by the websites, applications, servers, networks and mobile devices that power business. Splunk software enables organizations to monitor, search, analyze, visualize and act on massive streams of real-time and historical machine data. More than 4,000 enterprises, universities, government agencies and service providers in more than 80 countries use Splunk Enterprise to gain operational intelligence that deepens business understanding, improves service and uptime, reduces cost and mitigates cyber-security risk. To learn more please visit www.splunk.com/company.
by OreillyMedia 93 views
John Kreisa, VP Marketing, Hortonworks.
Hortonworks is a leading commercial vendor of Apache Hadoop, the preeminent open source platform for storing, managing and analyzing big data. Our distribution, Hortonworks Data Platform powered by Apache Hadoop, provides an open and stable foundation for enterprises and a growing ecosystem to build and deploy big data solutions. Hortonworks is the trusted source for information on Hadoop, and together with the Apache community, Hortonworks is making Hadoop more robust and easier to install, manage and use. Hortonworks provides unmatched technical support, training and certification programs for enterprises, systems integrators, and technology vendors. For more information, visit www.hortonworks.com.
by OreillyMedia 144 views
Russell Kennedy, VP, Product Strategy, Cleversafe.
Cleversafe has created a breakthrough technology that solves petabyte and beyond big data storage problems. This solution drives up to 90% of the storage costs out of the business while enabling secure and reliable global access and collaboration. Cleversafe is combining the power of Cleversafe's Dispersed Storage® System with Hadoop MapReduce on the same platform. This solution replaces the Hadoop Distributed File System (HDFS) which relies on 3 copies to protect data thereby significantly improving reliability and allowing analytics at a scale previously unattainable. Many of the world's largest data repositories rely on Cleversafe for limitless data storage.
by OreillyMedia 178 views
Andrew Grinalds, Strategic Marketing, MAPR Technologies
MapR delivers on the promise of Hadoop, making managing and analyzing Big Data a reality for more business users. The award-winning MapR Distribution brings unprecedented dependability, speed and ease-of-use to Hadoop. Combined with data protection and business continuity, MapR enables customers to harness the power of Big Data analytics. Leading companies including Amazon, Cisco, EMC and Google partner with MapR to deliver an enterprise-grade Hadoop solution. MapR's open, enterprise-grade distribution for Hadoop, is also easy to start using. Visit www.mapr.com to try the VM demo version, install on a cluster or use in the cloud.
by OreillyMedia 67 views
Gavin is the founder and CEO of AMEE, which provides trusted, comprehensive and timely answers to millions of environmental questions.
Gavin is a serial entrepreneur with over a decade of experience in building data businesses, 20 years in internet and software development, and a background in astrophysics.
His experience spans business, technology, science and media. Leading projects with organisations as diverse as the UK Government, SAS, Google, BBC, Rolls Royce, LogicaCMG, Shell and Christian Aid, he has broad and deep knowledge of how data can change the world.
AMEE aggregates and automates access to the world's environmental and energy information. Its unique partnerships with world-leading standards providers including the WRI, DECC and CDP, make AMEE the most comprehensive and advanced web-based solution for greenhouse gas emissions, energy efficiency, and other environmental data.
by OreillyMedia 298 views
Kaitlin comes from Digital Science, a new technology company started out of Macmillan Publishers, providing tools for researchers. She's a technologist, open science advocate, and data nerd who works in her day job to make scientific research more efficient through better use of technology. Prior to moving to the UK to work for Digital Science, she managed the science division of Creative Commons where she worked to enable better knowledge sharing and research. She is also the external co-chair for Strata Conference in London, taking place October 1-2. For more about Digital Science, visit http://digital-science.com. You can follow her at @kaythaney.
by OreillyMedia 145 views
Nick Halstead, CEO and Founder, Datasift
You're thinking - not yet another big data provider, right? We're all about a cloud-platform for social data and social insights. Our platform gulps down the twitter firehose and other social data sources to help you filter, augment and create data feeds and insights for your apps or business. We enrich social data so you know the sentiment, influence, geolocation, etc of conversations. And we're about to launch historical twitter data so you can do trend analysis. Best of all we have super-fast developer APIs and transparent pricing so you can get started today.
by OreillyMedia 166 views
Couchbase is the NoSQL database leader, with production deployments at AOL, Deutsche Post, NTT Docomo, Orbitz, Salesforce.com, Turner Broadcasting Systems, Vimeo, Zynga and hundreds of other household names worldwide. Couchbase Server is the simple, fast, elastic NoSQL database that delivers a more scalable, high-performance, and cost-effective approach to data management than relational database technology. It is particularly well suited for web applications deployed on virtualized or cloud infrastructures, and for applications requiring real-time data synchronization between mobile devices and the cloud. Couchbase is a privately held company funded by Accel Partners, Ignition Partners, Mayfield Fund and North Bridge Venture Partners.
by OreillyMedia 147 views
Nuno Sebastiao, CEO, FeedZai
FeedZai Pulse™: Real-Time Big Data, Made Easy! is a revolutionary next-generation business intelligence product that combines the power of its real-time data processing engine (Pulse™) with the scalability of big data processing solutions such as Cassandra™ or Hadoop™ and the predictive analytics of R in a single unified view of past, present and future. Launched in May 2011, FeedZai Pulse attracted clients from 4 continents and processes data from over 90 countries. Pulse processes events at rates of up to tens of thousands of messages per second, while comparing against months of continuously updated historical data in self-serving business dashboards. FeedZai was named a Cool Vendor by Gartner in 2011.