 News and analysis from Big Data SV 2014 is brought to you by headline sponsors, Actian, accelerating Big Data 2.0. And WAN Disco, we make Hadoop invincible. Hello, everyone, welcome back to our wrap up of the Big Data SV, Big Data Silicon Valley, Innovation Hour event, Silicon Angles the Cubes, Big Data event in Silicon Valley. This is an augmentation to our Big Data New York City event, hashtag Big Data NYC and this is the hashtag Big Data SV. I'm John Furrier, the founder of Silicon Angle and I'm excited to say we had a great event, over 44 interviews here on theCUBE over two and a half days, we talked to a lot of people. We had a party, all of our community, all the CUBE alumni and their friends came last night. Great event, everything's happening here. Great show from a new standpoint at Stratoconference, a lot of buzz sold out the whole show, a lot of activity. I'm John with Jeff Kelly, chief analyst for Big Data at Wikibon and Jeff Frick, GM of our CUBE business. Jeff, CUBE was a success again. CUBE was a success, this is our second show that we've done ourselves to go out and do our own Big Data SV and we're really excited with the response of the community, really excited about the interviews that we had and some of the insight that we saw. It takes a lot of people to put a big show one like this and it's really a nice thing when we pull it off and it works and I think we've got some good info. Yeah, and I think people get to know the CUBE, it's our 50th year, we have a lot of CUBE alumni. Now, it's like, hey, CUBE alumni, this is getting kind of played, right? We've interviewed over 3,300 people almost over the life of the CUBE and we'd like to stream the data. We are about providing the data unfiltered, extracting the signal from the noise. We love talking to smart people, extracting the knowledge and sharing it with you. That's our mission, we're looking forward to continuing to do that. We want to thank our underwriters who got us here. Thank you very much. If you're interested in sponsoring more of the CUBE so it gets bigger and better, we're happy to talk, talk to Jeff about that. But getting back to the data and extracting the signal from the noise, Jeff Kelly gave a big presentation keynote at Strata today presenting his data on the market study, survey sizing, Jeff, your take on the feedback and the data that you're getting from the field. Well, I think a few takeaways I had from this week. Number one, I think it's great that the conversation is moving away from the technology and more to the business value, what the technology enables. So that's a good sign for the market. It's a good sign for all these vendors out here and it's a good sign really for enterprises who can now actually look to some of these providers to actually deliver value and not just a set of technologies and essentially more work. You've got to integrate them and get things running inside your organization. Some other things that I thought were really interesting was again the business focus and talking more to business people rather than just IT. We had Cloudera on talking about their enterprise data hub strategy and how it really enables them to shorten that conversation about the tech and the guts of the technology. So they're not talking so much about Hadoop and Flume and Scoop, they're talking about business value to a business person. And that's really the kind of up level conversation that's gonna move this market forward. A few other things that I think the partnerships that were announced, we're gonna see more of that this year, I think that's critical again to getting big data into the enterprise. We saw MapR and EP Vertica with a big announcement. Of course, another big one was Hortonworks and Red Hat. So you're gonna continue to see that this year, I think, rather than some of those big product announcements because now a lot of the technology is kind of starting to solidify around Hadoop and some of the other things that you can do with it thanks to Yarn and some of the other innovations we saw last year. And now it's about implementing it, making it real in the enterprise. So we're starting to see some settlement of people finding their swim lanes, as we say. Amongst the players. And obviously, this industry would not be possible without all the great work at Cloudera when they first started, they were the first company in. Amarawa Dalla, Jeff Hammerbacher, Michael Olson, Kristoff, Basile that we be data. Those guys, they started the movement and the folks at Yahoo then spun out for Hortonworks. That was a seminal moment. I think when history is written, I think they're gonna look back and forget about all the little mudfights going on and open source and they're gonna look at those guys and saying that's really what it's all about. And I talked to Michael Olson about this and they have a mission. From day one they've had a mission and then the Yahoo guys became Hortonworks. Again, the mission of Hadoop is the real deal. That's the big story here, that it's a done deal. And the open source community continues to evolve and storage, compute, algorithms, the tools, the decision tools are all becoming open source and open source is now the core lever for the massive innovation and change. So that's a big story. And I think at the end of the day, competition will still happen and it's good to see that but they're set a limit into their groove. I mean, you see Cloudera, what's your take on those swim lanes? Well, yeah, I mean, if you're talking about Hadoop specifically, I agree with you. I think all three of the kind of the pure play Hadoop vendors, Cloudera, Hortonworks, and MAPR are kind of finding their identities. Cloudera specifically really, I think has had a really interesting last six months or so with their introduction of the enterprise data hub and really focusing on, or not worrying so much about whether they're proprietary or open source, focusing on what they do well, the strengths that they have and really simplifying the message, simplifying the pricing, making it easier for their customers to understand what they do and how best to engage with them. So great job by Cloudera, they're set for a really big year if they can execute and by all accounts, they're ready to do that. From Hortonworks, they have in waivers from day one. They're open source, they're gonna leverage the community, they're gonna leverage partnerships to get out into the enterprise and really get to scale. So they've done that last year with partnerships with Teradata, with Microsoft, SAP and others. This year is about, again, for them it's also about execution, executing those reseller arrangements, getting the Teradata Salesforce out there, reselling Hortonworks, same with the SAP Salesforce, et cetera. So they're also primed for a big year if that pans out the way they expect and MAPR, MAPR give them a lot of credit. They're not ideologues, they focus on solving real problems for real customers today, whether that's open source, proprietary, some combination. They support open source projects, but they're not shy of injecting their own IP when they think that's gonna help customers. They had that great partnership announcement with HB Vertica. So I think all three of them are finding their way. There's still gonna be plenty of squabbles this year. There's competition, there's gonna be plenty of back and forward banter, but I think you're right. They're finding their own groove and there's a lot of, this is a big market as we saw, there's room for everybody. Okay, well I took some notes down throughout the interviews and I wanna share with you my summary of kind of what happened here in the moment in time of big data Silicon Valley or big data SV. Obviously I mentioned storage, compute, outroads and tools are all the key. Hadoop as a platform is viable, analytics continues to be the holy grail application and everything else around is kind of filling in and you're starting to see the swim lanes from the vendors, as Jeff was pointed out. I think the real focus I heard here was the next step is the ecosystem. You start people talking about their ecosystems and that's a normal conversation around evolution around platforms. Outside of that, one of the key highlights from a topic standpoint is the business focus is number one and it's greater than a POC and we're seeing no real discussions around POCs. They're talking about budget. So the business conversation is not a bad thing. In Silicon Valley, people don't have those conversations normally but right now, if you're not talking about business outcomes and business model value, then you are irrelevant and that is clearly coming through this show and entrepreneurs and companies out there who have high valuations or high hopes, if you don't have that business conversation dialogue, you will be left out in the cold. So that's very clear from the show here. Two, data science continues to evolve and be a very hot topic. The notion of scale has been checked. That's kind of behind us. Hadoop checked behind us. The key conversations around data science is discovery, adaptive data, insights for the data, decision tooling, making it easier and to extend to that as a whole new category topic, I'm gonna put down as the interface problems around data, whether it's a mobile app or graphs or making it simplifying, the interface problem is really a big deal. Joe Hellerstein kind of highlighted that in our conversation with that and that's key. And then fourth, open source continues to dominate and will not stop in my opinion. It's really changing the computer science equation both from how computer science is being done and then the effects of the disruption. And then finally, the past, the yarn, the in memory, these are topics that are gaining big steam. What is a platform as a service for big data? Yarn has got a lot of traction. In memory of Spark is clearing the runway and that's gonna continue to change the data workflows and how this integrates into the business platforms which converts into IT, et cetera. And finally, my honorable mention topic is always gonna be emerging out and that's internet of things. Internet of things is front and center. That's gonna be a big part of the conversation this year on top of those topics. So in summary, I've had to kind of boil up the topics, business, data science, interface problem, open source dominating, past, yarn, in memory, data workflows and internet of things. So to me, Jeff, that is my take on what's happening and it's great news for everyone. The growth is there. Couldn't agree more. I mean, you said it well. It's up level to conversation, talking about open source, data science, really moving towards that value conversation which is really where, that's when you know the market's starting to really become mature. Yeah, and I think looking down the road a little bit further I think Joe also said, hello, senior, it was one of our last guests. What happens to the way you think about problems when storage and compute power is basically infinite and free? Which it's not there yet but it's the old economics curve, it's absolutely approaching that line. And so how do you readdress your problems? How do you think of the way you attack your problems in this kind of new world? And we're very quickly headed that way. We are at the open compute forum with now open source is getting into the hardware side of the business, which is a whole another disruption that we're gonna soon see. All the armchips going into that piece kind of disrupting the x86 dominance which has been locked in for such a long time. I think that's the other one. And what he also said, he was a great guest professor at Berkeley so he's a little bit further thinking down the road is there's a whole new rash of opportunities that are coming because of this way to look at data. Jeff, I want to ask you, you were out managing the front and making sure everyone's coming in. Our team here did a great job, props to the whole CUBE team. I want to ask you a question. You know, with the party last night, we saw the community of the CUBE kind of coming together for the second time. What were some of the things that you've heard from folks? You had a lot of chances to prep the guests. You were scheduling, you were interfacing with all the party folks. What's the vibe of the CUBE? We were sitting behind the desk all day. What was happening out there? I think the vibe is positive. I just talked with Sarkin and Pauline before she came in and she said, how do you like doing your own events? Is that a good thing or a bad thing? And I told her, it's really a good thing. It's a new challenge for us, but it's an opportunity to bring our community together in a unique way. And we're seeing, as you said, I think at the parties, a lot of biz dev going on, people know each other and it is a small community of people that are changing the world in big data and cloud. And I think we're really fortunate to be able to participate and get the tech athletes, as we love to call them, on to tell the stories because we had big companies here this week and we had small companies, we had new partnerships and we had emerging people coming out of nowhere. So it's a really exciting time to be in the tech space. We're really excited to be able to go out to these events and bring the folks to you and the audience to hear from them and see what's going on. I've been through a bunch of booms and busts in the valley since back in the early 90s and this one feels like a great boom but what's different is it's based on real business value and we hear repeatedly time and time again that in these really large scale operations, if you can eke out small percentages of deltas, the impact is enormous because you're talking about a big numerator. So it's exciting. I think people are excited to be here and I think big data NYC in the fall is gonna be another great show. Well, we love talking to the tech athletes and I think that's the key thing about the party highlights for us is that we've interviewed a lot of people, a lot of people watch, we had over 100,000 live views on just on the live stream. So again, the interest is there. We're happy to bring it to you and you know what? We love doing it and the social relationships that come out of the data that we're streaming, the raw materials unfilter and I think the beautiful thing about this new social media world that we're living in is that the value of the data in context to being unfiltered, a lot of people make up their own mind and we're proud to do that. We would not be able to do it without the sponsors. So Jeff, share with the folks out there, some of the folks who helped underwrite our independent data streaming operation. Yeah, again, thank you very much. So it's Acti and Wendisco as our featured sponsors. Thanks a lot for coming in and supporting Big Data SV. We had Alterics, Cloudera, InfoObjects, MapR, who's a great sponsor of theCUBE, Syncsort, HP Vertica, we did the HP Vertica data show last year, Trasada and Squirrel. So again, thank you so much to all those companies for underwriting us. If you stop by theCUBE, if you're able to make it by the Yosemite Room where you've seen us on the road, you see we've got a lot of people, a lot of gear. We can't do this without underwriters helping us out and so big shout out to them. Also why I've got a minute, just big thanks to the guys that you don't see behind the lights here in the studio. Mark and Matt, Alex, Andrew, Greg and Mick who are working hard. They're up early, they're up late. They're working over at the board. And of course a big shout out to our Dave who aren't with us here. Dave Vellante had to go back early. And then Dave Butler who keeps us all on track back in Boston. Yeah, and Chris and Nicole, and then Chris and Nicole, the blog, getting all the blog posts out there. We want to put as much content as possible at siliconangle.com. Look, go to wikibon.org for all the free research, all the free content. We're glad to bring it to you. And you know what? I had people come to me last night and said, I want to have hashtag Big Data Chicago. And so we'll definitely be in New York. So Big Data NYC and Silicon Valley will be continuing to do this. But we might bring it on the road. There's a lot of interest. If you want to help us there, just let us know, contact Jeff. Or if you want to talk to theCUBE, we'd be happy to do your events. Again, we want to go where the action is and we will go wherever it takes to find the signal from the noise. That's our mission. This is theCUBE. Thanks for watching. This is a wrap for Big Data SV. We'll see you on our next CUBE show.