 Hi, this is Stu Miniman at Wikibon headquarters and joining me is Jeff Kelly and Talking about Cloudera today and their new Partner program that they've launched and Jeff I was wondering if you can walk us through a little bit about what Cloudera is talking About they've been in the news quite a bit lately. They have they announced partnership with Dell last week, of course And they followed that up this week with an announcement about a larger partner program Kind of contains two of my perspective two big Points that we want to talk about first is their the first ever Hadoop certification program Essentially it will allow software hardware vendors to certify that their technology is compatible with Cloudera's Apache Hadoop distribution The second part of course is the partner solution spotlight essentially is going to allow vendors to partners and other vendors to Basically show off on Cloudera site how they work with Hadoop Hadoop distributions And kind of basically educate the market a little bit about what you can do with some of the more traditional technologies You might have in your existing infrastructure along with Hadoop. Okay, great So let's let's dig in a little bit to the certification program because we've been talking about for probably the last year that really Cloudera really is to Hadoop what Red Hat is to the open-source Linux community And when I look at typically open-source, it's about making sure things get into the open-source stream and that we put together You know really the package, which is what how Cloudera has had for a while. So first of all, what's what's different about this certification? well, I mean essentially this is going to Basically Cloudera is trying to put their stamp on in this case literally a certification stamp On Hadoop they want to be the go-to Hadoop provider and one way to do that is to With the certification program essentially they want to get to the point where an organization considering implementing Hadoop It's only going to do so with Cloudera's distribution along with technologies that are certified So this is beyond their distribution This is complementary products and solutions that are going to get as you said their logo certified So we actually look they've got on their website here. It's the certified technology program to make things simpler So What I found this really interesting is if you look at kind of Linux Traditionally you had things that go into open-source, but if you talk about a certification program It really sounds more like building an ecosystem and Cloudera is looking to put themselves At the center of this ecosystem So remind me much more of what Microsoft did with Microsoft certification program and more recently VMware has done So this isn't just you know some open-source, you know Servicing and packaging but you know trying to build you know lots of companies get lots of contributors and build bigger solutions That that can be more robust in giving companies lots of options and lots of things that they can build on Absolutely, and as you said with Cloudera at the center of that of that ecosystem I think it's a pretty smart move on their part. They are really the leading distribution Hadoop distribution vendor at this point They they comply completely with the Apache open-source Project in fact they are pretty much the leading contributor to the Apache open-source Hadoop project in a way you could say Cloudera is the Apache Hadoop So I guess if I tease that out a little bit if I if I think about Linux Linux tends to be subservient to the hardware that it goes on So what I mean by that is Red Hat has a distribution But they really rely on the servers that it's it's going on or plugging underneath the hypervisor So it sounds like really Hadoop is much more of a software ecosystem and a much more robust and large Software ecosystem than even Linux is yeah, I think I would agree with that. Absolutely. Okay, so I guess two questions on that first is Does Cloudera really have the chops to be the leader in this community and you know Who's going to listen to them as the leader? I mean is it just a bunch of little startups? Who are you know, how how are they working with the broader ecosystem? Well, I you know, it's not just startups that they're partnering with we mentioned Dell of course last week But there were some other big players on their partner list if you go to that partner spotlight You can see IBM's on there IBM Natesa terror data And you've got some players on the application layer the analytics layer like a Tivio And then of course you've got some of the more Hadoop pure plays I should as I call them the karma spheres on their data mirror. So they really they're really embracing You know all manner of partners at this point You know, I think it'll be interesting. I think that the spotlight page is going to be a really good resource for End users considering getting getting involved with Hadoop as they begin to explore it Cloudera's website, you see, you know, maybe they're thinking that they hear about Cloudera's leading distribution They're poking around they're doing some research and they say oh, I see terror data here on this list I've got a terror data data warehouse. Let's see. They might give you some ideas How you can start? Utilizing Hadoop and that's you know, really what Cloudera is trying to do here is is spur more adoption Yeah, no, that's really interesting. I I like the point that you're saying is it's really that to a street on the Ecosystem people that want to learn about Hadoop probably are going to go to Cloudera They're gonna learn about other technologies and as we were talking about earlier. They're gonna say, oh, you know I'm already working with terror data, you know on some of my other projects Here's how it ties into what we're doing and then they'll learn about some of the new players like Tivio or Karmus here In this space. So pretty interesting. So Now we're talking about the spotlights and the partners. What else? What is Cloudera doing to kind of help those partners in the ecosystem? Well, as part of the program, they do have quite a quite a few resources available to them to to actually get to the point Where they're ready for certification. I don't know if you want to go to that page, but essentially kind of resources around training services How to integrate your technology with Cloudera's Apache Hadoop distribution? So the supporting that way, of course, they've got The spotlight page gives them a real a good platform, I think To show off what they can do and and where they can take Hadoop together Okay, so I guess we're at the point talking about, you know, education and customers learning about this Where are we with Hadoop adoption and you know, what's what's holding customers back? Where are they along this this path? Right? Well, you know, Hadoop is a relatively young and developing technology And as such, it's you know in terms of the adoption curve It's still at the low end. We're just kind of getting to the point where Maybe maybe just starting to cross the threshold from early adopters to creeping into the mainstream But but there are a few things holding it back and one of course is the lack of skilled Developers and database administrators with Hadoop experience and that's also the data scientists I would assume right and then of course yes And that's kind of the infrastructure level and then kind of the next level up would be the data scientist actually doing some of the analytics work and both those Sets of workers there are in high demand and unfortunately short supply at this point Which kind of leads to another, you know, another offering is the whole services angle here I think services gonna play a key role in the adoption of Hadoop, especially in the short term You know first two three five years Because there's just aren't enough trained Hadoop professionals for Every organization to have a full, you know Hadoop staff Sounds like we need a stimulus program for the government to start training people up. Well being big data scientists You know what that might not be a bad idea because there's definitely a demand for it It is a pretty complex Skill set you need to to learn. Yeah, so I don't know how easy it would be Long-term investment. It's a long-term investment But in the meantime there are services vendors out there a lot of the cloud area has some of their own services, of course But then you've got some other Services vendors like think big analytics And I think those types of vendors are gonna play an increasing role I know cloud area in particular mentioned to to me during a recent interview They don't see services to their business. They they want to see a Thriving kind of third-party services ecosystem develop This is so that that's interesting to see where you know Cloudera looks that they can make money on the distribution and where they're partnering and that's why I think it makes sense on that's Ecosystem in building out the partners. So let's spend a minute or two talking about the services angle on this sure Because you know really customers if we look at they need to be educated And how there is doing that, you know you and Wikibon and we're trying or definitely trying to help educate where we can But really we see services as a critical component for rolling out any new technology and obviously, you know Hadoop fits into there. So what can you give us your thoughts as to how that fits? Right? Well, I mean as I said, it's a very complex technology to Deploy and ministry administer manage And that's really what's holding back a lot of organizations. They just don't know how to do it They just don't have the skills. So that's where the services players come in You know the technology itself you could go to Cloudera and you don't necessarily have to purchase their enterprise edition You could purchase the or I should say use the community edition or go straight to Apache Hadoop and download for free The they're a free open-source distribution But then you've got to actually get it up and running integrate it with the rest of your IT infrastructure That's where the services come in and as I said I think you're gonna see more third-party vendors who are focused on big data and Hadoop in particular come come to the fore and then of your traditional services vendors are I think starting to understand the Market potential here there could be a really really good opportunity for services vendors Because this has a potential to be a billion dollar market and a big chunk of that is gonna be services. Yeah, okay So Jeff I guess looking forward, you know, what are you keeping an eye on what's coming in the near future? Where should people be watching for what's coming next with Hadoop? Well, you know, it's it's the continued evolution of the technology to you know, the goal eventually is you know an enterprise ready stable easy to deploy and manage Technology we're a ways from that But you know every little step we're seeing like this from Cloudera with these partnerships We're getting ever closer to that point still have has a ways to go But I think we're gonna see some more distribution vendors probably crop up We've you know, there are other vendors out there IBM is working with Hadoop You've got MapR and EMC Green Plum working together on a more of a closed proprietary version Which is interesting kind of tangent and we'll see where that goes We've got you know, there are other options like Lexus nexus has their HPCC Big data processing technology, which is kind of developed in parallel to Hadoop Within Lexus nexus, so there's a lot of there's still a lot of the market still really needs to work itself out And it'll be interesting to see where it goes, but I think Cloudera right now is in a really good position They've really established themselves I think as the leading distribution vendor at this point and this and being the first vendor with a certification program I think certainly helps them Establish themselves as the go-to The ego to Hadoop distribution vendor. Yeah, so it's definitely a nascent highly fragmented market Hadoop's really trying to take the reins here So event-wise we've actually we're gonna be having the cube at VM world and looking at a big data panel while we're there Seeing the intersection of virtualization in cloud with big data. Yeah, that's a whole nother topic. We could spend Hours talking about that look forward to that and then next month. We've also got the Big Stratoconference in New York City Yep, that's coming up. We'll be there as well. So definitely. I know you will you will be there and Looking forward to seeing there any other final pieces or notes you want to leave our watchers with well Certainly, you know, we've got lots of resources up on wikibon org. I encourage you to check out We've also got silicon angle comm is covering the day-to-day news around the big data space So that's definitely a the go-to place to kind of get stay up to date on the latest happening So we'll throw up some links on our site to all these different resources we have so you can check them out Okay, thank you Jeff for joining us appreciate the the update on cloud era in Hadoop and thank you for joining us. Thank you