 Live from San Jose, it's theCUBE. Presenting Big Data Silicon Valley, brought to you by SiliconANGLE Media and its ecosystem partners. Welcome back to theCUBE, our continuing coverage of our event, Big Data SV. We are on day two of this event. I'm Lisa Martin with George Gilbert. We've had a great day yesterday, learning a lot and really peeling back the layers of Big Data, looking at it from different perspectives, from challenges to opportunities. Joining us next is one of our CUBE alumni, Chris Selland, the VP of Strategic Alliances from Unify Software. Chris, great to meet you, welcome back. Thank you, Lisa, it's great to be here. I have to say, as an alumni and a many-time speaker, this venue is spectacular. Congratulations on the growth of theCUBE and this is an awesome venue. I've been on theCUBE a bunch of times and this is as nice as I've ever seen it, so onward and upward. This place is great. Isn't it cool? This is our 10th Big Data event. We've been, I think, five now in San Jose, do our fifth one in New York City in the fall and it's always interesting because we get the chance, George and I and the other guests host to really look at what is going on from different perspectives in the industry of Big Data. So before we kind of dig into that, tell us a little bit about Unify Software. What do you guys do? What is unique and differentiating about Unify? Sure, yeah, so I joined Unify a little over a year ago. I was attracted to the company because it really, I think, is aligned with where the market is going and Peter talked this morning, Peter Burris was talking this morning about networks of data. Unify is fundamentally a data catalog and data preparation platform kind of combined or unified together. So, you know, some people say, what do you do? We're a data catalog with integrated data preparation and the idea behind that, to go to Peter's mention of networks of data, is that data is becoming more and more distributed in terms of where it is, where it lives, where it sits. This idea of we're going to put everything in the data warehouse and then we're going to put everything in the data lake. Well, in reality, some of the data is in the warehouse, some of the data is in the lake, some of the data is in SaaS applications, some of the data is in blob storage and where is all of that data? What is it and what can I do with it? That's really the fundamental problem that we solve. And by the way, solve it for business people because it's not just data scientists anymore, it's really going out into the entire business community now, you know, marketing people, operations people, finance people, they need data to do their jobs. Their jobs are becoming more data-driven but they're not necessarily data people. They don't know what schemas are, joins are, but they know I need better data to be able to do my job more effectively. So that's really what we're helping with. So Chris, it's kind of interesting as you distill, you know, the capability down to the catalog and the prep, so that it's ready for a catalog. But that sort of thing is, it's like investment in infrastructure in terms of like building the highway system. But they're going to be, you know, for those early highways, there's going to be roots that you, there were a reason to build them out. What were some, what are some of those early use cases that justifies the investment in data infrastructure? There absolutely are. I mean, and by the way, those roots don't go away. They're those roots, you know, just like cities, right? New roots get built on top of them. So we are very much, you know, about there's still data sitting in mainframes and legacy systems and, you know, that data is absolutely critical. For many large organizations, we do a lot of working in banking and financial services and healthcare. Are there common use cases that they start with? A lot of times. Either by industry or just cross-section. Well, it's interesting because, you know, analysts like yourselves have tended to put data catalog, which is a relatively new term, although some other big analyst firm that's having another conference this week, they were telling us recently that starts with a G, right? They were telling us that data catalog is now the number one search term they're getting, but it's been, by many analysts, also kind of lumped in, lumped in is the wrong word, but incorporated with data governance. So traditionally governance, another word that starts with G has been the term. So we often, we're not a traditional data governance platform per se, but cataloging data has to have a foundation of security and governance. You know, think about what's going on in the world right now, both in the court of law and the court of public opinion, things like GDPR, right? So GDPR sort of says any customer data you have needs to be managed a certain way with a certain level of sensitivity, and then there's other capabilities you need to open up to customers, like the right to be forgotten. So that means I need to have really good control. First of all, knowledge of control over and governance over my customer data. I talked about all those business people before. Certainly marketers are a great example. Marketers want all the customer data they can get, right? But there's social security numbers, PII. Who should be able to see and use what? Because if this data is used inappropriately, then it can cause a lot of problems. So, so IT kind of sits in a, they want to enable the business, but at the same time there's a lot of risk there. So anyway, going back to your question, you know, the catalog market has kind of evolved out of the governance market with more of a focus on kind of, you know, enabling the business, but making sure that it's done in a secure and well-governed way. Guardrails. Yes, guardrails, exactly. Good way to say it, so yeah, that's good. I said about 500 words and you distilled it to about two, right, perfect, yeah. So in terms of your role in strategic alliances, tell us a little bit about some of the partnerships that Unify is forging to help customers understand where all this data is, to your point earlier, the different lines of business that need it to drive, identify where's their value and drive the business forward can actually get it. Absolutely, well, certainly to your point, our customers are our partners and we can talk about some of them. But also strategic alliances, we work very closely with a number of, you know, larger technology companies. Microsoft is a good example. We were actually part of the Microsoft Accelerator Program which I think they've now rebranded Microsoft for startups, but we've really been given tremendous support by that team and we're doing a lot of work to kind of, we're to some degree cloud agnostic. We support AWS, we support Azure, we support Google Cloud, but we're doing a lot of our development also on the Azure Cloud platform. But, you know, customers use all of the above so we need to support all the above. So Microsoft's a very close partner of ours. Another, I'll be in two weeks and we've got some interesting news pending which unfortunately I can't get into today, but maybe in a couple of weeks with Adobe, we're working very closely with them on their marketing cloud. Their experience cloud, which is what they call their enterprise marketing cloud, which obviously big, big focus on customer data. And then we've been working with a number of organizations and the sort of professional services system integration. We've had a lot of success with a firm called Access Group. We announced a partnership with them about two weeks ago. They've been a great partner for us as well. So, you know, it's all about an ecosystem. Making customers successful is about getting an ecosystem together so it's a really exciting place to be. So, Chris, it's actually interesting. It sounds like there's sort of the two classic routes to market. One is essentially people building your solution into theirs, whether it's an application or, you know, or enabling layer. Yes, yes, even a higher layer. And, but with corporate developers, you know, it's almost like we spent years trying experimenting with these data lakes, but they were a little too opaque. And, you know, it's not just that you provide the guardrails, but you also provide sort of some transparency into that. Have you seen a greater success rate within organizations who curate their data lakes as opposed to those who don't? Yes, absolutely. I think Peter said it very well in his presentation this morning as well, that, you know, generally when you say data lake, we associate with Hadoop, there are use cases that Hadoop is very good for, but there are others where it might not be the best fit, which, to the early point, about networks of data and distributed data. So, companies that have, or organizations that have approached Hadoop with a, let's use it what it's good for, as opposed to let's just dump everything in there and figure it out later, and there have been a lot of the latter, but the former have done, generally speaking, a lot better, and that's what you're seeing. We actually use Hadoop as a part of our platform, at least for the data preparation and transformation side of what we do. We use it as an enabling technology as well. You know, it's funny actually when you talk about, as Peter talked about, networks of data versus centralized repositories. Scott now, CTO of Hortonworks was on yesterday, and he was talking about how he had originally come from Teradata, and that they had tried to do work, that he had tried to push them in the direction of recognizing that not all the analytic data was going to be in Teradata, but they had to look more broadly with HADAPT, and I forgot what the rest of, you know, old Aster, and yes, exactly. But what was interesting is that Hortonworks was moving towards, we believe everything's going to be in the data lake, but now with their data plane service, they're talking about, you know, we have to give you visibility and access, you mediate access to data everywhere. So maybe help, so for folks who aren't like, all bought into Hortonworks, for example, how might you explain how you work relative to data plane service? Well, you know, maybe I could step back and give a more general answer because I agree with that philosophically, right? That as I think we've been talking about here with the networks of data that goes back to my prior statement that there's, you know, there's different types of data platforms that have different use cases and different types of solutions should be built on top of them. So things are getting more distributed. I think that, you know, Hortonworks, like every company has to make the investments that are as we are making their customers successful. So using Hadoop and Hortonworks is one of our supported Hadoop platforms. We do work with them on engagements, but you know, it's all about making customers successful ultimately. It's not about a particular product. It's about, you know, which data belongs in which location and for what use case and what purpose. And then at the same time, when we're taking all of these different data sets and data sources and cataloging them and preparing them and creating them out, but where should we put that and catalog that so we can create kind of a continuous improvement cycle as well? And for those types, a flywheel, exactly. Continuous improvement flywheel. And for those types of purposes, you know, that's actually a great use case for, you know, Hortonworks, the Hadoop. And that's a lot of what we typically use it for. We can actually put the data in any place our customers define, but that's very often what we do with it. And then, but doing it in a very structured organized way. As opposed to, you know, a lot of the early Hadoop, and not specific to any particular distro that went bad, where it was just like, let's just dump it all into Hadoop because it's cheaper. You know, it's cheaper than the warehouse. And so let's just put it all in there and we'll figure out what to do with it later. That's bad, but if you're using it in a structured way, it can be extremely useful at the same point, and at the same time, not everything is going to go there or belongs there if you're being thoughtful about it. So you're seeing a lot more thoughtfulness these days, which is good, which is good for customers, and it's good for us on the vendor side. Us, Hortonworks, everybody, so. So is there, maybe you can tell us of the different approaches to, like, the advantage of integrating the data prep with the catalog as soon as, because as soon as you're done with data prep, it's visible within the catalog. Absolutely. That's one, yep. When, let's say when people do derive additional views into the data, how are they doing that in a way that then gets also registered back in the catalog for further discovery? Well, having the integrated data preparation, which is a huge differentiator for us, there are a lot of data catalog products out there, but our huge differentiator, one of them, is the fact that we have integrated data preparation. We don't have to hand off to another product. So that, as you said, gives us the ability to then catalog our output and build that flywheel by continuing some improvement flywheel, and it also just basically simplifies things for customers, hence our name, so. Okay. So, you know, it really kind of starts there. I think the second part of your question, I didn't really rewind back on that for me, it was. Go ahead. Well, I'm not sure I remember it right now. Okay. Right now, either, but. We all need more coffee. Exactly. We all need more coffee. So I'll ask you this last question then. Yes, please. What are, so here we are in March, 2018. What are you looking forward to in terms of momentum and evolution of Unify this year? Well, a lot of it, and tying into my role I mentioned, I and we will be at Adobe Summit in two weeks. So if you're going to be at Adobe Summit, come see us there. Some of the work that we're doing with our partners, some of the events we're doing with people like Microsoft and Access, but really it's also just customer success. I mean, we're seeing tremendous momentum on the customer side, working with our customers, working with our partners. And again, as I mentioned, we're seeing so much more thoughtfulness in the market these days, and less talk about the speeds and feeds and more around business solutions. That's really also where our professional services, system integration partners, many of whom I've been with this week, really help because they're building out solutions. GDPR is coming in May, right? And you're starting to really see a groundswell of, okay, and that's not about speeds and feeds, that's ultimately about making sure that I'm compliant with this huge regulatory environment. At the same time, the court of public opinion is just as important. We want to make sure that we're doing the right thing with data, spread it throughout our organization, make ourselves successful, make our customers successful. So it's a lot of fun. That's fun, it's good. It is, exactly, fun is good. Well, we thank you so much, Chris, for stopping back by theCUBE and sharing your insights, what you're hearing in the big data industry and some of the momentum that you're looking forward to carrying throughout the year. It's always a pleasure and you too, so love the venue. So, thank you, Lisa, thank you George. Absolutely. We want to thank you for watching theCUBE. You're watching our coverage of our event, Big Data SV, hashtag Big Data SV for George, I almost said George Martin for George Gilbert. I wish. Hey, George RR, yeah. You might not be here if you were George RR. No, I wouldn't. That was a really long way to say thank you for watching. I'm Lisa Martin for this George. Stick around, we'll be right back with our next guest.