 from Berlin, Germany. It's theCUBE, covering DataWorks Summit Europe 2018. Brought to you by Hortonworks. Hello and welcome to theCUBE on day two of DataWorks Summit 2018 from Berlin. It's been a great show so far. We have just completed the day two keynote and in just a moment I'll bring you up to speed on the major points and presentations from that. It's been a great conference, fairly well attended here. It's a hallway chatter and discussion's been great. The breakouts have been stimulating. For me, the takeaway is the fact that Hortonworks, the show host, has announced yesterday at the keynote. It's gotten out of the CTO of Hortonworks and now it's Data Steward Studio, DSS, they call it. Part of the data plane, Hortonworks data plane services portfolio. And it could not be more timely Data Steward Studio because we are now five weeks away from GDPR, that's the General Data Protection Regulation, becoming the law of the land, and when I say the land, the EU. But really any company that operates in the EU, and that includes many US based and APAC based and other companies, will need to comply with the GDPR as of May 25th and ongoing in terms of protecting the personal data of EU citizens. And that means a lot of different things. Data Steward Studio announced yesterday was demoed today by Hortonworks and it was a really excellent demo and showed that it's a powerful solution for a number of things that are at the core of GDPR compliance. The demo covered the capability of the solution to discover and inventory personal data within a distributed data lake or enterprise data environment, number one. Number two, the ability of the solution to centralize consent, provide a consent portal essentially, that data subjects can use then to review the data that's kept on them to make fine grained consents or withdraw consents for use and profiling of their data they own. And then number three, they show, they demonstrate the capability of the solution then to execute the data subject, the people's requests in terms of the handling of their personal data. The three main points in terms of enabling, you know, giving, adding the teeth to enforce GDPR in an operational setting in any company that needs to comply with GDPR. So what we're going to see, I believe going forward in the, really in the whole global economy and in the big data space is that Hortonworks and others in the data lake industry, and there's many others, are going to need to roll out similar capabilities in their portfolios because their customers are absolutely going to demand it. In fact, the deadline is fast approaching, it's only five weeks away. One of the interesting takeaways from the keynote this morning was the fact that John Kreis of the VP for marketing at Hortonworks did a quick survey of those in the audience, a poll, asking how ready they are to comply with GDPR as of May 25th. And it was a bit eye-opening, I wasn't surprised, but I think it was 19 or 20%, I don't have the numbers in front of me, said that they won't be ready to comply. I believe it was something where between 20 and 30% said they will be able to comply. About 40%, don't quote me on that, but a fair plurality said that they're preparing. So that indicates that they're not entirely 100% sure that they will be able to comply 100% to the letter of the law as of May 25th. And I think that's probably accurate in terms of ballpark figures. I think there is a lot of, I know there's a lot of companies, users racing for compliance by that date. And so really GDPR is definitely the headline banner umbrella story around this event and really around the big data community worldwide right now in terms of enterprise, investments in the needed compliance software and services and capabilities needed to comply with GDPR. That was important. That wasn't the only thing that was covered in not only the keynotes, but in the sessions here so far. AI, clearly AI and machine learning are hot themes in terms of the innovation side of big data. There's compliance, there's GDPR, but really innovation in terms of what enterprises are doing with their data, with their analytics, they're building more and more AI and embedding that in conversational UIs and chat bots and they're embedding AI in all manner of e-commerce applications, internal applications in terms of search, as well as things like face recognition and voice recognition and so forth and so on. So what we've seen here at the show is what I've been seeing for quite some time is that more of the actual developers who are working with big data are the data scientists of the world and more of the traditional coders are getting up to speed very rapidly on the new state of the art for building machine learning and deep learning AI, natural language processing into their applications. That said, so Hortonworks has become a fairly substantial player in the machine learning space. In fact, really across their portfolio and many of the discussions here I've seen shows that everybody's buzzing about getting up to speed on frameworks for building and deploying and iterating and refining machine learning models in operational environments. So that's definitely a hot theme and so there was an AI presentation this morning from the first gentleman that came on that laid out the broad parameters of what the developers are doing and looking to do with data that they maintain in their lakes, training data to both build the models and train them and deploy them. So that was also something I expected and it's good to see at DataWorks Summit that there's a substantial focus on that in addition of course to GDPR and compliance. It's been about seven years now since Hortonworks was essentially spun off of Yahoo. It's been, I think it's about three years or so since they went IPO and what I can see is that they're making great progress in terms of their growth, in terms of not just the finances but their customer acquisition and their deal size and also customer satisfaction. I get a sense from talking to many of the attendees at this event is that Hortonworks has become a fairly blue chip vendor that they're really in many ways continuing to grow their footprint of Hortonworks products and services and those are their partners, such as IBM. And from what I can see, everybody was wrapped with attention around DataSteward Studio and I sensed sort of a sigh of relief that it looks like a fairly good solution and so I have no doubt that a fair number of those in this hall right now are probably, as we say in the US, probably kicking the tires of DSS and probably gonna expedite their adoption of it. So with that said, we have day two here so what we're gonna have is Alan Gates, one of the founders of Hortonworks coming on in just a few minutes and I'll be interviewing him asking him about the vibrancy and the health of the community, the Hortonworks ecosystem, the developers, partners and so forth as well as of course the open source communities for Hadoop and Ranger and Atlas and so forth, the growing stack of open source code upon which Hortonworks has built their substantial portfolio of solutions. Following him we'll have John Chrysa, the VP for marketing and I'm gonna ask John to give us an update on really the sort of the health of Hortonworks as a business in terms of the reach out to the community in terms of their messaging obviously and have him really position Hortonworks in the community in terms of who do they see them competing with? What segments is Hortonworks in now? The whole Hadoop segment, increasingly, Hadoop is there, it's the foundation. The word is not invoked in the context of discussions of Hortonworks as much now as it was in the past. And the same thing for, say, Cloudera, one of their closest to traditional rivals, closest in the sense that people associate them. I was at the Cloudera Annals to event the other week in Santa Monica, California. It was the same thing, I think both of these vendors are on a similar path to become fairly substantial data warehousing and data governance suppliers to the enterprises of the world that have traditionally gone with the likes of IBM and Oracle and SAP and so forth. So I think their Hortonworks is definitely evolved into a far more diversified solution provider than people realize and that's really one of the takeaways from DataWorks Summit. With that said, this is Jim Kobielus, I'm the lead analyst, I should have said that at the outset, I'm the lead analyst at SiliconANGLES Media's Wikibon team focused on big data analytics. I'm your host this week on theCUBE at DataWorks Summit Berlin and I'll close off this segment and we'll get ready to talk to the Hortonworks and IBM personnel and there's a gentleman from Accenture on as well. Today on theCUBE here at DataWorks Summit Berlin.