 Live from the Moscone Center in San Francisco, California, it's The Cube at AWS Summit 2015. Okay, welcome back everyone. We are live here in San Francisco for Amazon Web Services Summit. This is The Cube, our flagship program. We go out to the events, extract the civil noise. I'm John Furrier, founder of SiliconANGLE. I'm my co-host, Mark Farley. And our next guest is Scott Hedrick, who's the director of strategic marketing at Informatica. Welcome to The Cube. Thank you, thank you for having me. Great to see you. So Informatica Cloud has been involved with Amazon a bit. You guys have some stuff to talk about here. Absolutely. The cloud is the movement from on-prem to cloud, but not exclusively. It's not binary, as Andy Jassy said. No, exactly. It's hybrid and there's the explosion of different hybrid patterns, as Andy talked about today. And Informatica has been working with AWS for a few years now on our Informatica Cloud, which is a SaaS service for data integration, quality, master data management. And it has connectivity to a bunch of the key Amazon services, like Redshift and S3 and EC2. And today what we announced is a new dimension of our kind of hybrid strategy, which is to bring some of our key on-premise traditional software products onto AWS. And so what that means is that customers that are using PowerCenter, our big data addition on Hadoop, data quality or our data exchange can now install those products running on AWS just as they would in an on-premise server. And they get the same support, they have the same licensing, and it adds a lot of flexibility and agility. So what is the number one concern from the roadmap that customers want to move from the enterprise? Because we're seeing that there's a pretty much a no-brainer that there's going to be a transition. That's pretty much a done deal. It's a reality. But it's not like tomorrow I do it. So there's a roadmap and there's a path. What is that path? There's a lot of moving parts. There's obviously a network, there's a full stack of movement and then you got a workload on top of it that could have a variety of different workloads within a workload. It sounds complicated, but it is. What's your take on that? And with customers who are taking that journey to have that hybrid, what's your view on that? Well, I mean, there's some companies that are really jumping in and saying cloud first. But even the cloud first companies aren't 100% on the cloud. In this there are startups, right? That there are traditional companies that are putting all their new systems or trying to migrate as much as they can to the cloud. And of course we can support them in that. So if they're running some of our software to do data integration, to do data movement, and Informatica has connectors and parsers for hundreds of data types. Basically anything from mainframes and SAP apps, relational, unstructured, specialized structure, they can bring all those in and process them through data integration and quality and mastering to prepare that data for analytics. And so many of these systems are now running on premise in some of these data center. And they're pointing at it, say a traditional data warehouse. And what people can do is the first step, they can redirect some of those workloads and that cleansed great data and pointed up to AWS to say do analysis in Redshift. Then as they want, they can move more and more of that data pipeline into the AWS cloud. So they can start by targeting things, maybe they can do storage and they have a mix of data that's up in the cloud. Some of the data will be on premise. They can either combine it on premise and move it to the cloud or they can move it to the cloud and then do the processing there. We had Matt Wood on earlier, Chief Data Scientist for Amazon. Yes. And do you see the machine learning as being a driver, if you will, for your application? Absolutely. And we have a lot of, one of the fastest growing businesses for Informatica is our big data business. So around, we have a whole suite of products that run on top of Hadoop to do data quality, data integration, do mastering, to do profiling, all of this running natively on each node in a Hadoop cluster. And so we see a lot of these customers, once they bring the data in there, they want to do some machine learning to get some initial insights to prepare for, say, their end analytics results. Yeah, and one of the points he brought up is that, typically within the data center, there are constrained resources for doing machine learning. It's hard to, you know, it's hard to really get it done if you don't have enough CPU power available to do it, right? Absolutely. So the availability of on-demand CPU, the on-demand processing in AWS would allow somebody to use your product with Hadoop or whatever to get the resources to bear to get the job done. Absolutely. I would assume that would open doors for you. Oh yeah, absolutely. I mean, to get a thousand node Hadoop cluster is non-trivial on premise, right? To order all the systems, set it all up, configure it, but if you can spin it up in Amazon, it's a much more flexible approach. Either to do prototyping or to actually run into production. It's a great way to expand your existing customer base with new capability that, yeah, it was available, but might not have been necessarily practical. Yeah, well, what we find too is, you know, if you use some of the, you can use scripting and hand-coding to move data, do some transformation, but once you get into very sophisticated data management, it can become very brittle and complex to hand-code it all. And that's really where Informatica comes in, is we've got the greatest breadth of support of the different sources of data, the types, and everything you do with Informatica is all visual. So you don't have to write any code. That means it's very easy to repurpose a data pipeline that you're running on premise and repurpose that and run that in the AWS Cloud. Yes, so you get the best of both worlds. You've got the agility of moving in the cloud fast. Yes. And you've got the pipelining, which is like, it doesn't sound crude, but it's the plumbing. You get some work involved, engineering, some big-time engineering involved in data pipelining, as well as other subsystems that go on. Is that right? Did I get that right? Is that how you see your customers using that? Absolutely. And so, you know, we have more than 5,500 customers around the world that rely on Informatica to give them great data, to take the data from wherever it sits and bring it together, cleanse it and stage it for operations and analytics. And we also have the ability to show them that full data pipeline, show that end-to-end lineage. As say, data comes from a source systems in Hadoop or on the web or in Salesforce and loading into being transformed, cleansed, subsetted and then prepped for some analytics, say on Redshift or Tableau or another analytics system. They can go look at that data and trace back the lineage field by field where it came from, which is critical for compliance. So, talk about Informatica world coming up. What do you guys expect there? I want to kind of get your thoughts on that to end as we get running out of time. You get, last time we were at Amazon, we pulled the plug. Literally, they pulled the plug on us. We stayed on until they teed us off. But I want to get your take before we close out. Absolutely. Informatica world's coming up. There's been some news and there's been rumors. About a private equity thing's been announced. What's going on? Give us a state of what's going on inside the company. Are people excited, confused, happy, sad, not knowing what's going to happen? I mean, Dell went private. So it's not bad. I mean, it's all good. Where are we? I think we're business as usual. And so for us, nothing has actually happened. The transaction hasn't closed. And as you said, everything is trade-offs. And I think that the investors that are investing in Informatica, they share our vision, and this vision that we'll be sharing in Informatica world about the areas of growth that we're focused on. We're focused on big data, on the cloud, on mastering data, and how to power data pipelines to provide great quality data for analytics. All of those things, none of that changes at all. And Informatica world is a great way to bring the Informatica ecosystem together and talk about where we're going and get us all along. So we've got a little crowd chat commentary coming in, so I got to ask it from the crowd. Thanks for going to crowdchat.net, Amazon, AWS Summit. Question for you. This is a good question. I love this, because if you just last comment, kind of stoked the fire. How does Informatica tackle real-time ETL? How does Informatica tackle real-time ETL? We just brought up data pipelining, moving to cloud for agile. Does this fit into that? No, absolutely. So Informatica has really pioneered what people used to call ETL. Now we tend to call data integration. It's become a lot more than just ETL. That's one pattern of moving and transforming data. But what is this? We really can bring this together, I think. So what specifically was the question about? How does Informatica tackle real-time ETL? So the real-time, yeah, sorry, there are a few things that we have, PowerCenter is our kind of flagship data integration product that we've been developing for more than 20 years. And PowerCenter does have real-time capabilities, so it's not just batch. So you can run real-time data integration there. We also have Vibe Datastream, which is built on the fastest messaging system in the world called Ultramessaging, which powers a lot of hedge funds, currency trading, and this very high-speed messaging can deliver the data from a variety of sources and write it directly into HDFS or traditional file systems. It can also deliver it right into a CEP engine, our complex event processing. That way you can have this Lambda architecture, which is kind of all the rage. So you can write data directly for longer-term storage, and then in parallel, you can make some quick decisions and kick off workflows or alerts based on anomalies or certain conditions you've identified in the data. And back to the Informatica world question, what's going to be the theme? You mentioned you kind of address the whole business as usual at the whole, hasn't been closed, nothing's been closed, nothing's done till the deal is signed, as I always say. Informatica world, what are we expecting to see there? What kind of themes, what's the big overarching pillars? Well, there are certain areas we're focused on. One is analytics. And although we're not an analytics company, we provide that full data pipeline to prepare the data for analytics. So that's one area. It's a big part of it. It's a huge part of it, right? And another area we're focused on is cloud modernization and how people want to integrate all the SaaS apps they've built up, Salesforce and Concur, and NetSuite, and bring those into their architecture, right? Bring that data in, use it for analysis, so that whole cloud modernization. Also, app consolidation and migrations, companies that are moving from some system to another. Maybe they want to move from SAP to Oracle or vice versa, or they want to move from some of their data warehouse to HANA from a traditional system. All of these can be enabled by Informatica. And those are some of the key areas that we're focused on. It's how we can help customers to do this in a very agile and manageable way. Well, we'll be there at theCUBE, so we're going to be breaking it down. It's going to be fantastic. And we've got the Big Data event going on in New York City, so I want to ask you, we do two events now, Silicon Valley and New York City around capital markets. New York's kind of like the Wall Street, you know, public companies. Silicon Valley's more of the startups. What's your take on the innovation strategies of the two kinds of profile companies? The bigger, growing, public companies are almost public, pre-IPO, and the startups. You're seeing a lot of action on both fronts. What's your take on your observations in those two markets? Where's the innovation coming from and what are they doing? I think it's coming from all over the place in different ways. And so, the startups are kind of starting from scratch. They're trying to really disrupt, but we've got a lot of customers that are trying to kind of disrupt themselves. They're trying to do things themselves. Companies like Intuit and Western Union, they're using Informatica to kind of put big data to work in new ways to be able to manage and look at much larger quantities of data than they've traditionally been able to do, as well as bring in new data types and be able to take those new data types, whether they're social or machine data or web logs, and combine them together with their traditional, like relational data or transactional data to get new insights. We'll put new operational systems in place. And so, we're very focused on helping these large companies. And I think one of the differences and the reason that so many big companies come to Informatica is the startups really have patience for hand-coding people, hand-coding everything, and hiring developers to write everything from scratch. And for big companies, we had a really interesting comment from one of our customers who's in the automotive business, let's say, in the web. And they said that their dev team first wanted to hand-code everything. They said, oh, we're using Hadoop, we can just write Pig and Hive, and they're development, the management team said, one guy quits, it's all done on a hand-basket. That's exactly what I said. So, their management team turned them and said, okay, so that means you will be on call 24-7, 365, forever on these systems. Because if anything goes wrong, you're the only ones that know how to fix it and how it was built. And they said, oh, let's buy a tool, let's go with your chromatica. That's not too cool. Being on vacation, handling that stuff every day. No, seriously, it's a big issue. Scalability of continuous operations. And if you're Twitter or Facebook, you can probably get somebody who is young and has a very free life that doesn't mind being on call all the time. Well, that's where Amazon has their little issues, and this has come up on our previous Cube gigs, where every startup, like you talked about Twitter, LinkedIn, LinkedIn, well, Twitter, Zynga, ones that I know for sure, Pinterest, they all grew up, they had to re-architect everything. So, you hit a point of like, we can't do this anymore. We got to build a scalable platform. But that's what they're trying to solve. It looks like. So, I would kind of give them a ding on that three years ago, that you couldn't do a full stack on the Amazon, you get some good ransom, I call it the junkyard cloud, if you will. But they're certainly professionalized the hell out of it the past three years. Absolutely, you have all the richness of the AWS services, and now you can bring all the traditional enterprise software, things like Informatica, to run right on AWS. It's well beyond make, roll your own cloud. It's just like real scalable software. Absolutely, and you can pick and choose where you want to run it, how quickly you want to embrace the cloud. Well Scott, looking forward to seeing that Informatica world, it's going to be fantastic in Las Vegas, the Cube will be there. Scott Hedrick with Informatica. I'm John Furrier with Mark Farley. We'll be right back after this short break. More action here at Amazon Web Services Summit live in San Francisco, we'll be right back. Thank you.