 Hi everybody, this is Dave Vellante. We're here at the HackReduce event. This is the cube where we extract the signal from the noise We bring you the smartest people that we can find and put guests in front of you our audience And we're here with Ming Shang Hong who's the chief data scientist that adapt a good friend of the cubes Ming Shang great to see you here. The pleasure is mine. Great to see you again Dave. Yeah So this HackReduce event is really there's a lot of buzz here Talk about what your feeling is about HackReduce. How are you involved and you know, what's your role? Absolutely. So here's my understanding of HackReduce. This is about Solving big data problems. Here are three key elements that you need to solve big data problems. First of all infrastructure Software and hardware right? Secondly, data. And the last one is people. People with expertise, of course And that's what HADAP has put together. They have a thousand nodes machine cluster They have various interesting data sets from the city of Boston for other private entities across a wide domain of You know verticals financial service health care, you know public, you know infrastructure kind of data So these are things that you know people don't have easy access to before and finally talents People who already have the kind of existing expertise and also people who are excellent Technologists but are looking to get into the big big data domain and these are the people that we can train We can motivate very quickly and get them involved in solving real problems in the public and private sectors So Miksha, you're an alpha geek and I mean that as a compliment, right? I mean you you're really in the heart of it your data scientist your real technologist What does it mean to someone like yourself or young people coming out of college? What does it mean to have a resource like this? How important is that? I think this is tremendously valuable I wish heck reduce who are there when I was a grad student Oh, you know why would when I was even in college? I think the value added is really if you look at the university courses and also, you know training provided by I would say commercial entities. There's a huge gap in really providing both practical training But also out in neutral technology oriented no vendor agenda that kind of training and providing the Collaboration space where people can find good patterns have interesting problems and data sets to work out It's tremendous because we all know that that you know if you should just by yourself going and reading different tutorials on tools You're not going to get very far. Let's talk a little bit about Hedapt We just had you in on the Cube in Marlboro and you did this great demo Hedapt just won the the best startup award at the Strata plus a dupe World Conference So congratulations on that. Thank you And now so so how's it going post strata? What's the reaction been? How you feeling? It's been really exciting for us there So for strata we want no demo showcase. We got incredible boost traffic, right? And you know people are really excited about the interactive performance We're bringing to Hadoop the BI tools integration and also the advanced edit packages and a fingertip of the end users All of those three elements together or addressing the needs of the business analysts people who have the domain expertise But no they don't have the coding skill You know the skills to write map reduce or sequel jobs and now they can leverage the power of Hadoop and the friends of Hadoop So you know the power of Hedapt in the integration with all those visualization tools and you're not alone in this space We saw cloud error made an announcement in a similar space. So that to me. That's a good thing It's confirmation that the market needs it's very clear to me ming shang And I'm sure you agree that the world wants to bring sequel and no sequel together They want to bring structure and unstructured together They want they don't want to have to do these complex and massive and expensive ETL's I was at IBM today and Steve Mills showed a case study where a big bank in Europe One of their big clients essentially was using 28% of their cores to do massive ETL's total total waste Are we seeing the the the end of ETL's as we know them? I think ETL is definitely getting much simpler now. No people love the convenience and the simplicity of dumping data into let's say had HDFS right and worrying about the schema later That's one of the main advantages of Hadoop and that's something that if you look at Hadoop and had that users They can really take advantage of dump the data there and where they start playing with the data They find the structure That's the time where they can optimize the storage of the data to make future queries faster But you don't want to spend days or weeks of upfront cost There's only a good schema and running the ETL jobs to load the data. It just you know It's less agile these days make sure I want you to talk about nesina Which is your other little pet project tell us what that's all about and what your involvement is there? Well, so nesina it stands for New England Chinese Information Networking Association It was started 17 years ago by a bunch of very successful industry leaders Entrepreneurs one of them is a gentleman named Chen Wu. He founded Aeropoint and a series of other companies Aeropoint was acquired at a peak of the dot-com bomb by by Cisco for almost six billion dollars in cash. No hugely successful I am honored. I have been volunteering at nesina for a few years and and I'm taking on the volunteer President role for this annual term and its missions are about promoting technology trends such as big data Cloud mobile and so on Advancing the career sort of development through entrepreneurship and leadership Education and finally, you know given a cultural identity of the organization Promoting bridging us and Asia especially China, you know facilitating business exchange But also extending market from both ends to the other side and the brilliant talent these around so you have an event coming up The end of this month, correct? Yeah, so we actually this is something that's really exciting November the 18th. It's a Sunday morning We're gonna have a big data conference where we have put together over 10 top notch leaders from the Boston investment Industry and academic communities. So we have keynote speakers including Chris Lynch Partner of endless ventures Steve Papa as many of people know founder of in DECA and Sam Madden Who's the thought leader and the professor of MIT computer science? You know, he did a lot of big data projects and we got two panels focusing respectively on product and business use cases Involving almost all of the hottest startups in the Boston area So make sure thanks very much for coming on the cube. Good luck with things that adapt There's always pleasure to see you and good luck with hacker design. Keep it right there. We'll be right back after this