Rating is available when the video has been rented.
This feature is not available right now. Please try again later.
Published on Oct 26, 2011
Ronen Schwartz, VP of Product Management at Informatica, explains HParser, Informatica's new parsing technology for Hadoop. This unique technology runs inside MapReduce to take advantage of massive parallelism. It supports the parsing of web logs, flat files, binary files, documents, such as Word, PDF and Excel, Hierarchical data: JSON and XML, and a variety of industry standards, including: ASN.1 (Telco), HL7, HIPAA (Telco), Bloomberg, FIX, SWIFT, NACHA (financial services), EDI-X12, EDIFACT (Retail and manufacturing) and much more.