Some exciting recent news, if you haven’t seen it yet…announced a few days ago at the DataWorks Summit/Hadoop Summit in San Jose, a new relationship between IBM and Hortonworks! Read about it here to learn how IBM and Hortonworks are partnering to further the efforts of our customers to expand their big data solutions.
More important for this blogger is the increased attention this brings to Apache Atlas. Apache Atlas, if you aren’t already familiar, is an evolving open source approach to enterprise information governance, metadata management, and lineage […go here for a general overview: https://hortonworks.com/apache/atlas/ ]. One highlight from news above draws particular attention to the contributions IBM and Hortonworks are making to this effort:
“Partnering On Apache
As part of their wide-ranging partnership, the companies will also team to advance the development of Unified Governance (IBM BigIntegrate, IBM BigQuality and IBM Information Governance Catalog) on the Apache Atlas open platform. Information Governance Catalog) on the Apache Atlas open platform. …”
It’s all a work-in-progress, but this is significant news that will hopefully accelerate the initiative. Have any of you started working heavily with Atlas? Which release? Are you using it exclusively with Hadoop, or externally? Have you interchanged metadata with Atlas and IGC? Considering it? Share your experiences!
June 16, 2017 at 8:06 am
We are considering interchanging metadata with IGC to extend lineage into Hadoop
August 21, 2017 at 10:07 am
So sorry I missed this Gino….have you moved forward with this solution? This is an excellent use of the IGC API or other extraction methods, and/or, in the reverse direction, for bringing hadoop based metadata “into” IGC, usually via OpenIGC.