Check out this “can’t miss” recording of metadata mgmt and governance in action!

One of our IBM partners, Compact (www.compactbi.eu and www.compactbi.com ), has published a video recording on their web site that illustrates their MetaDex solution. MetaDex, in combination with Information Server Business Glossary and Metadata Workbench, enables metadata management for additional technologies that are outside of Information Server. Parsing for independent ETL tools and complex SQL scripting are just part of what MetaDex offers.

This is an excellent video that is worth eight minutes of your time. It highlights the functionality of Business Glossary and Metadata Workbench and how they work together along with MetaDex to provide a strong solution in support of your governance objectives. Enjoy!

Ernie

The recording is specifically at: http://compactbi.eu/solutions/metadex

Actional Diagnostics…great to use an “old friend” again…

Hi All…

Just a quick note. I had the pleasure today of finally getting around to installing Actional Diagonostics. This is the latest release of what used to be called “SOAPScope”…it’s been awhile since I’ve had a need for it…but it was perfect, still providing all the great things that it did in its earlier implementations.

This is something I’ve been meaning to do for a long time, but haven’t had the chance. Mindreef was acquired by Progress Software several years ago, and what was originally “SOAPScope” has been rebranded. I don’t have any specifics as to the other Actional offerings that are connected, but I can say that the experience was excellent. The download and install went very smoothly, and I was invoking a service within minutes. The screens appear to be the same, although they have probably included new functionality that I have yet to explore.

If you need an easy to use testing tool for your services, put this one on your list. There are many of them out there….all good tools….. I happen to have always liked this one because it offers a good compromise — it will appeal to users like myself who are comfortable with xml and http protocols, yet also be easily adopted by users who don’t want to be exposed to xml and simply want an easy-to-use GUI. Especially nice in Actional Diagnostics is the ability to perform load testing, where you can easily create multiple threads (thus simulating multiple users) invoking your service in concurrent fashion.

Bravo to the team who is still supporting this.

You can find the download details at http://web.progress.com/en/actional/

Ernie

New developerWorks article on DataStage and new XML Stage!

Hi all…

My esteemed colleagues on the xml development team have published a great article on the new XML Stage in 8.5….enjoy!

devWorks article on the New XML Stage!

Ernie

Reviewing the Advanced Tab in the Metadata Workbench

Hi all…

Just thought I’d throw in a quick review of the important (imho) links at the Advanced tab…..some of these factoids are buried in my other posts, but I needed to have a cheat-sheet for myself and others. Here is is:

Automated Services. This option brings up the dialog that runs the parsing or “stitching” process for the detailed metadata you have in your DataStage Jobs or Connector-imported rdbms views. It does a lot of stuff, takes time the first time you run it (if you have a ton of metadata), and should be scheduled during off-hours. After the first run against a particular project, it uses a change recognition mechanism to only pick up Jobs that have been updated. Note the “checked” DS Projects carefully. Only select those that are really critical, and once checked, don’t “uncheck” — as you will see from the warnings, this will “remove” all parsing history. Ultimately, this step is the one that reviews the Jobs, connects them via common information found in Stages, etc. See my other posts for how the connection of Jobs to each other is determined.

Stage Binding. When all else fails, you can connect two stages to each other. Use this when, for some reason, two Jobs won’t connect, or when the rules for connecting them can’t be met. I’ve needed this with some custom Stage or Operator implementations, and when I am using a technique that prevents automatic connection. Imagine having a Sequential Stage at the end of a Job that is writing out some xml content — and then I’m using the XML Stage in the next Job to read that content. There isn’t much in common between those Jobs, but I still want lineage to run directly thru them…

Data Item Binding. This provides a “manual” binding of particular Stages to Database Tables and Data Files (see other posts for what those are, how they are created, and how they are different from “DataStage Table Definitions”). Use this when you are unable to get Database Alias to work as you expect and you simply want to “bolt” a particular Database Table or Data File to a Stage in one of your Jobs to complete the lineage picture.

Data Source Identity. Use this when, for whatever reason, you want to link two identical tables for lineage purposes. Reasons? Two people might have imported the same metadata accidentally and you don’t want to delete it….or you might have the “design” information from an ERwin model and also have the “actual” table information from the rdbms catalog. There are many valid reasons. This link let’s you relate tables together. They must have the same name — the option here lets you relate the “Schemas” of two different databases. Identical tables within those schemas will become linked for lineage reporting — and therefore, also linked to whatever those individual tables connect to for lineage.

Database Alias. This option establishes the connection between an abstract string in a DataStage Stage (Server name, DSN name, etc., as defined by the relational stage) and the “Host/Database” combination that was actually imported. Database Tables in Metadata Workbench are typically “actual” tables — but in DataStage, like any well designed application, the “name” is a placeholder. This assigns the “placeholder” to the host and database. The schema.tablename used in the Stage will then be matched against the Host/Database set of Tables to create a lineage connection. The list presented at this option will be entirely empty until you perform Automated Services. Then it will be populated with each StageType and “server string” combination that it finds in your Jobs.

Hope this helps understand these options.

Ernie

2010 in review

The stats helper monkeys at WordPress.com mulled over how this blog did in 2010, and here’s a high level summary of its overall blog health:

Healthy blog!

The Blog-Health-o-Meter™ reads This blog is on fire!.

Crunchy numbers

Featured image

A helper monkey made this abstract painting, inspired by your stats.

The average container ship can carry about 4,500 containers. This blog was viewed about 21,000 times in 2010. If each view were a shipping container, your blog would have filled about 5 fully loaded ships.

In 2010, there were 11 new posts, growing the total archive of this blog to 52 posts. There were 8 pictures uploaded, taking up a total of 3mb.

The busiest day of the year was January 5th with 152 views. The most popular post that day was Incorporating Java classes into your DataStage Jobs.

Where did they come from?

The top referring sites in 2010 were dsxchange.com, it.toolbox.com, google.co.in, google.com, and en.wordpress.com.

Some visitors came searching, mostly for cobol etl, datastage web services, real time etl, datastage real time scenarios, and complex web services.

Attractions in 2010

These are the posts and pages that got the most views in 2010.

1

Incorporating Java classes into your DataStage Jobs March 2008
19 comments

2

Reading XML Content as a Source December 2007
10 comments

3

What is Real Time ETL anyway? November 2007
4 comments

4

Tips for using Web Services Pack with DataStage Part I: Getting Started November 2007
4 comments

5

Tips for Using Web Services Pack with DataStage: Part II – Invoking your first Service December 2007
8 comments

The new XMLPack in 8.5….generating xsd’s….

As noted in an earlier post and outlined nicely in Vincent’s blog (new xml!), the new XML Pack is here… It is very powerful, providing new features for reading and transforming hierarchical data, performs faster (and smarter) than earlier xml technologies within DataStage, and much more. It also requires that you have an XML Schema Definition (xsd) for the import of xml metadata. Most of the time this is not an issue. The xml documents you are reading and/or writing are well defined, complying with a formal xsd developed within your organization, or perhaps by a partner, yourself or by a standards body. But sometimes, there is no xsd. You may not have access to one, it might have been lost, or it never existed. The XML might be simple enough that it was just generated by another tool without the use of an xsd (or you are asked to generate it), or the xml might be old enough to pre-date the arrival of xsd’s.

There are many ways to generate an xsd. Popular tools such as Altova XMLSpy support this capability, as do many others, including xml Max, whose link I have over on the link list to the right. A quick search on the web will invite you to try a lengthy list of possibilities. One that I’ve been very successful with is called “trang”.

http://www.thaiopensource.com/relaxng/trang.html

This little tool does more than just xsd generation, although that is the functionality that I have found most useful. I’ve tried it on Windows and on Linux. It is easy to use, well documented, has references from other bloggers across the web, and does the Job. It is command line based, and requires that you have a java run time locally installed. There may be more sophisticated tools out there, but this is sufficient for what I need to be productive with the new XML Stage.

Let me know if you find any others!

Ernie

The new XML Stage is here!

Just announced yesterday…the new XML Stage is available for 8.5! This introduces a whole new level of XML Transformation to the Information Server platform! Among its new capabilities are the ability to read single huge documents using a new streaming methodology that avoids the need to load the document into memory, support for any type of xsd, or collection of xsd’s, to define your xml metadata, and perhaps most important, a whole new hierarchical editing mode called an “Assembly”, which provides support for the creation of complex multi-node hierarchical structures! There’s much more, such as very explicit control of xml validation, a built-in test facility to ease transformation development, and support for both EE and Server Jobs. I’ve had a chance to play with the Stage over the last few months, and will share my experiences and techniques in upcoming posts.

In the meantime, I’d like to congratulate my IBM teammates in engineering for this accomplishment! This new capability will change how we approach many transformation solutions!

You can find the new XML Transformation capability at fix pack central for application to your 8.5 installation.

Ernie

Follow

Get every new post delivered to your Inbox.