solicitation no. fa5215-04-r-0007 gpo transformation with smart data michael c. daconta vice...

19
Solicitation No. FA5215-04-R-0007 GPO Transformation with Smart Dat Michael C. Daconta Vice President, Enterprise Data Management [email protected] May 11, 2006

Upload: bridget-warner

Post on 25-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Solicitation No. FA5215-04-R-0007

GPO Transformation with Smart Data

Michael C. DacontaVice President, Enterprise Data [email protected]

May 11, 2006

2© 2006 Oberon Associates, Inc.

ExperienceExperience

Former Program Manager, DHS Metadata Center of Excellence– EDM Strategy and Maturity Model– Metadata Registry/Repository– FEA DRM– NIEM

Current Experience– TSA Data Management Support– Axiom Pub/Sub Info Sharing Network– Biometrics Data Warehouse and

Multi-modal Biometric Watchlist

Upcoming GCN Op/Ed on Information Sharing – “What is the Face of Information Sharing?”

New Blog: practical-metadata.blogspot.com

3© 2006 Oberon Associates, Inc.

AgendaAgenda

The Problem: Info Lifecycle Reinvention Revolution The Objective: Real-Time Relevance The Solution: Smart Data

– A Copernican Shift Smart Data Continuum – In Depth

– XML– Taxonomies & Mixed Vocabularies– Ontologies and Rules

Where do you Start?– Semantic Bootstrapping– Interrogatives (Focus)– Data Reference Model

FEA DRM– Metadata and Benefits

Conclusion

4© 2006 Oberon Associates, Inc.

The ProblemThe Problem

The World is changing Rapidly … But How Rapidly???

More than just the paper-to-digital transformation … aka “digitization”– Think a Complete Information

Lifecycle Reinvention

Beware – Strategic Vision may not be viable beyond 5 years without understanding the ramifications of this Information Lifecycle Reinvention– Precedent: Task Process Exploit Disseminate (TPED)

versus Task Post Process Use (TPPU)– Highly Collaborative and Centralized Authoring (examine

your kid’s tech habits)– Raw Production (voice-to-text-to-publish)– Cradle-to-Grave Real-Time-Relevance– Hyper Granularity and Fidelity Don’t think Documents,

think Media-Rich, Connected Statements

5© 2006 Oberon Associates, Inc.

The Objective…The Objective…

Strategic Goal– Real-time Relevance: “The Right Information to the Right Person at the

Right Time”

User Context&

Requirements

RelevanceContent

& Services

What the User Needs RIGHT

NOW!

What we mean by “Right”

What you have available?

Do you know your users?

Can you calculate this in real-time? How much of what you

have do you actively manage?

6© 2006 Oberon Associates, Inc.

The Solution …The Solution …

The trend is to put the “smarts” in the data, not in the applications.

XML Ontology andAutomated Reasoning

XML Taxonomies andDocuments with Mixed Vocabularies

XML Documents UsingSingle Vocabularies

Text Documents andDatabase Records

“Point to Point Interfaces must DIE”

“The Smart Data Continuum”-Daconta, Obrst, Smith

7© 2006 Oberon Associates, Inc.

Apps

Data

We had it all wrong…

In the beginning…

This shift (of Copernican proportion) will create an “Information Network Effect”

Apps

Apps

Apps

Paradigm Shift…

8© 2006 Oberon Associates, Inc.

SDC Level 1: XMLSDC Level 1: XML

Simple Value Proposition!– Unify documents and data– Application independence– Model-View-Controller on the Web

(AJAX)

More than just a container!– Strategic technology– Fueling Web Services– Opportunity for Rules on Content

Family of Supporting Standards…– Addressing, Query– Transforms, XForms, – Encryption, Signature, etc.

Recommendation:– XSD and RDF Compliance (i.e. DDMS)

<Immigrant><FullName> John Doe </FullName> <Height> 5 &apos; 10 &quot; </Height><Weight unit=“lbs”> 170 </Weight>

</Immigrant>

9© 2006 Oberon Associates, Inc.

SDC Level 2: Formal TaxonomiesSDC Level 2: Formal Taxonomies

Formal Node Definition– Collection, Class, Instance

Formal Link Definition– partOf, subclassOf, instanceOf

FEA DRM and Pub/Sub Networks– Semandex and the Spawar Axiom Network

Car

Sports Car SedanSUV

Corvette Mustang

Car

Engine Wheel

Transmission Carburetor

* Article at http://www.xml.com/pub/a/2005/01/26/formtax.html

10© 2006 Oberon Associates, Inc.

SDC Level 2: Mixed VocabulariesSDC Level 2: Mixed Vocabularies

Mixed Vocabularies– Namespaces – URNs versus URIs– Recommendation: URIs– Why? Semantic Chains

Even Fragments Require Identifiers– Globally Unique– Semantic versus Opaque– Retrievable– Authoritative– Class versus Instance

samePerson

xmlns=“uri”

RDF(RDDL)

Catalog or MetaCards(RDF, DublinCore)

Ontology/Multi-domain

Catalog-level

Single domain

Content-level

RDF(RSS)

Exportas

Context-level

11© 2006 Oberon Associates, Inc.

SDC Level 3: Ontologies and RulesSDC Level 3: Ontologies and Rules

Data and logic are the yin and yang of information processing

Two given relations and one inferred relation (uncleOf)

Person A

Person B

Person C

siblingOf

uncleOf

Rules

if (C.gender == “male” AND C == childOf(A)) then C = sonOf(A);

if (B.gender == “male” AND B == siblingOf(A)) then B == brotherOf(A);

if (C == sonOf(A) AND B == brotherOf(A)) then B = uncleOf(C);

childOf

Killer App Watch: The W3C Rule Interchange Format

12© 2006 Oberon Associates, Inc.

Where do you Start? (1)Where do you Start? (1)

Shared Identity(Naming & Addressing)

Shared Metamodels(Domain & System)

Shared Business Logic(Services & Rules)

Shared Transactions(Containers & Context)

Targeted Semantics(Definitions & Scope)

“Semantic Bootstrapping”

13© 2006 Oberon Associates, Inc.

Where do you Start? (2)Where do you Start? (2)

Interrogative search and Focus!– WHO SUCCESS (narrow)

– Biometrics/watchlists – BAT/BIR and TWPDES

– Person-centric KM – WHERE SUCCESS (narrow)

– Google maps and mashups– WHEN Moderate Success (broad)

– Opportunity - calendars– Opportunity – event

– WHAT Limited Success (very broad)– Taxonomy, folksonomy, RSS

– WHY NEW– Activity-based search (TAP)– Opportunity: pragmatics and

unobtrusive assistance Opportunity: Integration of the above…

14© 2006 Oberon Associates, Inc.

Where do you start? (3) - FEA DRMWhere do you start? (3) - FEA DRM

What is the FEA DRM?– One of Five Inter-related

Reference Models– A reference model that sets

the requirements for federal agency data architectures to promote interagency information sharing.

DRM 2.0 Released by OMB– http://www.whitehouse.gov/om

b/egov/documents/DRM_2_0_Final.pdf

Policy mandates its use across the federal government.

15© 2006 Oberon Associates, Inc.

Metadata and the DRMMetadata and the DRM

What is Metadata?– Needs to be Redefined– “Data about Data” Considered Harmful– Strawman Definition:

● Metadata – an external description of a distinct data resource to provide context, metrics or amplification.

● Discuss at practical-metadata.blogspot.com

The DRM represents Metadata on an Organization’s Data Architecture

FEA DRMFEA DRM

16© 2006 Oberon Associates, Inc.

DRM Architectural Pattern (Simplified)DRM Architectural Pattern (Simplified)

From Expanding Egovernment – Report to Congress

17© 2006 Oberon Associates, Inc.

DRM Architectural Pattern (1)DRM Architectural Pattern (1)

18© 2006 Oberon Associates, Inc.

FEA DRM BenefitsFEA DRM Benefits

FEA DRMFEA DRM

Cross-Sections Perspectives,Time-savings,Insights

Drill-Downs Interoperability,Authority,Summarization

Coverages Exposure,Quality,Metadata

19© 2006 Oberon Associates, Inc.

ConclusionConclusion

As We May Think1…– “Our ineptitude in getting at the record is largely caused by the

artificiality of systems of indexing. …The human mind does not work that way. It operates by association. …Selection by association, rather than indexing, may yet be mechanized.” - Vannevar Bush, 1945

The GPO has an unprecedented opportunity– Understanding the forthcoming Information Lifecycle Reinvention and

the Smart Data Solution will enable a successful transformation!

1 © Vannevar Bush

© Ocean Cruise Guides

A journey of a thousand miles begins with a single step. - Lao-Tzu

All Aboard!!