solicitation no. fa5215-04-r-0007 gpo transformation with smart data michael c. daconta vice...
TRANSCRIPT
Solicitation No. FA5215-04-R-0007
GPO Transformation with Smart Data
Michael C. DacontaVice President, Enterprise Data [email protected]
May 11, 2006
2© 2006 Oberon Associates, Inc.
ExperienceExperience
Former Program Manager, DHS Metadata Center of Excellence– EDM Strategy and Maturity Model– Metadata Registry/Repository– FEA DRM– NIEM
Current Experience– TSA Data Management Support– Axiom Pub/Sub Info Sharing Network– Biometrics Data Warehouse and
Multi-modal Biometric Watchlist
Upcoming GCN Op/Ed on Information Sharing – “What is the Face of Information Sharing?”
New Blog: practical-metadata.blogspot.com
3© 2006 Oberon Associates, Inc.
AgendaAgenda
The Problem: Info Lifecycle Reinvention Revolution The Objective: Real-Time Relevance The Solution: Smart Data
– A Copernican Shift Smart Data Continuum – In Depth
– XML– Taxonomies & Mixed Vocabularies– Ontologies and Rules
Where do you Start?– Semantic Bootstrapping– Interrogatives (Focus)– Data Reference Model
FEA DRM– Metadata and Benefits
Conclusion
4© 2006 Oberon Associates, Inc.
The ProblemThe Problem
The World is changing Rapidly … But How Rapidly???
More than just the paper-to-digital transformation … aka “digitization”– Think a Complete Information
Lifecycle Reinvention
Beware – Strategic Vision may not be viable beyond 5 years without understanding the ramifications of this Information Lifecycle Reinvention– Precedent: Task Process Exploit Disseminate (TPED)
versus Task Post Process Use (TPPU)– Highly Collaborative and Centralized Authoring (examine
your kid’s tech habits)– Raw Production (voice-to-text-to-publish)– Cradle-to-Grave Real-Time-Relevance– Hyper Granularity and Fidelity Don’t think Documents,
think Media-Rich, Connected Statements
5© 2006 Oberon Associates, Inc.
The Objective…The Objective…
Strategic Goal– Real-time Relevance: “The Right Information to the Right Person at the
Right Time”
User Context&
Requirements
RelevanceContent
& Services
What the User Needs RIGHT
NOW!
What we mean by “Right”
What you have available?
Do you know your users?
Can you calculate this in real-time? How much of what you
have do you actively manage?
6© 2006 Oberon Associates, Inc.
The Solution …The Solution …
The trend is to put the “smarts” in the data, not in the applications.
XML Ontology andAutomated Reasoning
XML Taxonomies andDocuments with Mixed Vocabularies
XML Documents UsingSingle Vocabularies
Text Documents andDatabase Records
“Point to Point Interfaces must DIE”
“The Smart Data Continuum”-Daconta, Obrst, Smith
7© 2006 Oberon Associates, Inc.
Apps
Data
We had it all wrong…
In the beginning…
This shift (of Copernican proportion) will create an “Information Network Effect”
Apps
Apps
Apps
Paradigm Shift…
8© 2006 Oberon Associates, Inc.
SDC Level 1: XMLSDC Level 1: XML
Simple Value Proposition!– Unify documents and data– Application independence– Model-View-Controller on the Web
(AJAX)
More than just a container!– Strategic technology– Fueling Web Services– Opportunity for Rules on Content
Family of Supporting Standards…– Addressing, Query– Transforms, XForms, – Encryption, Signature, etc.
Recommendation:– XSD and RDF Compliance (i.e. DDMS)
<Immigrant><FullName> John Doe </FullName> <Height> 5 ' 10 " </Height><Weight unit=“lbs”> 170 </Weight>
</Immigrant>
9© 2006 Oberon Associates, Inc.
SDC Level 2: Formal TaxonomiesSDC Level 2: Formal Taxonomies
Formal Node Definition– Collection, Class, Instance
Formal Link Definition– partOf, subclassOf, instanceOf
FEA DRM and Pub/Sub Networks– Semandex and the Spawar Axiom Network
Car
Sports Car SedanSUV
Corvette Mustang
Car
Engine Wheel
Transmission Carburetor
* Article at http://www.xml.com/pub/a/2005/01/26/formtax.html
10© 2006 Oberon Associates, Inc.
SDC Level 2: Mixed VocabulariesSDC Level 2: Mixed Vocabularies
Mixed Vocabularies– Namespaces – URNs versus URIs– Recommendation: URIs– Why? Semantic Chains
Even Fragments Require Identifiers– Globally Unique– Semantic versus Opaque– Retrievable– Authoritative– Class versus Instance
samePerson
xmlns=“uri”
RDF(RDDL)
Catalog or MetaCards(RDF, DublinCore)
Ontology/Multi-domain
Catalog-level
Single domain
Content-level
RDF(RSS)
Exportas
Context-level
11© 2006 Oberon Associates, Inc.
SDC Level 3: Ontologies and RulesSDC Level 3: Ontologies and Rules
Data and logic are the yin and yang of information processing
Two given relations and one inferred relation (uncleOf)
Person A
Person B
Person C
siblingOf
uncleOf
Rules
if (C.gender == “male” AND C == childOf(A)) then C = sonOf(A);
if (B.gender == “male” AND B == siblingOf(A)) then B == brotherOf(A);
if (C == sonOf(A) AND B == brotherOf(A)) then B = uncleOf(C);
childOf
Killer App Watch: The W3C Rule Interchange Format
12© 2006 Oberon Associates, Inc.
Where do you Start? (1)Where do you Start? (1)
Shared Identity(Naming & Addressing)
Shared Metamodels(Domain & System)
Shared Business Logic(Services & Rules)
Shared Transactions(Containers & Context)
Targeted Semantics(Definitions & Scope)
“Semantic Bootstrapping”
13© 2006 Oberon Associates, Inc.
Where do you Start? (2)Where do you Start? (2)
Interrogative search and Focus!– WHO SUCCESS (narrow)
– Biometrics/watchlists – BAT/BIR and TWPDES
– Person-centric KM – WHERE SUCCESS (narrow)
– Google maps and mashups– WHEN Moderate Success (broad)
– Opportunity - calendars– Opportunity – event
– WHAT Limited Success (very broad)– Taxonomy, folksonomy, RSS
– WHY NEW– Activity-based search (TAP)– Opportunity: pragmatics and
unobtrusive assistance Opportunity: Integration of the above…
14© 2006 Oberon Associates, Inc.
Where do you start? (3) - FEA DRMWhere do you start? (3) - FEA DRM
What is the FEA DRM?– One of Five Inter-related
Reference Models– A reference model that sets
the requirements for federal agency data architectures to promote interagency information sharing.
DRM 2.0 Released by OMB– http://www.whitehouse.gov/om
b/egov/documents/DRM_2_0_Final.pdf
Policy mandates its use across the federal government.
15© 2006 Oberon Associates, Inc.
Metadata and the DRMMetadata and the DRM
What is Metadata?– Needs to be Redefined– “Data about Data” Considered Harmful– Strawman Definition:
● Metadata – an external description of a distinct data resource to provide context, metrics or amplification.
● Discuss at practical-metadata.blogspot.com
The DRM represents Metadata on an Organization’s Data Architecture
FEA DRMFEA DRM
16© 2006 Oberon Associates, Inc.
DRM Architectural Pattern (Simplified)DRM Architectural Pattern (Simplified)
From Expanding Egovernment – Report to Congress
18© 2006 Oberon Associates, Inc.
FEA DRM BenefitsFEA DRM Benefits
FEA DRMFEA DRM
Cross-Sections Perspectives,Time-savings,Insights
Drill-Downs Interoperability,Authority,Summarization
Coverages Exposure,Quality,Metadata
19© 2006 Oberon Associates, Inc.
ConclusionConclusion
As We May Think1…– “Our ineptitude in getting at the record is largely caused by the
artificiality of systems of indexing. …The human mind does not work that way. It operates by association. …Selection by association, rather than indexing, may yet be mechanized.” - Vannevar Bush, 1945
The GPO has an unprecedented opportunity– Understanding the forthcoming Information Lifecycle Reinvention and
the Smart Data Solution will enable a successful transformation!
1 © Vannevar Bush
© Ocean Cruise Guides
A journey of a thousand miles begins with a single step. - Lao-Tzu
All Aboard!!