topological data analysis and stochastic topology - … · topological data analysis and stochastic...

34
topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Upload: dangdieu

Post on 25-Apr-2018

223 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

topological data analysisand stochastic topology

yuliy baryshnikovwaikiki, march 2013

Page 2: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Promise of topological data analysis: extract the structure from the data.

In: point clouds

Out: hidden structure

Page 3: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Underlying structure: manifolds?stratified sets?

Two ways to look at them: Poincare's and Riemann:

Correspondingly, there are two approaches in dimensionality reduction:

• Poincare view - use ambient coordinates - PCA, Kernel PCA,...

• Riemann view - only intrinsic proximity makes sense - IsoMap, EigenMap,...

Page 4: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Topological Data Analysis aims at recovery of the most stable invariants of the data.

In particular, complete scale invariance is sought

This is both a curse and a blessing:

● very susceptible to noise● only vague notion of proximity matters

Page 5: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Invariants of topological spaces: two spaces are the same if they are

homeomorphic?

diffeomorphic?

homotopic?

Page 6: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Invariants of topological spaces:

Dimension?

Homotopy groups?

Homology and cohomology

Page 7: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Homology and cohomology

come in different flavors and colors

singular, de Rham, simplicial

Simplicial - easiest to deal with. Works well with meshes...

Page 8: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Algebraic formalism includes chains, cycles and boundaries, and the idiosyncratic relation

In essence, linear algebra encoding combinatorics of adjacencies.

Page 9: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Homology groups depend on the coefficient ring: what are the entries of the boundary matrix.

Sometimes there is torsion:

Working over a field – ranks of the homology groups are called Betti numbers: numeric invariants of a topological space

Page 10: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Morse theory – an important tool to generate the cellular partition of the underlying manifold, and estimate the Betti numbers

Page 11: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

What happens when one adds a simplex of dimension d: either a cycle in dimension d-1 disappears, or a cycle of dimension d appears.

This implies existence of an aggregate invariant, the Euler characteristic.

Page 12: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Back to data: given a point cloud, how to arrive at a topological space? Taking into account desirable scale?

Old idea: alpha-shapes (Robbins, Edelsbrunner...)

Place a ball around each point in the dataset and construct the Čech complex: glue in a simplex for any non-empty intersection of the balls.

Nerve Theorem: If all intersections are contractible, the union of balls is homotopy equivalent to the Čech complex.

Page 13: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Noise is a problem, however: how to get rid of the small features?Answer: increase the scale.and againand againUntil the features disappear altogether

Persistence is a way to address the onslaught of irrelevant features

Page 14: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Tidal traces to be ignored: only the islands survive.

Page 15: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Persistence homology

Start with a filtration, and record when a cycle appears, and when it dies.

Page 16: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Encoding: either barcodes, or persistence diagram

Page 17: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Important result: stability theorem – small perturbations of the filtration change the persistence diagram little

Page 18: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Important tool: Morse theory

Page 19: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Some examples

Mumford dataset and the Klein bottle

Page 20: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Krioukov et al. Hyperbolic mappingof complex networks

Page 21: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Simple test: what is the dimension of the internet?

How we find the dimension knowing a sample from a manifold?

Page 22: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Testing on a special (stratified) set

Page 23: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Testing on a special (stratified) set

Page 24: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Autonomous Systems (“Internet”)

Page 25: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Stochastic Topology: dealing with the baseline case

Niyogi, Smale, Weinberger: Cech complex for small enough r and dense enough sample with has the homotopy type of the underlying manifold

Page 26: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Big Picture: from dust to foam to monolith

Page 27: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Big Picture: timescale

Page 28: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Big Picture: foam phase

Page 29: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Foam phase: oscillation of Euler characteristic

Euler characteristic in dim=7

Page 30: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Foam phase: concentrations of Betti numbers

Page 31: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Life in the ambient space: crackle phenomenon

If the noise is Gaussian, the balls form a solid ball. In exponential and subexponential cases, the homologies form in layers.

Page 32: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Plenty of further models and questions in stochastic topology

Persistence diagrams for Brownian bridge

Page 33: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013

Plenty of further models and questions in stochastic topology

Topology of the level sets of random functions

Random simplicial complexes

Random configuration spaces

...

Page 34: topological data analysis and stochastic topology - … · topological data analysis and stochastic topology yuliy baryshnikov waikiki, march 2013