geometry on positive definite matrices induced from v...

42
Geometry on Positive Definite Matrices Induced from V-Potentials and Its Application Atsumi Ohara University of Fukui Joint work with Shinto Eguchi (Institute of Statistical Mathematics) 量子系の統計的推測とその幾何学的構造, RIMS 2014 Nov. 12 1

Upload: others

Post on 15-Jul-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Geometry on

Positive Definite Matrices

Induced from V-Potentials and

Its Application

Atsumi Ohara

University of Fukui

Joint work with Shinto Eguchi (Institute of Statistical Mathematics)

量子系の統計的推測とその幾何学的構造, RIMS 2014 Nov. 12 1

Page 2: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

1. Introduction

: the set of Positive Definite

real symmetric matrices

● logarithmic characteristic func. on [Vinberg 63], [Faraut & Koranyi 94]

R);(,detlog)( nPDPPP

2 2

Page 3: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

φ(P) = -log det P appears in

Semidefinite Programming (SDP)

self-concordant barrier function

Multivariate Analysis (Gaussian dist.)

log-likelihood function

(structured covariance matrix estimation)

Symmetric cone: log characteristic function

Information geometry on

a potential function in standard case

3

Page 4: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

standard IG on

[O,Suda & Amari LAA96]

- : Riemannian metric is the Hesse matrix

of (Fisher for Gaussian) )(P

- , :related to the third derivatives of )(P

Nice properties: -invariant (unique),

KL-divergence, Pythagorean theorem, etc

plays a role of potential function )(P

4

Page 5: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Purpose of this presentation

The other convex potentials

V-potential functions

Their different and/or common geometric

structures

Structure of submanifolds

Decomposition properties of divergences

Application to non-Gaussian pdf’s 5

Page 6: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Outline

Preliminaries

Dualistic geometry induced by V-potentials

Riemannian metric, dual affine connections

Foliated structures

Submanifold of constant determinant

Decomposition of divergence

Applications to multivariate non-Gaussian pdf’s

GL(n)-invariance of geometry of q-Gaussian family

induced by beta-divergence (Quasi-Newton update formulas [Kanamori & O13])

6

Page 7: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

7

Preliminaries and Notation

the set of n by n real symmetric matrix

:arbitrary set of basis matrices

(primal) affine coordinate system

Identification

Page 8: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

8

PD(n)の情報幾何 [O Suda Amari 96]

PD(n)上に

Riemann計量・・・微少な二点間の距離や角度

接続・・・空間の曲がり方

を定める.

定め方はいろいろあるが,様々な双対性を反映した幾何構造(情報幾何)を考える.

以下,微分幾何速習コース(4枚)

Page 9: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

9

接空間

Page 10: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

10

Riemann計量 ー接空間の内積ー

Page 11: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

11

接続(または平行移動) (1) ー接空間同士の関係ー

Page 12: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

12

接続(または平行移動) (2) ー接空間同士の関係ー

Page 13: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

13

PD(n)の情報幾何(2)

対称行列集合(ベクトル空間)の基底行列

座標

ポテンシャル関数

Riemann計量

α接続

Page 14: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Information geometry on

Dualistic geometrical structure

: arbitrary vector fields on

:Riemannian metric

a pair of dually flat affine connections 14

Page 15: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

V-potential function

-The standard case:

PPssV detlog)(log)(

, RR :)(sV

15

Def.

2. Geometry induced by V-potentials

Page 16: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Def.

Rem. The standard case V= -log:

2,0)(,1)(1 kss k

Prop.1 (convexity conditions)

The Hessian matrix of the V-potential is positive

definite on if and only if

16

Page 17: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Assumption: the convexity conditions hold.

- Riemannian metric is

=

Here,

X, Y in sym(n,R) ~ tangent vectors at P

Rem. The standard case V= -log:

=

17

Page 18: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Prop. (affine connections)

Let be the canonical flat connection on .

Then the V-potential defines the following dual

connection with respect to :

18

Page 19: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

divergence function

- a variant of relative entropy,

- Pythagorean type decomposition

19

19

: Dually flat structure on

induced by the V-potential

Page 20: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

group invariance of

the structure on

Linear transformation on

congruent transformation:

the differential:

TG

TG

GXGX

nGLGGPGP

*)(

),,(,

R

20 20

Page 21: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

group invariance of

the structure on

Linear transformation on

congruent transformation:

the differential:

Invariance

metric:

connections:

and the same for

where

TG

TG

GXGX

nGLGGPGP

*)(

),,(,

R

21 21

Page 22: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Prop.

The largest group that preserves the dualistic

structure invariant is

except in the standard case.

),( RnSLGG

),( RnGLGG Rem. the standard case:

Rem. The power potential of the form:

has a special property.

22

with

with

Page 23: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

3. Foliated structures

The following foliated structure features the dualistic geometry induced

from every V-potential.

V-potential.

23

leaf

ray

Page 24: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Prop.

Each leaf and ray are orthogonal

to each other with respect to .

Prop.

Every is simultaneously a - and -

geodesic for an arbitrary V-potential.

24

Page 25: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Prop. For any V, the followings hold:

i) Riemannian metric:

ii) Divergence:

iii) Dual connection:

25

Submanifolds of const det

Induced geometry on from

Page 26: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Prop.

Each leaf is a Riemannian sym. sp.

26

Submanifolds of const det

: Involutive isometry of

Page 27: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Submanifolds of const det

Level surface of both and ADG

Normal vector field N

Centro-affine immersion

27

Page 28: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Submanifolds of const det

Lemma [UOF 00]

The submfd is 1-conformally flat.

Assume

.

Then

where

28

Page 29: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Submanifolds of const det

+ Illustration: decomposition of divergence (1)

If and with ,

then

where

29

Page 30: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Prop.

Each leaf is a homogeneous space of

constant negative curvature

Shown by Gauss eq. and

Lemma (modified Pythagorean) [Kurose94]

30

Submanifolds of const det

Decomposition of divergences (2)

Page 31: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Submanifolds of const det

31

+ Combining the two decomposition results

Prop.

If and are orthogonal at ,

then

>0

Page 32: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

4. Application to

multivariate statistics

Non Gaussian distribution

(generalized exponential family)

Robust statistics

beta-divergence,

Machine learning, and so on

Nonextensive statistical physics

Power distribution,

generalized (Tsallis) entropy, and so on

32

Page 33: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

U-model and U-divergence

U-model

Def.

Given a convex function U on R and set

u=U’, U-model is a family of elliptic pdf’s

specified by P:

:normalizing const. 33

Page 34: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Rem. When U=exp, the U-model is the

family of Gaussian distributions.

U-divergence:

Natural closeness measure on the U-model

,

Rem. When U=exp, the U-divergence is the

Kullback-Leibler divergence (relative entropy). 34

Page 35: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Example: beta-model and beta-

divergence (1)

Beta-model

For and

q-exponential and q-logarithmic functions 35

Page 36: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Example: beta-model and beta-

divergence (2)

Beta-divergence

36

Page 37: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

IG induced from divergences

Divergence induces stat mfd structure.

where

37

Page 38: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Prop.

IG on induced from coincides with

derived from the following V-

potential function:

38

Relation between the U- and V-

geometries

Page 39: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Group invariance for the

power potentials

Prop.

V is of the power form

1) Orthogonality is GL(n)-invariant.

2) The dual affine connections derived from the

power potentials are GL(n)-invariant.

Hence,

Both - and -projections

are GL(n) -invariant. 39

Page 40: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Thm [O & Eguchi 13]

IG on induced from coincides with

on induced from

Implication: statistical inference on using

is GL(n)-invariant.

40

Page 41: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Conclusions

Derived dualistic geometry is invariant under the

SL(n,R)-group actions

For power funcion, dual connections and orthogonality are

GL(n,R)-invariant

Each leaf is a homogeneous manifold with a negative

constant curvature

Decomposition of the divergence function

Relation with the U-model with the U-divergence

41

Page 42: Geometry on Positive Definite Matrices Induced from V ...bayes.sigmath.es.osaka-u.ac.jp/ftanaka/workshop/r14s/...Positive Definite Matrices Induced from V-Potentials and Its Application

Main References

A. Ohara, N. Suda and S. Amari, Dualistic Differential Geometry of Positive Definite

Matrices and Its Applications to Related Problems, Linear Algebra and its

Applications, Vol.247, 31-53 (1996).

A. Ohara, Geodesics for Dual Connections and Means on Symmetric Cones,

Integral Equations and Operator Theory, Vol.50, 537-548 (2004).

A. Ohara and S. Eguchi, Geometry on Positive Definite Matrices Deformed by V-

Potential and its Submanifold Structure, ISM Research Memorandom No. 950

(2005) and F. Nielsen eds. Geometric Theory of Information, Chap. 2, 31-55,

Springer (2014).

T. Kanamori and A. Ohara,

A Bregman Extension of quasi-Newton updates I: An Information Geometrical

Framework, Optimization Methods and Software, Vol. 28, No. 1, 96-123 (2013).

A. Ohara and S. Eguchi, Group Invariance of Information Geometry on q-Gaussian

Distributions Induced by Beta-Divergence, Entropy, Vol. 15, 4732-4747 (2013).

42