in nite-dimensional svd for revealing microphone …...in nite-dimensional svd for revealing...

2
Infinite-dimensional SVD for revealing microphone array’s characteristics マイクロホンアレイの特性の解析のための無限次元特異値分解 5115E007-1 小谷野 雄史 指導教員 及川 靖広 教授 KOYANO Yuji Prof. OIKAWA Yasuhiro 概要: 近年,様々な構成のマイクロホンアレイが提案・利用されている.しかし,異なる座標系で構成されたアレイの 特性の比較は容易ではない.アレイの特性を評価する指標はこれまでも提案されているが,その多くはアプリケーショ ン指向の指標であり,ある指標において最良の構成は他の指標においては良い構成とは限らない.したがって,構成や 用途に依存しないアレイの解析手法が必要である.本論文では,無限次元 SVD によるアレイのサンプリング特性の解 析手法を提案する.物理モデルによって定式化されたサンプリング作用素の特異値分解によって得られた特異値,関 数およびベクトルを用いて,マイクロホンアレイが観測可能な音の方向性の情報の可視化などの解析手法を実現した. キーワードHelmholtz 方程式,平面波近似,Herglotz 波動関数,固有値分解 Keywords: Helmholtz equation, plane wave approximation, Herglotz wave function, eigendecomposition. 1. Introduction Nowadays, various types of microphone arrays are used in many applications. However, it is not easy to compare arrays of different types because each array has been treated by a specific theory depending on the type of the array. Although some criteria have been proposed for microphone arrays for evaluating and/or designing the arrays, most of them are application- oriented criteria, and the best configuration for some criterion may not be a better one for the other crite- rion. Therefore, an analysis method for microphone arrays which does not depend on the array config- uration and application is necessary. In this paper, infinite-dimensional SVD is proposed for analyzing and comparing the properties of arrays. The singular values, functions and vectors obtained by the proposed SVD provide the sampling property of an array, such as the measurable directional information of sound. 2. Infinite-Dimensional SVD In linear acoustics, a sound filed can be repre- sented the Helmholtz equation [1]: ( + k 2 ) u(x) = 0. Observation of the sound field by M microphones is a mapping from the sound to measured signal, which is defined as the sampling operator SM as SM : u(x) 7-→ {u(xm)} M m=1 , where {u(xm)} M m=1 is the sound field u sampled by M microphones placed at {xm} M m=1 . The aim of this paper is to analyze the sampling property of SM based on the above physical model. However, SM cannot be analyzed directly because it is the mapping from the infinite-dimensional function to the measured finite-dimensional signal as illustrated in Fig. 2. Since the measurable information depends on the configuration of the array, every function satisfying the Helmholtz equation must be considered. In other words, some convenient representation of the solutions is necessary in order to make the problem tractable. Since every solution u can be represented by the Her- glotz wave function [2,3], u can be written as u = H α by defining an operator H : α(ν ) 7-→ u(x), (H α)(x)= S 2 α(ν ) exp(jkx, ν ) dS(ν ), (1) where α is a weight function defined on the unit sphere S 2 . Therefore α can be considered as a representative of the sound field. Sampled version of the Herglotz wave function can be defined as u(xm)= S 2 α(ν ) exp(jkxm, ν ) dS(ν ), (2) which is also denoted shortly as u = HMα, where HM(= SMH ) is the corresponding integral opera- tor HM : α(ν ) 7-→ {u(xm)} M m=1 . The relationship among these operators is illustrated in Fig. 1. For analyzing a matrix, one of the most popular methodologies is SVD which decomposes an M × N matrix A into the three matrices as A = U ΣV * , (3) where U is an M × M unitary matrix, Σ is an M × M diagonal matrix, V is an N × M column orthonor- mal matrix, and V * is conjugate transpose of V . This decomposition reveals the mapping properties of the matrix. There are several routes for computing SVD. Since AA * = U ΣV * V ΣU * = U Σ 2 U * , (4) U and Σ can be obtained through the eigendecompo- sition of AA * . Then, V is calculated from U and Σ as twice differentiable functions Herglotz integral Herglotz sampling : : : sampling projection : sound (solution of the Helmholtz equation) measurable information measured signal measurable information weight functions Fig. 1 Diagram of sampling of sound field. SM is the sampling operator to be analyzed. HM = SMH is the alternative representation of SM.

Upload: others

Post on 23-Apr-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: In nite-dimensional SVD for revealing microphone …...In nite-dimensional SVD for revealing microphone array’s characteristics マイクロホンアレイの特性の解析のための無限次元特異値分解

Infinite-dimensional SVD for revealing

microphone array’s characteristics

マイクロホンアレイの特性の解析のための無限次元特異値分解

5115E007-1 小谷野 雄史 指導教員 及川 靖広 教授KOYANO Yuji Prof. OIKAWA Yasuhiro

概要:近年,様々な構成のマイクロホンアレイが提案・利用されている.しかし,異なる座標系で構成されたアレイの特性の比較は容易ではない.アレイの特性を評価する指標はこれまでも提案されているが,その多くはアプリケーション指向の指標であり,ある指標において最良の構成は他の指標においては良い構成とは限らない.したがって,構成や用途に依存しないアレイの解析手法が必要である.本論文では,無限次元 SVDによるアレイのサンプリング特性の解析手法を提案する.物理モデルによって定式化されたサンプリング作用素の特異値分解によって得られた特異値,関数およびベクトルを用いて,マイクロホンアレイが観測可能な音の方向性の情報の可視化などの解析手法を実現した.キーワード:Helmholtz方程式,平面波近似,Herglotz波動関数,固有値分解Keywords: Helmholtz equation, plane wave approximation, Herglotz wave function, eigendecomposition.

1. Introduction

Nowadays, various types of microphone arrays areused in many applications. However, it is not easy tocompare arrays of different types because each arrayhas been treated by a specific theory depending on thetype of the array. Although some criteria have beenproposed for microphone arrays for evaluating and/ordesigning the arrays, most of them are application-oriented criteria, and the best configuration for somecriterion may not be a better one for the other crite-rion. Therefore, an analysis method for microphonearrays which does not depend on the array config-uration and application is necessary. In this paper,infinite-dimensional SVD is proposed for analyzingand comparing the properties of arrays. The singularvalues, functions and vectors obtained by the proposedSVD provide the sampling property of an array, suchas the measurable directional information of sound.

2. Infinite-Dimensional SVD

In linear acoustics, a sound filed can be repre-sented the Helmholtz equation [1]:

(△+ k2

)u(x) = 0.

Observation of the sound field by M microphonesis a mapping from the sound to measured signal,which is defined as the sampling operator SM asSM : u(x) 7−→ {u(xm)}Mm=1, where {u(xm)}Mm=1 isthe sound field u sampled by M microphones placedat {xm}Mm=1.The aim of this paper is to analyze the sampling

property of SM based on the above physical model.However, SM cannot be analyzed directly because it isthe mapping from the infinite-dimensional function tothe measured finite-dimensional signal as illustrated inFig. 2. Since the measurable information depends onthe configuration of the array, every function satisfyingthe Helmholtz equation must be considered. In otherwords, some convenient representation of the solutionsis necessary in order to make the problem tractable.Since every solution u can be represented by the Her-

glotz wave function [2,3], u can be written as u = H αby defining an operator H : α(ν) 7−→ u(x),

(H α)(x) =

∫S2

α(ν) exp(jk⟨x,ν⟩) dS(ν), (1)

where α is a weight function defined on the unit sphere

S2. Therefore α can be considered as a representativeof the sound field.Sampled version of the Herglotz wave function can

be defined as

u(xm) =

∫S2

α(ν) exp(jk⟨xm,ν⟩) dS(ν), (2)

which is also denoted shortly as u = HMα, whereHM (= SMH ) is the corresponding integral opera-

tor HM : α(ν) 7−→ {u(xm)}Mm=1. The relationshipamong these operators is illustrated in Fig. 1.For analyzing a matrix, one of the most popular

methodologies is SVD which decomposes an M × Nmatrix A into the three matrices as

A = UΣV ∗, (3)

where U is an M ×M unitary matrix, Σ is an M ×Mdiagonal matrix, V is an N × M column orthonor-mal matrix, and V ∗ is conjugate transpose of V . Thisdecomposition reveals the mapping properties of thematrix. There are several routes for computing SVD.Since

AA∗ = UΣV ∗V ΣU∗ = UΣ2U∗, (4)

U and Σ can be obtained through the eigendecompo-sition of AA∗. Then, V is calculated from U and Σ as

twicedifferentiable

functions

Herglotz integralHerglotz sampling :

:: sampling

projection :

sound(solution of the

Helmholtz equation)

measurableinformation

measuredsignal

measurableinformation

weight functions

Fig. 1 Diagram of sampling of sound field. SM is thesampling operator to be analyzed. HM = SMH isthe alternative representation of SM .

Page 2: In nite-dimensional SVD for revealing microphone …...In nite-dimensional SVD for revealing microphone array’s characteristics マイクロホンアレイの特性の解析のための無限次元特異値分解

Fig. 2 Visual examples of right singular functions.

1 kHz 1.7 kHz 2 kHz 1 kHz 1.7 kHz 2 kHz

Fig. 3 Visual examples of spatial impulse response.

Σ−1U∗A = Σ−1U∗UΣV ∗ = Σ−1ΣV ∗ = V ∗. (5)

Although HM is infinite-dimensional, HMH ∗M be-

comes a finite-dimensional matrix which allows the or-dinary matrix decomposition. Therefore, SVD of HM

can be calculated by decomposing the M ×M matrixHMH ∗

M and using Eq. (5). Then, one has to evaluateHMH ∗

M whose (m,n)-th entry is

(HMH ∗M )mn=

∫S2

exp(jk⟨xm,ν⟩)exp(jk⟨xn,ν⟩) dS(ν),

(6)

where z is complex conjugate of z. Since the integrandof Eq. (6) is a zonal function, (HMH ∗

M )mn can be cal-culated analytically as follows

(HMH ∗M )mn= 4π sinc(k∥xm − xn∥), (7)

where sinc(x) = sin(x)/x if x ̸= 0, and sinc(0) = 1.Since the matrix is real and symmetric, it can be

diagonalized by a real-valued orthonormal matrix Uas

HMH ∗M = UΣ2U∗. (8)

Then, m-th column of V , which is a continuous func-tion on the unit sphere, is obtained by Eq. (5) as

vm(ν) =1

σm

M∑ℓ=1

Uℓm exp(jk⟨xℓ,ν⟩), (9)

where V =[v1, v2, . . . , vM ], and Σ = diag(σ1, σ2, · · · , σM ).Note that Eq. (7) does not involve any approximation,and therefore the obtained decomposition should beaccurate up to the machine precision.By the proposed SVD, u = HMα is rewritten as

u = UΣV ∗α, where V ∗ transforms a weight functionα into an M -dimensional vector, Σ multiplies the sin-gular values {σm}Mm=1 to the M -dimensional vector,and U transforms the basis within the M -dimensionalspace to obtain the measured signal u.

3. Numerical Examples

Since V ∗ can be regarded as an M × ∞ matrix,whose rows are the continuous functions, only theM -dimensional components of the weight function ismeasured by the microphone array. That is, themeasurable part of the weight function is the M -dimensional subspace spanned by the right singularfunctions {vm}Mm=1. Furthermore, a projection V V ∗

onto the subspace spanned by {vm}Mm=1 is an inter-mediate representation of the measurable directionalinformation of sound. Therefore, “Spatial impulse re-sponse” of V V ∗ defined as V V ∗δ(ν − ν′) is proposed,

0 1 2 3 4 5 6 7 8Frequency [kHz]

00.10.20.30.40.50.60.70.80.91

Nor

mal

ized

sing

ular

val

ue

Nyquistfrequency

Fig. 4 Normalized singular values of linear array.

which represents the measurable information of a sin-gle plane wave arriving from the direction −ν′.The proposed SVD was applied to a linear array, and

the proposed impulse responses were calculated to seeits property. The microphones of the linear array areplaced in an alignment of interval 0.1 m whose lengthis 1.1 m and Nyquist frequency is 1.7 kHz. The soundspeed was assumed to be 340 m/s.In Fig. 2, each right singular function vm(ν)

weighted by the corresponding singular value σm isillustrated for 1 kHz. As the right singular function isa continuous function on the sphere, its magnitude isrepresented by the radius. The shapes of them indicatea rotational symmetric property of the linear array asin Fig. 2. In Fig. 3, the proposed spatial impulse re-sponse V V ∗δ(ν − ν′) for several ν′ and frequenciesare illustrated. Since the Herglotz wave function withα(ν) = δ(ν−ν′) corresponds to the single plane wave,this figure indicates the uncertainty of the directionalinformation of measured signal. For example, righthalf of Fig. 3 illustrates the rotational ambiguity ofthe linear array as the directional information of thehorizontal plane was disappeared. That is, the hori-zontal direction of the arrival sound cannot be knownfrom the measured signal because the information wasdeleted by V ∗. For the higher frequencies, the impulseresponse has larger sidelobes than the low frequencies.Those can be regarded as the spatial aliasing whichdepends on the array configuration. At 1.7 kHz, thespatial Nyquist frequency of the linear array, it canbe seen that the impulse response becomes symmetricwith respect to the aperture direction of the array.In Fig. 4, the normalized singular values {σm/σ1}12m=1

of the sampling operator HM (M = 12) are illustrated.The Nyquist frequency of the linear array is also de-picted in Fig. 4 as a vertical line. Interestingly, thevertical line agrees with the Nyquist frequency whereall singular values have extrema.

4. Conclusions

In this paper, the infinite-dimensional SVD and thespatial impulse response as the intermediate represen-tation of the measurable information for analyzing mi-crophone arrays is proposed. The right singular func-tions obtained by proposed SVD can visualize the di-rectional information of microphone array. Therefore,applying the proposed method to many practical prob-lems of a microphone array is definitely the next stepwhich should be investigated in the future.

Reference

[ 1 ] Y. Koyano, K. Yatabe, Y. Ikeda, and Y. Oikawa,“Physical-model based efficient data representation formany-channel microphone array,” in IEEE Proc. ICASSP,Mar. 2016, pp. 370–374.

[ 2 ] Y. Koyano, K. Yatabe, and Y. Oikawa, “Infinite-dimensional SVD for analyzing microphone array,” in IEEEProc. ICASSP, Mar. 2017.

[ 3 ] Y. Koyano, K. Yatabe, and Y. Oikawa, “Infinite-dimensional SVD for revealing microphone array’s charac-teristics,” (submitted).