discriminative syntactic language modeling for speech recognition michael collins, brian roark...

Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University 報報報報報報： 2013/02/26 43rd Annual Meeting of the Association for Computational Linguistics

Upload: jemimah-horton

Post on 30-Dec-2015

221 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Discriminative Syntactic Language Modeling for Speech Recognition

Michael Collins, Brian Roark Murat, Saraclar

MIT CSAIL, OGI/OHSU, Bogazici University

報告者：郝柏翰2013/02/26

43rd Annual Meeting of the Association for Computational Linguistics

Page 2: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Outline

• Introduction

• Parse Tree Feature

• Experiments

• Conclusion

Page 3: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Introduction

1) SLM in my mind

2) p-value

Page 4: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Introduction

• Word n-gram models have been extremely successful as language models (LMs) for speech recognition.

• However, modeling long-span dependency can help the language model better predict words.

• This paper describes a method for incorporating syntactic features into the language model, for reranking approach, using discriminative parameter estimation techniques.

𝑤∗=𝑎𝑟𝑔max𝑤

(𝛽 log 𝑃 𝑙 (𝑤 )+⟨𝛼 ,∅ (𝑎 ,𝑤 ) ⟩+log 𝑃𝑎(𝑎∨𝑤))

Page 5: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Introduction

• Our approach differs from previous work in a couple of important respects.

1) through the feature vector representations we can essentially incorporate arbitrary sources of information from the string or parse tree into the model.We would argue that our method allows considerably more flexibility in terms of the choice of features in the model.

2) second contrast between our work and previous work, is in the use of discriminative parameter estimation techniques.

Page 6: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Parameter Estimation

1) Perceptron

2) GCLMs

where

• We are reporting results using just the perceptron algorithm. This has allowed us to explore more of the potential feature space than we would have been able to do using the more costly GCLM estimation techniques.

Page 7: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Parse Tree Feature

• Figure 2 shows a Penn Treebank style parse tree that is of the sort produced by the parser.

Page 8: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Parse Tree Feature

• Sequences derived from a parse tree

1) POS-tag sequence:we/PRP helped/VBD her/PRP paint/VB the/DT house/NN

2) Shallow parse tag sequence:we/NPb helped/VPb her/NPb paint/VPb the/NPb house/NPc

3) Shallow parse tag plus POS tag:we/PRP-NPb helped/VBD-VPb her/PRP-NPb paint/VB-VPb the/DT-NPb house/NN-NPc

4) Shallow category with lexical head sequence:we/NP helped/VP her/NP paint/VP house/NP

Page 9: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Parse Tree Feature H2H

• P : Parent Node• HC : Head Child• C : non-head Child

• + : right• - : left• 1 : adjacent• 2: non-adjacent

Page 10: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Experiments

• The training set consists of 297580 transcribed utterances (3297579 words). The oracle score for the 1000-best lists was 16.7%.

• n-gram perceptron partition into 28 sets

Page 11: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Experiments

• The first additional features that we experimented with were POS-tag sequence derived features.

1) ,,

Page 12: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Experiments

• This yielded 35.2% WER, a reduction of 0.3% absolute over what was achieved with just n-grams, which is significant at p < 0.001, reaching a total reduction of 1.2% over the baseline recognizer.

Page 13: Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins, Brian Roark Murat, Saraclar MIT CSAIL, OGI/OHSU, Bogazici University

Conclusion

• The results presented in this paper are a first step in examining the potential utility of syntactic features for discriminative language modeling for speech recognition.

• The best of which gave a small but significant improvement beyond what was provided by the n-gram features.

• Future work will include a further investigation of parser-derived features. In addition, we plan to explore the alternative parameter estimation methods

The Journal of Neuroscience...2020/02/18 · 34 P30NS061800 (Dr. Sue Aicher), and OHSU Innovation Fund (ES) awards. The electron 35 microscope was purchased with funds from the Murdock

Home Page | MIT CSAIL · 2004. 6. 17. · CV+C-initial root=pre6x di PASS affixes at the beginning of a her — isualisasi-kin or root— prefix ar VC+C-initial root — — infix

USA. West coast Portland, Oregon Marquam Hill CROET OHSU meine Wohnung

h^ ikjlW&enm/^%Y[o VXW&f'Y[b - Home Page | MIT CSAIL · vhi)rq \i^@segt q)t fyy[j acxcf9rkegt e q rkegty lac )egn[yp> vhi)r0q)>9bkj i)r[act9]sbkj9e@ q)ndbdeprg ndbkj9egnka_n h rkegq)fi^t9}

The Space Above BQP - homepage | MIT CSAIL Theory of ...theory.csail.mit.edu/ITCS2016/slides/bouland.pdfThe Space ‟Just Above QP Adam Bouland Based on joint work with Scott Aaronson,

Home Page | MIT CSAIL · sohbet mahkum kahpe ah ka:ya fi:rist ka:ve ma :sus me: met a :met ra met4 tosegme (10) also corr absolute initial (11) 'crocod 2.2. Again in inforr sonorant

CuttingtheElectricBillforInternet-ScaleSystems...CuttingtheElectricBillforInternet-ScaleSystems Asfandyar Qureshi MIT CSAIL [email protected] Rick Weber Akamai Technologies [email protected]

Giới Thiệu - WordPress.com · xiii xiv xv i - phẦn ba - gail wynand ii iii iv. v. vi. vii. viii. ix. i - phẦn bỐn - howard roark ii. iii. iv. v vi vii. viii. ix

CuttingtheElectricBillforInternet-ScaleSystemsinat.lcs.mit.edu/papers/sigcomm372-aqureshi.pdfCuttingtheElectricBillforInternet-ScaleSystems Asfandyar Qureshi MIT CSAIL [email protected]

Experimental and Clinical Neurotoxicology - OHSU...外来因素暴露、剂量和反应暴露和剂量是相互关联但又是相对独立的概念。谈到暴露必然要提到暴露途径。暴露途径

Random Lens Imaging - MIT CSAIL

Society5.0・Connected Industriesを実現する「新産 …³‡料:産業構造審議会新産業構造部会（FU会議）MIT CSAIL Prof. Daniela Rus“Toward the Fourth Industrial