A parallel recurrent neural network for language modeling with POS tags

Chao Su; Heyan Huang; Shumin Shi; Yuhang Guo; Hao Wu

A parallel recurrent neural network for language modeling with POS tags

Chao Su, Heyan Huang, Shumin Shi, Yuhang Guo, Hao Wu

Research output: Contribution to conference › Paper › peer-review

Abstract

Language models have been used in many natural language processing applications. In recent years, the recurrent neural network based language models have defeated the conventional n-gram based techniques. However, it is difficult for neural network architectures to use linguistic annotations. We try to incorporate part-of-speech features in recurrent neural network language model, and use them to predict the next word. Specifically, we proposed a parallel structure which contains two recurrent neural networks, one for word sequence modeling and another for part-of-speech sequence modeling. The state of part-of-speech network helped improve the word sequence's prediction. Experiments show that the proposed method performs better than the traditional recurrent network on perplexity and is better at reranking machine translation outputs.

Original language	English
Pages	140-147
Number of pages	8
Publication status	Published - 2019
Event	31st Pacific Asia Conference on Language, Information and Computation, PACLIC 2017 - Cebu City, Philippines Duration: 16 Nov 2017 → 18 Nov 2017

Conference

Conference	31st Pacific Asia Conference on Language, Information and Computation, PACLIC 2017
Country/Territory	Philippines
City	Cebu City
Period	16/11/17 → 18/11/17

Cite this

Su, C., Huang, H., Shi, S., Guo, Y., & Wu, H. (2019). A parallel recurrent neural network for language modeling with POS tags. 140-147. Paper presented at 31st Pacific Asia Conference on Language, Information and Computation, PACLIC 2017, Cebu City, Philippines.

@conference{98d6ac733a454ba9a65425eaeacabfc1,

title = "A parallel recurrent neural network for language modeling with POS tags",

abstract = "Language models have been used in many natural language processing applications. In recent years, the recurrent neural network based language models have defeated the conventional n-gram based techniques. However, it is difficult for neural network architectures to use linguistic annotations. We try to incorporate part-of-speech features in recurrent neural network language model, and use them to predict the next word. Specifically, we proposed a parallel structure which contains two recurrent neural networks, one for word sequence modeling and another for part-of-speech sequence modeling. The state of part-of-speech network helped improve the word sequence's prediction. Experiments show that the proposed method performs better than the traditional recurrent network on perplexity and is better at reranking machine translation outputs.",

author = "Chao Su and Heyan Huang and Shumin Shi and Yuhang Guo and Hao Wu",

note = "Publisher Copyright: Copyright {\textcopyright} 2017 Chao Su, Heyan Huang, Shumin Shi, Yuhang Guo and Hao Wu; 31st Pacific Asia Conference on Language, Information and Computation, PACLIC 2017 ; Conference date: 16-11-2017 Through 18-11-2017",

year = "2019",

language = "English",

pages = "140--147",

}

TY - CONF

T1 - A parallel recurrent neural network for language modeling with POS tags

AU - Su, Chao

AU - Huang, Heyan

AU - Shi, Shumin

AU - Guo, Yuhang

AU - Wu, Hao

PY - 2019

Y1 - 2019

N2 - Language models have been used in many natural language processing applications. In recent years, the recurrent neural network based language models have defeated the conventional n-gram based techniques. However, it is difficult for neural network architectures to use linguistic annotations. We try to incorporate part-of-speech features in recurrent neural network language model, and use them to predict the next word. Specifically, we proposed a parallel structure which contains two recurrent neural networks, one for word sequence modeling and another for part-of-speech sequence modeling. The state of part-of-speech network helped improve the word sequence's prediction. Experiments show that the proposed method performs better than the traditional recurrent network on perplexity and is better at reranking machine translation outputs.

AB - Language models have been used in many natural language processing applications. In recent years, the recurrent neural network based language models have defeated the conventional n-gram based techniques. However, it is difficult for neural network architectures to use linguistic annotations. We try to incorporate part-of-speech features in recurrent neural network language model, and use them to predict the next word. Specifically, we proposed a parallel structure which contains two recurrent neural networks, one for word sequence modeling and another for part-of-speech sequence modeling. The state of part-of-speech network helped improve the word sequence's prediction. Experiments show that the proposed method performs better than the traditional recurrent network on perplexity and is better at reranking machine translation outputs.

UR - http://www.scopus.com/inward/record.url?scp=85072795345&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:85072795345

SP - 140

EP - 147

T2 - 31st Pacific Asia Conference on Language, Information and Computation, PACLIC 2017

Y2 - 16 November 2017 through 18 November 2017

ER -

A parallel recurrent neural network for language modeling with POS tags

Abstract

Conference

Other files and links

Fingerprint

Cite this