The Natural Language Decathlon: Multitask Learning as Question Answering McCann, Bryan; Keskar, Nitish Shirish; Xiong, Caiming; Socher, Richard (2018).
The Natural Language Decathlon: Multitask Learning as Question Answering McCann, Bryan; Keskar, Nitish Shirish; Xiong, Caiming; Socher, Richard (2018).
Improved Training of Wasserstein GANs Gulrajani, Ishaan; Ahmed, Faruk; Arjovsky, Mart{\’{\i}}n; Dumoulin, Vincent; Courville, Aaron C. in CoRR (2017). abs/1704.00028
Non-parametric estimation of Jensen-Shannon Divergence in Generative Adversarial Network training Sinn, Mathieu; Rawat, Ambrish (2017).
Attention is all you need Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N; Kaiser, {\L}ukasz; Polosukhin, Illia in Advances in Neural Information Processing Systems (2017). 5998–6008.
DRAGNN: A Transition-based Framework for Dynamically Connected Neural Networks Kong, Lingpeng; Alberti, Chris; Andor, Daniel; Bogatyy, Ivan; Weiss, David (2017).
Wasserstein GAN Arjovsky, Martin; Chintala, Soumith; Bottou, Léon (2017).
Derivation of Backpropagation in Convolutional Neural Network (CNN) Zhang, Zhifei (2016). 7.
An Ensemble Method to Produce High-Quality Word Embeddings Speer, Robert; Chin, Joshua (2016).
Enriching Word Vectors with Subword Information Bojanowski, Piotr; Grave, Edouard; Joulin, Armand; Mikolov, Tomas (2016).
ConceptRDF: An RDF presentation of ConceptNet knowledge base Najmi, Erfan; Malik, Zaki; Hashmi, Khayyam; Rezgui, Abdelmounaam in Information and Communication Systems (ICICS), 2016 7th International Conference on (2016). 145–150.
Globally Normalized Transition-Based Neural Networks Andor, Daniel; Alberti, Chris; Weiss, David; Severyn, Aliaksei; Presta, Alessandro; Ganchev, Kuzman; Petrov, Slav; Collins, Michael (2016).
Neural Architectures for Named Entity Recognition Lample, Guillaume; Ballesteros, Miguel; Subramanian, Sandeep; Kawakami, Kazuya; Dyer, Chris in CoRR (2016). abs/1603.01360
An Empirical Exploration of Recurrent Network Architectures. Józefowicz, Rafal; Zaremba, Wojciech; Sutskever, Ilya in ICML, JMLR Workshop and Conference Proceedings, F. R. Bach, D. M. Blei (reds.) (2015). (Vol. 37) 2342–2350.
LSTM: A Search Space Odyssey. Greff, Klaus; Srivastava, Rupesh Kumar; Koutník, Jan; Steunebrink, Bas R.; Schmidhuber, Jürgen in CoRR (2015). abs/1503.04069
Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network Wang, Peilu; Qian, Yao; Soong, Frank K.; He, Lei; Zhao, Hai in CoRR (2015). abs/1510.06168
Neural Machine Translation by Jointly Learning to Align and Translate Bahdanau, Dzmitry; Cho, Kyunghyun; Bengio, Yoshua (2014).
Retrofitting Word Vectors to Semantic Lexicons Faruqui, Manaal; Dodge, Jesse; Jauhar, Sujay K.; Dyer, Chris; Hovy, Eduard; Smith, Noah A. (2014).
Glove: Global Vectors for Word Representation. Pennington, Jeffrey; Socher, Richard; Manning, Christopher D in EMNLP (2014). (Vol. 14) 1532–1543.
Generative Adversarial Networks Goodfellow, Ian J.; Pouget-Abadie, Jean; Mirza, Mehdi; Xu, Bing; Warde-Farley, David; Ozair, Sherjil; Courville, Aaron; Bengio, Yoshua (2014).
Convolutional Neural Networks for Sentence Classification Kim, Yoon in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, {EMNLP} 2014, October 25-29, 2014, Doha, Qatar, {A} meeting of SIGDAT, a Special Interest Group of the {ACL} (2014). 1746–1751.
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation Cho, Kyunghyun; van Merrienboer, Bart; Gulcehre, Caglar; Bahdanau, Dzmitry; Bougares, Fethi; Schwenk, Holger; Bengio, Yoshua (2014).
Training recurrent neural networks Sutskever, Ilya in University of Toronto, Toronto, Ont., Canada (2013).
On the difficulty of training Recurrent Neural Networks Pascanu, Razvan; Mikolov, Tomas; Bengio, Yoshua (2012).
Integration of world knowledge for natural language understanding Ovchinnikova, Ekaterina (2012). (Vol. 3) Springer Science \& Business Media.
Natural Language Understanding and World Knowledge Ovchinnikova, Ekaterina in Integration of World Knowledge for Natural Language Understanding (2012). 15–37.
BLEU: a method for automatic evaluation of machine translation Papineni, Kishore; Roukos, Salim; Ward, Todd; Zhu, Wei-Jing in Proceedings of the 40th annual meeting on association for computational linguistics (2002). 311–318.