Deutsch Intern
    Data Science Chair


    Related Literature

    • The Natural Language Decathlon: Multitask Learning as Question Answering McCann, Bryan; Keskar, Nitish Shirish; Xiong, Caiming; Socher, Richard (2018).
      cite arxiv:1806.08730
    • The Natural Language Decathlon: Multitask Learning as Question Answering McCann, Bryan; Keskar, Nitish Shirish; Xiong, Caiming; Socher, Richard (2018).
      cite arxiv:1806.08730
    • Improved Training of Wasserstein GANs Gulrajani, Ishaan; Ahmed, Faruk; Arjovsky, Mart{\’{\i}}n; Dumoulin, Vincent; Courville, Aaron C. in CoRR (2017). abs/1704.00028
    • Non-parametric estimation of Jensen-Shannon Divergence in Generative Adversarial Network training Sinn, Mathieu; Rawat, Ambrish (2017).
      cite arxiv:1705.09199
    • Attention is all you need Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N; Kaiser, {\L}ukasz; Polosukhin, Illia in Advances in Neural Information Processing Systems (2017). 5998–6008.
    • DRAGNN: A Transition-based Framework for Dynamically Connected Neural Networks Kong, Lingpeng; Alberti, Chris; Andor, Daniel; Bogatyy, Ivan; Weiss, David (2017).
      cite arxiv:1703.04474Comment: 10 pages; Submitted for review to ACL2017
    • Wasserstein GAN Arjovsky, Martin; Chintala, Soumith; Bottou, Léon (2017).
      cite arxiv:1701.07875
    • Derivation of Backpropagation in Convolutional Neural Network (CNN) Zhang, Zhifei (2016). 7.
    • An Ensemble Method to Produce High-Quality Word Embeddings Speer, Robert; Chin, Joshua (2016).
      cite arxiv:1604.01692Comment: 12 pages, 3 figures
    • Enriching Word Vectors with Subword Information Bojanowski, Piotr; Grave, Edouard; Joulin, Armand; Mikolov, Tomas (2016).
      cite arxiv:1607.04606Comment: Accepted to TACL. The two first authors contributed equally
    • ConceptRDF: An RDF presentation of ConceptNet knowledge base Najmi, Erfan; Malik, Zaki; Hashmi, Khayyam; Rezgui, Abdelmounaam in Information and Communication Systems (ICICS), 2016 7th International Conference on (2016). 145–150.
    • Globally Normalized Transition-Based Neural Networks Andor, Daniel; Alberti, Chris; Weiss, David; Severyn, Aliaksei; Presta, Alessandro; Ganchev, Kuzman; Petrov, Slav; Collins, Michael (2016).
      cite arxiv:1603.06042
    • Neural Architectures for Named Entity Recognition Lample, Guillaume; Ballesteros, Miguel; Subramanian, Sandeep; Kawakami, Kazuya; Dyer, Chris in CoRR (2016). abs/1603.01360
    • An Empirical Exploration of Recurrent Network Architectures. Józefowicz, Rafal; Zaremba, Wojciech; Sutskever, Ilya in ICML, JMLR Workshop and Conference Proceedings, F. R. Bach, D. M. Blei (reds.) (2015). (Vol. 37) 2342–2350.
    • LSTM: A Search Space Odyssey. Greff, Klaus; Srivastava, Rupesh Kumar; Koutník, Jan; Steunebrink, Bas R.; Schmidhuber, Jürgen in CoRR (2015). abs/1503.04069
    • Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network Wang, Peilu; Qian, Yao; Soong, Frank K.; He, Lei; Zhao, Hai in CoRR (2015). abs/1510.06168
    • Neural Machine Translation by Jointly Learning to Align and Translate Bahdanau, Dzmitry; Cho, Kyunghyun; Bengio, Yoshua (2014).
      cite arxiv:1409.0473Comment: Accepted at ICLR 2015 as oral presentation
    • Retrofitting Word Vectors to Semantic Lexicons Faruqui, Manaal; Dodge, Jesse; Jauhar, Sujay K.; Dyer, Chris; Hovy, Eduard; Smith, Noah A. (2014).
      cite arxiv:1411.4166Comment: Proceedings of NAACL 2015
    • Glove: Global Vectors for Word Representation. Pennington, Jeffrey; Socher, Richard; Manning, Christopher D in EMNLP (2014). (Vol. 14) 1532–1543.
    • Generative Adversarial Networks Goodfellow, Ian J.; Pouget-Abadie, Jean; Mirza, Mehdi; Xu, Bing; Warde-Farley, David; Ozair, Sherjil; Courville, Aaron; Bengio, Yoshua (2014).
      cite arxiv:1406.2661
    • Convolutional Neural Networks for Sentence Classification Kim, Yoon in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, {EMNLP} 2014, October 25-29, 2014, Doha, Qatar, {A} meeting of SIGDAT, a Special Interest Group of the {ACL} (2014). 1746–1751.
    • Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation Cho, Kyunghyun; van Merrienboer, Bart; Gulcehre, Caglar; Bahdanau, Dzmitry; Bougares, Fethi; Schwenk, Holger; Bengio, Yoshua (2014).
      cite arxiv:1406.1078Comment: EMNLP 2014
    • Training recurrent neural networks Sutskever, Ilya in University of Toronto, Toronto, Ont., Canada (2013).
    • On the difficulty of training Recurrent Neural Networks Pascanu, Razvan; Mikolov, Tomas; Bengio, Yoshua (2012).
      cite arxiv:1211.5063Comment: Improved description of the exploding gradient problem and description and analysis of the vanishing gradient problem
    • Integration of world knowledge for natural language understanding Ovchinnikova, Ekaterina (2012). (Vol. 3) Springer Science \& Business Media.
    • Natural Language Understanding and World Knowledge Ovchinnikova, Ekaterina in Integration of World Knowledge for Natural Language Understanding (2012). 15–37.
    • BLEU: a method for automatic evaluation of machine translation Papineni, Kishore; Roukos, Salim; Ward, Todd; Zhu, Wei-Jing in Proceedings of the 40th annual meeting on association for computational linguistics (2002). 311–318.