About

I am a PhD candidate and professional lecturer at The George Washington University. I work with Prof. Mona Diab on natural language processing research, particularly vector-space models for words and sentences, cross-lingual mapping of such models, and their applications in paraphrasing and machine translation.

Projects

Currently, I am working on learning general-purpose cross-lingual vector representation that can be used for semantic textual similarity, document classification, paraphrasing, and machine translation using matrix factorization and neural networks. I am also investigating unsupervised methods for deducing dictionaries from independent vector-space models. Previously, I worked on Arabic part-of-speech tagging as part of the MADAMIRA project.

Teaching

  • Fall 2018: Instructor, 3907/6907 - Intro to Statistical NLP.
  • Spring 2018: Lab instructor, CSCI 1112 - Algorithms and Data Structures.
  • Fall 2017: Teaching assistant, Discrete Structures I.
  • Spring 2017: Teaching assistant, CSCI 3362 / 6362 - Probability for Computer Science.
  • Fall 2016: Teaching assistant, CSCI 4364/6364 - Machine Learning.
  • Fall 2014 to Spring 2016: Lab instructor, CSCI 1112 - Algorithms and Data Structures.
  • Spring 2014: Teaching assistant, CSCI 3907/6907 - Intro to Statistical NLP.
  • Fall 2013: Teaching assistant, CSCI 1112 - Algorithms and Data Structures.
  • NLP Publications

    Hanan Aldarmaki and Mona Diab. "Evaluation of Unsupervised Compositional Representations." In Processings of The 27th International Conference on Computational Linguistics (COLING), 2018. [pdf]

    Hanan Aldarmaki, Mahesh Mohan, and Mona Diab. "Unsupervised Word Mapping Using Structural Similarities in Monolingual Embeddings." Transactions of the Association for Computational Linguistics (TACL), 2018. [pdf]

    Hanan Aldarmaki and Mona Diab. "Learning Cross-lingual Representations with Matrix Factorization." Workshop on Multilingual and Cross-lingual Methods in NLP, 2016. [pdf]

    Hanan Aldarmaki and Mona Diab. "GWU NLP at SemEval-2016 Shared Task 1: Matrix Factorization for Crosslingual STS." Proceedings of SemEval, 2016. [pdf]

    Robust Part-of-Speech Tagging of Arabic Text. Hanan Aldarmaki & Mona Diab. Proceedings of the second workshop on Arabic Natural Language Processing (ANLP 2015). [pdf]

    Learn More

    Email me here if you have questions.