Marine Carpuat
NRC Institute for Information Technology283 Alexandre-Taché
Building CRTL, Room F-1040
Gatineau, Quebec J8X 3X7
Tel: +1 613-993-5038
Fax: +1 819-934-2607
Email: Marine.Carpuatx[at]xcnrc-nrc.gc.ca
I just moved to the National Research Council Canada, where I am a member of the Interactive Language Technology group in Gatineau, Québec.
Until September 2010, I was a postdoctoral researcher at the Columbia University Center for Computational Learning Systems working with Mona Diab. My research interests are in natural language processing with a focus on learning semantic models for NLP applications, in particular statistical machine translation. I received a PhD in Computer Science from the Hong Kong University of Science & Technology, where I worked with Dekai Wu at the Human Language Technology Center. Before that, I earned a MPhil in Electrical Engineering from HKUST under the supervision of Pascale Fung, and an engineering degree from the French Grande Ecole Supelec. I did my undergraduate studies in France, at Sainte Genevieve and Supelec.
Recent activities
Publications
- Marine CARPUAT, Yuval MARTON and Nizar HABASH. "Improving Arabic-to-English Statistical Machine Translation by Reordering Post-verbal Subjects for Alignment". 48th Annual Conference of the Association for Computational Linguistics (ACL 2010). Short Paper. Uppsala, Sweden: July 2010.
- Marine CARPUAT, Yuval MARTON and Nizar HABASH. "Reordering Matrix Post-verbal Subjects for Arabic-to-English SMT". 17th Conference sur le Traitement des Langues Naturelles (TALN 2010). Montreal, Canada: July 2010. Best Paper Award.
- Marine CARPUAT and Mona DIAB, "Task-based Evaluation of Multiword Expressions: a Pilot Study in Statistical Machine Translation", HLT-NAACL 2010.
- Dekai WU, Pascale FUNG, Marine CARPUAT, Chi-kiu LO, Yongsheng YANG, and Zhaojun WU, "Lexical Semantics for Statistical Machine Translation", to appear in the GALE book chapter on "MT from text", 2010.
- Marine CARPUAT. "One Translation Per Discourse". Semantic Evaluations Workshop at NAACL-HLT 2009 (SEW-2009). Boulder, CO: June 2009.
- Marine CARPUAT. "Toward Using Morphology in French-English Phrase-Based SMT". EACL 2009 Fourth Workshop on Statistical Machine Translation. Athens, Greece: March 2009.
- Marine CARPUAT and Dekai WU. "Evaluation of Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation". Sixth International Conference on Language Resources and Evaluation (LREC-2008). Marrakech, Morocco: May 2008.
- Yihai SHEN, Chi-kiu LO, Marine CARPUAT and Dekai WU. "HKUST Statistical Machine Translation Experiments for IWSLT 2007". Fourth International Workshop on Spoken Language Translation (IWSLT 2007). Trento: Oct 2007. 84-88.
- Marine CARPUAT and Dekai WU. "Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation". Machine Translation Summit XI. Copenhagen: Sep 2007.
- Marine CARPUAT and Dekai WU. "How Phrase Sense Disambiguation outperforms Word Sense Disambiguation for Statistical Machine Translation". 11th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 2007). Skovde: Sep 2007.
- Marine CARPUAT and Dekai WU. "Improving Statistical Machine Translation using Word Sense Disambiguation". 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2007). Prague: Jun 2007.
- Marine CARPUAT, Pascale FUNG and Grace NGAI. "Aligning Word Senses Using Bilingual Corpora", ACM Transactions on Asian Language and Information Processing, 5(2), pp 89-120, 2006.
- Dekai WU, Marine CARPUAT, and Yihai SHEN. "Inversion Transduction Grammar Coverage of Arabic-English Word Alignment for Tree-Structured Statistical Machine Translation". IEEE/ACL 2006 Workshop on Spoken Language Technology (SLT 2006). Aruba: Dec 2006.
- Marine CARPUAT, Yihai SHEN, Xiaofeng YU and Dekai WU. "Toward Integrating Semantic Processing in Statistical Machine Translation". International Workshop on Spoken Language Translation (IWSLT 2006). Kyoto: Nov 2006.
- Xiaofeng YU, Marine CARPUAT and Dekai WU. "Boosting for Chinese Named Entity Recognition". 5th SIGHAN Workshop on Chinese Language Processing. Sydney, Australia: July 2006.
- Marine CARPUAT and Dekai WU. "Evaluating the Word Sense Disambiguation Performance of Statistical Machine Translation". Second International Joint Conference on Natural Language Processing (IJCNLP-2005). Jeju, Korea: Oct 2005.
- Marine CARPUAT and Dekai WU. "Word Sense Disambiguation vs. Statistical Machine Translation". 43rd Annual Meeting of the Association for Computational Linguistics (ACL-2005). Ann Arbor, MI: Jun 2005.
- Weifeng SU, Marine CARPUAT, and Dekai WU. "Semi-Supervised Training of a Kernel PCA Model for Word Sense Disambiguation". 20th International Conference on Computational Linguistics (COLING-2004). Geneva: Aug 2004.
- Dekai WU, Grace NGAI, and Marine CARPUAT. " Why Nitpicking Works: Evidence for Occam's Razor in Error Correctors". 20th International Conference on Computational Linguistics (COLING-2004). Geneva: Aug 2004.
- Dekai WU, Weifeng SU, and Marine CARPUAT. "A Kernel PCA Method for Superior Word Sense Disambiguation". 42nd Annual Meeting of the Association for Computational Linguistics (ACL-2004). Barcelona: Jul 2004.
- Marine CARPUAT, Weifeng SU, and Dekai WU. "Augmenting Ensemble Classification for Word Sense Disambiguation with a Kernel PCA Model". Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (Senseval-3). ACL-2004 Workshop. Barcelona: Jul 2004.
- Grace NGAI, Dekai WU, Marine CARPUAT, Chi-Shing WANG, and Chi-Yung WANG. "Semantic Role Labeling with Boosting, SVMs, Maximum Entropy, SNOW, and Decision Lists". Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (Senseval-3). ACL-2004 Workshop. Barcelona: Jul 2004.
- Richard WICENTOWSKI, Grace NGAI, Dekai WU, Marine CARPUAT, Emily THOMFORDE, and Adrian PACKEL. "Joining forces to resolve lexical ambiguity: East meets West in Barcelona". Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (Senseval-3). ACL-2004 Workshop. Barcelona: Jul 2004.
- Dekai WU, Grace NGAI, and Marine CARPUAT. "Raising the Bar: Stacked Conservative Error Correction Beyond Boosting". Fourth International Conference on Language Resources and Evaluation (LREC-2004). Lisbon: May 2004.
- Lufeng ZHAI, Pascale FUNG, Richard SCHWARTZ, Marine CARPUAT and Dekai WU. "Using N-best Lists for Named Entity Recognition from Chinese Speech". Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT/NAACL-2004). Boston: May 2004.
- Dekai WU, Grace NGAI, and Marine CARPUAT. "N-fold Templated Piped Correction". First International Joint Conference on Natural Language Processing (IJCNLP-2004). Hainan, China: Mar 2004.
- Dekai WU, Grace NGAI, and Marine CARPUAT. "A stacked, voted, stacked model for named entity recognition". Computational Natural Language Learning (CoNLL-2003), at Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT/NAACL-2003). Edmonton, Canada: May 2003.
- Dekai WU, Grace NGAI, Marine CARPUAT, Jeppe LARSEN, and Yongsheng YANG. "Boosting for named entity recognition". Computational Natural Language Learning (CoNLL-2002), at 19th International Conference on Computational Linguistics (Coling-2002), 195-198. Taipei: Sep 2002.
- Grace NGAI, Marine CARPUAT and Pascale FUNG. "Identifying Concepts Across Languages: A First Step towards a Corpus-based Approach to Automatic Ontology Alignment". 19th International Conference on Computational Linguistics (COLING-2002). Taipei: Sep 2002.
- Marine CARPUAT, Grace NGAI, Pascale FUNG and Kenneth CHURCH. "Creating a bilingual ontology: a corpus-based approach for aligning WordNet and HowNet". 1st Global WordNet conference. Mysore, India: Jan 2002.
Last updated: Jan 2011