Overview:
Making complex decisions in real
world problems often involves assigning values to sets of interdependent variables where the expressive dependency structure can influence, or even dictate, what assignments
are possible.
Structured learning problems provide one such example, but the setting we study is broader. We are interested in cases where decisions depend on multiple models that cannot be learned simultaneously as well as cases where constraints among models' outcomes are available only at decision time.
We have developed a general framework -- Constrained Conditional Models -- that augments the learning of conditional (probabilistic or discriminative) models with declarative
constraints (written, for example, using a first-order representation) as a way to support decisions in an expressive output space while maintaining modularity and
tractability of training and inference.
While incorporating nonlocal
dependencies in a probabilistic model can lead to intractable training and inference, our framework allows one to learn a rather simple (or multiple simple) model(s), and make decisions with more expressive models that take into account also global declerative (hard or soft) constraints.
We have used this framework successfully in the context of multiple NLP and IE problems, starting with our work on named entities and relations (CoNLL'94) and our SRL work.
Our framework, which suggests to learn conditional models and use them as an objective function for a global
constrained optimization problem, has been followed by a large body of work in NLP.
Following (Roth and Yih,
2004) that has formalized global decision problems in the context of IE
as constrained optimization problems and solved these optimization problems using Integer Linear Programming (ILP) we have seen (Punyakanok et al., 2005; Barzilay and Lapata, 2006;
Clarke and Lapata, ; Marciniak and Strube, 2005) and others.
We have also studied theoretically training paradigms for CCMs and have developed an understanding for the advantages of different training regimes.
Recently we studied unsupervised learning in this framework and have shown that declarative
constraints can be used to take advantage of unlabeled
data when training conditional models.
Relevant Publications:
- D. Roth and Y. Tu, Aspect Guided Text Categorization with Unobserved Labels. Proceedings of ICDM (2009)
- M. Chang, D. Goldwasser, D. Roth, and Y. Tu, Unsupervised Constraint Driven Learning For Transliteration Discovery. NAACL (2009)
- D. Goldwasser and D. Roth, Transliteration as Constrained Optimization. EMNLP (2008) pp. xx--yy
- M. Chang, L. Ratinov, and D. Roth, Constraints as Prior Knowledge. ICML Workshop on Prior Knowledge for Text and Language Processing (2008) pp. 32-39
- M. Chang, L. Ratinov, N. Rizzolo, and D. Roth, Learning and Inference with Constraints. Proceedings of the National Conference on Artificial Intelligence (AAAI) (2008)
- M. Chang, L. Ratinov, and D. Roth, Guiding Semi-Supervision with Constraint-Driven Learning. Proc. of the Annual Meeting of the ACL (2007) pp. 280--287
- N. Rizzolo and D. Roth, Modeling Discriminative Global Inference. Proceedings of the First International Conference on Semantic Computing (ICSC) (2007) pp. 597-604
- D. Roth and W. Yih, Global Inference for Entity and Relation Identification via a Linear Programming Formulation. Introduction to Statistical Relational Learning (2007)
- D. Roth and W. Yih, Integer Linear Programming Inference for Conditional Random Fields. Proc. of the International Conference on Machine Learning (ICML) (2005) pp. 737--744
- V. Punyakanok, D. Roth, W. Yih, and D. Zimak, Learning and Inference over Constrained Output. Proc. of the International Joint Conference on Artificial Intelligence (IJCAI) (2005) pp. 1124--1129
- V. Punyakanok and D. Roth, Inference with Classifiers: The Phrase Identification Problem. Computational Linguistics (2005)
- D. Roth and W. Yih, A Linear Programming Formulation for Global Inference in Natural Language Tasks. Proc. of the Annual Conference on Computational Natural Language Learning (CoNLL) (2004) pp. 1--8
- V. Punyakanok, D. Roth, W. Yih, and D. Zimak, Semantic Role Labeling via Integer Linear Programming Inference. Proc. the International Conference on Computational Linguistics (COLING) (2004) pp. 1346--1352
- V. Punyakanok, D. Roth, W. Yih, D. Zimak, and Y. Tu, Semantic Role Labeling via Generalized Inference over Classifiers Shared Task Paper. Proc. of the Annual Conference on Computational Natural Language Learning (CoNLL) (2004) pp. 130--133
- D. Roth and W. Yih, A Linear Programming Formulation for Global Inference in Natural Language Tasks. Proceedings of AI & Math (2004) pp. 1--8
- V. Punyakanok, D. Roth, W. Yih, and D. Zimak, Learning via Inference over Structurally Constrained Output. NIPS Workshop on Learning Structured Output (2004)
- V. Punyakanok and D. Roth, The Use of Classifiers in Sequential Inference. The Conference on Advances in Neural Information Processing Systems (NIPS) (2001) pp. 995--1001