Ido Dagan
Department of Computer Science, Bar Ilan University

[Contact | Short Bio | Activities | Teaching | Students |Publications]


Book

  1. Quiñonero-Candela, J.; Dagan, I.; Magnini, B.; d'Alché-Buc, F. (Eds.) Machine Learning Challenges. Lecture Notes in Computer Science, Vol. 3944, 462 p. Springer, 2006.

Journal Articles

  1. Dagan, Ido, Martin C. Golumbic and Ron Y. Pinter. Trapezoid graphs and their coloring, Discrete Applied Mathematics, 1988, Vol. 21, pp. 35-46.
  2. Dagan, Ido and Alon Itai. Set expression based inheritance system, Annals of Mathematics and Artificial Intelligence, 1991, Vol. 4(3-4), pp. 269-280.
  3. Dagan, Ido and Alon Itai. Word sense disambiguation using a second language monolingual corpus, Computational Linguistics, 1994, Vol. 20(4), pp. 563-596.  [pdf]
  4. Dagan, Ido, John Justeson, Shalom Lappin, Herbert Leass and Amnon Ribak. Syntax and lexical statistics in anaphora resolution, Applied Artificial Intelligence, 1995, Vol. 9, pp. 633-644.
  5. Dagan, Ido, Shaul Marcus and Shaul Markovitch. Contextual word similarity and estimation from sparse data, Computer, Speech and Language, 1995, Vol. 9, pp. 123-152.  
  6. Dagan, Ido and Kenneth Church. Termight: Coordinating man and machine in bilingual terminology acquisition, Machine Translation, 1997, Vol. 12(1-2), pp. 89-107. [ps]
  7. Feldman, Ronen, Ido Dagan and Haym Hirsh. Mining text using keyword distributions, Journal of Intelligent Information Systems, 1998, Vol. 10(3), pp. 281-300. [pdf]
  8. Dagan, Ido, Lillian Lee and Fernando Pereira. Similarity-based models of cooccurrence probabilities, Machine Learning, 1999, Vol. 34(1-3) special issue on Natural Language Learning, pp. 43-69. [pdf]
  9. Argamon, Shlomo, Ido Dagan and Yuval Krymolowski. A memory based approach to learning shallow natural language patterns, Journal of Experimental and Theoretical AI (JETAI), 1999, Vol. 11, pp. 369-390. [pdf]
  10. Argamon-Engleson, Shlomo and Ido Dagan. Committee-Based Sample Selection for Probabilistic Classifiers, Journal of Artificial Intelligence Research (JAIR), 1999, Vol. 11, pp. 335-360. [ps]
  11. Marx, Zvika and Ido Dagan. Conceptual mapping through keyword coupled clustering. Mind and Society: a Special Issue on Commonsense and Scientific Reasoning, 4(2), pp. 59-85, 2001. [ps]
  12. Marx, Zvika, Ido Dagan, Joachim M. Buhmann and Eli Shamir. Coupled clustering: a method for detecting structural correspondence, Journal of Machine Learning Research, 2002, Vol. 3(Dec), pp. 747-780. [pdf]
  13. M. Koppel, N. Akiva and I. Dagan, (2005), Feature Instability as a Criterion for Selecting Potential Style Markers, Journal of the American Society for Information Science and Technology (JASIST), Volume 57, Number 11, September 2006, pp. 1519-1525. [ps]
  14. A Gliozzo, C Strapparava, I Dagan, (2005), Unsupervised and Supervised Exploitation of Semantic Domains in Lexical Disambiguation, Computer Speech and Language, Vol. 18, Issue 3, July 2004, pp. 275-299. [pdf]
  15. Zhitomirsky-Geffet, Maayan and Ido Dagan. Bootstrapping Distributional Feature Vector Quality, Computational Linguistics, to appear.  [pdf]
  16. Alfio Gliozzo, Carlo Strapparava and Ido Dagan. 2009. Improving text categorization bootstrapping via unsupervised learning. ACM Transactions on Speech and Language Processing (TSLP). Volume 6 Issue 1. [pdf]
  17. Lili Kotlerman, Ido Dagan, Idan Szpektor and Maayan Zhitomirsky-Geffet. Directional Distributional Similarity for Lexical Inference. Special Issue of Natural Language Engineering on Distributional Lexical Semantics. Natural Language Engineering 16 (4): 359–389. Cambridge University Press, 2010. [pdf]

Articles in Books

  1. Engelson, Sean and Ido Dagan. Sample selection in natural language learning, in S. Wermter, E. Riloff and G. Scheler (Eds.), Connectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processing, Springer, 1996, pp. 230-245.
  2. Dagan, Ido, Kenneth Church and William Gale. Robust bilingual word alignment for machine aided translation, in S. Armstrong, K. Church, P. Isabelle, S. Manzi, E. Tzoukermann and D. Yarowsky (Eds.), Natural Language Processing Using Very Large Corpora, Kluwer Academic Publishers, 1999, pp. 209-224.
  3. Dagan, Ido. Contextual Word Similarity, in Rob Dale, Hermann Moisl and Harold Somers (Eds.), Handbook of Natural Language Processing, Marcel Dekker Inc, 2000, Chapter 19, pp. 459-476. [doc]
  4. Choueka, Yaacov, Ehud S. Conley and Ido Dagan. A comprehensive bilingual word alignment system: application to disparate languages - Hebrew and English, in J. Veronis (Ed.), Parallel Text Processing, Kluwer Academic Publishers, 2000, pp. 69–96. [doc]
  5. Dagan, Ido and Yuval Krymolowski. Compositional memory-based partial parsing, in R. Bod, R. Scha and K. Sima'an (Eds.), Data-Oriented Parsing, CSLI Publications, 2002, forthcoming (20 pages). [pdf]
  6. Glickman Oren, Ido Dagan. Acquiring lexical paraphrases from a single corpus, in N. Nicolov, K. Bontcheva, G. Angelova and R. Mitkov (editors). Recent Advances in Natural Language Processing III, John Benjamins Publ. Co., Amsterdam, 2004, pp. 81-90. [pdf]
  7. Ido Dagan, Oren Glickman and Bernardo Magnini. The PASCAL Recognising Textual Entailment Challenge. In Quiñonero-Candela, J.; Dagan, I.; Magnini, B.; d'Alché-Buc, F. (Eds.) Machine Learning Challenges. Lecture Notes in Computer Science , Vol. 3944, pp. 177-190, Springer, 2006. [pdf]
  8. Oren Glickman, Ido Dagan and Moshe Koppel. A lexical alignment model for probabilistic textual entailment. In Quinonero-Candela, J.; Dagan, I.; Magnini, B.; d'Alché-Buc, F. (Eds.) Machine Learning Challenges. Lecture Notes in Computer Science , Vol. 3944, pp. 287-298, Springer, 2006. [pdf]
  9. Ido Dagan, Roy Bar-Haim, Idan Szpektor, Iddo Greental and Eyal Shnarch. Natural Language as the Basis for Meaning Representation and Inference. In: A. Gelbukh (Ed.) Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science 4919: 151-170, Springer, 2008. . [pdf]

Conferences and Workshops

  1. Dagan, Ido and Alon Itai. Automatic Acquisition of Constraints for the Resolution of Anaphora References and Syntactic Ambiguities, in Proceedings of COLING, 1990, pp. 330-332.
  2. Dagan, Ido and Alon Itai. A Statistical Filter for Resolving Pronoun References, in Y. A. Feldman and A. Bruckstein (Eds.), Artificial Intelligence and Computer Vision, Elsevier Science Publishers B.V., 1991, pp. 125-135 (Proceedings of the 7th Israeli Symposium on Artificial Intelligence and Computer Vision, 1990).
  3. Dagan, Ido, Alon Itai and Ulrike Schwall. Two languages are more informative than one, in Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 1991, pp. 130-137. [pdf]
  4. Dagan, Ido. Lexical disambiguation: Information sources and their statistical realization, in Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL) (Student Session), 1991, pp. 341-342.
  5. Rackow, Ulrike, Ido Dagan and Ulrike Schwall. Automatic translation of noun compounds, in Proceedings of COLING, 1992, pp. 1249-1253. [pdf]
  6. Dagan, Ido, Shaul Marcus and Shaul Markovitch. Contextual word similarity and estimation from sparse data, in Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 1993, pp. 164-171. [pdf]
  7. Dagan, Ido, Kenneth Church and William Gale. Robust bilingual word alignment for machine aided translation, in Proceedings of the Workshop on Very Large Corpora (WVLC), 1993, pp. 1-8.
  8. Dagan, Ido, John Justeson, Shalom Lappin Herbert Leass and Amnon Ribak. Syntax and lexical statistics in anaphora resolution, Bar-Ilan Symposium on Foundations of AI, 1993.
  9. Dagan, Ido, Fernando Pereira and Lillian Lee. Similarity-based estimation of word cooccurrence probabilities, in Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 1994, pp. 272-278.
  10. Dagan, Ido and Kenneth Church. Termight: Identifying and translating technical terminology, in Proceedings of the 4th Conference on Applied Natural Language Processing (ANLP), 1994, pp. 34-40. [pdf]
  11. Dagan, Ido and Sean Engelson. Committee-based sampling for training probabilistic classifiers, in Proceedings of the Twelfth International Conference on Machine Learning (ICML), 1995.
  12. Dagan, Ido and Sean Engelson. Selective sampling in natural language learning, in Proceedings of the IJCAI Workshop on New Approaches to Learning for Natural Language Processing, 1995, pp. 41-48.
  13. Feldman, Ronen and Ido Dagan. KDT - Knowledge Discovery in Texts, in Proceedings of the First International Conference on Knowledge Discovery (KDD), 1995, pp. 112-117. [ps]
  14. Feldman, Ronen and Ido Dagan. Knowledge Discovery in Textual Databases, in Proceedings of the ECML Workshop in Knowledge Discovery, 1995.
  15. Engelson, Sean and Ido Dagan. Minimizing Manual Annotation Cost in Supervised Training from Corpora, in Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 1996, pp. 319-326. [pdf]
  16. Dagan, Ido, Ronen Feldman and Haym Hirsh. Keyword-Based Browsing and Analysis of Large Document Sets, in Proceedings of The Fifth Annual Symposium on Document Analysis and Information Retrieval (SDAIR), 1996, pp. 191-208. [pdf]
  17. Feldman, Ronen, Ido Dagan and Willi Kloesgen. Efficient algorithms for mining and manipulating associations in texts, in Proceedings of the Thirteenth European Meeting on Cybernetics and Systems Research (EMCSR), 1996.
  18. Dagan, Ido, Lillian Lee and Fernando Pereira. Similarity-based methods for word sense disambiguation, in Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 1997, pp 56-63. [ps]
  19. Dagan, Ido, Yael Karov and Dan Roth. Mistake-driven learning in text categorization, in Proceedings of Second Conference on Empirical Methods in Natural Language Processing (EMNLP-2), 1997.
  20. Yamazaki, Takefumi and Ido Dagan. Mistake-driven learning with thesaurus for text categorization, in Proceedings of the Natural Language Pacific Rim Symposium (NLPRS-97), 1997.
  21. Argamon, Shlomo, Ido Dagan and Yuval Krymolowsky. Memory-based learning of shallow natural language patterns, in Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 1998. [ps]
  22. Marx, Zvi, Ido Dagan and Eli Shamir. Detecting Sub-Topic Correspondence through Bipartite Term Clustering, in Proceedings of the ACL-1999 Workshop on Unsupervised Learning in Natural Language Processing, 1999, pp. 45-51. [ps]
  23. Krymolowski, Yuval and Ido Dagan. Compositional Memory-Based Partial Parsing, in Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2000, pp. 45-52.
  24. Marx, Zvika, Ido Dagan, Joachim M. Buhmann. Coupled Clustering: a method for detecting structural correspondence, in Proceedings of the Eighteenth International Conference on Machine Learning (ICML), 2001, pp.353–360. [pdf]
  25. Marx, Zvika, Ido Dagan and Eli Shamir. Cross-component clustering for template induction, in Proceedings of the ICML Workshop on Text Learning (TextML), 2002, pp. 66-75. [pdf]
  26. Dagan, Ido, Zvika Marx and Eli Shamir. Cross-dataset clustering: revealing corresponding Themes Across Multiple Corpora, in Proceedings of the Sixth Conference on Natural Language Learning (CoNLL), 2002, pp. 15-21. [doc]
  27. Koppel, Moshe, Navot Akiva and Ido Dagan. A Corpus-Independent Feature Set for Style Based Text Categorization, in Proceedings of IJCAI'03 Workshop on Computational Approaches to Style Analysis and Synthesis, Acapulco, Mexico, 2003. [pdf]
  28. Glickman, Oren and Ido Dagan. Identifying Lexical Paraphrases from a Single Corpus: A Case Study for Verbs, in Proceedings of Recent Advantages in Natural Language Processing (RANLP '03), 2003. [pdf]
  29. Marx Z., Dagan I. and Shamir E. (2004). Identifying structure across pre-partitioned data. In Thrun S., Saul L., and Scho"lkopf B. (eds.), Advances in Neural Information Processing Systems 16 (NIPS 2003), December 8-13, Vancouver, Canada. [pdf]
  30. Ido Dagan and Oren Glickman. 2004. Probabilistic textual entailment: Generic applied modeling of language variability. In PASCAL Workshop on Learning Methods for Text Understanding and Mining, Grenoble. [pdf]
  31. Idan Szpektor, Hristo Tanev, Ido Dagan and Bonaventura Coppola. Scaling Web-based Acquisition of Entailment Relations. Proceedings of Empirical Methods in Natural Language Processing (EMNLP), 2004. [pdf]
  32. Maayan Geffet and Ido Dagan. Feature Vector Quality and Distributional Similarity. Proceedings of The 20th International Conference on Computational Linguistics (COLING), 2004. [pdf]
  33. Maayan Geffet and Ido Dagan. The Distributional Inclusion Hypotheses and Lexical Entailment. Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL), 2005. [pdf]
  34. Oren Glickman and Ido Dagan. A Probabilistic Setting and Lexical Cooccurrence Model for Textual Entailment. Proceedings of ACL Workshop on Empirical Modeling of Semantic Equivalence and Entailment, 2005. [pdf]
  35. Oren Glickman, Ido Dagan and Moshe Koppel. A Probabilistic Classification Approach for Lexical Textual Entailment. Proceedings of the 20th National Conference on Artificial Intelligence (AAAI), 2005. [pdf]
  36. Oren Glickman, Ido Dagan and Moshe Koppel. A Probabilistic Lexical Approach to Textual Entailment. Proceedings of the International Joint Conferences on Artificial Intelligence (IJCAI), 2005. [pdf]
  37. Alfio Gliozzo, Carlo Strapparava, and Ido Dagan. Investigating Unsupervised Learning for Text Categorization Bootstrapping. Proceedings of the joint Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), 2005. [pdf]
  38. Zvika Marx, Ido Dagan and Eli Shamir. A Generalized Framework for Revealing Analogous Themes across Related Topics. Proceedings of the joint Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), 2005. [pdf]
  39. Ido Dagan, Oren Glickman, Alfio Gliozzo, Efrat Marmorshtein and Carlo Strapparava. Direct Word Sense Matching for Lexical Substitution. Proceedings of COLING-ACL 2006, 17-21 Jul 2006, Sydney, Australia. [pdf]
  40. Shachar Mirkin, Ido Dagan and Maayan Geffet. Integrating Pattern-based and Distributional Similarity Methods for Lexical Entailment Acquisition. Proceedings of COLING-ACL 2006, 17-21 Jul 2006, Sydney, Australia. [pdf]
  41. Lorenza Romano, Milen Kouylekov, Idan Szpektor, Ido Dagan and Alberto Lavelli. Investigating a Generic Paraphrase-based Approach for Relation Extraction. Proceedings of EACL 2006, 5-7 April 2006, Trento, Italy. [pdf]
  42. Oren Glickman, Ido Dagan, Mikaela Keller, Samy Bengio and Walter Daelemans. Investigating Lexical Substitution Scoring for Subtitle Generation. Proceedings of CoNLL-X, 8-9 Jun 2006, New York City, USA. [pdf]
  43. Oren Glickman, Ido Dagan and Eyal Shnarch. Lexical Reference: a Semantic Matching Subtask. Proceedings of EMNLP 2006, 22-23 Jul 2006, Sydney, Australia. [pdf]
  44. Roy Bar-Haim, Ido Dagan, Iddo Greental and Eyal Shnarch. Semantic Inference at the Lexical-Syntactic Level. Proceedings of the 22nd National Conference on Artificial Intelligence (AAAI), July 2007, Vancouver, Canada. [pdf]
  45. Idan Szpektor, Ido Dagan, Alon Lavie, Danny Shacham and and Shuly Wintner. Cross Lingual and Semantic Retrieval for Cultural Heritage Appreciation. Proceedings of the ACL Workshop on Language Technology for Cultural Heritage Data (LaTeCH), June 2007, Prague, Czech Republic. [pdf]
  46. Idan Szpektor, Eyal Shnarch and Ido Dagan. Instance-based Evaluation of Entailment Rule Acquisition. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL), June 2007, Prague, Czech Republic. [pdf]
  47. Roy Bar-Haim, Ido Dagan, Iddo Greental, Idan Szpektor and Moshe Friedman. Semantic Inference at the Lexical-Syntactic Level for Textual Entailment Rocognition. Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, June 2007, Prague, Czech Republic. [pdf]
  48. Idan Szpektor and Ido Dagan. Learning Canonical Forms of Entailment Rules. Proceedings of the International Conference Recent Advantages in Natural Language Processing (RANLP), September 2007, Bulgaria. [pdf]
  49. Idan Szpektor, Ido Dagan, Roy Bar-Haim and Jacob Goldberger. 2008. Contextual Preferences. In Proceedings of ACL 2008. [pdf]
  50. Idan Szpektor and Ido Dagan. 2008. Learning Entailment Rules for Unary Templates. Accepted to COLING 2008 as a full oral paper. [pdf]
  51. Roy Bar Haim, Ido Dagan, Bill Dolan, Lisa Ferro, Danilo Giampiccolo, Bernardo Magnini and Idan Szpektor. The Second PASCAL Recognising Textual Entailment Challenge. Proceedings of The Second PASCAL Recognising Textual Entailment Challenge, 10 April 2006, Venice, Italy. [pdf]
  52. Danilo Giampiccolo; Bernardo Magnini; Ido Dagan; Bill Dolan. The Third PASCAL Recognizing Textual Entailment Challenge. Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, June 2007, Prague, Czech Republic. [pdf]
  53. Roy Bar-Haim, Jonathan Berant, Ido Dagan, Iddo Greental, Shachar Mirkin, Eyal Shnarch and Idan Szpektor. Efficient Semantic Deduction and Approximate Matching over Compact Parse Forests. In Proceedings of Text Analysis Conference (TAC), 2009. [pdf]
  54. Libby Barak, Ido Dagan, Eyal Shnarch. Text Categorization from Category Name via Lexical Reference. In Proceedings of North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT), 2009. [pdf]
  55. Shachar Mirkin, Ido Dagan, Eyal Shnarch. Evaluating the Inferential Utility of Lexical-Semantic Resources. 2009. EACL. Athens, Greece.  [pdf]
  56. Jonathan Berant, Ido Dagan and Jacob Goldberger. Global Learning of Focused Entailment Graphs. ACL 2010. [pdf]
  57. Shachar Mirkin, Jonathan Berant, Ido Dagan and Eyal Shnarch. 2010. Recognising Entailment within Discourse. COLING. [pdf]
  58. Shachar Mirkin, Ido Dagan and Sebastian Padó. 2010. Assessing the Role of Discourse References in Entailment Inference. ACL.  [pdf]
  59. Wilker Aziz, Marc Dymetmany, Shachar Mirkin, Lucia Specia, Nicola Cancedda and Ido Dagan. 2010. Learning an Expert from Human Annotations in Statistical Machine Translation: the Case of Out-of-Vocabulary Words. EAMT. [pdf]
  60. Azad Abad, Luisa Bentivogli, Ido Dagan, Danilo Giampiccolo, Shachar Mirkin, Emanuele Pianta and Asher Stern. 2010. A Resource for Investigating the Impact of Anaphora and Coreference on Inference. LREC.  [pdf]
  61. Roni Ben Aharon, Idan Szpektor and Ido Dagan. 2010. Generating Entailment Rules from FrameNet. ACL.  [pdf]