Pushpak Bhattacharyya


  1. Name and Designation:
    Dr. Pushpak Bhattacharyya, Director (IIT Patna) and Vijay and Sita Vashee Chair Professor (Department of Computer Science and Engineering, IIT Bombay)
  2. Organization: IIT Patna and IIT Bombay
  3. Educational Qualification (Bachelor to Doctoral):
    • B. Tech. - 1984, Electrical Engineering, Indian Institute of Technology, Kharagpur, INDIA
    • M. Tech. – 1986, Computer Science and Engineering, Indian Institute of Technology Kanpur, INDIA
    • Visiting Research Fellow – 1990, Computer Science and Engineering, Massachusetts Institute of Technology, Cambridge, USA
    • Ph. D. – 1994, Computer Science and Engineering, Indian Institute of Technology Bombay, INDIA
  4. Professional Experience:

    Dr. Bhattacharyya has made seminal contributions in Natural Language Processing (NLP) and Machine Learning (ML), working in these fields for last 25 years. In addition to being Professor at Dept. of CSE in IIT Bombay, he has been a visiting Professor at Stanford University (2004), University Joseph Fourier (2005, 2009, 2011, 2014), Distinguished lecturer at University of Houston, USA (2012), and Visiting Scholar- MIT (1990). Currently he is the President of Association of Computational Linguistics (ACL), the international apex body in NLP.  Dr. Bhattacharyya has carried out sponsored research projects on various NLP technologies, funded by the Ministry of Communication and Information Technology, Ministry of Human Resource Development and by IBM, Microsoft Research, Yahoo, HP Labs and NEC. He has written a text book called “Machine Translation”, published by CRC Press, USA (Taylor and Francis group) which brings to light foundational points pertaining to translation of Indian languages. Google Scholar Citation which is the benchmark for Computer Science and Engineering shows approx.. 2300 career citations to Dr. Bhattacharyya’s papers with an h-index of 26.

  5. Specialization and Expertise:

    Artificial Intelligence, Natural Language Processing (NLP), Machine Learning (ML), Cross Lingual IR, Information Extraction

  6. Awards and Distinctions:
    1. FNAE: Elected Fellow of Indian National Academy of Engineering, 2015
    2. President of Association of Computational Linguistics (ACL), the highest international body overseeing Natural Language Processing and related Machine learning activities in the world
    3. Technology Development awards from IIT Roorkee and IIT Bombay: VNMM 2014 and Patwardhan 2007, respectively
    4. Ministry of IT, India awards for technology development, Manthan Award- 2008
    5. Multiple periodic Faculty grant awards from IBM, Microsoft, Yahoo, Accenture, and Elsevier
    6. Speaker as expert in multilingual computation in the prestigious Dagstuhl Seminar, Germany in 2012
    7. United Nations Research Grant: 1996
  7. Top ten publications in last ten years:
    1. Abhijit Mishra, Kanojia, Kuntal Dey, Seema Nagar and Pushpak Bhattacharyya, Harnessing Cognitive Features for Sarcasm Detection, ACL 2016, Berlin, Germany, August 8-10, 2016.
    2. Abhijit Mishra, Diptesh Kanojia and Pushpak Bhattacharyya, Predicting Readers' Sarcasm Understandability by Modelling Gaze Behaviour, AAAI 2016, Phoenix, USA, Feb 12-17, 2016.
    3. Pushpak Bhattacharyya, Multilingual Projections, book chapter in Language Production, Cognition and the Lexicon (ed. Nuria Gala et al), Springer, Switzerland, 2015.
    4. Aditya Joshi, Vinita Sharma and Pushpak Bhattacharyya, Harnessing Context Incongruity for Sarcasm Detection, ACL 2015, Beijing, China, July 26-31, 2015.
    5. Anoop Kunchukuttan, Abhijit Mishra, Rajen Chatterjee, Ritesh Shah and Pushpak Bhattacharyya, Shata-Anuvadak: Tackling Multiway Translation of Indian Languages, LREC 2014, Rekjyavik, Iceland, 26-31 May, 2014
    6. Kashyap Popat, Balamurali A.R, Pushpak Bhattacharyya and Gholamreza Haffari, , ACL 2013, Sofia, Bulgaria, 4-9 August, 2013
    7. A.Kumaran, Mitesh Khapra and Pushpak Bhattacharyya, Compositional Machine Transliteration, ACM Transactions on Asian Language Information Processing (TALIP Journal), Vol. 9, Issue 4, December 2010.
    8. Manoj Chinnakotla, Karthik Raman and Pushpak Bhattacharyya, Multilingual PRF: English Lends a Helping HandSIGIR 2010, Geneva, Switzerland, July, 201
    9. Pushpak Bhattacharyya, IndoWordNet, Lexical Resources Engineering Conference 2010 (LREC 2010), Malta, May, 2010
    10. Ananthakrishnan Ramanathan, Hansraj Choudhary, Avishek Ghosh, Pushpak Bhattacharyya, August 2009, Case markers and morphology: addressing the crux of the fluency problem in English-Hindi SMT, ACL-IJCNLP 2009, Singapore.

      (More information: https://www.cse.iitb.ac.in/~pb/pubs-yearwise.html)

  8. Summary of research output (papers, patents, technology development):

    Prof. Bhattacharyya has co-authored approx. 230 research papers since 2006, in leading conferences, journals and other publications. He has contributed, not only to theoretical research in NLP, but also to the practical applications. His research spans lexical resources (like Wordnets), annotation tools, and NLP systems. NLP systems developed by him include Shata-Anuvadak (a translation engine from-and-to 110 different Indian languages), Sentiment Analysis Engines, and so on. Details of the research output are in Section 10.

  9. Major sponsored projects completed/ongoing:
    1. Sampark: Indian Language to Indian Language Machine Translation system, DEITY
    2. Sandhan: Cross Lingual Information Access (CLIA) - Phase II, DEITY
    3. Anuvadaksh: English to Indian Language Machine Translation (EILMT), DEITY
    4. LIS: Laboratory for Intelligent Systems, Tata Consultancy Services
    5. Indowordnet: Development and Integration of Wordnets of Indian Languages, DEITY
    6. Text Mining: Elsevier, Netherlands
    7. Artificial Intelligence in Question Answering, Virtual Agents etc.: Accenture Global R & D
    8. (More information: https://www.cse.iitb.ac.in/~pb/sponsor.html)

  10. Technology development/translation/initiation:
    1. Search Engine:  Sandhan (URL: http://www.tdil-dc,in/sandhan)
    2. Translation Engines: (a) Shata-Anuvadak,  (b) English to Indian Language Machine Translation (EILMT) SMT System for “English to Marathi”, (c) Indian to Indian Language Machine Translation (IL-IL-MT) system for “Marathi to Hindi” and “Hindi to Marathi”, (d) Brahmi-Net (Indian language transliteration system)
    3. Lexical Resources:  (a) Hindi, Marathi and Sanskrit WordNets, (b) Indradhanush: Integrated Wordnet for Bengali, Gujarati, Kashmiri,Konkani, Oriya,Punjabi & Urdu, (c) IndoWordNet (includes Hindi, Marathi and Sanskrit WordNets), (d) Dravidian Wordnet: Kannad, Malayalam, Tamil, Telugu, (e) Development and Integration of Wordnets of North East Languages, (f) Universal-Hindi Dictionary
  11. Any other relevant information:

    Professional Service:

    1. Associate Editor- ACM Transaction on Asian Language Information Processing (TALIP) 2010-16,
    2. Editorship, Program Chair, Program Committee member roles for top journals and conferences of the world (TALIP, ACL. COLING, EMNLP, NAACL, LREC, SIGIR, CIKM etc.),
    3. Organizing Chair - Computation Linguistics Conference (COLING) in 2012 at IIT Bombay (first time to take place in India).

    Advisorship: PhD: ongoing 14; completed 17, Masters and Undergraduate advisorship (more than 150)