- Assistant Professor, Department of Computer Science and Engineering, NIT Silchar, India – 2nd July 2018 – till date [Current Position]
- Visiting Assistant Professor, Department of CSE, IIIT Senapati, Manipur – 2nd May 2016 – 29th June 2018
- Lecturer, Dept of Computer Science, St. Anthony’s College, Shillong. 10th Aug 1998 – 18th Nov 2007
- Guest Lecturer, Dept of CSE, NERIST, Itanagar. 21st Aug 1997 – 6th May 1998.
- Post Doctoral Fellow Dept of Computer Science, University of Houston, USA. 2015 – 2016
- Post Doctoral Research Fellow, Dept of Computer Science, School of Computing, National University of Singapore (NUS), Singapore. 2013 – 2014
- Senior Staff Scientist/Senior Technical Officer, CDAC Mumbai (Formerly NCST). 20th Nov 2007 – 31st Aug 2013
ACADEMIC QUALIFICATIONS (FULL DETAILS )
- B. Tech.(Computer Science and Engineering) (NERIST)
- Ph.D. (Engg) (Jadavpur)
- PostDoc (NUS, Singapore), PostDoc(UH, USA)
DBLP: CLICK HERE
Google Scholar: CLICK HERE
Linkedin: CLICK HERE
SHORT BIO SKETCH
Dr. Thoudam Doren Singh was born in the state of Manipur in the northeast part of India. He attended Toubul High School for his early schooling. Currently, he is an Assistant Professor in the Department of Computer Science and Engineering at NIT Silchar, India. He was a research visitor at Universität des Saarlandes, Saarbrücken, Germany as part of SPARC project. He also worked as Visiting Assistant Professor, Department of Computer Science and Engineering, IIIT Senapati, Manipur from May 2016 – June 2018. Prior to this, he was a Post Doctoral Researcher, Department of Computer Science at University of Houston, TX, USA from March 2015 to March 2016 after a stint as Post Doctoral Research Fellow at Department of Computer Science, School of Computing, National University of Singapore, Singapore from September 2013 to July 2014. Earlier, he served as Senior Staff Scientist later re-designated as Senior Technical Officer at Centre for Development of Advanced Computing (CDAC) – formerly known as NCST, Mumbai, India from November 2007 to August 2013 after working as Lecturer of Dept of Computer Science, St. Anthonys College, Shillong, India from August 1998 to November 2007. His first service was Guest Lecturer of Dept of CSE, NERIST, Itanagar, India from August 1997 to May 1998.
AREA OF INTEREST AND SPECIALISATION (INCLUDING RESEARCH AREA)
- Human Language Technology
- Applied Machine Learning
- Big Data
- Cloud Security
- Social Media Analytics
RESEARCH PUBLICATIONS
INTERNATIONAL JOURNAL
- Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay, An empirical study of a novel multimodal dataset for low-resource machine translation. Knowl Inf Syst (2024). https://doi.org/10.1007/s10115-024-02087-6 (SCIE)
- Rinki Das, Thoudam Doren Singh, Which words are important?: an empirical study of Assamese sentiment analysis. Lang Resources & Evaluation (2024). https://doi.org/10.1007/s10579-024-09756-6 (SCIE)
- Saurav Kumar, Mrinmoy Mondal, Tanuja Dutta, Thoudam Doren Singh. Cyberbullying detection in Hinglish comments from social media using machine learning techniques. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-19031-z (SCIE)
- Aiusha Vellintihun Hujon, Thoudam Doren Singh, Khwairakpam Amitab, Neural machine translation systems for English to Khasi: A case study of an Austroasiatic language, Expert Systems with Applications, Volume 238, Part A, 2024, 121813, https://doi.org/10.1016/j.eswa.2023.121813 (SCIE)
- Nongmaithem Nandini Devi, Surmila Thokchom, Thoudam Doren Singh, Gayadhar Panda, and Ramasamy Thaiyal Naayagi. 2023. Multi-Stage Bargaining of Smart Grid Energy Trading Based on Cooperative Game Theory, Energies 16, no. 11: 4278. https://doi.org/10.3390/en16114278 (SCIE)
- Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay. Exploiting multiple correlated modalities can enhance low-resource machine translation quality. Multimed Tools Appl (2023). https://doi.org/10.1007/s11042-023-15721-2 (SCIE)
- Loitongbam Sanayai Meetei, Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay. Do cues in a video help in handling rare words in a machine translation system under a low-resource setting?, Natural Language Processing Journal, Volume 3, (2023), https://doi.org/10.1016/j.nlp.2023.100016
- Ringki Das and Thoudam Doren Singh. Multimodal Sentiment Analysis: A Survey of Methods, Trends and Challenges. ACM Computing Surveys. (2023) https://doi.org/10.1145/3586075 . (SCI Journal)
- Ringki Das and Thoudam Doren Singh. Image-Text Multimodal Sentiment Analysis Framework of Assamese News Articles using Late Fusion. ACM Transactions on Asian and Low-Resource Language Information Processing (ACM-TALLIP) (2023). https://doi.org/10.1145/3584861 (SCIE journal)
- Ringki Das and Thoudam Doren Singh. A multi-stage multimodal framework for sentiment analysis of Assamese in low resource setting. Expert Systems with Applications. Vol. 204. (2022). https://doi.org/10.1016/j.eswa.2022.117575. (SCIE Journal)
- Ringki Das and Thoudam Doren Singh. Assamese news image caption generation using attention mechanism. Multimed Tools Appl 81, 10051–10069 (2022). https://doi.org/10.1007/s11042-022-12042-8 (SCIE journal)
- Ringki Das and Thoudam Doren Singh. A hybrid fusion-based machine learning framework to improve sentiment prediction of Assamese in low resource setting. Multimed Tools Appl (2023). https://doi.org/10.1007/s11042-023-15356-3 (SCIE journal)
- Salam Michael Singh and Thoudam Doren Singh. An empirical study of low-resource neural machine translation of manipuri in multilingual settings. Neural Comput & Applic (2022). https://doi.org/10.1007/s00521-022-07337-8 (SCIE journal)
- Salam Michael Singh and Thoudam Doren Singh. Low resource machine translation of english–manipuri: A semi-supervised approach, Expert Systems with Applications, (2022), https://doi.org/10.1016/j.eswa.2022.118187 (SCIE journal)
- Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay. V2T: video to text framework using a novel automatic shot boundary detection algorithm. Multimed Tools Appl (2022). https://doi.org/10.1007/s11042-022-12343-y (SCIE journal)
- Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay. An encoder-decoder based framework for Hindi image caption generation. Multimed Tools Appl (2021). https://doi.org/10.1007/s11042-021-11106-5 (SCIE journal)
- Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay. Attention based video captioning framework for Hindi. Multimedia Systems (2021). https://doi.org/10.1007/s00530-021-00816-3 (SCI journal)
- Loitongbam Sanayai Meetei, Thoudam Doren Singh, Borgohain, S.K. et al. Low resource language specific pre-processing and features for sentiment analysis task. Lang Resources & Evaluation (2021). https://doi.org/10.1007/s10579-021-09541-9 (SCIE journal)
- Thoudam Doren Singh, Abdullah Faiz Ur Rahman Khilji, Divyansha, Apoorva Vikram Singh, Surmila Thokchom, Sivaji Bandyopadhyay, Predictive Approaches for the UNIX Command Line: Curating and Exploiting Domain Knowledge in Semantics Deficit Data, Multimed Tools Appl (2020). https://doi.org/10.1007/s11042-020-10109-y (SCIE journal)
- Sabuzima Nayak, Ripon Patgiri, Thoudam Doren Singh, Big computing: Where are we heading?, EAI Endorsed Transactions on Scalable Information Systems, 2020, Page 1-10, doi:10.4108/eai.13-7-2018.163972 (Web of Science)
- Aiusha Vellintihun Hujon, Thoudam Doren Singh, Existing English to Khasi Translated Documents for Parallel Corpora Development : A Survey. International Journal on Natural Language Computing (IJNLC), Vol 7, Issue 5, Pages 81-91, 2018 https://doi.org/10.5121/IJNLC.2018.7508
- Thoudam Doren Singh, Building Parallel Corpora for SMT System: A Case Study of English-Manipuri. International Journal of Computer Applications 52(14):47-51, ISSN 0975-8887, August 2012. Published by Foundation of Computer Science, New York, USA, Pages 47-51, 2012. https://doi.org/10.5120/8274-1876
- Thoudam Doren Singh and Sivaji Bandyopadhyay, Bidirectional Statistical Machine Translation of Manipuri English Language Pair using Morpho-Syntactic and Dependency Relations, International Journal of Translation, (Vol. 23, No. 1, Jan-Jun 2011), ISSN 0970-9819, 2011
- Thoudam Doren Singh and Sivaji Bandyopadhyay, Semi Automatic Parallel Corpora Extraction from Comparable News Corpora, In the International Journal of POLIBITS, Issue 41 (January – June 2010), ISSN 1870-9044, Pages 11-17, 2010. http://doi.org/10.17562/PB-41-2
- Thoudam Doren Singh and Sivaji Bandyopadhyay, Manipuri-English Example Based Machine Translation System, International Journal of Computational Linguistics and Applications (IJCLA) (ed.) A. Gelbukh, ISSN 0976-0962, Pages 147-158, 2010.
INTERNATIONAL CONFERENCE/WORKSHOPS
- Ringki Das and Thoudam Doren Singh. Encoder-Decoder Based Image Caption Generation Framework for Assamese. Proceedings of the 18th International Conference on Natural Language Processing (ICON-2021), https://aclanthology.org/2021.icon-main.28.pdf
- Ringki Das and Thoudam Doren Singh. A Step Towards Sentiment Analysis of Assamese News Articles Using Lexical Features. Proceedings of the International Conference on Computing and Communication Systems(2021) . Lecture Notes in Networks and Systems, vol 170. Springer, Singapore. https://doi.org/10.1007/978-981-33-4084-8_2
- Thoudam Doren Singh, Divyansha, A. V. Singh, A. Sachan and A. F. U. R. Khilji, “Debunking Fake News by Leveraging Speaker Credibility and BERT Based Model,” 2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2020, pp. 960-968, doi: 10.1109/WIIAT50758.2020.00147
- Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay, Mihaela Vela and Josef van Genabith, English to Manipuri and Mizo Post-Editing Effort and its Impact on Low Resource Machine Translation, Proceedings of the 17th International Conference on Natural Language Processing (ICON), 2020, Pages 50–59, IIT Patna, India, https://aclanthology.org/2020.icon-main.7.pdf
- Salam Michael Singh and Thoudam Doren Singh. 2020. Unsupervised Neural Machine Translation for English and Manipuri. In the third Workshop on Technologies for MT of Low Resource Languages (LoResMT 2020), In Conjunction with AACL-IJCNLP-2020, Pages 69–78, https://aclanthology.org/2020.loresmt-1.10.pdf
- Salam Michael Singh, Thoudam Doren Singh and Sivaji Bandyopadhyay, The NITS-CNLP System for the Unsupervised MT Task at WMT 2020, WMT -2020 in Conjuction with EMNLP -2020
- Subhra Jyoti Baroi, Nivedita Singh, Ringki Das and Thoudam Doren Singh, NITS-Hinglish-SentiMix at SemEval-2020 Task 9: Sentiment Analysis ForCode-Mixed Social Media Text Using an Ensemble Model, 14th SemEval, COLING, Dec. 2020.
- Alok Singh, Thoudam Doren Singh and Sivaji Bandyopadhyay, NITS-VC System for VATEX Video Captioning Challenge 2020, (LVVU 2020) – In conjunction with CVPR 2020
- Thoudam Doren Singh, Divyansha, Apoorva Vikram Singh, Abdullah Faiz Ur Rahman Khilji, A Hybrid Classification Approach using Topic Modeling and Graph Convolution Networks, 2020 International Conference on Computational Performance Evaluation (ComPE), Pages 285-289, IEEE
- Thoudam Doren Singh, Aiusha Vellintihun Hujon, Low Resource and Domain Specific English to Khasi SMT and NMT Systems, 2020 International Conference on Computational Performance Evaluation (ComPE), Pages 733-737, IEEE
- Loitongbam Sanayai Meetei, Thoudam Doren Singh and Sivaji Bandyopadhyay, WAT2019: English-Hindi Translation on Hindi Visual Genome Dataset, Proceedings of the 6th Workshop on Asian Translation, pages 181–188, Hong Kong, China, November 4, 2019.
- Mirinso Shadang, Navanath Saharia, Thoudam Doren Singh, Towards the study of morphological processing of the Tangkhul language, In proceeding of Regional International Conference on Natural Language Processing (regICON) 2017, 3rd and 4th Novmber 2017, Imphal, India
- Thoudam Doren Singh, An Empirical Study of Diversity of Word Alignment and its Symmetrization Techniques for System Combination, In proceedings of the Twelfth International Conference on Natural Language Processing (ICON-2015), IIITM-Kerala, Trivandrum, India, December 11-14, 2015.
- Thoudam Doren Singh, Taste of Two Different Flavours: Which Manipuri Script works better for English-Manipuri Language pair SMT Systems? In proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-7), NAACL HLT 2013 / SIGMT / SIGLEX Workshop 13 June 2013, Atlanta, GA, USA, Pages 11-18
- Thoudam Doren Singh , Bidirectional Bengali Script and Meetei Mayek Transliteration of Web Based Manipuri News Corpus, In the Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing (SANLP) of COLING 2012, IIT Bombay, Mumbai, India, pages 181-189, 8th December, 2012
- Thoudam Doren Singh, Addressing some Issues of Data Sparsity towards Improving English-Manipuri SMT using Morphological Information, In proceedings of The Tenth Biennial Conference of the Association for Machine Translation in the America (AMTA 2012), San Diego, USA, Pages 46-54, 1st November, 2012.
- Thoudam Doren Singh and Sivaji Bandyopadhyay, Integration of Reduplicated Multiword Expressions and Named Entities in a Phrase Based Statistical Machine Translation System, In proceedings of International Joint Conference on Natural Language Processing (IJCNLP), Chiang Mai, Thailand, Pages 1304-1312, November 2011.
- Thoudam Doren Singh and Sivaji Bandyopadhyay, Web Based Manipuri Corpus for Multiword NER and Reduplicated MWEs Identification using SVM, In Proceeding of the 1st Workshop on South and Southeast Asian Natural Language Processing (WSSANLP) of 23rd International Conference on Computational Linguistics (COLING), Beijing, Pages 35-42, August 2010.
- Thoudam Doren Singh and Sivaji Bandyopadhyay, Manipuri-English Bidirectional Statistical Machine Translation Systems using Morphology and Dependency Relations, In Proceeding of Syntax and Structure in Statistical Translation (SSST-4) of 23rd International Conference on Computational Linguistics (COLING), SIGMT / SIGLEX Workshop, Beijing, Pages 75-83, August 2010.
- Thoudam Doren Singh and Sivaji Bandyopadhyay, Statistical Machine Translation of English-Manipuri using Morpho-Syntactic and Semantic Information, In the proceedings of Ninth Conference of the Association for Machine Translation in Americas (AMTA 2010), Denver, Colorado, USA, Pages 333-340, 2010. [SCOPUS]
- Thoudam Doren Singh, Yengkhom Ranjan Singh and Sivaji Bandyopadhyay, Manipuri-English Semi Automatic Parallel Corpora Extraction from Web, In proceedings of 23rd International Conference on the Computer Processing of Oriental Languages (ICCPOL 2010) – New Generation in Asian Information Processing , San Francisco Bay, CA, USA, Pages 45-48, 2010.
- Thoudam Doren Singh, Kishorjit Nongmeikapam, Asif Ekbal, Sivaji Bandyopadhyay, Named Entity Recognition for Manipuri using Support Vector Machine, In proceedings of 23rd Pacific Asia Conference on Language, Information and Computation (PACLIC 23), Hong Kong, Pages 811-818, 2009. [SCOPUS]
- Thoudam Doren Singh, Asif Ekbal, Sivaji Bandyopadhyay, Manipuri POS Tagging Using CRF and SVM: A Language Independent Approach, In proceeding of 6th International conference on Natural Language Processing (ICON -2008), Pune, India, Pages 240-245, 2008.
- Thoudam Doren Singh and Sivaji Bandyopadhyay, Manipuri Morphological Analyzer, Platinum Jubilee International Conference of the LSI, Hyderabad, 6-8 December 2005.
NATIONAL CONFERENCE
- Thoudam Doren Singh and Sivaji Bandyopadhyay, Word Class and Sentence Type Identification in Manipuri Morphological Analyzer, In Proceedings of MSPIL, IIT Bombay, Pages 11-17, 2006.
BOOK CHAPTER
- Salam Michael Singh and Thoudam Doren Singh. (2021) Statistical and Neural Machine Translation Systems of English to Manipuri: A Preliminary Study. In: Reddy V.S., Prasad V.K., Wang J., Reddy K.T.V. (eds) Soft Computing and Signal Processing. Advances in Intelligent Systems and Computing, vol 1325. Springer, Singapore. https://doi.org/10.1007/978-981-33-6912-2_19 Pages 203-211, Print ISBN 978-981-33-6911-5
- Alok Singh, Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay. (2021) Generation and Evaluation of Hindi Image Captions of Visual Genome. In: Proceedings of the International Conference on Computing and Communication Systems. Lecture Notes in Networks and Systems, vol 170. Springer, Singapore. https://doi.org/10.1007/978-981-33-4084-8_7 Pages 65-73, Print ISBN 978-981-33-4083-1
- Thoudam Doren Singh, Singh T.J., Shadang M., Thokchom S. (2021) Review Comments of Manipuri Online Video: Good, Bad or Ugly. In: Proceedings of the International Conference on Computing and Communication Systems. Lecture Notes in Networks and Systems, vol 170. Springer, Singapore. https://doi.org/10.1007/978-981-33-4084-8_5 Pages 45-53, Print ISBN 978-981-33-4083-1
- Ringki Das, Thoudam Doren Singh (2021) A Step Towards Sentiment Analysis of Assamese News Articles Using Lexical Features. In: Proceedings of the International Conference on Computing and Communication Systems. Lecture Notes in Networks and Systems, vol 170. Springer, Singapore. https://doi.org/10.1007/978-981-33-4084-8_2 Pages 15-23, Print ISBN 978-981-33-4083-1
- Anwesha Das, Thoudam Doren Singh (2021). Development of English-to-Bengali Neural Machine Translation Systems. In: Proceedings of the International Conference on Computing and Communication Systems. Lecture Notes in Networks and Systems, vol 170. Springer, Singapore. https://doi.org/10.1007/978-981-33-4084-8_6 Pages 55-64, Print ISBN 978-981-33-4083-1
- Loitongbam Sanayai Meetei, Ringki Das, Thoudam Doren Singh, Sivaji Bandyopadhyay. (2020) Automatic Extraction of Locations from News Articles Using Domain Knowledge. In: Big Data, Machine Learning, and Applications. BigDML 2019. Communications in Computer and Information Science, vol 1317. Springer, Cham. https://doi.org/10.1007/978-3-030-62625-9_4 Pages 36-47, Print ISBN 978-3-030-62624-2
- Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay. (2019) Extraction and Identification of Manipuri and Mizo Texts from Scene and Document Images. In: Deka B., Maji P., Mitra S., Bhattacharyya D., Bora P., Pal S. (eds) Pattern Recognition and Machine Intelligence. PReMI 2019. Lecture Notes in Computer Science, vol 11941. Springer, Cham. https://doi.org/10.1007/978-3-030-34869-4_44 Pages 405-414, Print ISBN 978-3-030-34868-7
- Thoudam Doren Singh, Thamar Solorio (2018) Towards Translating Mixed-Code Comments from Social Media. In: Gelbukh A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2017. Lecture Notes in Computer Science, vol 10762. Springer, Cham, https://doi.org/10.1007/978-3-319-77116-8_34 Pages 457-468, Print ISBN 978-3-319-77115-1
Awards and Recognition:
- Asian-Nagao Fund Award for IJCNLP 2008 [Award number: D1010]
- Complimentary main conference registration and two-year AMTA membership [AMTA-SRW 2010]
- Post Doctoral Research Fellowship, NUS and CSIDM Phase 2: [R-252-100-372-490], National University of Singapore(NUS) [2013-2014]
- Post Doctoral Research Fellowship, University of Houston, TX, USA [Program Number: P-1-02650] [2015-2016]
Invited Talk(s):
- AICTE Sponsored Five Days Online Short Term Training Programme (STTP) On Artificial Intelligence and its Societal Applications, 22nd March-26th March 2021, NIT Meghalaya.
- TEQIP-III & NMHS Workshop on Data Science: Industry and Research Perspective, NIT Meghalaya, 29 Oct 2018 – 02 Nov 2018.
- TEQIP-III Workshop on Deep Learning and Applications, NIT Manipur, 12-16 March, 2018
- AICTE sponsored FDP on Machine Learning and Internet of Things (IoT), Tezpur University, 19-23 Feb, 2018
- TEQIP-III Workshop on NLP, IIIT Manipur, 13-15 Oct 2017.
Workshop/Conference Organised/Session Chair/Session Coordinator:
- Co-Chair, Machine Translation, ICON-2023, Goa University
- Session Chair, ICICSA-2022/2023, NIT Silchar
- Tutorial/Workshop Chair, ICON-2022, IIIT Delhi
- Panel Expert Member, 3rd Research Conclave, NIT Meghalaya
- Organizing Chair, ICON-2021: The eighteenth International Conference on Natural Language Processing, National Institute of Technology, Silchar, 16-19 December 2021.
- Organizer, MMTLRL-2021: First Workshop on Multimodal Machine Translation for Low Resource Languages, In conjunction with RANLP-2021, Varna, Bulgaria, 7 September 2021
- Co-ordinator, One Week AICTE sponsored Faculty. Development Program. (FDP) on “Recent Advances in. NLP using Deep. Learning”, (NLP-DL-2021), 8th March -12 March 2021., NIT Silchar
- Convenor, One Week Indo-German SPARC Symposium cum Workshop on “Recent Advances in Machine Translation”, (RAMT-2021), 15-19 March 2021, NIT Silchar
- Organizer, SPARC International Symposium on Mahatma Gandhi and Linguistic Diversity, Sep 23, 2020 [Online Mode]
- Technical Session Chair, ComPE-2020, IEEE Conference
- Session Coordinator, Session -5, MIND 2020
- Poster and Workshop Chair, 25th International Symposium on Frontiers of Research in Speech and Music (FRSM 2020), 8-9 October 2020, NIT Silchar, India.
- Program Chair, International Conference On Big Data, Machine Learning and Applications (BigDML 2019), National Institute of Technology Silchar, December 16-19, 2019.
- Convenor, Workshop on “Machine Learning using Python”, TEQIP-III, IIIT Manipur, 4th Dec 2017 – 10th Dec 2017.
- Organising Committee and PC Chair, regICON 2017, 3rd Nov 2017 – 4th Nov 2017, IIIT Senapati, Manipur.
- Convenor, Workshop on Web Application Development (4-23) July 2016, IIIT Senapati, Manipur.
List of Administrative Works:
- Faculty Co-Ordinator, Training and Placement, Dept of CSE, NIT Silchar, August 2023 till date
- Co-Faculty-In-Charge (Co-FIC) of CNLP, NIT Silchar, Dec 2022 till date
- Department PhD Co-Ordinator, NIT Silchar, Sept 2021 to Aug 2023
- DPC Member, NIT Silchar, Sept 2023 till date
- DPPC Member, NIT Silchar, Sept 2023 till date
- DPMC Member, NIT Silchar, Sept 2021 till Aug 2023
- DUPC Member, NIT Silchar, Sept 2019 – 2nd Sept 2021
- Faculty-Lab-in-Charge : CC07, NIT Silchar, 2019 till date
- Department NBA Committee member 2020, NIT Silchar
- Third Party Proctor, NIT Silchar, NPIU, 2019
- Convener, Technical Committee of Center for Natural Language Processing, NIT Silchar, 2019
- Committee Member of ARC, NIT Silchar, 2018
- Faculty-in-Charge – Computer Centre, IIIT Senapati, Manipur. [5 Feb 2017 – 29 June 2018]
- Presiding Officer, GATE-JAM 2017, IIIT Senapati, Manipur Centre
- Test Centre Administrator (TCA), TCS Examinations, IIIT Senapati, Manipur. [25 July 2015 – 19 May 2017]
- Past Member of Board of Undergraduate Studies (BUGS) of CSE, IT, ECE, North Eastern Hill University (NEHU), Shillong
- Past Member of Board of Undergraduate Studies (BUGS) of BCA, Mizoram University, Aizawl
Ph.D. Students Supervised:
- Mr. Alok Singh Main Supervisor
Scholar ID.: 19-3-05-127
Thesis title: Visual Description Generation: Bridging a Gap Between Vision and Natural Language - Mr. Loitongbam Sanayai Meetei Main Supervisor
Scholar ID.: 18-3-05-109
Thesis title: Multimodal Machine Translation – Convergence of Multiple Modes of Input - Mr. Salam Michael Singh Sole Supervisor
Scholar ID.: 19-3-05-132
Thesis title: Study of Machine Translation for Low Resource Languages with Multilingual Cues - Mrs. Ringki Das Sole Supervisor
Scholar ID.: 19-3-05-107
Thesis title: Multimodal Sentiment Analysis of Assamese News Articles
Projects Involved:
- Senior Team Member of Statistical Machine Translation (SMT) Group of the English to Indian Language Machine Translation (EILMT) Consortium funded by Department of Information Technology (DIT), Govt. of India at Center for Development of Advanced Computing (CDAC), Mumbai. (November 2007-May 2009)
- Senior Team Member of State Service Delivery Gateway (SSDG) project funded by Department of Information Technology (DIT), Government of India of National E-Governance Plan (NeGP) at CDAC, Mumbai. (June 2009- 31st August 2013)
Research Project(s):
Sl No | Name of Project | Sponsored By | Duration | Organization | Amount | Role | Status | |
1 | Multimodal Machine Translation – Convergence of Multiple Modes of Input | SPARC, MOE(erstwhile MHRD), Govt of India | 15 March, 2019-30 September, 2023 | NIT Silchar, India and Universität des Saarlandes, Germany | 49.58 lacs | Co-PI | Completed | |
2. | Project ISHAAN: A System for Bidirectional Machine Translation Between 1) English and Assamese, Bodo, Manipuri, Nepali 2) Manipuri and Hindi 3) Assamese and Bodo’ under the Project titled ‘National Language Translation Mission (NLTM) : BHASHINI’. | MEITY, Govt of India | April 2022- | NIT Silchar | Rs. 141.81 lakhs | PI | Ongoing |
Program/Course Organized
Sl No | Name of Course/Program |
Sponsored By | Duration | Organization | Role | Status | ||
1 | Machine Translation | SPARC, MHRD, Govt of India | 2nd December, 2019-13th Dec, 2019 | NIT Silchar, India and Universität des Saarlandes, Germany | Organiser | Completed |
Shared Task Participation:
- The 6th Workshop on Asian Translation (WAT-2019), Multimodal : English –> Hindi of Indic Translation Task, 4th Nov 2019, Hong Kong, China [Result] Name of Team [NITSNLP]
- Unsupervised MT and Very Low Resource Supervised MT of WMT-2020 in conjunction with EMNLP-2020: Nov 19-20, 2020, Online [Result] Name of the Team NITS-CNLP
- VATEX Video Captioning Challenge in conjunction with CPVR-2020, Workshop on Language & Vision with applications to Video Understanding (LVVU2020)
- SEMEVAL-2020, Task 9: Sentiment Analysis for Code-Mixed Social Media Text, Name of Team [rns2020]
SUBJECTS TAUGHT (INCLUDING SUBJECTS CURRENTLY TEACHING)
- Introduction to Computer Programming in C [CS-1101/CS-101 (B.Tech.)]
- Theory of Computation [CS-1402 (B.Tech.)]
- Machine Translation [CS-1483, CS-5149]
- Compiler Design [CS-1305 (B.Tech.)]
- Text Mining and Analytics [CS-1447 (B.Tech., M.Tech., PhD)]
- Natural Language Processing [CS-1540 (PhD) and CS-1436 (B.Tech.)]
- Artificial Intelligence [CS-5109 (M.Tech., PhD]
- Object Oriented Programming [CS-213 (B.Tech.)]
Reviewer /Served as TPC
- Reviewer/PC of top conferences like ACL-2023, EMNLP-2023, COLING-2022/2023, LREC-COLING-2024, AAAI-2024, NeurIPS fast track-2024, ASONAM 2024, WWW, ICON, PReMI, MICAI, DaSAA etc..
- Natural Language Engineering, Cambridge University Press
- Language Resources and Evaluation, Springer
- ACM Transactions on Asian and Low-Resource Language Information Processing (ACM TALLIP), ACM
- Expert Systems and Application, Elsevier
- Neurocomputing, Elsevier
- IEEE Transactions on Computational Social Systems
- SN Computer Science, Springer Nature
- Multimedia Tools and Applications, Springer
- Multimedia System, Springer
- Scientific Reports, Springer Nature
- PLOS ONE
- Natural Language Processing Journal, Elsevier
- SADHANA, Springer
- CSI Transactions on ICT (Springer)
- Advances in Language and Vision Research ALVR -2020, ALVR-2021, ALVR-2024
- ClimateNLP 2024
- eKNOW 2023, ICETCE 2019, ICCIS 2019
- International Conference on Natural Language Processing (ICON) 2015, 2017, 2018, 2022
- Conference on Information and Communication Technology (CICT), 2017, 2018
- ICITAM 2017, icSoftComp 2017, AAAI 2016 (Sub-Reviewer), WWW 2016 (Sub-Reviewer)
- TBTCIA 2013 (ICON 2013 Workshop)
- Reviewer of International Journal of Computational Linguistics and Chinese Language Processing (IJCLCLP) Journal, 2013
- Book chapter reviewer of “Emerging Applications of Natural Language Processing: Concepts and New Research”, IGI Global, 2012.
- Reviewer of 24th Florida Artificial Intelligence Research Society Conference (FLAIRS-24), Palm Beach, Florida, May 18-20, 2011
- Many More to be UPDATED ….
Workshop/STTP Attended:
- One-week Faculty Development Program on “Cyber Security”, NIT Silchar, 14th Oct – 19th Oct 2019.
- One-week Workshop on “Data Analytics with Machine Learning Techniques” under TEQIP – III, NIT Silchar and GUIST, Guwahati, Assam, 29th July’19 – 02nd August 2019
- TEQIP-III sponsored One Week Workshop on “ Recent Research Trends & Future Perspective of Machine Learning in Academics and Industry” NIT Silchar and GUIST, Guwahati, Assam. 01-05 October, 2018.
- TEQIP-III sponsored workshop on “ Outcome Based Education & Accreditation (WOBEA 2018) Organised by NIT Silchar and GUIST, Guwahati, Assam. 30th September 2018-01st October 2018.
- Workshop on Teaching Computer Networks, 15-16 September 2018, IIT Kanpur
- Workshop on Teaching Software Architecture, 15-16 September 2018, IIT Kanpur
- Summer Training Program on Active Learning for Faculty, 4 June 2018- 8June 2018, IIT Bombay
- FDP on Deep Learning and HPC, 17 – 22 March 2017, IIIT Senapati, Manipur
- Workshop on Cyber Security and Forensics, 03 – 08 April 2017, IIIT Senapati, Manipur
- Seminar on “Towards Standardizing Khasi for Computational purposes”, 28th – 29th October, 2014, St. Anthony’s College, Shillong
- Workshop on Ontology, NLP, IE and IR, 16 th to 18 th July, 2008, IIT Bombay
- Tutorial on “How to Add a New Language on the NLP Map: Building Resources and Tools for Languages with Scarce Resources”, 7 Jan 2008, IIIT Hyderabad.
- ISI-NERIST winter School on Soft Computing, Data Mining and Bioinformatics, February 14-18, 2005, NERIST, Itanagar
- Workshop on trends and issues in Wireless Networks and Mobile Computing, 26-28 August 2004, Tezpur University
- Seminar on “Intellectual Property Rights”, 2004, St. Anthony’s College, Shillong
- UGC sponsored refresher course in the subject “Natural Language Processing: from Fundamentals to Research Roadmaps”, 25 Nov -16 Dec, 2003, Jadavpur University, Kolkata
- Autumn School on Information Technology: Emerging Research Issues, 22-26 September, 2003, NERIST, Itanagar
- Workshop on Vision, Autonomy, Accreditation, 10-12 February 2000, St. Anthony’s College, Shillong
- Seminar on “Personal growth, self awareness and Vision 2020”, 8-9 February 1999, St. Anthony’s College, Shillong
MEMBER
- Member of AFNLP
- Member of ACL
STUDENTS GUIDANCE
Title of Project | Program | Name of Students (Scholar ID) | Year of Passing |
---|---|---|---|
Chatbot Powered by Large Language Models for Personalized Recommendations | B.Tech.(CSE) | Jaswanth Kumar Polisetty(2012071),Bikkina Uday Sathyanarayana(2012108),Prabhudas Avanigadda(2012192) | 2024 |
Fake News Detection in Bangla using Machine Learning and Deep Learning Techniques | B.Tech.(CSE) | Debopriya Das(2012035),Jyotirmoy Das(2012054),Depayon Ghosh(2012082) | 2024 |
Propaganda Detection and Classification Using Pre-trained Models and Simple NLP Techniques | B.Tech.(CSE) | Reeya Hazarika(1912027), Santanu Baruah(1912029), Laharjit Das(1912054) | 2023 |
Image Caption Generation in Bengali Using Deep Learning Techniques | B.Tech.(CSE) | Sanchayita Purkayastha(1912013), Dude Saketh Krishna(1912025), Rituparna Kagyung(1912026) | 2023 |
Cyberbullying Detection in Hinglish Comments from Social Media using Machine Learning Techniques | B.Tech.(CSE) | Mrinmoy Mondal(1912049), Tanuja Dutta(1912066), Saurav Kumar(1912070) | 2023 |
Spoken Language Identification: A Case Study of Indo-Aryan and Dravidian Language Family | B.Tech.(CSE) | Anirban Ghosh(1912180), Amal Kuniyil Parambath(1912179), Arpita Bhattacharjee(1912003) | 2023 |
Prediction of Research Trends using LDA based Topic Modelling | B.Tech.(CSE) | Rahul Kumar Gupta(1815125), Joythish Reddy Evuri(1815079), Ritu Agarwalla(1815069), Bukya Hemanth Naik(1815111), Apil Thapa(1815084) | 2022 |
Dynamic Framedrop for Speech Classification using Deep Reinforcement Learning | B.Tech.(CSE) | Divyansha Lachi(1715094), Aniket Agarwalla(1715093), Yash Bajoria(1715092), Raushan Gandhi(1715054) | 2021 |
Smart Health Centre Management System | B.Tech.(CSE) | Atmashree Ray(1615004), Manjit Borah(1615009), Shubham Prasad(1615025), Ganesh Shah(1615059) | 2020 |
A Deep Dive and Investigation into Sentiment Analysis for the Mizo Language | M.Tech.(CSE) | Mercy Lalthangmawii (2222205) | 2024 |
Improving Named Entity Recognition Tasks in Biomedical Text Using NLP Techniques | M.Tech.(CSE) | Hunnan Hussain(2122208) | 2023 |
Building an Open-Domain Hindi Dialog System using Various Sequence-to-Sequence Architectures | M.Tech.(AI) | Sandeep Kumar Rana(2122101) | 2023 |
Fake News Detection of Hindi News Dataset using Various Machine Learning Based Approaches | M.Tech.(CSE) | Sudhanshu Kumar((2022107)) | 2022 |
Grammatical Error Correction with Pre-trained Language Model | M.Tech.(CSE) | Debartha Saha(1922118) | 2021 |
Study on Development of English to Bengali Neural Machine Translation Systems | M.Tech.(CSE) | Anwesha Das(1825102) | 2020 |
CALL FOR PAPERS:
1. First CFP of 18th International Conference on Natural Language Processing (ICON-2021)
Important Dates:
- Paper Submission Deadline:
October 15, 2021 - Paper Acceptance Notification:
November 15, 2021 - Paper Camera Ready Paper Submission:
December 5, 2021 - Doctoral Consortium Deadline:
October 15, 2021 - Paper Acceptance Notification (Doctoral Consortium):
November 15, 2021 - Workshop Proposal Submission:
September 15, 2021 - Workshop Acceptance Notification: September 30, 2021
- Tutorial Proposal Submission:
October 10, 2021 - Tutorial Acceptance Notification:
October 30, 2021 - Conference:
December 16-19, 2021
Publication: ACL Anthology
Proceedings of ICON-2021 Main Conference: https://aclanthology.org/volumes/2021.icon-main/
Proceedings of the co-located events of ICON-2021:
- Parsing and its Applications for Indian Languages (PAIL-2021)
- Speech and Music Processing (SMP-2021)
- Natural Language Processing for Digital Humanities (NLP4DH-2021)
- Multilingual Gender Biased and Communal Language Identification (MultiGen-2021)
2. Proceedings of First Workshop on Multimodal Machine Translation for Low Resource Languages (MMTLRL 2021): https://aclanthology.org/2021.mmtlrl-1.0.pdf
Journal Guest Editor(s):
1. Recent Advances on Social Media Analytics and Multimedia Systems: Issues and Challenges [SCIE Indexed]
Important Dates
Submission deadline: extended to December 31, 2020
First notification to authors: February 28, 2021
Final manuscript due: April 15, 2021
Tentative publication date: Autumn 2021
Publication: Multimedia Tools and Applications, Springer
Other Students Guidance:
- Divyansha (Undergrad- Graduated 2021)
- Abdullah Faiz Ur Rahman Khilji (Undergrad- Graduated 2021)
- Apoorva Vikram Singh (Undergrad-Graduated 2021)
More to be updated…