"Programming isn't about what you know, it's about what you can figure out."
—Chris Pine, Learn to Program
Tonmoy Talukder (He/Him)
[tɒnˈmɔɪ təˈluːkdər]
Resume | LinkedIn | GitHub | Google Scholar
I pursued my Bachelor of Science degree in Computer Science and Engineering from Ahsanullah University of Science and Technology. At present, I am deeply involved in research on Machine Learning, advised by Mr. G. M. Shahariar Shibli. My aspiration is to specialize in research, aiming to work in renowned AI research institutions as a scientist. These days I am specifically dedicated to advancing NLP and Computer Vision solutions that intersect with human interaction.
Research. Broadly, my research explores how to enhance various aspects of Natural Language Processing, particularly in "low resource" languages like Bangla. Throughout my work, I thoroughly investigate the primary challenges faced by these languages, which mainly arise from constraints in data processing techniques. My constant devotion lies in the innovation of state-of-the-art solutions aimed at tackling these challenges head-on, thereby amplifying the data processing capabilities of "low resource" languages, particularly Bangla. Additionally, I think about how to integrate Computer Vision techniques with Natural Language Processing solutions for these resource-constrained languages. I am curious about the prospect of creating generative models that combine NLP and Computer Vision, driven by human interaction.
Collaboration. From a collaborative perspective, there are numerous opportunities to learn from others. As of now, I am actively engaged in research with my undergraduate advisor. Additionally, I have experience in collaborative research with various faculty members at my undergraduate institution, including Md. Tanvir Rouf Shawon.
I would greatly appreciate receiving an invitation for a research collaboration in the fields of Natural Language Generation, Computer Vision, and Human-Computer Interaction.
"Programming isn't about what you know, it's about what you can figure out."
—Chris Pine, Learn to Program
Research Quest
I am deeply passionate about researching the utilization of natural language processing, computer vision, and multimodal learning, all engaged with human interaction. However, my particular focus is on utilizing these technologies to enhance the understanding and use of low-resource languages.
Recently, I have been actively engaged in multiple projects centered on text summarization, text generation, text classification, question answering, and image captioning within this domain. Additionally, my curiosity extends to understanding how machine learning models interact with data representations during training and how this interaction can be used to improve their performance.
What truly captivates my curiosity is the exploration of how computers could learn from both languages and images simultaneously, mirroring the multifaceted nature of human learning — learning through listening, watching, and even feeling. This captivating exploration fuels my enthusiasm, and I am eager to witness the boundless possibilities that await.
Research Interest. My research interest lies in Natural Language Processing, Natural Language Generation, "low resource" language, Multimodal Deep Learning, Computer Vision, and Human-Computer Interaction.
RESEARCH
Ongoing Works 📢
🔨 Image2Cap: Bengali Caption Generation from Images using Pre-trained Transformers
🔨 Multimodal Image and Text Classification using Pre-trained Transformers
🔨 Bangla Hate Speech Classification using Pre-Trained Transformers on a Benchmark Dataset
Find Me: 👉
I write in: 🖊️
Write me: ✍️
© 2024, Tonmoy Talukder. All rights reserved.
This site has been developed by myself using NEXT.JS.