Research Interests: Causality in RL; RL; RL for LLMs; Multi-Agent RL; Multi-Agent Reasoning; Representation Learning
I am an M.Tech graduate in Data Science from IIT Madras (2024), and hold a B.Tech in Computer Science and Engineering from VIT (2022). I previously worked as a Project Associate at RBCDSAI, IIT Madras, under the guidance of Prof. B. Ravindran.
My research interests focus on reinforcement learning (RL), especially its applications to large language models (LLMs).
RL Assignments (IITM CS6700): Implementations including SARSA, Q-Learning, Hierarchical RL (SMDP & Intra-Option Q-Learning), Dueling DQN, and REINFORCE.
RL4LLM Framework: Toolkit for RL-based Large Language Model fine-tuning.
NLP Information Retrieval System (IITM CS6370): Project focusing on building an Information Retrieval system.
Causal-NLP: Development of causal inference tools tailored for Natural Language Processing tasks.
BSc Thesis - COVID-19 CXR Classification: Classification of COVID-19 from Chest X-Rays using Transfer Learning.
Feel free to reach out at ahmecse@gmail.com or connect with me on LinkedIn!