Ahmed Shmels Muhe

Research Interests: Causality in RL; RL; RL for LLMs; Multi-Agent RL; Multi-Agent Reasoning; Representation Learning

Ahmed Shmels Muhe

About Me

I am an M.Tech graduate in Data Science from IIT Madras (2024), and hold a B.Tech in Computer Science and Engineering from VIT (2022). I previously worked as a Project Associate at RBCDSAI, IIT Madras, under the guidance of Prof. B. Ravindran.

My research interests focus on reinforcement learning (RL), especially its applications to large language models (LLMs).

  • Current Focus: Reinforcement Learning for Large Language Models (RL for LLMs)
  • Open to Collaboration: Projects involving RL, LLMs, Causality, and Multi-Agent Systems
  • Expertise Areas: Machine Learning · Deep Learning · Computer Vision · Natural Language Processing (NLP) · Reinforcement Learning

Projects

Reinforcement Learning (RL)

RL Assignments (IITM CS6700): Implementations including SARSA, Q-Learning, Hierarchical RL (SMDP & Intra-Option Q-Learning), Dueling DQN, and REINFORCE.

RL4LLM Framework: Toolkit for RL-based Large Language Model fine-tuning.

Natural Language Processing (NLP)

NLP Information Retrieval System (IITM CS6370): Project focusing on building an Information Retrieval system.

Causal-NLP: Development of causal inference tools tailored for Natural Language Processing tasks.

Computer Vision (CV)

BSc Thesis - COVID-19 CXR Classification: Classification of COVID-19 from Chest X-Rays using Transfer Learning.

Contact

Feel free to reach out at ahmecse@gmail.com or connect with me on LinkedIn!