Shubham Pateria

Tagline:AI Scientist | Machine Learning, Reinforcement Learning, Multi-agent Systems, Federated Learning

Education

Doctor of Philosophy - PhD
from: 2017, until: 2021
Field of study:Computer ScienceSchool:Nanyang Technological University Singapore
Description
Working on algorithms for autonomous decision-making via hierarchical and deep reinforcement learning.

Reinforcement learning refers to the training of machine learning models to make a sequence of decisions that optimize an objective in the future.
Bachelor of Technology (B.Tech.)
from: 2009, until: 2013
Field of study:Electrical, Electronics and Communications EngineeringSchool:National Institute of Technology Durgapur
Bachelor of Technology - BTech
from: 2025, until: present
School:National Institute of Technology Durgapur

Publications

FedART: A neural model integrating federated learning and adaptive resonance theory
Journal ArticlePublisher:Neural NetworksDate:2025
Authors:
Shubham PateriaBudhitama SubagdjaAh-Hwee Tan
Explaining sequences of actions in multi-agent deep reinforcement learning models
Journal ArticlePublisher:[ "International Foundation for Autonomous Agents", "Multiagent Systems" ]Date:2024
Authors:
Phyo Wai KHAINGMinghong GENGShubham PATERIABudhitama SUBAGDJAAh-hwee TAN
Explaining Sequences of Actions in Multi-agent Deep Reinforcement Learning Models
Conference PaperPublisher:Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsDate:2024
Authors:
Khaing Phyo WaiMinghong GengShubham PateriaBudhitama SubagdjaAh-Hwee Tan
FedSTEM-ADL: A federated spatial-temporal episodic memory model for ADL prediction
Conference PaperPublisher:2024 International Joint Conference on Neural Networks (IJCNN)Date:2024
Authors:
Doudou WuShubham PateriaBudhitama SubagdjaAh-Hwee Tan
HiSOMA: A hierarchical multi-agent model integrating self-organizing neural networks with multi-agent deep reinforcement learning
Journal ArticlePublisher:Expert Systems with ApplicationsDate:2024
Authors:
Minghong GengShubham PateriaBudhitama SubagdjaAh-Hwee Tan
Benchmarking MARL on long horizon sequential multi-objective tasks
Journal ArticlePublisher:[ "International Foundation for Autonomous Agents", "Multiagent Systems" ]Date:2024
Authors:
Minghong GengShubham PateriaBudhitama SubagdjaAh-Hwee Tan
Value-Based Subgoal Discovery and Path Planning for Reaching Long-Horizon Goals
Journal ArticlePublisher:IEEE Transactions on Neural Networks and Learning SystemsDate:2023
Authors:
Shubham PateriaBudhitama SubagdjaAh-Hwee TanChai Quek
Towards explaining sequences of actions in multi-agent deep reinforcement learning models
Journal ArticlePublisher:ACMDate:2023
Authors:
Phyo Wai KHAINGMinghong GengBudhitama SubagdjaShubham PateriaAh-Hwee Tan
Methods for autonomously decomposing and performing long-horizon sequential decision tasks
Journal ArticlePublisher:Nanyang Technological UniversityDate:2022
Authors:
Shubham Pateria
Hierarchical reinforcement learning: A comprehensive survey
Journal ArticlePublisher:ACM Computing Surveys (CSUR)Date:2021
Authors:
Shubham PateriaBudhitama SubagdjaAh-hwee TanChai Quek

Contact

https://www.linkedin.com/in/shubham-pateria-2091/

Work Experiences

AI Research Scientist
from: 2022, until: present
Organization:Singapore Management UniversityLocation:Singapore
Description:
 Leading multiple funded projects in complex long-horizon multi-agent Reinforcement Learning and privacy-preserving Federated Learning.
 Strong expertise in Deep Learning, Reinforcement Learning, Federated Learning, Python, and Pytorch.
 Developed and successfully licensed a deep multi-agent learning and self-organizing neural networks hybrid system achieving above 90% success rates in large-scale defense forces automation tasks. Technology transferred to DSO Laboratories.
 Built domain knowledge of Federated Learning from scratch without prior experience and launched multiple ongoing projects on federated human activity recognition, distributed data classification, distributed clustering, and time-series domain adaptation.
Creative AI Advisor
from: 2022, until: 2022
Organization:NewYork.SGLocation:Singapore
Description:
Introducing AI to creatives such as designers, writers, creative directors, artists, etc.
Co-Founder
from: 2021, until: 2022
Organization:Maargo Technologies
Description:
Digital mental health startup. Discontinued.

gomaargo.com (mobile).
Founder in Residence (EFSG10)
from: 2021, until: 2022
Organization:Entrepreneur FirstLocation:Singapore
20172021
Ph.D. Scholar
from: 2017, until: 2021
Organization:Nanyang Technological UniversityLocation:Singapore
Description:
https://dblp.org/pid/186/7303.html

Working on algorithms for autonomous decision-making via hierarchical and deep reinforcement learning. Robotic navigation. AI agents for games.

Reinforcement learning refers to the training of machine learning models to make a sequence of decisions that optimize an objective in the future.

Developed a multi-agent hierarchical reinforcement learning approach for robot coordination for Search & Rescue (in simulation), as part of the STE-NTU Corporate Lab project ‘Joint Situation Awareness and Cooperative Reinforcement Learning’.

Wrote the first-ever comprehensive survey covering the whole breadth of the Hierarchical Reinforcement Learning research up to the recent years.

Developed novel hierarchical learning algorithms using topological/graphical memory and goal-decomposition for long-horizon decision-making, which can be applied in tasks such as robot navigation, manipulation, and AI-based game playing.
20182021
Graduate Teaching Assistant
from: 2018, until: 2021
Organization:Nanyang Technological University
Description:
Served as a teaching assistant for the following courses:

CZ2001: Algorithms, Semester 2, 2018.

CZ3003: Software System Analysis and Design, Semester 1, 2019.

CZ2001: Algorithms, Semester 2, 2019.

CZ1015: Introduction to Data Science, Semester 1, 2020.

CZ3004: Multidisciplinary Design Project (MDP), Semester 2, 2020.
Senior Software Engineer
from: 2016, until: 2017
Organization:Samsung ElectronicsLocation:Bengaluru Area, India
Description:
R&D Contribution: Novel methods for optimizing power consumption and visual quality of the Samsung smartphone display modules. The work led to two issued patents.
20142016
Software Engineer
from: 2014, until: 2016
Organization:Samsung ElectronicsLocation:Bangaon Area, India
Description:
Worked with Display technology teams in SRI-B and Samsung HQ responsible for board bring-up and device driver upgrades critical for the successful commercial launch of Samsung smartphones in the worldwide market (Galaxy S4-variants, A7, Tab4, and other mid-tier smartphone variants).

R&D Contribution: Machine learning for power-efficient and fault-tolerant sensor management system for Smart Home IoT sensors with minimal error (average 0.24$^{\circ}$C) in temperature prediction. Published in IEEE IACC.
Volunteer
from: 2014, until: 2014
Organization:Agastya International FoundationLocation:Bengaluru Area, India
Description:
Created didactic content for virtual Chemistry lab as part of the Lab-on-a-Tab project.
Technology Associate - Trainee
from: 2013, until: 2014
Organization:Sapient Global MarketsLocation:Bangalore

contents:

Shubham Pateria

Education

Doctor of Philosophy - PhD

Bachelor of Technology (B.Tech.)

Bachelor of Technology - BTech

Publications

FedART: A neural model integrating federated learning and adaptive resonance theory

Explaining sequences of actions in multi-agent deep reinforcement learning models

Explaining Sequences of Actions in Multi-agent Deep Reinforcement Learning Models

FedSTEM-ADL: A federated spatial-temporal episodic memory model for ADL prediction

HiSOMA: A hierarchical multi-agent model integrating self-organizing neural networks with multi-agent deep reinforcement learning

Benchmarking MARL on long horizon sequential multi-objective tasks

Value-Based Subgoal Discovery and Path Planning for Reaching Long-Horizon Goals

Towards explaining sequences of actions in multi-agent deep reinforcement learning models

Methods for autonomously decomposing and performing long-horizon sequential decision tasks

Hierarchical reinforcement learning: A comprehensive survey

Contact

Work Experiences

AI Research Scientist

Creative AI Advisor

Co-Founder

Founder in Residence (EFSG10)

Ph.D. Scholar

Graduate Teaching Assistant

Senior Software Engineer

Software Engineer

Volunteer

Technology Associate - Trainee