Shubham Pateria
Tagline:AI Scientist | Machine Learning, Reinforcement Learning, Multi-agent Systems, Federated Learning
Education
Doctor of Philosophy - PhD
from: 2017, until: 2021Field of study:Computer ScienceSchool:Nanyang Technological University Singapore
DescriptionWorking on algorithms for autonomous decision-making via hierarchical and deep reinforcement learning.
Reinforcement learning refers to the training of machine learning models to make a sequence of decisions that optimize an objective in the future.
Bachelor of Technology (B.Tech.)
from: 2009, until: 2013Field of study:Electrical, Electronics and Communications EngineeringSchool:National Institute of Technology Durgapur
Bachelor of Technology - BTech
from: 2025, until: presentSchool:National Institute of Technology Durgapur
Publications
FedART: A neural model integrating federated learning and adaptive resonance theory
Journal ArticlePublisher:Neural NetworksDate:2025Authors:Shubham PateriaBudhitama SubagdjaAh-Hwee TanExplaining sequences of actions in multi-agent deep reinforcement learning models
Journal ArticlePublisher:[ "International Foundation for Autonomous Agents", "Multiagent Systems" ]Date:2024Authors:Phyo Wai KHAINGMinghong GENGShubham PATERIABudhitama SUBAGDJAAh-hwee TANExplaining Sequences of Actions in Multi-agent Deep Reinforcement Learning Models
Conference PaperPublisher:Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsDate:2024Authors:Khaing Phyo WaiMinghong GengShubham PateriaBudhitama SubagdjaAh-Hwee TanFedSTEM-ADL: A federated spatial-temporal episodic memory model for ADL prediction
Conference PaperPublisher:2024 International Joint Conference on Neural Networks (IJCNN)Date:2024Authors:Doudou WuShubham PateriaBudhitama SubagdjaAh-Hwee TanHiSOMA: A hierarchical multi-agent model integrating self-organizing neural networks with multi-agent deep reinforcement learning
Journal ArticlePublisher:Expert Systems with ApplicationsDate:2024Authors:Minghong GengShubham PateriaBudhitama SubagdjaAh-Hwee TanBenchmarking MARL on long horizon sequential multi-objective tasks
Journal ArticlePublisher:[ "International Foundation for Autonomous Agents", "Multiagent Systems" ]Date:2024Authors:Minghong GengShubham PateriaBudhitama SubagdjaAh-Hwee TanValue-Based Subgoal Discovery and Path Planning for Reaching Long-Horizon Goals
Journal ArticlePublisher:IEEE Transactions on Neural Networks and Learning SystemsDate:2023Authors:Shubham PateriaBudhitama SubagdjaAh-Hwee TanChai QuekTowards explaining sequences of actions in multi-agent deep reinforcement learning models
Journal ArticlePublisher:ACMDate:2023Authors:Phyo Wai KHAINGMinghong GengBudhitama SubagdjaShubham PateriaAh-Hwee TanMethods for autonomously decomposing and performing long-horizon sequential decision tasks
Journal ArticlePublisher:Nanyang Technological UniversityDate:2022Authors:Shubham PateriaHierarchical reinforcement learning: A comprehensive survey
Journal ArticlePublisher:ACM Computing Surveys (CSUR)Date:2021Authors:Shubham PateriaBudhitama SubagdjaAh-hwee TanChai Quek
Work Experiences
AI Research Scientist
from: 2022, until: presentOrganization:Singapore Management UniversityLocation:Singapore
Description: Leading multiple funded projects in complex long-horizon multi-agent Reinforcement Learning and privacy-preserving Federated Learning.
Strong expertise in Deep Learning, Reinforcement Learning, Federated Learning, Python, and Pytorch.
Developed and successfully licensed a deep multi-agent learning and self-organizing neural networks hybrid system achieving above 90% success rates in large-scale defense forces automation tasks. Technology transferred to DSO Laboratories.
Built domain knowledge of Federated Learning from scratch without prior experience and launched multiple ongoing projects on federated human activity recognition, distributed data classification, distributed clustering, and time-series domain adaptation.Creative AI Advisor
from: 2022, until: 2022Organization:NewYork.SGLocation:Singapore
Description:Introducing AI to creatives such as designers, writers, creative directors, artists, etc.
Co-Founder
from: 2021, until: 2022Organization:Maargo Technologies
Description:Digital mental health startup. Discontinued.
gomaargo.com (mobile).
Founder in Residence (EFSG10)
from: 2021, until: 2022Organization:Entrepreneur FirstLocation:Singapore
Ph.D. Scholar
from: 2017, until: 2021Organization:Nanyang Technological UniversityLocation:Singapore
Description:https://dblp.org/pid/186/7303.html
Working on algorithms for autonomous decision-making via hierarchical and deep reinforcement learning. Robotic navigation. AI agents for games.
Reinforcement learning refers to the training of machine learning models to make a sequence of decisions that optimize an objective in the future.
-
Developed a multi-agent hierarchical reinforcement learning approach for robot coordination for Search & Rescue (in simulation), as part of the STE-NTU Corporate Lab project ‘Joint Situation Awareness and Cooperative Reinforcement Learning’.
-
Wrote the first-ever comprehensive survey covering the whole breadth of the Hierarchical Reinforcement Learning research up to the recent years.
-
Developed novel hierarchical learning algorithms using topological/graphical memory and goal-decomposition for long-horizon decision-making, which can be applied in tasks such as robot navigation, manipulation, and AI-based game playing.
-
Graduate Teaching Assistant
from: 2018, until: 2021Organization:Nanyang Technological University
Description:Served as a teaching assistant for the following courses:
- CZ2001: Algorithms, Semester 2, 2018.
- CZ3003: Software System Analysis and Design, Semester 1, 2019.
- CZ2001: Algorithms, Semester 2, 2019.
- CZ1015: Introduction to Data Science, Semester 1, 2020.
- CZ3004: Multidisciplinary Design Project (MDP), Semester 2, 2020.
Senior Software Engineer
from: 2016, until: 2017Organization:Samsung ElectronicsLocation:Bengaluru Area, India
Description:R&D Contribution: Novel methods for optimizing power consumption and visual quality of the Samsung smartphone display modules. The work led to two issued patents.
Software Engineer
from: 2014, until: 2016Organization:Samsung ElectronicsLocation:Bangaon Area, India
Description:-
Worked with Display technology teams in SRI-B and Samsung HQ responsible for board bring-up and device driver upgrades critical for the successful commercial launch of Samsung smartphones in the worldwide market (Galaxy S4-variants, A7, Tab4, and other mid-tier smartphone variants).
-
R&D Contribution: Machine learning for power-efficient and fault-tolerant sensor management system for Smart Home IoT sensors with minimal error (average 0.24$^{\circ}$C) in temperature prediction. Published in IEEE IACC.
-
Volunteer
from: 2014, until: 2014Organization:Agastya International FoundationLocation:Bengaluru Area, India
Description:Created didactic content for virtual Chemistry lab as part of the Lab-on-a-Tab project.
Technology Associate - Trainee
from: 2013, until: 2014Organization:Sapient Global MarketsLocation:Bangalore