Shubham Pateria

Tagline:AI Scientist | Machine Learning, Reinforcement Learning, Multi-agent Systems, Federated Learning

personal photo of Shubham Pateria

Education

  • Doctor of Philosophy - PhD

    from: 2017, until: 2021

    Field of study:Computer ScienceSchool:Nanyang Technological University Singapore

    Description

    Working on algorithms for autonomous decision-making via hierarchical and deep reinforcement learning.

    Reinforcement learning refers to the training of machine learning models to make a sequence of decisions that optimize an objective in the future.

  • Bachelor of Technology (B.Tech.)

    from: 2009, until: 2013

    Field of study:Electrical, Electronics and Communications EngineeringSchool:National Institute of Technology Durgapur

  • Bachelor of Technology - BTech

    from: 2025, until: present

    School:National Institute of Technology Durgapur

Publications

  • FedART: A neural model integrating federated learning and adaptive resonance theory

    Journal ArticlePublisher:Neural NetworksDate:2025
    Authors:
    Shubham PateriaBudhitama SubagdjaAh-Hwee Tan
  • Explaining sequences of actions in multi-agent deep reinforcement learning models

    Journal ArticlePublisher:[ "International Foundation for Autonomous Agents", "Multiagent Systems" ]Date:2024
    Authors:
    Phyo Wai KHAINGMinghong GENGShubham PATERIABudhitama SUBAGDJAAh-hwee TAN
  • Explaining Sequences of Actions in Multi-agent Deep Reinforcement Learning Models

    Conference PaperPublisher:Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsDate:2024
    Authors:
    Khaing Phyo WaiMinghong GengShubham PateriaBudhitama SubagdjaAh-Hwee Tan
  • FedSTEM-ADL: A federated spatial-temporal episodic memory model for ADL prediction

    Conference PaperPublisher:2024 International Joint Conference on Neural Networks (IJCNN)Date:2024
    Authors:
    Doudou WuShubham PateriaBudhitama SubagdjaAh-Hwee Tan
  • HiSOMA: A hierarchical multi-agent model integrating self-organizing neural networks with multi-agent deep reinforcement learning

    Journal ArticlePublisher:Expert Systems with ApplicationsDate:2024
    Authors:
    Minghong GengShubham PateriaBudhitama SubagdjaAh-Hwee Tan
  • Benchmarking MARL on long horizon sequential multi-objective tasks

    Journal ArticlePublisher:[ "International Foundation for Autonomous Agents", "Multiagent Systems" ]Date:2024
    Authors:
    Minghong GengShubham PateriaBudhitama SubagdjaAh-Hwee Tan
  • Value-Based Subgoal Discovery and Path Planning for Reaching Long-Horizon Goals

    Journal ArticlePublisher:IEEE Transactions on Neural Networks and Learning SystemsDate:2023
    Authors:
    Shubham PateriaBudhitama SubagdjaAh-Hwee TanChai Quek
  • Towards explaining sequences of actions in multi-agent deep reinforcement learning models

    Journal ArticlePublisher:ACMDate:2023
    Authors:
    Phyo Wai KHAINGMinghong GengBudhitama SubagdjaShubham PateriaAh-Hwee Tan
  • Methods for autonomously decomposing and performing long-horizon sequential decision tasks

    Journal ArticlePublisher:Nanyang Technological UniversityDate:2022
    Authors:
    Shubham Pateria
  • Hierarchical reinforcement learning: A comprehensive survey

    Journal ArticlePublisher:ACM Computing Surveys (CSUR)Date:2021
    Authors:
    Shubham PateriaBudhitama SubagdjaAh-hwee TanChai Quek

Work Experiences

  • AI Research Scientist

    from: 2022, until: present

    Organization:Singapore Management UniversityLocation:Singapore

    Description:

     Leading multiple funded projects in complex long-horizon multi-agent Reinforcement Learning and privacy-preserving Federated Learning.
     Strong expertise in Deep Learning, Reinforcement Learning, Federated Learning, Python, and Pytorch.
     Developed and successfully licensed a deep multi-agent learning and self-organizing neural networks hybrid system achieving above 90% success rates in large-scale defense forces automation tasks. Technology transferred to DSO Laboratories.
     Built domain knowledge of Federated Learning from scratch without prior experience and launched multiple ongoing projects on federated human activity recognition, distributed data classification, distributed clustering, and time-series domain adaptation.

  • Creative AI Advisor

    from: 2022, until: 2022

    Organization:NewYork.SGLocation:Singapore

    Description:

    Introducing AI to creatives such as designers, writers, creative directors, artists, etc.

  • Co-Founder

    from: 2021, until: 2022

    Organization:Maargo Technologies

    Description:

    Digital mental health startup. Discontinued.

    gomaargo.com (mobile).

  • Founder in Residence (EFSG10)

    from: 2021, until: 2022

    Organization:Entrepreneur FirstLocation:Singapore

  • Ph.D. Scholar

    from: 2017, until: 2021

    Organization:Nanyang Technological UniversityLocation:Singapore

    Description:

    https://dblp.org/pid/186/7303.html

    Working on algorithms for autonomous decision-making via hierarchical and deep reinforcement learning. Robotic navigation. AI agents for games.

    Reinforcement learning refers to the training of machine learning models to make a sequence of decisions that optimize an objective in the future.

    • Developed a multi-agent hierarchical reinforcement learning approach for robot coordination for Search & Rescue (in simulation), as part of the STE-NTU Corporate Lab project ‘Joint Situation Awareness and Cooperative Reinforcement Learning’.

    • Wrote the first-ever comprehensive survey covering the whole breadth of the Hierarchical Reinforcement Learning research up to the recent years.

    • Developed novel hierarchical learning algorithms using topological/graphical memory and goal-decomposition for long-horizon decision-making, which can be applied in tasks such as robot navigation, manipulation, and AI-based game playing.

  • Graduate Teaching Assistant

    from: 2018, until: 2021

    Organization:Nanyang Technological University

    Description:

    Served as a teaching assistant for the following courses:

    1. CZ2001: Algorithms, Semester 2, 2018.
    2. CZ3003: Software System Analysis and Design, Semester 1, 2019.
    3. CZ2001: Algorithms, Semester 2, 2019.
    4. CZ1015: Introduction to Data Science, Semester 1, 2020.
    5. CZ3004: Multidisciplinary Design Project (MDP), Semester 2, 2020.
  • Senior Software Engineer

    from: 2016, until: 2017

    Organization:Samsung ElectronicsLocation:Bengaluru Area, India

    Description:

    R&D Contribution: Novel methods for optimizing power consumption and visual quality of the Samsung smartphone display modules. The work led to two issued patents.

  • Software Engineer

    from: 2014, until: 2016

    Organization:Samsung ElectronicsLocation:Bangaon Area, India

    Description:
    1. Worked with Display technology teams in SRI-B and Samsung HQ responsible for board bring-up and device driver upgrades critical for the successful commercial launch of Samsung smartphones in the worldwide market (Galaxy S4-variants, A7, Tab4, and other mid-tier smartphone variants).

    2. R&D Contribution: Machine learning for power-efficient and fault-tolerant sensor management system for Smart Home IoT sensors with minimal error (average 0.24$^{\circ}$C) in temperature prediction. Published in IEEE IACC.

  • Volunteer

    from: 2014, until: 2014

    Organization:Agastya International FoundationLocation:Bengaluru Area, India

    Description:

    Created didactic content for virtual Chemistry lab as part of the Lab-on-a-Tab project.

  • Technology Associate - Trainee

    from: 2013, until: 2014

    Organization:Sapient Global MarketsLocation:Bangalore