Sai Mitheran Jagadesh Kumar

Princeton, New Jersey, United States
4K followers 500+ connections

View mutual connections with Sai Mitheran

Sai Mitheran can introduce you to 10+ people at Latent AI

Email or phone

Password

Forgot password?

or

New to LinkedIn? Join now

By clicking Continue to join or sign in, you agree to LinkedIn’s User Agreement, Privacy Policy, and Cookie Policy.

Join to view profile

Latent AI, Inc.

Carnegie Mellon University

About

I am currently a part of the Machine Learning team at Latent AI, Inc. after graduating…

Activity

You make Claude work hard. I let it play Pokemon Red for 10 hours. Along the way, Karthik and I wrote the field guide to RL environments—what they…

You make Claude work hard. I let it play Pokemon Red for 10 hours. Along the way, Karthik and I wrote the field guide to RL environments—what they…

Liked by Sai Mitheran Jagadesh Kumar
1987: AI can't win at chess—planning is uniquely human 1997: AI can't win at Go—intuition is uniquely human 2016: AI can't win at poker—bluffing is…

1987: AI can't win at chess—planning is uniquely human 1997: AI can't win at Go—intuition is uniquely human 2016: AI can't win at poker—bluffing is…

Liked by Sai Mitheran Jagadesh Kumar
Transformers v5 is officially out 🔥🎉 With v5, Transformers is cementing its position as the ground truth for model architecture definitions…

Transformers v5 is officially out 🔥🎉 With v5, Transformers is cementing its position as the ground truth for model architecture definitions…

Liked by Sai Mitheran Jagadesh Kumar

Join now to see all activity

Experience

Latent AI, Inc.

New Jersey, United States
-

Pittsburgh, Pennsylvania, United States
-

Pittsburgh, Pennsylvania, United States
-

Princeton, New Jersey, United States
-

Pittsburgh, Pennsylvania, United States
-

Pittsburgh, Pennsylvania, United States
-
-
-

Singapore
-

Saarbrücken, Saarland, Germany
-
-
-
-
-

Tiruchirappalli, Tamil Nadu, India
-
-
-

Tiruchirappalli, Tamil Nadu, India
-
-
-

Coimbatore, Tamil Nadu, India
-

Education

Carnegie Mellon University

4.0/4.0

2022 - 2023

Activities and Societies: Graduate Teaching Assistant (18-794, Pattern Recognition Theory; 18-290, Signals and Systems), Graduate Research Assistant (CyLab, AirLab)

Coursework:

Fall 2022:
- Applied Stochastic Processes for AI and ML
- Machine Learning for Signal Processing
- Building Reliable Distributed Systems
- Introduction to Graduate Studies

Spring 2023:
- Advanced Deep Learning
- Visual Learning and Recognition
- Optimization
- Career and Professional Development for Engineering Masters Students

Fall 2023:
- Embedded Deep Learning
- Speech Recognition and Understanding
- Data, Inference and…

Coursework:

Fall 2022:
- Applied Stochastic Processes for AI and ML
- Machine Learning for Signal Processing
- Building Reliable Distributed Systems
- Introduction to Graduate Studies

Spring 2023:
- Advanced Deep Learning
- Visual Learning and Recognition
- Optimization
- Career and Professional Development for Engineering Masters Students

Fall 2023:
- Embedded Deep Learning
- Speech Recognition and Understanding
- Data, Inference and Applied Machine Learning
- Research (MS Graduate Project)
-

2018 - 2022
-

2011 - 2018

Licenses & Certifications

Responsible Conduct of Research

CITI Program

Issued Aug 2022

See credential
AWS Machine Learning Scholarship

AWS Elemental, an Amazon Web Services Company

Issued Jun 2021
Virtual Internship Experience

JPMorgan Chase & Co.

Issued Jun 2020

Credential ID bX6jdHgLCRckGhdXG

See credential
Deep Learning Specialization

Coursera

Issued May 2020

Credential ID BW7NTZGNDRN4

See credential

Volunteer Experience

Reviewer

IEEE

Jun 2023 - Present 2 years 8 months

Education

Reviewer for IEEE Transactions on Neural Networks and Learning Systems (Impact Factor: 14.255)
Community Volunteer

Hope, NIT Trichy

Oct 2020 - Apr 2023 2 years 7 months

Health
Community Volunteer

Hope Organisation

Aug 2021 - Jul 2022 1 year

Civil Rights and Social Action
Teaching Volunteer

U&I Trust

Sep 2021 - Mar 2022 7 months

Education
Student Volunteer

ICLR

Mar 2021 - May 2021 3 months

Science and Technology

Selected as a Student Volunteer for the 9th International Conference on Learning Representations, that was conducted virtually for 2021. Contributed to the tech team to resolve issues and stress-test the platform.

Certificate - https://drive.google.com/file/d/1MBHV4x9ERS6WLKsL1Sbh7jCbQPMw9krJ/view?usp=drivesdk
Co-Founder

GAIA Organization

May 2016 - Jul 2018 2 years 3 months

Environment

• Planted a new forest called the 'Miyawaki Forest' in the unused barren land of a school campus and maintained it regularly as the saplings came to life, to promote greenery and reduce pollution
• Visited 'Snehalaya for Animals' and donated the funds raised to help provide shelter to homeless stray animals, associating with 'Snehalaya Dog Shelter' to carry out the needful.
• Organised and carried out waste collection campaigns in several parts of the city, promoting Swachh Bharat and…

• Planted a new forest called the 'Miyawaki Forest' in the unused barren land of a school campus and maintained it regularly as the saplings came to life, to promote greenery and reduce pollution
• Visited 'Snehalaya for Animals' and donated the funds raised to help provide shelter to homeless stray animals, associating with 'Snehalaya Dog Shelter' to carry out the needful.
• Organised and carried out waste collection campaigns in several parts of the city, promoting Swachh Bharat and raised funds to donate to rural schools and homes for education.
Student Volunteer

ICML

Jun 2021 - Jul 2021 2 months

Education

Selected as a Student Volunteer for the Thirty-eighth International Conference on Machine Learning, that was conducted virtually for 2021.
Tennis Lead and Mentor

National Sports Organisation

Sep 2018 - May 2022 3 years 9 months

Health

Publications

Rethinking Feature Extraction: Gradient-Based Localized Feature Extraction for End-To-End Surgical Downstream Tasks

IEEE International Conference on Robotics and Automation (2023) January 31, 2023
Previously accepted at IEEE RA-L, also accepted for publication and presentation at IEEE ICRA 2023.

Several approaches have been introduced to understand surgical scenes through downstream tasks like captioning and surgical scene graph generation. However, most of them heavily rely on an independent object detector and region-based feature extractor. Encompassing computationally expensive detection and feature extraction models, these multi-stage methods suffer from slow inference…

Previously accepted at IEEE RA-L, also accepted for publication and presentation at IEEE ICRA 2023.

Several approaches have been introduced to understand surgical scenes through downstream tasks like captioning and surgical scene graph generation. However, most of them heavily rely on an independent object detector and region-based feature extractor. Encompassing computationally expensive detection and feature extraction models, these multi-stage methods suffer from slow inference speed, making them less suitable for real-time surgical applications. The performance of the downstream tasks also degrades from inheriting errors of the earlier modules of the pipeline. This work develops a detector-free gradient-based localized feature extraction approach that enables end-to-end model training for downstream surgical tasks such as report generation and tool-tissue interaction graph prediction. We eliminate the need for object detection or region proposal and feature extraction networks by extracting the features of interest from the discriminative regions in the feature map of the classification models. Here, the discriminative regions are localized using gradient-based localization techniques (e.g. Grad-CAM). We show that our proposed approaches enable the real-time deployment of end-to-end models for surgical downstream tasks. We extensively validate our approach on two surgical tasks: captioning and scene graph generation. The results prove that our gradient-based localized feature extraction methods effectively substitute the detector and feature extractor networks, allowing end-to-end model development with faster inference speed, essential for real-time surgical scene understanding tasks.

Other authors
See publication
Rethinking Feature Extraction: Gradient-Based Localized Feature Extraction for End-To-End Surgical Downstream Tasks

IEEE Robotics and Automation Letters October 25, 2022
Several approaches have been introduced to understand surgical scenes through downstream tasks like captioning and surgical scene graph generation. However, most of them heavily rely on an independent object detector and region-based feature extractor. Encompassing computationally expensive detection and feature extraction models, these multi-stage methods suffer from slow inference speed, making them less suitable for real-time surgical applications. The performance of the downstream tasks…

Several approaches have been introduced to understand surgical scenes through downstream tasks like captioning and surgical scene graph generation. However, most of them heavily rely on an independent object detector and region-based feature extractor. Encompassing computationally expensive detection and feature extraction models, these multi-stage methods suffer from slow inference speed, making them less suitable for real-time surgical applications. The performance of the downstream tasks also degrades from inheriting errors of the earlier modules of the pipeline. This work develops a detector-free gradient-based localized feature extraction approach that enables end-to-end model training for downstream surgical tasks such as report generation and tool-tissue interaction graph prediction. We eliminate the need for object detection or region proposal and feature extraction networks by extracting the features of interest from the discriminative regions in the feature map of the classification models. Here, the discriminative regions are localized using gradient-based localization techniques (e.g. Grad-CAM). We show that our proposed approaches enable the real-time deployment of end-to-end models for surgical downstream tasks. We extensively validate our approach on two surgical tasks: captioning and scene graph generation. The results prove that our gradient-based localized feature extraction methods effectively substitute the detector and feature extractor networks, allowing end-to-end model development with faster inference speed, essential for real-time surgical scene understanding tasks.

Other authors
See publication
Rich Feature Distillation with Feature Affinity Module for Efficient Image Dehazing

Optik July 18, 2022
Single-image haze removal is a long-standing hurdle for computer vision applications. Several works have been focused on transferring advances from image classification , detection, and segmentation to the niche of image dehazing, primarily focusing on contrastive learning and knowledge distillation. However, these approaches prove computationally expensive, raising concern regarding their applicability to on-the-edge use-cases. This work introduces a simple, lightweight, and efficient…

Single-image haze removal is a long-standing hurdle for computer vision applications. Several works have been focused on transferring advances from image classification , detection, and segmentation to the niche of image dehazing, primarily focusing on contrastive learning and knowledge distillation. However, these approaches prove computationally expensive, raising concern regarding their applicability to on-the-edge use-cases. This work introduces a simple, lightweight, and efficient framework for single-image haze removal, exploiting rich "dark-knowledge" information from a lightweight pre-trained super-resolution model via the notion of heterogeneous knowledge distillation. We designed a feature affinity module to maximize the flow of rich feature semantics from the super-resolution teacher to the student dehazing network. In order to evaluate the efficacy of our proposed framework, its performance as a plug-and-play setup to a baseline model is examined. Our experiments are carried out on the RESIDE-Standard dataset to demonstrate the robustness of our framework to the synthetic and real-world domains. The extensive qualitative and quantitative results provided establish the effectiveness of the framework, achieving gains of upto 15% (PSNR) while reducing the model size by∼20 times.

Other authors
See publication
Not All Lotteries Are Made Equal

International Conference on Machine Learning (ICML) June 13, 2022
The Lottery Ticket Hypothesis (LTH) states that for a reasonably sized neural network, a sub-network within the same network yields no less performance than the dense counterpart when trained from the same initialization. This work investigates the relation between model size and the ease of finding these sparse sub-networks. We show through experiments that, surprisingly, under a finite budget, smaller models benefit more from Ticket Search (TS).

Other authors
See publication
Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding

IEEE International Conference on Robotics and Automation (ICRA) February 1, 2022
Other authors
See publication
CADSketchNet - An Annotated Sketch dataset for 3D CAD Model Retrieval with Deep Neural Networks

Special Section on 3DOR2021 - 14th EG 3D Object Retrieval Workshop of the Computers & Graphics Journal July 1, 2021
The paper is accepted for publication in the Special Section on 3DOR2021 - 14th EG 3D Object Retrieval Workshop of the Computers & Graphics Journal.

Other authors
See publication
Audiomer: A Convolutional Transformer for Keyword Spotting

AAAI Conference on Artificial Intelligence, 2022
Other authors
See publication
Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding

IEEE Robotics and Automation Letters
Other authors
See publication
Introducing Self-Attention to Target Attentive Graph Neural Networks

International Conference on Artificial Intelligence and Signal Processing '22
Other authors
See publication
User-Friendly Waveguide Modes Visualiser

IEEE Microwave Magazine (IEEE MTT-S)
Other authors
See publication

Join now to see all publications

Projects

Fine-Tuning Pre-trained Language Models for Discrete Unit-based ASR

Oct 2023 - Dec 2023

Our research combines self-supervised speech models (S3Ms) and large language models (LLMs) to enhance discrete speech unit-based Automatic Speech Recognition (ASR). By having S3Ms handle acoustic modeling and LLMs manage language modeling, we achieve superior results with up to a 21.7% relative word error rate reduction on LibriSpeech-100 compared to existing baselines.
Smoothed Gradient Descent-Ascent for Min-Max Optimization

Mar 2023 - Jun 2023

- We leverage Smoothed Gradient Descent Algorithm (SGDA) to effectively tackle minimax optimization problems, mitigating convergence issues inherent in non-convex and non-concave scenarios.

- SGDA outperforms Gradient Descent Ascent (GDA) across popular datasets, notably excelling in Generative Adversarial Networks (GANs) where it successfully avoids mode collapse and generates diverse, high-quality samples in MNIST, CIFAR100, and ImageNet200 experiments.

- Explores SGDA's…

- We leverage Smoothed Gradient Descent Algorithm (SGDA) to effectively tackle minimax optimization problems, mitigating convergence issues inherent in non-convex and non-concave scenarios.

- SGDA outperforms Gradient Descent Ascent (GDA) across popular datasets, notably excelling in Generative Adversarial Networks (GANs) where it successfully avoids mode collapse and generates diverse, high-quality samples in MNIST, CIFAR100, and ImageNet200 experiments.

- Explores SGDA's potential with more expressive models and dives deeper into a double-smoothed version of SGDA to improve generated sample diversity while pushing forward the realistic appearance of generated samples.
SurgIQ: Rethinking Visual Question Answering in Surgical Scenes

Feb 2023 - May 2023

- SurgiQ revolutionizes Surgical Visual Question Answering (S-VQA) by leveraging Grad-CAM++ guided features for enhanced instance-level localization.

- This approach eliminates the need for bounding box annotations and multi-stage processing, streamlining S-VQA while improving accuracy.

- SurgiQ integrates Grad-CAM++ guided visual features, and Bio-BERT word embeddings (priors) with VisualBERT-ResMLP, resulting in robust visual-text representations for accurate answers.

-…

- SurgiQ revolutionizes Surgical Visual Question Answering (S-VQA) by leveraging Grad-CAM++ guided features for enhanced instance-level localization.

- This approach eliminates the need for bounding box annotations and multi-stage processing, streamlining S-VQA while improving accuracy.

- SurgiQ integrates Grad-CAM++ guided visual features, and Bio-BERT word embeddings (priors) with VisualBERT-ResMLP, resulting in robust visual-text representations for accurate answers.

- The project simplifies the S-VQA pipeline with efficient inference and a straightforward classification head, offering a promising solution for medical domain challenges.
Compressing Vision Transformers for Low-Resource Visual Learning

Jan 2023 - Apr 2023

- The project aims to optimize Vision Transformers (ViTs) for edge deployment on UAVs, targeting memory-efficient solutions without sacrificing accuracy.

- Knowledge distillation from Swin-v2-T to MobileViT enhances IoU by a substantial 0.16, while ResNet18 + UPerNet improves by 0.12, showcasing the effectiveness of this technique.

- On the Jetson Nano, MobileViT exhibits a 1.8x throughput increase when quantized to half-precision floats (fp16). However, ResNet18 maintains better…

- The project aims to optimize Vision Transformers (ViTs) for edge deployment on UAVs, targeting memory-efficient solutions without sacrificing accuracy.

- Knowledge distillation from Swin-v2-T to MobileViT enhances IoU by a substantial 0.16, while ResNet18 + UPerNet improves by 0.12, showcasing the effectiveness of this technique.

- On the Jetson Nano, MobileViT exhibits a 1.8x throughput increase when quantized to half-precision floats (fp16). However, ResNet18 maintains better runtime due to PyTorch's convolution vs. attention kernel implementation.

- The final framework (distilled and quantized UPerNet + MobileViT) consumes 3742 MB of RAM with 1030 MB of swap on the Jetson Nano, providing a promising framework for edge applications in surveillance, search-and-rescue, and more, with potential further optimization through bare-metal solutions and inference engines.
Attention-Guided Saliency-Enhanced Product Detector

Sep 2022 - Jan 2023

- Designed an efficient, lightweight saliency detection framework (27.184 FPS, 7.45M params) for real-time retail item detection utilizing the attention mechanism for feature aggregation with edge refinement in the frequency domain.

- Achieved a balance between performance and computational efficiency by utilizing masked edge, union, and object attention modules. This approach leads to a significant improvement in SOD performance, with a mean F-measure score of 0.875 on the DUTS dataset…

- Designed an efficient, lightweight saliency detection framework (27.184 FPS, 7.45M params) for real-time retail item detection utilizing the attention mechanism for feature aggregation with edge refinement in the frequency domain.

- Achieved a balance between performance and computational efficiency by utilizing masked edge, union, and object attention modules. This approach leads to a significant improvement in SOD performance, with a mean F-measure score of 0.875 on the DUTS dataset and significant improvements on in-house product datasets.

- Designed an adaptive pixel intensity loss function that helps to improve the robustness of the network to noisy labels and improve local-global structure awareness.
Exploring the Lottery Ticket Hypothesis in Neural Networks

Dec 2021 - Feb 2022

The Lottery Ticket Hypothesis states that neural networks contain much smaller sparse subnetworks capable of training to the same (or better) accuracy than the full network.

- Applied Layer-Adaptive Magnitude Pruning (LAMP) to Lottery Ticket Hypothesis.

- Showed that LAMP can be used to find Winning Tickets and analyze the distribution of weights.

- Analyzed the expressive power of pruned networks with Trajectory Length and Activation Patterns v/s. their dense counterparts.

See project
Algorithmic applications in 5G MIMO Systems

Sep 2021 - Dec 2021
The contribution in this project is an experimental overview of designing algorithms from both software paradigms that are leveraged to provide solutions for various applications involving MMIMO technology and antenna design in 5G systems. Specifically:

- A rule-based method for Obstacle Detection. This method involves MMIMO radar and the theory of cognition. (Software 1.0)

- Learning algorithms for MIMO-based Antenna Applications - Machine Learning and Deep Learning algorithms…

The contribution in this project is an experimental overview of designing algorithms from both software paradigms that are leveraged to provide solutions for various applications involving MMIMO technology and antenna design in 5G systems. Specifically:

- A rule-based method for Obstacle Detection. This method involves MMIMO radar and the theory of cognition. (Software 1.0)

- Learning algorithms for MIMO-based Antenna Applications - Machine Learning and Deep Learning algorithms used for Human Activity Classification, using human movement data captured by a radar device. (Software 2.0)

𝐔𝐧𝐝𝐞𝐫 𝐑𝐞𝐯𝐢𝐞𝐰 𝐚𝐭 𝐈𝐄𝐄𝐄 𝐀𝐧𝐭𝐞𝐧𝐧𝐚𝐬 𝐚𝐧𝐝 𝐖𝐢𝐫𝐞𝐥𝐞𝐬𝐬 𝐏𝐫𝐨𝐩𝐚𝐠𝐚𝐭𝐢𝐨𝐧 𝐋𝐞𝐭𝐭𝐞𝐫𝐬

Other creators
AI For Good - The "No Poverty Challenge"

Aug 2021 - Oct 2021

- Analyzed extreme poverty population across the globe while comparing the efforts of the United Nations and other partners towards eradicating poverty.

- Studied patterns in the data to draw inferences in an Unsupervised manner, building a predictive model to identify funding required by countries.

𝐑𝐞𝐜𝐞𝐢𝐯𝐞𝐝 𝐚𝐧 𝐇𝐨𝐧𝐨𝐫𝐚𝐫𝐲 𝐦𝐞𝐧𝐭𝐢𝐨𝐧 𝐟𝐨𝐫 𝐭𝐡𝐞 𝐢𝐧𝐧𝐨𝐯𝐚𝐭𝐢𝐯𝐞 𝐬𝐨𝐥𝐮𝐭𝐢𝐨𝐧 𝐩𝐫𝐨𝐩𝐨𝐬𝐞𝐝, 𝐚𝐦𝐨𝐧𝐠𝐬𝐭 𝟏𝟎𝟎+ 𝐩𝐚𝐫𝐭𝐢𝐜𝐢𝐩𝐚𝐧𝐭𝐬.
Learning with Retrospection

Apr 2021 - May 2021

The first PyTorch Implementation of Learning with Retrospection.

- Learning with retrospection (LWR) makes use of the learned information in the past epochs to guide the subsequent training.

- LWR is a simple yet effective training framework to improve accuracies, calibration, and robustness of DNNs without introducing any additional network parameters or inference cost, only with a negligible training overhead.

𝐆𝐢𝐭𝐇𝐮𝐛 𝐂𝐨𝐝𝐞:…

The first PyTorch Implementation of Learning with Retrospection.

- Learning with retrospection (LWR) makes use of the learned information in the past epochs to guide the subsequent training.

- LWR is a simple yet effective training framework to improve accuracies, calibration, and robustness of DNNs without introducing any additional network parameters or inference cost, only with a negligible training overhead.

𝐆𝐢𝐭𝐇𝐮𝐛 𝐂𝐨𝐝𝐞: https://github.com/The-Learning-Machines/LearningWithRetrospection

See project
MiraiShield

Apr 2021 - May 2021

A steady increase in usage of IoT devices that are more vulnerable to security compromises than desktop computers has led to an increase in the occurrence of IoT-based botnet attacks. To mitigate this new threat, a deep learning model is designed in this project. This deep learning model uses a publicly available dataset for pre-training, which was further fine-tuned on scraped data and smaller datasets. Furthermore, an IoT network was designed, to attack the network with malware and show the…

A steady increase in usage of IoT devices that are more vulnerable to security compromises than desktop computers has led to an increase in the occurrence of IoT-based botnet attacks. To mitigate this new threat, a deep learning model is designed in this project. This deep learning model uses a publicly available dataset for pre-training, which was further fine-tuned on scraped data and smaller datasets. Furthermore, an IoT network was designed, to attack the network with malware and show the effectiveness of the model.

𝐆𝐢𝐭𝐇𝐮𝐛 𝐂𝐨𝐝𝐞 - https://github.com/smj007/MiraiShield
𝐃𝐞𝐦𝐨 𝐕𝐢𝐝𝐞𝐨 - https://www.youtube.com/watch?v=In_BqB0dU_0

See project
Generative Modelling - Architecture Zoo

Sep 2020 - Mar 2021

Developed an experimental codebase for various papers researched, some of which include DCGAN, StyleGAN, InfoGAN, BiGAN, CycleGAN, DiscoGAN and LS-GAN.

𝐅𝐫𝐚𝐦𝐞𝐰𝐨𝐫𝐤𝐬: 𝐏𝐲𝐓𝐨𝐫𝐜𝐡 𝐚𝐧𝐝 𝐓𝐞𝐧𝐬𝐨𝐫𝐅𝐥𝐨𝐰

See project
Waveguide Visualiser

Feb 2021 - Feb 2021

A novel interactive Waveguide Visualizer using Python, depicting the complex Electric and Magnetic Vector fields in TE, TM, TEM modes, for Rectangular and Cylindrical waveguides. Presented to 100+ faculty of IITs and NITs in SPARC-2021. Published on Microwaves101. Was presented in the workshop titled - "Recent Trends on Metamaterial Antennas for Wireless Applications and Deep Learning Techniques", in association with the University of Saskatchewan, Canada.

𝐀𝐜𝐜𝐞𝐩𝐭𝐞𝐝 𝐚𝐭 𝐭𝐡𝐞…

A novel interactive Waveguide Visualizer using Python, depicting the complex Electric and Magnetic Vector fields in TE, TM, TEM modes, for Rectangular and Cylindrical waveguides. Presented to 100+ faculty of IITs and NITs in SPARC-2021. Published on Microwaves101. Was presented in the workshop titled - "Recent Trends on Metamaterial Antennas for Wireless Applications and Deep Learning Techniques", in association with the University of Saskatchewan, Canada.

𝐀𝐜𝐜𝐞𝐩𝐭𝐞𝐝 𝐚𝐭 𝐭𝐡𝐞 𝐈𝐄𝐄𝐄 𝐌𝐢𝐜𝐫𝐨𝐰𝐚𝐯𝐞 𝐌𝐚𝐠𝐚𝐳𝐢𝐧𝐞 (𝐈𝐄𝐄𝐄 𝐌𝐓𝐓-𝐒)

See project
Data-driven Low Altitude UAV Channel Model using Generative Adversarial Networks

Aug 2020 - Feb 2021

- Demonstrated low altitude UAV Channel modeling using the data generated under four distinct environments, and hypothesize the use of a prior distribution of sample channel gains at different launch characteristics of a UAV.

- Introduced the 𝑳𝑨-𝑼𝑨𝑽 𝑪𝒉𝒂𝒏𝒏𝒆𝒍 𝑫𝒂𝒕𝒂𝒔𝒆𝒕, consisting of normalized channel coefficients obtained from the four environments selected.

- We verify the claim of modeling the channel using sample gains at a single angular level by developing a…

- Demonstrated low altitude UAV Channel modeling using the data generated under four distinct environments, and hypothesize the use of a prior distribution of sample channel gains at different launch characteristics of a UAV.

- Introduced the 𝑳𝑨-𝑼𝑨𝑽 𝑪𝒉𝒂𝒏𝒏𝒆𝒍 𝑫𝒂𝒕𝒂𝒔𝒆𝒕, consisting of normalized channel coefficients obtained from the four environments selected.

- We verify the claim of modeling the channel using sample gains at a single angular level by developing a Vanilla GAN.

- Further, we extend the system proposed for modeling the UAV channel to a cGAN, conditioned on launch angle information. We train the cGAN in different environments to ensure robustness to environmental change and any provided prior.

𝐔𝐧𝐝𝐞𝐫 𝐑𝐞𝐯𝐢𝐞𝐰 𝐚𝐭 𝐭𝐡𝐞 𝐏𝐡𝐲𝐬𝐢𝐜𝐚𝐥 𝐂𝐨𝐦𝐦𝐮𝐧𝐢𝐜𝐚𝐭𝐢𝐨𝐧 𝐉𝐨𝐮𝐫𝐧𝐚𝐥
Deep RL agent to beat Atari's Breakout - Modified A3C

Aug 2020 - Aug 2020

Implemented an Asynchronous Advantage actor-critic model (A3C) to ace the Breakout game by Atari, modified with an LSTM for improved experience replay.

𝐆𝐢𝐭𝐇𝐮𝐛 𝐂𝐨𝐝𝐞: https://github.com/smj007/Breakout_A3C

See project
Self Driving Car using Deep Q-Learning

Jul 2020 - Jul 2020

Implemented an ANN to learn a path to be followed in a custom-drawn environment using the Kivy Language. Adopted the Q-Learning approach to solving the task at hand.

𝐆𝐢𝐭𝐇𝐮𝐛 𝐂𝐨𝐝𝐞: https://github.com/smj007/Self-Driving-Car

See project
Luminosity and Contrast Improvement in Color Fundus Images

Feb 2020 - Apr 2020

Worked on Enhanced Fundus Imaging, to improve the Local Contrast and Luminance in Images of the Eye. Implemented the Discrete Wavelet Transform, and further implemented the Curvelet Transform as a part of the algorithmic approach to aid ophthalmologists, in easier detection of Exudates.

𝐏𝐫𝐞𝐬𝐞𝐧𝐭𝐞𝐝 𝐚𝐭 𝐭𝐡𝐞 𝐒𝐢𝐠𝐧𝐚𝐥 𝐏𝐫𝐨𝐜𝐞𝐬𝐬𝐢𝐧𝐠 𝐋𝐚𝐛𝐨𝐫𝐚𝐭𝐨𝐫𝐲, 𝐍𝐈𝐓 𝐓𝐫𝐢𝐜𝐡𝐲
I-Travel: The one-stop travel ticket booking app

May 2017 - Oct 2017

The project’s aim is to create a travel ticket booking application, which allows
customers to book travel tickets with ease. Here the application 𝐈-𝐓𝐫𝐚𝐯𝐞𝐥 is depicted in such
an attractive and user-friendly manner which makes the customer inwardly use the
application.

- By using Netbeans and its efficient feature JFrame as Form, the application is easily
able to get sufficient information from the user and makes the process of ticket booking
easier. Attractive…

The project’s aim is to create a travel ticket booking application, which allows
customers to book travel tickets with ease. Here the application 𝐈-𝐓𝐫𝐚𝐯𝐞𝐥 is depicted in such
an attractive and user-friendly manner which makes the customer inwardly use the
application.

- By using Netbeans and its efficient feature JFrame as Form, the application is easily
able to get sufficient information from the user and makes the process of ticket booking
easier. Attractive backgrounds have also been attached that not only make the project look
presentable but also enhance its functioning by luring more customers.

- The Project is linked with Paytm so that for every referral and ticket booking made by the
users, they gain some amount of money, which is credited to their Paytm account.

- In order to make the application look more official, a unique website for 𝐈-𝐓𝐫𝐚𝐯𝐞𝐥
has been used and the forms have been hyperlinked.

- The usage of various components such as buttons, labels, text fields, radio
buttons, combo boxes, date choosers, and text areas make the form easy to operate and the
functioning is very transparent to the users.

𝐏𝐫𝐞𝐬𝐞𝐧𝐭𝐞𝐝 𝐚𝐭 𝐭𝐡𝐞 𝐂𝐨𝐦𝐩𝐮𝐭𝐞𝐫 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐋𝐚𝐛𝐨𝐫𝐚𝐭𝐨𝐫𝐲, 𝐘𝐮𝐯𝐚𝐛𝐡𝐚𝐫𝐚𝐭𝐡𝐢 𝐏𝐮𝐛𝐥𝐢𝐜 𝐒𝐜𝐡𝐨𝐨𝐥

Honors & Awards

Reviewer

IEEE Transactions on Neural Networks and Learning Systems

Jun 2023

Selected as a reviewer for IEEE Transactions on Neural Networks and Learning Systems (Impact Factor: 14.255), contributing to the advancement of knowledge in the field of applied ML.
Sri. Janardhana Iyengar Memorial Award

National Institute of Technology, Tiruchirappalli

May 2023
Institute Medal, 2022

NIT Trichy

Aug 2022

Gold medalist, rank 1 in the Department of ECE
Research Week with Google

Google Research India

Jan 2022

Shortlisted amongst 10000+ applicants to participate in Research Week with Google. Google Research India is hosting a four-day Research Week with Google. The program structure is to discuss the state of the art of a variety of ML Techniques and also discuss their limitations and critiques.
Dr. A.L Abdus Sattar Memorial Award

National Institute of Technology, Tiruchirappalli

Aug 2021

Outstanding student in 3rd year, ECE
Institute Day Prize Winner (2021) - Rank 1 in ECE

National Institute of Technology, Tiruchirappalli

Aug 2021

Awarded for 2nd year performance - Rank 1 in the Department of ECE
Mentorship Program

Illuminate AI

Aug 2021

Selected for the 2021 cohort of the Mentorship Program at Illuminate AI. Working with peers to target 'AI For Good', developing Machine Learning pipelines to analyze poverty and hunger trends across the world, while deploying a mechanism to help deal with the crisis we are facing.
The Graphics Replicability Stamp Award

Graphics Replicability Stamp Initiative

Aug 2021

Recognition of the service provided to the scientific community by releasing the CADSketchNet dataset for free, non-commercial use.
IASc-INSA-NASI Summer Research Fellowship

Indian National Science Academy

Jun 2021

Nominated for the SRFP, Summer '21
DAAD-WISE Scholarship

DAAD

May 2021

Selected for the WISE program in Germany, which targets Indian students pursuing a degree in Science/Engineering, and wish to do a research internship at a German Institute
AWS Machine Learning Scholarship

AWS, Udacity

Jan 2021
Annual Day Award for Academic Excellence

Yuvabharathi Public School

Jun 2018

State Rank 1 in Informatics Practices (Computer Science), securing 100/100 in boards. Best outgoing student in the Engineering Stream of 2018
Japan Country Ambassador

World Maths Day - 3P Learning

Mar 2011

Was selected as the representative for Japan, to help my peers and spread awareness, about the online competition, to ensure active and healthy participation.
World Rank 2

World Maths Day - 3P Learning

Mar 2011

Finished as the category winner in Asia in a 48-hour long online international speed-math competition, with over 3,000,000 participants from 200+ countries

More activity by Sai Mitheran

The demand for silver is sky-rocketing! 1) Industry demands silver 2) Silver prices go up 3) The cost of manufacturing goes up 4) The price is…

The demand for silver is sky-rocketing! 1) Industry demands silver 2) Silver prices go up 3) The cost of manufacturing goes up 4) The price is…

Liked by Sai Mitheran Jagadesh Kumar
🚀 Introducing Latent Linguist: conversation-ready translation at the tactical edge. In remote and contested environments, reliance on the cloud…

🚀 Introducing Latent Linguist: conversation-ready translation at the tactical edge. In remote and contested environments, reliance on the cloud…

Liked by Sai Mitheran Jagadesh Kumar
I’ve recently joined OpenAI to build out an AI/ML team in a key area of ChatGPT. The first several days have felt like my early days at Google — a…

I’ve recently joined OpenAI to build out an AI/ML team in a key area of ChatGPT. The first several days have felt like my early days at Google — a…

Liked by Sai Mitheran Jagadesh Kumar
Turning a model into a reliable software agent starts with the loop. Codex CLI is our local, cross-platform agent for making high-quality code…

Turning a model into a reliable software agent starts with the loop. Codex CLI is our local, cross-platform agent for making high-quality code…

Liked by Sai Mitheran Jagadesh Kumar
🚀📉 Efficient Inference Just Got a Major Upgrade #NVIDIAResearch We’ve just released Qwen3-8B-DMS-8x fine-tuned for 8x KV cache compression. It…

🚀📉 Efficient Inference Just Got a Major Upgrade #NVIDIAResearch We’ve just released Qwen3-8B-DMS-8x fine-tuned for 8x KV cache compression. It…

Liked by Sai Mitheran Jagadesh Kumar
I'll make it quick: series E, $300m at $5b valuation. Huge revenue growth since the last round 5 months ago with no customer churn. Well-known AI…

I'll make it quick: series E, $300m at $5b valuation. Huge revenue growth since the last round 5 months ago with no customer churn. Well-known AI…

Liked by Sai Mitheran Jagadesh Kumar
I am thrilled to share that I have been awarded the prestigious Pre-doctoral Fellowship from the American Heart Association (AHA) to support my…

I am thrilled to share that I have been awarded the prestigious Pre-doctoral Fellowship from the American Heart Association (AHA) to support my…

Liked by Sai Mitheran Jagadesh Kumar
On the GPTZero controversy: As a past reviewer at ACL, ICLR, AAAI, WSDM, SIGIR, COLM, and an Area Chair at internal conferences, I feel like I can…

On the GPTZero controversy: As a past reviewer at ACL, ICLR, AAAI, WSDM, SIGIR, COLM, and an Area Chair at internal conferences, I feel like I can…

Liked by Sai Mitheran Jagadesh Kumar
Here is the math why your portfolio is not performing. A portfolio grows because stocks grow. Stocks grow: because there is EPS expansion (i.e…

Here is the math why your portfolio is not performing. A portfolio grows because stocks grow. Stocks grow: because there is EPS expansion (i.e…

Liked by Sai Mitheran Jagadesh Kumar
If you are looking for reading material for the upcoming weekend, (the long-promised) Chapter 5 on LLM self-refinement is now finally out in the…

If you are looking for reading material for the upcoming weekend, (the long-promised) Chapter 5 on LLM self-refinement is now finally out in the…

Liked by Sai Mitheran Jagadesh Kumar
It still feels a little unreal to look back at how far vLLM has come. What started as a small project that Zhuohan and I launched ended up receiving…

It still feels a little unreal to look back at how far vLLM has come. What started as a small project that Zhuohan and I launched ended up receiving…

Liked by Sai Mitheran Jagadesh Kumar
Join xAI to build the future of multimodal AI. We're hiring across image editing, video generation, world modeling, visual understanding and…

Join xAI to build the future of multimodal AI. We're hiring across image editing, video generation, world modeling, visual understanding and…

Liked by Sai Mitheran Jagadesh Kumar
Emergent is operating at a speed rarely seen in history: $50M ARR in 7 months. $70M Series B... and yes, we're hiring! Raj Shamani and I discussed…

Emergent is operating at a speed rarely seen in history: $50M ARR in 7 months. $70M Series B... and yes, we're hiring! Raj Shamani and I discussed…

Liked by Sai Mitheran Jagadesh Kumar
I'm often asked how to land a research job at a frontier AI lab like OpenAI. It's hard without a research background, but I like to point to Keller…

I'm often asked how to land a research job at a frontier AI lab like OpenAI. It's hard without a research background, but I like to point to Keller…

Liked by Sai Mitheran Jagadesh Kumar
Stop predicting words one by one. A new paper just broke the next-token rule every LLM follows. It cuts generation steps by 4× and training compute…

Stop predicting words one by one. A new paper just broke the next-token rule every LLM follows. It cuts generation steps by 4× and training compute…

Liked by Sai Mitheran Jagadesh Kumar
I am eager to share that we have officially kicked off the case projects within the Accenture Strategic Business Analytics Chair at ESSEC Business…

I am eager to share that we have officially kicked off the case projects within the Accenture Strategic Business Analytics Chair at ESSEC Business…

Liked by Sai Mitheran Jagadesh Kumar

View Sai Mitheran’s full profile

See who you know in common
Get introduced
Contact Sai Mitheran directly

Join to view full profile

Other similar profiles

Sonu Lohar

Sonu Lohar

Pune

Connect
Rajni Arora

Rajni Arora

Bengaluru

Connect
Hina S.

Hina S.

Pune

Connect
Kshitij Bansal

Kshitij Bansal

Dubai, United Arab Emirates

Connect
Naman Kapur

Naman Kapur

Gurgaon

Connect
Vishal Sharma

Vishal Sharma

Gurugram

Connect
Sourav Aggarwal

Sourav Aggarwal

New Delhi

Connect
Aishwarya Jadhav

Aishwarya Jadhav

Mountain View, CA

Connect
Brijesh Saxena

Brijesh Saxena

Karnataka, India

Connect
SOWMYA AARI

SOWMYA AARI

Bengaluru

Connect
Himani Gulati

Himani Gulati

Bloomington, IN

Connect
Sri Hari

Sri Hari

Hyderabad

Connect
Vijay Prakash Saini

Vijay Prakash Saini

Bengaluru

Connect
Dhilip maharish

Dhilip maharish

Madurai

Connect
Rahul Dubey

Rahul Dubey

Gurugram

Connect
Mayank Dawar

Mayank Dawar

Bengaluru

Connect
Surya Teja Gutta

Surya Teja Gutta

Netherlands

Connect
Sharoon Hussain

Sharoon Hussain

Bengaluru

Connect
Aman Shrivastava

Aman Shrivastava

Bengaluru

Connect
Yogesh Tekale

Yogesh Tekale

Maharashtra, India

Connect

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses

See all courses

Sai Mitheran Jagadesh Kumar

Princeton, New Jersey, United States 4K followers 500+ connections

About

Activity

You make Claude work hard. I let it play Pokemon Red for 10 hours. Along the way, Karthik and I wrote the field guide to RL environments—what they…

Liked by Sai Mitheran Jagadesh Kumar

1987: AI can't win at chess—planning is uniquely human 1997: AI can't win at Go—intuition is uniquely human 2016: AI can't win at poker—bluffing is…

Liked by Sai Mitheran Jagadesh Kumar

Transformers v5 is officially out 🔥🎉 With v5, Transformers is cementing its position as the ground truth for model architecture definitions…

Liked by Sai Mitheran Jagadesh Kumar

Experience

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

Education

4.0/4.0

-

-

Licenses & Certifications

AWS Machine Learning Scholarship

Volunteer Experience

Reviewer

Community Volunteer

Hope, NIT Trichy

Community Volunteer

Hope Organisation

Teaching Volunteer

Student Volunteer

Co-Founder

GAIA Organization

Student Volunteer

ICML

Tennis Lead and Mentor

National Sports Organisation

Publications

IEEE International Conference on Robotics and Automation (2023) January 31, 2023

IEEE Robotics and Automation Letters October 25, 2022

Optik July 18, 2022

International Conference on Machine Learning (ICML) June 13, 2022

IEEE International Conference on Robotics and Automation (ICRA) February 1, 2022

Special Section on 3DOR2021 - 14th EG 3D Object Retrieval Workshop of the Computers & Graphics Journal July 1, 2021

AAAI Conference on Artificial Intelligence, 2022

IEEE Robotics and Automation Letters

International Conference on Artificial Intelligence and Signal Processing '22

IEEE Microwave Magazine (IEEE MTT-S)

Projects

Fine-Tuning Pre-trained Language Models for Discrete Unit-based ASR

Oct 2023 - Dec 2023

Smoothed Gradient Descent-Ascent for Min-Max Optimization

Mar 2023 - Jun 2023

SurgIQ: Rethinking Visual Question Answering in Surgical Scenes

Feb 2023 - May 2023

Compressing Vision Transformers for Low-Resource Visual Learning

Jan 2023 - Apr 2023

Attention-Guided Saliency-Enhanced Product Detector

Sep 2022 - Jan 2023

Dec 2021 - Feb 2022

Algorithmic applications in 5G MIMO Systems

Sep 2021 - Dec 2021

AI For Good - The "No Poverty Challenge"

Aug 2021 - Oct 2021

Apr 2021 - May 2021

Princeton, New Jersey, United States
4K followers 500+ connections