Sai Dheeraj Gummadi

Machine Learning Engineer @ Motorola | Ex-Deep Learning Engineer @ Brane | Ex-Data Scientist @ HighRadius (Resume)


Jul 2024 - Present
• AI Software Engineer, Working on Designing and Implementing Inhouse MultiModal RAG System for Internal Employees to understand different products based on their documentations(Text, Image, Video formats).
Nov 2023 - Jul 2024
• As a Deep Learning Engineer at Brane, I Designed and Developed the pipeline for freeflow text based No Code Low Code Platform using Natural Language Processing techniques and LLMs which can build the complete solution from the given set of sentences in natural language
• Synthesised data of workflows in different domains using ChatGPT, LLama2, and Mistral for training T5/BART/GPT models for coreference resolution, paraphrasing, splitting complex sentences, span extraction
• Implemented knowledge distillation to create distilled versions of LLMs and deployed the models as RESTful APIs using Docker and Ray Serve.
• Training BART for paraphrasing and Splitting the complex sentences.
• Fine-Tuned GPT2 (1.5 billion parameter) for next task prediction using LORA and Bitsandbytes on multiple GPUs (Tesla V100) using model and data parallelization. A virtual assistant tailored for solution developers, aimed at enhancing their productivity.
• Deployed the model with vLLM Inference Engine and achieved low-latency (0.06 sec) and high-throughput (71 rps) inferences by optimizing real-time interactions.
• Prototyping RAG based approach for next task prediction using LlamaIndex for getting less hallucinated responses from LLMs.
Aug 2021 - Oct 2023
• As Data Scientist at HighRadius, Designed and Developed an end-to-end pipeline for a Financial Document Scanner product using CNN, YOLO, and OCR for extracting the required tabular regions like balance sheets, and income statements from Financial Statements and analyzing the customer’s credit risk.
• Trained and Improved the 6-layered unconventional CNN using Explainable AI tools like Lime and GradCam
• Achieved an overall automation rate of 80% and Reduced 40% of the time spent on manual analysis of financial documents with 100–200 pages.
• Improved the latency of the Generic Parser product which is one point solution for parsing Claims/Invoices/Remittences/ POD’s using NLP transformer-based quantized LayoutLM token classification model and unsupervised algorithms with an overall accuracy of 90% and achieved automation rate of 75%.
• Worked on POC for classifying the pages in input documents into Sales Invoice, Remittance, Check, and Claim for improving the accuracy of downstream parsing modules by Fine Tuning the LayoutLM Sequence Classification model for classifying the images based on both content and layout.
• Developed Payment date prediction models using regression models like Random Forest / XGBoost for FORTUNE 500 CPG companies to predict invoice payment date with an average +/-3 Day accuracy of 85%.
2023 - 2024
Master of Science in Data Science at the International University of Applied Sciences, Berlin, Germany. I studied masters parallely while working as data scientist at HighRadius.
2018 - 2022
B.Tech at the SRM University, AP with a major in Electronics and Communication Engineering. I Published 8 conference papers on deep learning published in IEEE, Springer, Scopus databases and Patented a pipeline for disease diagnosis under Anirban Ghosh.
publications
International Conference on Intelligent Systems and Applications, 2023
G.S.Dheeraj, Anirban Ghosh.
International Conference on Innovations in Computational Intelligence and Computer Vision, 2023
G.S.Dheeraj, Anirban Ghosh
World Conference on Applied Intelligence and Computing, 2022
G.S.Dheeraj, Anirban Ghosh.
International Symposium on Digital Forensics and Security, 2022
G.S.Dheeraj, Anirban Ghosh, V.Yeswanth
International Conference on Computational Intelligence and Communication Networks, 2021
G.S.Dheeraj, V.Yeswanth, Anirban Ghosh, P.N.Kartheek, K.A.Krishna.
International Conference on Computing, Communication, and Networking Technologies, 2021
G.S.Dheeraj, Anirban Ghosh
Ripublications, International Journal of Engineering Research and Technology (Journal)
Ch.S.Nived, A.R.Kumar, G.S.Dheeraj, P.Jithendra
IRJET (Journal), 2020
G.S.Dheeraj, K.B.V.Lakshmi, K.A.Krishna

Also on Google Scholar
patents
S. D. Gummadi, V. Yeswanth, A. Ghosh, P. N. Karhteek, K. A. Krishna. 2022. "A System for Analyzing Medical Images". India Patent 202241049544. filed (August 30, 2022), and published (September 16, 2022).
V. Lohith, S. D. Gummadi, S. Niladri, S. Nilotpal, G. Ansumun, A. Pratyush. 2023 "Machine Learning based Systems and Methods for Data Extraction from Financial Statements". US Patent 18/396,772. filed (2023).
featured talks
pet projects