// AI Engineer × Researcher

DHRUV.
KUMAR

Multimodal  ·  AI/ML  ·  NLP  ·  Computer Vision

Let's Collaborate →

ABOUT

photo

I am an AI Engineer and Researcher from India with 1+ years of professional experience. Passionate about building large-scale systems and advanced AI, pursuing cutting-edge research in Deep Learning, Multimodal AI, NLP & Computer Vision, developing multi-agent frameworks, knowledge-retrieval pipelines, and data-driven models.

Currently working at SRIC, IIT Kharagpur, I focus on bridging research with real-world applications through scalable AI systems built with Python and modern ML frameworks. I enjoy solving complex problems with a competitive programming mindset and exploring how AI can transform ideas into impactful technology.

WHAT I DO

[ AI Development ]

Building Intelligent Systems

Developing ML models, AI agents, and intelligent automation tools. Focused on designing scalable AI systems, working with LLMs, deep learning, and data-driven models to solve real-world problems.

// Languages

Python
JS
SQL

// ML Frameworks

PyTorch
TF
Keras
sklearn
OpenCV

// Libraries

NumPy
Pandas
Matplotlib
Sb
NLTK
REQ
BS4
ONNX

[ Research ]

Advanced ML Research

Working on multimodal AI, multi-agent systems, and applied machine learning research to push the boundaries of intelligent systems. Focused on experimenting with novel architectures, data-driven modeling, and bridging academic research with practical AI applications.

// AI / LLM & Frameworks

HF
LangChain
FastAPI
Streamlit
Poetry
GEE

// Developer Tools

Git
GitHub
Docker
GCP
MySQL
MongoDB
Django
Linux

// Focus Areas

LLMsNLPComputer Vision TransformersGANsDiffusion Fine-TuningMulti-AgentMultimodal

EXPERIENCE

Indian Institute of Technology Kharagpur

Kharagpur, West Bengal, India

Jul 2025 – Present Contract

Junior Project Engineer

  • Authored an ISRO research proposal on interpretable crop health monitoring and early disease detection using hyperspectral satellite imagery, proposing a contrastive learning framework to address satellite–ground resolution mismatch.
  • Implemented RGB-to-Hyperspectral reconstruction architectures to reduce hyperspectral acquisition costs and leveraged Google Earth Engine for large-scale agricultural data extraction and preprocessing.
  • Developed a multimodal persuasive image ranking framework by fine-tuning vision–language models on a 450-group human-evaluated benchmark dataset, resulting in a research manuscript currently in preparation.
  • Implemented and benchmarked 7 state-of-the-art Hindi machine-generated text detection models on curated and paraphrased datasets generated using Gemini, Llama, Qwen, and GPT-4o, enabling cross-LLM evaluation across multiple generation scenarios; results documented in a manuscript under review at SIGIR 2026 Reproducibility Track.
  • Designed a hybrid Hindi MGT detection architecture combining middle-layer embeddings with KL-divergence signals from two auxiliary LLMs, achieving ~95% detection accuracy across evaluation setups.
ResearchNLPLLMs Computer VisionGoogle Earth EngineMGT Detection

Prodigal AI Technologies

New Delhi, India

Sep 2024 – Dec 2024 Internship

Machine Learning Intern [PyPI] [Source Code] [Web App]

  • Accelerated LinkedIn content production by 40% by developing Co-Agent, an open-source multi-agent conversational framework using Google Gemini API for blog-to-post transformation.
  • Processed 6,000+ blog posts via full-stack NLP pipeline: Web Scraping, Data Cleaning, API Integration, Summarisation, AI Agent-based Validation, and LinkedIn-specific Formatting.
  • Managed a 5-intern team for R&D; launched a PyPI package with 3,500+ downloads in 3 months, backed by a proof-of-concept WebApp.
Multi-AgentLangChain Gemini APINLPPyPI

ISRO – SDSC SHAR

Sriharikota, Andhra Pradesh, India

Jul 2024 – Aug 2024 Internship

Machine Learning Intern [View Website]

  • Achieved 97.8% accuracy (↑10%) and 95.3% recall by designing a "Multi-model ML Architecture" ensembling ANN & CNN for space-domain classification.
  • Reduced false negatives to 4.7% via class balancing, EDA, feature engineering, normalisation, and augmenting 90K+ samples.
  • Deployed a scalable Research-Aid SaaS with real-time API integration via Django REST Framework in collaboration with 5 Astronomers, a full MLOps pipeline.
CNNANN MLOpsDjango RESTSaaS

PROJECTS

SPECIOUS

Mar–Jun 2025

Spectral Perturbation Engine for Contrastive Inference Over Universal Surrogates

snookydru.github.io/SPECIOUS

SPECIOUS — Spectral Perturbation Engine for Contrastive Inference Over Universal Surrogates

A universal, multi-model defensive technique that embeds imperceptible high-frequency perturbations into the luminance (Y) channel of YCbCr representations — invisible to humans but systematically degrades feature embeddings across ResNet-50, CLIP ViT-B/32, and other surrogate models, preventing generative AI from reproducing copyrighted artistic styles.

✓ >80% fooling — ResNet-50 ✓ >70% — CLIP zero-shot ✓ LPIPS < 0.01 ✓ CLIP similarity ↓ ~0.345
  • Designed SpeciousLoss — a novel bi-objective loss minimising LPIPS perceptual distance while maximising surrogate feature distortion, ensuring adversarial stealth and efficacy simultaneously.
  • U-Net generator operating in Fourier domain with a learnable high-pass mask, concentrating perturbations on high-frequency edges & textures — components generative AI exploits most.
  • Outperforms Glaze & Nightshade by being label-agnostic, multi-model, and frequency-domain aware, without requiring target-specific training.

SnookyDru / SPECIOUS

Loading repository data...

Hangman Transformer

2024

Encoder-Decoder Transformer for the Hangman Word-Guessing Game

  • Trained a full encoder-decoder Transformer ("Attention is All You Need" architecture) end-to-end to play Hangman — the encoder ingests the partially revealed word (blanks + guessed letters), while the decoder auto-regressively predicts the most probable missing letter at each step.
  • Framed letter-guessing as a sequence-to-sequence masked language modelling task: blank positions act as mask tokens, enabling the model to leverage global context from the entire word rather than relying on n-gram statistics or frequency tables.
  • Employs positional encoding + multi-head self-attention so the model attends across all positions simultaneously, capturing long-range character co-occurrence patterns that rule-based bots cannot model.
  • Achieved a significantly higher win rate than traditional frequency-based Hangman solvers, demonstrating that sequence modelling with attention is highly effective for combinatorial word-completion tasks.

SnookyDru / Hangman-Encoder-Decoder-Transformer

Loading repository data...

View All Projects on GitHub →

PUBLICATIONS

SIGIR paper
SIGIR 2026 · Reproducibility Track Under Review

Understanding the Challenges in Detecting Machine Generated Hindi Text

Dhruv Kumar et al. · SIGIR 2026, Melbourne | Naarm, Australia

Comprehensive evaluation of SOTA MGT detection methods adapted for Hindi across 6 RQs. Benchmarks XLM-RoBERTa, RADAR, ConDA, IndicBERT, DetectGPT, GPTZero, and Text Fluoroscopy on Hindi datasets with Gemini 1.5, Llama 3, and Qwen3. ConDA was the most stable across domains; zero-shot detection fundamentally fails for Hindi due to perplexity inversion.

[Conference]
cryptoQA paper
ACM · FIRE 2024 Published

Overview of CryptOQA: Opinion Extraction and Question Answering from CryptoCurrency-Related Tweets and Reddit Posts

Dhruv Kumar, Gourav Sen, Sougata Sarkar, Subham Kumar Nigam, Koustav Rudra, Kripabandhu Ghosh · FIRE 2024, Gandhinagar, India

Overview of the CryptOQA shared task at FIRE 2024, evaluating post classification and QnA on cryptocurrency social media (Twitter & Reddit). Tasks include 8-class opinion classification and question-answer relevance detection. Transformer models (RoBERTa, XLM-RoBERTa) outperformed baselines; prompt-based GPT-4-Turbo approaches led QnA performance.

[ACM DL]
Algae paper
IJERT · Vol. 13, Issue 05 · May 2024 Published

Machine Learning Methods: Application for Amido Black Dye Adsorption Prediction on to Green Algae Powder-Activated Carbon

Sai Venkata Surya Punugoti, Dhruv Kumar, Meena Vangalapati · GGSIPU & Andhra University

Applied ML methods to predict amido black dye adsorption onto green algae powder-activated carbon adsorbent. Demonstrates ML applications in environmental engineering for sustainable wastewater treatment solutions.

[IJERT] [PDF]

CERTIFICATIONS

naukri young turks certificate

Naukri Campus Young Turks

Naukri Campus · Oct 2024

95.25% percentile — India's Largest Skill Contest

[View Certificate]
AI Summarizer app certificate

Project-Based Learning: Build an AI Text Summarizer App

Postman · Jun 2024

Skills: Python · HTML5 · CSS · ML · AI

[Verify Credential]
JETM certificate

Certification of Publication — JETM

Elsevier · May 2024 · ID: JETM/8210

Extra Tree Regression Model for Atrazine Herbicide Removal using ML

[View Certificate]
ICMSMT certificate

Research Paper Presentation — ICMSMT 2024

Diligentec Solutions · May 2024

Skills: Research · AI · Presentation · Communication

[View Certificate]
postman certificate

Postman API Fundamentals Student Expert

Postman · May 2024 · ID: 6658cf44

Skills: APIs · Computer Science

[View Badge]

BLOGS

Medium Feb 2024 · 7 min read

"Viola-Jones Algorithm" ~ A Miracle

A deep dive into the landmark 2001 CV paper by Paul Viola and Michael Jones — HAAR features, integral images, AdaBoost, and cascaded classifiers explained with clarity and storytelling.

Read on Medium →
Medium 2023

Detecting the RGB Color from a Webcam using Python OpenCV

A hands-on guide to extracting real-time RGB colour values from a webcam feed using Python and OpenCV — a practical project walkthrough for CV beginners.

Read on Medium →
Medium 2023

My First Case Study

An introspective first case study exploring problem-solving, analytical thinking, and applying ML concepts to real-world scenarios.

Read on Medium →
Read More on Medium →

DSA & COMPETITIVE PROGRAMMING

pvt.dhruvkumar

Loading...

Rating
Max Rating
Contests
Solved
View on Codeforces ↗
Loading activity...
Less More

EDUCATION

B.Tech — Artificial Intelligence & Machine Learning

Guru Gobind Singh Indraprastha University, New Delhi

Aug 2022 – Jul 2025 🏆 $500 HANA Bank Scholarship

Diploma — Computer Engineering

Ambedkar DSEU Shakarpur Campus 1, New Delhi

Aug 2017 – Jul 2020