INITIALIZING SYSTEM...
Available for opportunities

AdityaVelpula

·
RAG PipelinesLLM SystemsCloud Data Infrastructure
25,565
Chunks Processed
1,192
Policy Sources
75-80%
LLM Cost Reduction

Idon'tjustanalyzedata.Iarchitectsystemsthatthink.

M.S. Data Analytics · George Mason University · 2024 to 2026

Building AI Systems That Scale

1,192 Policy Sources → Actionable Intelligence

SYSTEM PROFILE
ENGINEERAditya Velpula
FOCUSIntelligent Systems
DOMAINSRAG • LLM • Cloud Infra
EDUCATIONM.S. Data Analytics (GMU)
STATUSACTIVE

Specializing in building scalable intelligent systems that bridge raw data and actionable insight. From RAG architectures that process thousands of policy documents to LLM orchestration systems that cut costs by 80%. I engineer solutions at the intersection of AI and infrastructure.

ACT III

The Architecture

Roles where I built intelligence systems from first principles. Not dashboards, not demos. Real systems with real impact.

FEATURED ROLE · Feb 2026 to Present

AI Engineer

DAPSE · NSI · Arlington, VA
Built Arctic policy intelligence system using RAG architecture for legal and strategic analysis across 9 nations.

Hybrid RAG Pipeline

Hybrid retrieval combining BM25 lexical search with FAISS dense vector search and neural reranking. Multi-tier LLM verification system ensures factual accuracy with citation tracking.

Built intelligence system for JAG officers analyzing Arctic policy across international legal frameworks.

1,192 Policy DocumentsRAW INPUTChunking Engine25,565 CHUNKSVector StoreFAISS + BM25Hybrid RetrievalDENSE + LEXICALMulti-tier LLM VerificationFACT CHECKStructured Response + CitationsOUTPUTUser QueryINPUT

Graduate Teaching Assistant

George Mason University · Fairfax, VA
Aug 2025 to Present

Mentored 50+ students in data analytics, machine learning, and statistical methods.

0+
Students Mentored
PythonRSQLTableau

AI Engineer

Indgeos Geospatial · Telangana, India
Nov 2023 to Jul 2024

Built geospatial ML pipelines and full-stack web applications for location intelligence.

PythonTensorFlowAWSReactPostgreSQL
ACT III · PROOF

Selected Projects

Each of these was shipped end-to-end. Click any card to see the architecture, metrics, and stack.

SLA · 78%
ITSM Analytics · Oct to Nov 2025

Ticket Resolution & SLA Breach Prediction

End-to-end ITSM analytics pipeline predicting ticket resolution time and flagging SLA breach risk before closure. Built on a realistic 5,000-ticket synthetic dataset simulating ServiceNow/Jira logs. Gradient-boosting models beat baselines for both regression and classification; results surface through a Power BI dashboard for proactive service management.

5,000+
Tickets Modelled
PythonXGBoostScikit-learnPandasPower BI
Climate Data × Machine Learning

Wildfire Risk Prediction

ML pipeline fusing MODIS satellite fire data, NOAA climate variables, and NDVI vegetation indices to predict wildfire risk. Random Forest + XGBoost with careful feature engineering reached AUC-ROC 0.99. Python visualisations of high-risk zones and key predictors (elevation, humidity, thermal anomalies) support proactive response.

0.99
AUC-ROC
PythonXGBoostRandom ForestGeoPandasNOAA
DIKW-Driven IOU vs Non-IOU Pricing Study

U.S. Electricity-Rate Analytics

Analysed 320K+ electricity rate records (2020 to 2023) through the DIKW framework, using Python, SQL, and statistical testing (t-tests, regression) to expose material pricing differences between IOU and Non-IOU utilities across sectors and states. Cluster models + forecasts highlight geographic trends and inflation effects for regulators.

320,000+
Records Analysed
PythonSQLStatistical ModelingClusteringForecasting
AV·20260.97 CONF
Real-Time YOLO + OCR Pipeline

License Plate Detection

Real-time license plate recognition combining YOLO object detection, OCR, and OpenCV image preprocessing. Peak detection accuracy achieved through dataset augmentation, bounding-box refinement, and localisation tuning. Senior-year capstone.

PythonYOLOOCROpenCVComputer Vision
90%
91%
92%
93%
94%
95%
96%
97%
98%
Collaborative + Content + Sentiment

Hybrid Movie Recommender

Hybrid recommendation engine blending content-based filtering, collaborative filtering, sentiment analysis, and Jaccard similarity for diverse personalised suggestions. Responsive Flask web app with real-time search, tunable parameters, and evaluation metrics for diversity, novelty, and serendipity.

PythonFlaskRecommender SystemsNLPSentiment Analysis
S3GlueAthenaBI
End-to-End AWS Data Pipeline

Obesity Risk Analytics

Cloud-native data pipeline for health-risk prediction using AWS services. Raw records flow through S3 → Glue ETL → Athena queries → QuickSight dashboards, enabling real-time risk scoring and visualisation.

AWSS3GlueAthenaQuickSight
Virtual Addiction-Support Platform

Support Circle

Full-stack virtual support platform helping people fight addictions. React frontend for a responsive chat-first UX, Python backend handling auth, data management, and secure real-time messaging + notifications so users get continuous peer support during recovery.

ReactPythonFlaskWebSocketsAuthentication
WHAT I WORK WITH

Skills & Stack

The tools I use daily, grouped by domain. Bars indicate depth. Primary marks my daily-driver, Working denotes hands-on familiarity.

Total Skills
0
Expert / Primary
0
Advanced
0
Domains
0
PrimaryExpertAdvancedProficientIntermediateWorking

AI / ML & Data

Production AI systems and orchestration
12
LLMs
GPT-4, Claude, fine-tuning
Expert
RAG
Production RAG systems
Expert
LangChain
Agent orchestration
Expert
FAISS
Vector search at scale
Expert
Pinecone
Managed vector index
Advanced
Hugging Face
Transformers, model hub
Advanced
PyTorch
Research + production
Advanced
TensorFlow
Model training
Advanced
scikit-learn
Classical ML pipelines
Expert
XGBoost
Gradient boosting
Advanced
NLP
Text processing pipelines
Advanced
OpenAI API
Completions, embeddings, tools
Expert

Languages

Daily-driver and supporting
08
Python
5+ years, every project
Primary
SQL
Complex queries, optimization
Advanced
TypeScript
Typed full-stack web
Proficient
JavaScript
Full-stack web
Proficient
C++
Systems programming
Proficient
Java
Enterprise applications
Proficient
R
Statistical analysis
Proficient
Bash
Automation, ops scripts
Proficient

Cloud & Infrastructure

AWS-certified, cloud-native delivery
09
AWS
Certified, primary platform
Advanced
S3 / Glue / Athena
Lake-house pipelines
Advanced
Lambda / API Gateway
Serverless APIs
Advanced
Docker
Containerized deployments
Advanced
Kubernetes
Orchestration
Intermediate
Terraform
Infrastructure-as-code
Intermediate
CI / CD
GitHub Actions, deploys
Advanced
Azure
Working knowledge
Working
GCP
Working knowledge
Working

Tools & Frameworks

Building, shipping, monitoring
12
FastAPI
Production APIs
Expert
Flask
Lightweight services
Advanced
Pandas
Data manipulation
Expert
NumPy
Numerical computing
Expert
Plotly / Matplotlib
Visualization
Advanced
Power BI / Tableau
BI dashboards
Advanced
Git
Branching, code review
Advanced
Spark
Distributed computing
Intermediate
Airflow
Workflow orchestration
Intermediate
MLflow
Experiment tracking
Intermediate
PostgreSQL
Relational data
Advanced
Redis
Caching, sessions
Intermediate

Let'sbuildintelligentsystemstogether.

Master of Science in Data Analytics · George Mason University · 2024 to 2026