Skip to content
View SunilReddy93's full-sized avatar
  • Bangalore

Block or report SunilReddy93

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SunilReddy93/README.md

Sunil Reddy

Senior Java Backend Engineer

Banking · Insurance · Fintech · 7+ Years

I design and build distributed backend systems that handle real financial workloads — concurrent transactions, fraud detection, event-driven pipelines, and AI-augmented decision making. My work sits at the intersection of enterprise Java engineering and applied AI integration.

LinkedIn Email


What I Work On

I specialise in the hard parts of backend engineering — designing systems that stay correct under concurrency, stay available under failure, and stay maintainable as they grow. Most of my production experience is in the BFSI domain, where data integrity and regulatory compliance are non-negotiable.

Over the past year I've been going deep on AI integration — not surface-level API calls, but production patterns: RAG pipelines with vector search, agentic systems with tool-calling, circuit breakers on LLM calls, caching strategies for AI responses. The goal is AI that behaves predictably in a distributed system, not just a chatbot bolted onto an API.


Tech Stack

Core

Java Spring Boot Spring Security Spring AI

Data

PostgreSQL pgvector MySQL Oracle Redis

Infrastructure

Apache Kafka Docker Kubernetes Jenkins

AI & Resilience

Groq Ollama Resilience4j


Projects

🧠 Agentic AI + RAG Claims Processing System

→ View Repository

A microservices platform where an AI agent autonomously evaluates insurance claims using RAG — retrieving historical fraud cases, policy coverage rules, and compliance guidelines from a vector store before making a decision. No hardcoded business rules. The agent reasons through context and acts.

The architecture problem I solved:

Most AI integrations in this space are stateless — send claim text, get a risk score back. That doesn't scale when you need decisions grounded in your own historical data, auditable reasoning trails, and downstream action execution. I built a dedicated claims-decision-engine service that owns the entire AI reasoning pipeline, isolated from the claims submission service so each can evolve independently.

Technical highlights:

  • Spring AI 1.0 tool-calling — agent has 7 annotated tools (3 RAG search, 4 action). @Tool descriptions drive LLM behaviour — the decision logic lives in the model's reasoning, not in application conditionals
  • pgvector with HNSW index — chose pgvector over Pinecone/Weaviate to avoid operational overhead; cosine similarity search across 768-dimension embeddings generated by Ollama locally
  • Fully async pipelineai-claims-service fires a non-blocking WebClient call to the decision engine with .subscribe(), returns the submission response immediately. Decision arrives out-of-band via email with complete agent reasoning
  • Dual Kafka topicsclaim-events for submission notifications, claim-decisions for agent decision notifications. notification-service consumes both independently
  • Resilience4j circuit breaker on all Groq calls — rule-based fallback scoring if AI is unavailable, claim submission never blocked

Stack: Java 17 Spring Boot 3.4 Spring AI 1.0 Groq llama-3.3-70b Ollama nomic-embed-text pgvector PostgreSQL Kafka Redis Resilience4j Docker Compose JWT


🤖 AI Integrated Claims Processing API

→ View Repository

An event-driven insurance claims microservice with real-time AI fraud scoring. This was the foundation project before the agentic system — focused on reliable AI API integration patterns in a production microservice context.

Technical highlights:

  • Groq AI fraud scoring (llama-3.3-70b-versatile) — structured prompt returns fraud risk (LOW/MEDIUM/HIGH), priority, and reasoning summary
  • Circuit breaker with fallback — if Groq is down, rule-based scoring kicks in automatically. Claim submission never fails due to AI unavailability
  • Redis caching on AI responses — identical claim descriptions return cached assessments, reducing Groq API calls by ~60%
  • Kafka event pipelineCLAIM_SUBMITTED, CLAIM_APPROVED, CLAIM_SETTLED events trigger downstream notification service with zero coupling
  • Idempotency + rate limiting — prevents duplicate claims and abuse of the AI scoring endpoint

Stack: Java 17 Spring Boot MySQL Redis Kafka Groq AI Resilience4j OpenFeign Docker JWT


🏦 Loan Application System

→ View Repository

A production-grade loan lifecycle management system built for the banking domain, designed around the concurrency and data integrity challenges that make financial systems genuinely hard to build correctly.

The architecture problem I solved:

Loan approval is a classic distributed systems problem — two concurrent requests approving the same loan simultaneously must not both succeed, and a user submitting the same application twice must not create duplicate records. I solved both with pessimistic locking at the database level and idempotency keys at the API level, then stress-tested both paths.

Technical highlights:

  • Pessimistic row locking (SELECT FOR UPDATE) on loan approval — prevents double disbursement under concurrent load
  • Idempotency keys with unique constraint enforcement — duplicate submissions return the existing record, never create a new one
  • Redis caching on credit score lookups and eligibility results — eliminates repeated DB calls for the same user profile
  • Bucket4j rate limiting — per-user application limits enforced at the API gateway layer
  • Full Kubernetes deployment on minikube — Deployments, Services, ConfigMaps, PersistentVolumeClaims; proper resource limits and health probes
  • Loan lifecycle state machine: APPLIED → UNDER_REVIEW → APPROVED → DISBURSED → CLOSED with audit trail

Stack: Java 17 Spring Boot MySQL Redis Docker Kubernetes Eureka Resilience4j Bucket4j JWT


Engineering Focus Areas

Distributed Systems      Microservices, service mesh, inter-service communication patterns
Data Integrity           Transactions, ACID guarantees, row locking, idempotency at scale
AI Integration           RAG pipelines, agentic systems, vector search, LLM reliability patterns
Resilience Engineering   Circuit breakers, bulkheads, retries, graceful degradation
Performance              Redis caching, DB indexing, query optimisation, connection pooling
Infrastructure           Docker, Kubernetes, CI/CD, observability

Open to Senior Java Backend Engineer roles in Banking, Insurance, and Fintech.

Pinned Loading

  1. Agentic-AI-with-RAG-Claims-Processing-System Agentic-AI-with-RAG-Claims-Processing-System Public

    AI-powered insurance claims processing system with RAG-based fraud detection, agentic decision-making (auto-approve, escalate, report), and Groq LLM integration — built with Spring Boot, pgvector, …

    Java

  2. AI-Integrated-Claims-Processing-System AI-Integrated-Claims-Processing-System Public

    AI-powered insurance claims processing system built with Spring Boot microservices, featuring Groq AI fraud detection, Kafka event-driven notifications, Redis caching, rate limiting, and circuit br…

    Java

  3. Loan-Application-System Loan-Application-System Public

    Production-grade loan application system using Spring Boot microservices. Covers core backend concepts: distributed transactions, pessimistic locking, idempotency, Redis caching, rate limiting, Res…

    Java

  4. Hotel-Management-System Hotel-Management-System Public

    A Hotel Management REST API built with Spring Boot, Spring Security (JWT), and MySQL. Features room management, guest registration, booking with date validation, check-in/check-out, and role-based …

  5. Library-Management-System-API Library-Management-System-API Public

    Secure RESTful Library Management System API built with Spring Boot, Spring Security, and JWT authentication, featuring role-based access control, CRUD operations for books and users, and productio…

    Java