📝 Publications
🤖 AI Safety & Hallucination Detection
AAAI 2025

HalluMat: Detecting Hallucinations in Domain-Specific Large Language Models for Materials Science
Bhanu Prakash Vangala, Jianlin Cheng
| Paper | Code |
- Key Innovation: Introduced HalluMatData benchmark dataset and HalluMatDetector framework for comprehensive hallucination detection in scientific LLMs
- Technical Contributions: Multi-stage pipeline combining intrinsic verification, multi-source retrieval, contradiction graph analysis, and metric-based assessment
- Impact: Achieved 30% reduction in hallucination rates compared to baseline LLM outputs in materials science domain
- Novel Metric: Introduced Paraphrased Hallucination Consistency Score (PHCS) to quantify response inconsistencies across semantically equivalent queries
AAAI 2025

HalluFormer: Transformer-based Faithfulness Evaluation Framework for Large Language Models
Bhanu Prakash Vangala, Jianlin Cheng
| Paper | Project |
- Architecture: Transformer-based model for multi-dimensional consistency checking between questions, answers, and retrieved knowledge contexts
- Performance: Achieved F1 score of 0.9471 on MultiNLI test dataset and 0.7285 on blind ANAH test dataset
- Generalization: Demonstrates strong cross-domain generalization capabilities for detecting LLM hallucinations
- Applications: Deployed in critical scientific and clinical domains requiring high factual accuracy
🚀 Scalable AI Infrastructure
Master's Thesis

Deploying LLM-as-a-Service in Kubernetes HPC Clusters
Bhanu Prakash Vangala, Grant Scott, Jianlin Cheng
| Code | Thesis (Coming Soon) |
- Infrastructure Innovation: Designed Helm-based GPU-aware deployment pipeline for seamless LLM inference in research clusters
- Scalability Features: Implemented auto-scaling, resource throttling, and multi-tenant access configurations
- Performance: Reduced deployment time by 60% and improved resource utilization by 40% in HPC environments
- Real-world Deployment: Successfully deployed on Nautilus cluster serving 100+ concurrent users
🏥 Medical AI & Computer Vision
Research Project

Brain Tumor Detection in MRI Images Using Deep Learning
Bhanu Prakash Vangala
| Project Report | Code |
- Model Architecture: Custom CNN and ResNet50V2 with advanced data augmentation techniques
- Performance Metrics: Achieved 96.8% accuracy, 0.95 F1-score on brain tumor classification
- Technical Innovations: Implemented learning rate optimization and ensemble methods for improved robustness
- Clinical Impact: Developed interpretable visualization techniques for medical practitioners
MICCAI Workshop

Pneumonia Detection in Chest X-rays Using Deep Learning
Bhanu Prakash Vangala
- Model Comparison: Evaluated 5 architectures including custom CNN, ResNet18, VGG16, ResNet50 with K-Fold CV, and EfficientNet
- Best Performance: EfficientNet achieved highest classification metrics with 94.2% accuracy
- Training Innovations: K-fold cross-validation and multi-GPU acceleration for robust model training
- Dataset: Trained on 5,216 chest X-ray images with rigorous validation protocols
🎨 Creative AI & Image Processing
IJARESM 2023

AI-Powered Image Colorization: From Grayscale to Photorealistic Color
Bhanu Prakash Vangala
- Architecture: Convolutional autoencoders and GAN-based models for photorealistic colorization
- Innovation: Novel loss functions combining perceptual and adversarial losses for natural color generation
- Applications: Successfully applied to historical photograph restoration and film remastering
- Performance: Achieved SSIM score of 0.92 and user preference rating of 87% in blind studies
💬 NLP & Social Media Analytics
B.Tech Thesis

Multilingual Sentiment Analysis on KOO: Uncovering User Sentiments and Trends
Bhanu Prakash Vangala
| Thesis | Best Department Thesis Award |
- Multilingual Support: Developed sentiment analysis pipeline supporting 10+ Indian languages
- Real-time Processing: Implemented streaming architecture for real-time sentiment detection
- Scale: Processed over 1M posts with sub-second latency for content moderation
- Impact: Deployed in production for enhanced user experience and safety on social platform
📚 Additional Research Projects
2025ReflectMemory: Persistent memory control system for long-context LLM reasoning with chain-of-thought embedding storage2025VisionAI: Real-time hazard detection for visually impaired users - Runner-up at MUIDSI Hackathon ($1,000 prize)2024ChatMed: Medical chatbot trained on BioGPT and PubMed articles for symptom-based health guidance2024CropInsight: Computer vision and sequence models for crop health monitoring and yield forecasting2023SocialSift: Crisis-aware multilingual sentiment analysis during natural disasters using transformers
🎯 Under Review / In Preparation
- Self-Correcting LLMs: Framework for automatic error detection and correction in language model outputs (Submitting to ACL 2025)
- Factuality Benchmarks: Comprehensive evaluation suite for scientific domain LLMs (Preparing for NeurIPS 2025)
- Reproducible AI Containers: Provenance-tracking containers for collaborative model development (NASA-funded, targeting ICML 2025)