📝 Publications

🤖 AI Safety & Hallucination Detection

AAAI 2025
HalluMat

HalluMat: Detecting Hallucinations in Domain-Specific Large Language Models for Materials Science
Bhanu Prakash Vangala, Jianlin Cheng

Paper Code
  • Key Innovation: Introduced HalluMatData benchmark dataset and HalluMatDetector framework for comprehensive hallucination detection in scientific LLMs
  • Technical Contributions: Multi-stage pipeline combining intrinsic verification, multi-source retrieval, contradiction graph analysis, and metric-based assessment
  • Impact: Achieved 30% reduction in hallucination rates compared to baseline LLM outputs in materials science domain
  • Novel Metric: Introduced Paraphrased Hallucination Consistency Score (PHCS) to quantify response inconsistencies across semantically equivalent queries
AAAI 2025
HalluFormer

HalluFormer: Transformer-based Faithfulness Evaluation Framework for Large Language Models
Bhanu Prakash Vangala, Jianlin Cheng

Paper Project
  • Architecture: Transformer-based model for multi-dimensional consistency checking between questions, answers, and retrieved knowledge contexts
  • Performance: Achieved F1 score of 0.9471 on MultiNLI test dataset and 0.7285 on blind ANAH test dataset
  • Generalization: Demonstrates strong cross-domain generalization capabilities for detecting LLM hallucinations
  • Applications: Deployed in critical scientific and clinical domains requiring high factual accuracy

🚀 Scalable AI Infrastructure

Master's Thesis
LLM-as-Service

Deploying LLM-as-a-Service in Kubernetes HPC Clusters
Bhanu Prakash Vangala, Grant Scott, Jianlin Cheng

Code Thesis (Coming Soon)
  • Infrastructure Innovation: Designed Helm-based GPU-aware deployment pipeline for seamless LLM inference in research clusters
  • Scalability Features: Implemented auto-scaling, resource throttling, and multi-tenant access configurations
  • Performance: Reduced deployment time by 60% and improved resource utilization by 40% in HPC environments
  • Real-world Deployment: Successfully deployed on Nautilus cluster serving 100+ concurrent users

🏥 Medical AI & Computer Vision

Research Project
Brain Tumor Detection

Brain Tumor Detection in MRI Images Using Deep Learning
Bhanu Prakash Vangala

Project Report Code
  • Model Architecture: Custom CNN and ResNet50V2 with advanced data augmentation techniques
  • Performance Metrics: Achieved 96.8% accuracy, 0.95 F1-score on brain tumor classification
  • Technical Innovations: Implemented learning rate optimization and ensemble methods for improved robustness
  • Clinical Impact: Developed interpretable visualization techniques for medical practitioners
MICCAI Workshop
Pneumonia Detection

Pneumonia Detection in Chest X-rays Using Deep Learning
Bhanu Prakash Vangala

Project Report

  • Model Comparison: Evaluated 5 architectures including custom CNN, ResNet18, VGG16, ResNet50 with K-Fold CV, and EfficientNet
  • Best Performance: EfficientNet achieved highest classification metrics with 94.2% accuracy
  • Training Innovations: K-fold cross-validation and multi-GPU acceleration for robust model training
  • Dataset: Trained on 5,216 chest X-ray images with rigorous validation protocols

🎨 Creative AI & Image Processing

IJARESM 2023
Image Colorization

AI-Powered Image Colorization: From Grayscale to Photorealistic Color
Bhanu Prakash Vangala

Paper

  • Architecture: Convolutional autoencoders and GAN-based models for photorealistic colorization
  • Innovation: Novel loss functions combining perceptual and adversarial losses for natural color generation
  • Applications: Successfully applied to historical photograph restoration and film remastering
  • Performance: Achieved SSIM score of 0.92 and user preference rating of 87% in blind studies

💬 NLP & Social Media Analytics

B.Tech Thesis
KOO Sentiment Analysis

Multilingual Sentiment Analysis on KOO: Uncovering User Sentiments and Trends
Bhanu Prakash Vangala

Thesis Best Department Thesis Award
  • Multilingual Support: Developed sentiment analysis pipeline supporting 10+ Indian languages
  • Real-time Processing: Implemented streaming architecture for real-time sentiment detection
  • Scale: Processed over 1M posts with sub-second latency for content moderation
  • Impact: Deployed in production for enhanced user experience and safety on social platform

📚 Additional Research Projects

  • 2025 ReflectMemory: Persistent memory control system for long-context LLM reasoning with chain-of-thought embedding storage
  • 2025 VisionAI: Real-time hazard detection for visually impaired users - Runner-up at MUIDSI Hackathon ($1,000 prize)
  • 2024 ChatMed: Medical chatbot trained on BioGPT and PubMed articles for symptom-based health guidance
  • 2024 CropInsight: Computer vision and sequence models for crop health monitoring and yield forecasting
  • 2023 SocialSift: Crisis-aware multilingual sentiment analysis during natural disasters using transformers

🎯 Under Review / In Preparation

  • Self-Correcting LLMs: Framework for automatic error detection and correction in language model outputs (Submitting to ACL 2025)
  • Factuality Benchmarks: Comprehensive evaluation suite for scientific domain LLMs (Preparing for NeurIPS 2025)
  • Reproducible AI Containers: Provenance-tracking containers for collaborative model development (NASA-funded, targeting ICML 2025)