CV | Yuetian Chen

Basics

Name	Yuetian Chen
Label	Ph.D. Student in Computer Science
Email	yuetian@purdue.edu
Phone	(518) 244-0845
Url	https://stry233.github.io/
Summary	Ph.D. student at Purdue University specializing in AI privacy and security, with focus on membership inference attacks against large language models. Published researcher in top venues including USENIX Security, CCS, NDSS, and ICLR.

Work

2024.07 - Present
Graduate Research Assistant

TruSe Lab, Purdue University

Leading research on membership inference attacks against fine-tuned LLMs and diffusion models under Professor Ninghui Li.
- Developed novel window-based MIA achieving 30% higher AUC than baselines
- Contributing to 5 papers at top security venues (ICLR, USENIX Security, NDSS)
- Co-developed SOFT framework reducing privacy leakage by 60% while maintaining 95% model utility
2023.05 - 2023.12
Undergraduate Research Assistant

Intelligent Systems Lab, RPI

Computer vision and robotics research under Professor Qiang Ji.
- Reduced MS-COCO object detection latency by 18% through novel backbone pruning
- Integrated real-time emotion/pose recognition on Pepper robot
- Successfully deployed system in 150-participant study
2023.05 - 2024.08
Undergraduate Research Assistant

Data Security and Privacy Lab, RPI

Pioneered new evaluation methodology for membership inference attacks under Professor Lei Yu.
- Published 2 papers (CCS'25, arXiv) on privacy attack research
- Co-developed MIAE toolkit with IBM Research featuring 8 attack algorithms
- Discovered critical vulnerability affecting 70% of existing MIA defenses
2022.03 - 2023.12
Undergraduate Research Assistant

Cognitive and Immersive Systems Lab, RPI

Research on LLM applications and creative AI under Professor Mei Si.
- Published 5 papers on creative AI applications (LREC-COLING, ICCC, AHFE, AIIDE)
- Developed prompt-chaining technique reducing GPT-3.5 hallucination by 40%
- Built multimodal pipeline deployed for 500+ users with 4.2/5 satisfaction rating

Education

2024.07 - Present

West Lafayette, Indiana
Ph.D.

Purdue University

Computer Science
- AI Privacy
- Large Language Models
- Membership Inference Attacks
- Differential Privacy
2020.07 - 2023.12

Troy, New York
Bachelor of Science

Rensselaer Polytechnic Institute

Computer Science
- Machine Learning from Data
- Intelligent Virtual Agent
- Computational Creativity
- Machine Learning & Optimization

Awards

2023.12.01

Dean's Honor List

Rensselaer Polytechnic Institute

Awarded for 6 consecutive semesters (Fall 2020 - Fall 2023)
2022.05.01

Academic Recognition Letter

Charles V. Stewart, RPI

Outstanding academic performance recognition
2022.08.01

Academic Recognition Letter

Mohammed J. Zaki, RPI

Outstanding academic performance recognition

Publications

2026.01.01

Cascading and Proxy Membership Inference Attack

Network and Distributed System Security Symposium (NDSS)

Attack-agnostic framework incorporating membership dependencies via conditional shadow training to boost MIA performance.
2026.01.01

Imitative Membership Inference Attack

USENIX Security Symposium

Novel imitative training technique requiring less than 5% computational cost of state-of-the-art approaches while outperforming existing MIAs.
2026.01.01

Window-based Membership Inference Attacks Against Fine-tuned Large Language Models

USENIX Security Symposium

Introduced WBC method using sliding windows and sign-based aggregation, achieving 2-3x improvements in detection rates at low false positive thresholds.
2026.01.01

Membership Inference Attacks on Finetuned Diffusion Language Models

International Conference on Learning Representations (ICLR)

First systematic investigation of MIA vulnerabilities in Diffusion Language Models. Introduced SAMA attack achieving 30% relative AUC improvement over baselines.
2025.01.01

Evaluating the Dynamics of Membership Privacy in Deep Learning

arXiv preprint

Dynamic analytical framework for dissecting privacy leakage dynamics at individual sample level throughout training.
2025.01.01

Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble

ACM Conference on Computer and Communications Security (CCS)

Systematic investigation of disparities in MIAs through coverage and stability analysis, proposing ensemble framework for robust privacy evaluation.
2025.01.01

SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks

USENIX Security Symposium

Novel defense technique that mitigates privacy leakage by 60% while maintaining 95% model utility through influential data selection.
2024.05.01

Reflections & Resonance: Two-Agent Partnership for Advancing LLM-based Story Annotation

Joint International Conference on Computational Linguistics (LREC-COLING)

Novel multi-agent system for automating story annotation, creating the 'StorySense' corpus with 615 annotated stories.
2023.11.01

Enhancing Sentiment Analysis Results through Outlier Detection Optimization

arXiv preprint

Deep SVDD algorithm for outlier detection in text-based emotion analysis, improving performance across multiple datasets.
2023.06.01

Prompt to GPT-3: Step-by-Step Thinking Instructions for Humor Generation

International Conference on Computational Creativity (ICCC)

3-stage template lifting humor ratings to 4.1/5 (+18%) while reducing token usage by 22%.
2023.01.01

Visual Story Generation Based on Emotion and Keywords

AAAI Conference on AI and Interactive Digital Entertainment (AIIDE)

Story generation pipeline allowing user control over events and emotions, combining LLMs with diffusion models.
2023.01.01

Automated Visual Story Synthesis with Character Trait Control

Applied Human Factors and Ergonomics (AHFE)

Novel technique for creating consistent human figures in visual stories using DreamBooth and Stable Diffusion.

Skills

	AI/ML Research
	PyTorch
	JAX/Flax
	Transformers
	LLMs (GPT/LLaMA/Mistral)
	LoRA/QLoRA/PEFT
	Diffusion Models
	RLHF
	Multi-GPU Training
	CUDA

	Privacy & Security
	Membership Inference Attacks
	Differential Privacy
	Adversarial ML
	Backdoor Attacks
	Model Extraction
	Federated Learning

	Programming
	Python (Expert)
	C++ (Proficient)
	CUDA
	SQL
	Bash
	Git
	LaTeX
	Docker

	Research Tools
	Weights & Biases
	TensorBoard
	Jupyter
	NumPy
	Pandas
	scikit-learn
	SHAP
	Statistical Analysis

	Infrastructure
	AWS (EC2, S3, SageMaker)
	GCP (Vertex AI)
	HPC Clusters
	DeepSpeed
	FSDP
	Ray
	Kubernetes

Languages

	English
	Proficient

	Chinese
	Native speaker

	Japanese
	Elementary

Interests

	AI Privacy & Security
	Membership Inference Attacks
	Privacy-Preserving ML
	LLM Security
	Differential Privacy

	Large Language Models
	Fine-tuning
	Prompt Engineering
	Model Alignment
	Creative AI Applications

	Computer Vision
	Object Detection
	Diffusion Models
	Visual Story Generation
	Human-Robot Interaction

References

	Professor Ninghui Li
	Ph.D. Advisor at Purdue University, Department of Computer Science. Leading expert in data privacy and security.

	Professor Lei Yu
	Research Advisor at RPI, Department of Computer Science. Expert in privacy-preserving machine learning and membership inference attacks.

	Professor Mei Si
	Research Advisor at RPI, Department of Cognitive Science. Expert in AI applications for creative content generation and interactive storytelling.

	Professor Qiang Ji
	Research Advisor at RPI, Department of Department of ECSE. Expert in AI applications for computer vision and human computer (robot) interaction.

Projects

2023.08 - 2024.08
MIAE: Membership Inference Attack Evaluation Framework

Comprehensive Python package with 8 state-of-the-art MIA algorithms and 3 evaluation metrics. Framework adopted by IBM Watson Research for internal privacy auditing of production models.
- 8 attack algorithms implemented
- Adopted by IBM Research
- Open-source contribution
2023.9 - 2023.10
LLM Science Exam - Kaggle Competition

Ranked top 19% (491/2,664) by fine-tuning GPT-J with LoRA and prompt ensemble techniques. Achieved 77% accuracy under 8GB GPU constraint.
- Top 19% ranking
- LoRA fine-tuning
- Resource-efficient implementation
2022.03 - 2023.12
Visual Story Generation Pipeline

End-to-end system combining LLMs with Stable Diffusion for interactive story creation. Deployed for 500+ users with 4.2/5 satisfaction rating.
- 500+ active users
- 4.2/5 user satisfaction
- Multimodal AI integration

Basics

Work

TruSe Lab, Purdue University

Leading research on membership inference attacks against fine-tuned LLMs and diffusion models under Professor Ninghui Li.

Intelligent Systems Lab, RPI

Computer vision and robotics research under Professor Qiang Ji.

Data Security and Privacy Lab, RPI

Pioneered new evaluation methodology for membership inference attacks under Professor Lei Yu.

Cognitive and Immersive Systems Lab, RPI

Research on LLM applications and creative AI under Professor Mei Si.

Education

Purdue University

Computer Science

Rensselaer Polytechnic Institute

Computer Science

Awards

Rensselaer Polytechnic Institute

Awarded for 6 consecutive semesters (Fall 2020 - Fall 2023)

Charles V. Stewart, RPI

Outstanding academic performance recognition

Mohammed J. Zaki, RPI

Outstanding academic performance recognition

Publications

Network and Distributed System Security Symposium (NDSS)

Attack-agnostic framework incorporating membership dependencies via conditional shadow training to boost MIA performance.

USENIX Security Symposium

Novel imitative training technique requiring less than 5% computational cost of state-of-the-art approaches while outperforming existing MIAs.

USENIX Security Symposium

Introduced WBC method using sliding windows and sign-based aggregation, achieving 2-3x improvements in detection rates at low false positive thresholds.

International Conference on Learning Representations (ICLR)

First systematic investigation of MIA vulnerabilities in Diffusion Language Models. Introduced SAMA attack achieving 30% relative AUC improvement over baselines.

arXiv preprint

Dynamic analytical framework for dissecting privacy leakage dynamics at individual sample level throughout training.

ACM Conference on Computer and Communications Security (CCS)

Systematic investigation of disparities in MIAs through coverage and stability analysis, proposing ensemble framework for robust privacy evaluation.

USENIX Security Symposium

Novel defense technique that mitigates privacy leakage by 60% while maintaining 95% model utility through influential data selection.

Joint International Conference on Computational Linguistics (LREC-COLING)

Novel multi-agent system for automating story annotation, creating the 'StorySense' corpus with 615 annotated stories.

arXiv preprint

Deep SVDD algorithm for outlier detection in text-based emotion analysis, improving performance across multiple datasets.

International Conference on Computational Creativity (ICCC)

3-stage template lifting humor ratings to 4.1/5 (+18%) while reducing token usage by 22%.

AAAI Conference on AI and Interactive Digital Entertainment (AIIDE)

Story generation pipeline allowing user control over events and emotions, combining LLMs with diffusion models.

Applied Human Factors and Ergonomics (AHFE)

Novel technique for creating consistent human figures in visual stories using DreamBooth and Stable Diffusion.

Skills

Languages

Interests

References

Projects

Comprehensive Python package with 8 state-of-the-art MIA algorithms and 3 evaluation metrics. Framework adopted by IBM Watson Research for internal privacy auditing of production models.

Ranked top 19% (491/2,664) by fine-tuning GPT-J with LoRA and prompt ensemble techniques. Achieved 77% accuracy under 8GB GPU constraint.

End-to-end system combining LLMs with Stable Diffusion for interactive story creation. Deployed for 500+ users with 4.2/5 satisfaction rating.