Research

Research Interest

Improving the efficiency and performance of large-scale Vision-Language Models (VLMs)

AI security, including agentic defensive AI for autonomous security systems and deepfake detection

Large-scale Data filtering and curation for efficient pre-training

Social AI, focusing on human-centric and socially aware intelligence

Reinforcement Learning for adaptive and generalizable decision-making systems

Autonomous driving-related perception and decision-making tasks

Generative modeling with Diffusion models

Improving the efficiency and performance of large-scale Vision-Language Models (VLMs)

[P4] What Does the Caption Really Say? Counterfactual Phrase Intervention for Compositional Data Selection in Vision-Language Pretraining
Hyejin Go, Semi Lee, and Hyesong Choi
arXiv preprint, 2026
[P3] AdaMerge: Salience-Aware Adaptive Token Merging for Training-Free Acceleration of Vision Transformers
Semi Lee, Hyejin Go, Hyesong Choi
arXiv preprint, 2026
[P2] When Eyes Betray AI: Social Gaze Consistency as a Semantic Cue for AI-Generated Image Detection
Jihyeon Kim, Sohee Kim, Soosan Lee, Souhwan Jung, James Matthew Rehg, Hyesong Choi
arXiv preprint, 2026
[P1] The Rescue Effect: Spatio-Semantic Early Exit Bypasses Quantization Collapse in CLIP
Kahyeon Nam, Hyesong Choi
arXiv preprint, 2026
[P10] Isotropic Embedding Perturbations for Robust Vision Language Encoders
Hyesong Choi, Daeun Kim, Song Park, Taekyung Kim, Byeongho Heo, Sangdoo Yun, Dongbo Min, and Dongyoon Han
under review, 2026
[P8] ECC: Encoder-Centric Corruption for Fine-Grained Vision in VLMs
Hyesong Choi, Daeun Kim, Sungmin Cha, Kwang Moo Yi, and Dongbo Min
under review, 2026
[P6] Enhancing Alignment for Unified Multimodal Models via Semantically-Grounded Supervision
Jiyeong Kim, Yerim So, Hyesong Choi, Uiwon Hwang, and Dongbo Min
under review, 2026
[P3] iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation
Hayeon Jo, Hyesong Choi, Minhee Cho, and Dongbo Min
arXiv preprint arXiv:2409.02838

AI security, including agentic defensive AI for autonomous security systems and deepfake detection

[P2] When Eyes Betray AI: Social Gaze Consistency as a Semantic Cue for AI-Generated Image Detection
Jihyeon Kim, Sohee Kim, Soosan Lee, Souhwan Jung, James Matthew Rehg, Hyesong Choi
arXiv preprint, 2026
[P5] Stabilizing Robustness Transfer in Adversarial Distillation with Controlled Teacher Adaptation
Hyejin Park, Hyesong Choi, and Dongbo Min
under review, 2026
[P10] Isotropic Embedding Perturbations for Robust Vision Language Encoders
Hyesong Choi, Daeun Kim, Song Park, Taekyung Kim, Byeongho Heo, Sangdoo Yun, Dongbo Min, and Dongyoon Han
under review, 2026

Large-scale Data filtering and curation for efficient pre-training

[P4] What Does the Caption Really Say? Counterfactual Phrase Intervention for Compositional Data Selection in Vision-Language Pretraining
Hyejin Go, Semi Lee, and Hyesong Choi
arXiv preprint, 2026
[P9] CORE: Corruption-Reconstruction based Data Filtering Network
Hyesong Choi, Daeun Kim, Seungmin Baek, Taekyung Kim, Byeongho Heo, Dongbo Min, and Dongyoon Han
under review, 2026
[C7] Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training
Hyesong Choi, Hyejin Park, Kwang Moo Yi, Sungmin Cha, and Dongbo Min
ECCV, 2024
[C6] Emerging Property of Masked TOken for Effective Pre-training
Hyesong Choi, Hunsang Lee, Seyoung Joung, Hyejin Park, Jiyeong Kim, and Dongbo Min
ECCV, 2024
[P2] SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction
Sumin Son, Hyesong Choi, and Dongbo Min
arXiv preprint arXiv:2409.02513

Social AI, focusing on human-centric and socially aware intelligence

[P2] When Eyes Betray AI: Social Gaze Consistency as a Semantic Cue for AI-Generated Image Detection
Jihyeon Kim, Sohee Kim, Soosan Lee, Souhwan Jung, James Matthew Rehg, Hyesong Choi
arXiv preprint, 2026

Reinforcement Learning for adaptive and generalizable decision-making systems

Autonomous driving-related perception and decision-making taskS

[C10] RobIA: Robust Instance-aware Continual Test-time Adaptation for Deep Stereo
Jueun Ko, Hyewon Park, Hyesong Choi, and Dongbo Min
NeurIPS, 2025
[C9] TADFormer: Task-Adaptive Dynamic TransFormer for Efficient Multi-Task Learning
Seungmin Baek, Soyul Lee, Hayeon Jo, Hyesong Choi, and Dongbo Min
CVPR, 2025
[C2] Sequential Cross Attention Based Multi-Task Learning
Sunkyung Kim, Hyesong Choi, and Dongbo Min
ICIP, 2022
[C1] Adaptive Confidence Thresholding for Monocular Depth Estimation
Hyesong Choi, Hunsang Lee, Sunok Kim, Seungryoung Kim, and Dongbo Min
ICCV, 2021
[J5] UniTT-Stereo: Unified Training of Transformer for Enhanced Stereo Matching
Soomin Kim, Hyesong Choi, Jihye Ahn, and Dongbo Min
IEEE ACCESS, 2025
[J4] Global Structural Knowledge Distillation for Semantic Segmentation
Hyejin Park, Keonhee Ahn, Hyesong Choi, and Dongbo Min
IEEE ACCESS, 2025
[J3] MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling
Jihye Ahn, Hyesong Choi, Sommin Kim, and Dongbo Min
IEEE ACCESS, 2025

Generative modeling with Diffusion models

[P2] When Eyes Betray AI: Social Gaze Consistency as a Semantic Cue for AI-Generated Image Detection
Jihyeon Kim, Sohee Kim, Soosan Lee, Souhwan Jung, James Matthew Rehg, Hyesong Choi
arXiv preprint, 2026
[P7] Adaptive Noise Injection, Bootstrapping Denoising Diffusion for Generalized Recognition
Hyesong Choi, Daeun Kim, and Dongbo Min
under review, 2026

AI Principles – Study the fundamental structure and learning mechanisms of modern AI models

Scalability & Efficiency – Design architectures and training strategies for scalable and efficient AI

Generalization – Develop AI systems that adapt across domains, tasks, and data modalities

Foundation Model – Research large-scale models from a core architectural and training perspective

Academic Sustainability – Make large models trainable even with limited resources compared to industry