Welcome to VICLab!



We are Video & Image Computing Lab at KAIST.
Our research of interest includes deep-learning-based computer vision, computational image & video processing as well as image & video understanding and 2D/3D video coding.
Our recent intensive works focus on Computer Vision research in the fields of:
[1] 3D image/video reconstruction: (1) optical flow estimation, (2) camera pose estimation, (3) dynamic neural radiance field (NeRF) and static/dynamic Gaussian Splatting (3D/4D GS) learning of video for novel view synthesis;
[2] image/video generation and editing with visual foundation models: (1) object-level image/video editing, (2) image text editing and style changes,
[3] natural image and video restoration: (1) super-resolution, (2) video frame interpolation and motion deblurring, (3) depth estimation, (4) image deraining, dehazing,and in-painting,
[4] satellite images: (1) PAN sharpening, super-resolution and cloud removal of Electro-Optical (EO) images, (2) super-resolution, detection and classification of Synthetic Aperture Radar (SAR) image targets, (3) SAR-to-EO image-to-image translation learning, (4) remote-sensing visual language models (RS-VLM).

![[(under review)] "🎯 PRIMEdit: Probability Redistribution for Instance-aware Multi-object Video Editing with Benchmark Dataset"](https://static.wixstatic.com/media/5b1cac_94374dbf808848809c589d18512528ff~mv2.png/v1/fill/w_250,h_250,fp_0.50_0.50,q_35,blur_30,enc_avif,quality_auto/5b1cac_94374dbf808848809c589d18512528ff~mv2.webp)
![[(under review)] "🎯 PRIMEdit: Probability Redistribution for Instance-aware Multi-object Video Editing with Benchmark Dataset"](https://static.wixstatic.com/media/5b1cac_94374dbf808848809c589d18512528ff~mv2.png/v1/fill/w_100,h_100,fp_0.50_0.50,q_95,enc_avif,quality_auto/5b1cac_94374dbf808848809c589d18512528ff~mv2.webp)
![[ICCV2025] "Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition"](https://static.wixstatic.com/media/5b1cac_d859019a092f4b6db6580caaac3af46a~mv2.png/v1/fill/w_250,h_250,fp_0.50_0.50,q_35,blur_30,enc_avif,quality_auto/5b1cac_d859019a092f4b6db6580caaac3af46a~mv2.webp)
![[ICCV2025] "Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition"](https://static.wixstatic.com/media/5b1cac_d859019a092f4b6db6580caaac3af46a~mv2.png/v1/fill/w_100,h_100,fp_0.50_0.50,q_95,enc_avif,quality_auto/5b1cac_d859019a092f4b6db6580caaac3af46a~mv2.webp)
![[ICCV2025] "👀One Look is Enough: Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation on High-Resolution Images"](https://static.wixstatic.com/media/5b1cac_3deeadc6f7eb4d2f934c04a8834bcdb1~mv2.png/v1/fill/w_250,h_250,fp_0.50_0.50,q_35,blur_30,enc_avif,quality_auto/5b1cac_3deeadc6f7eb4d2f934c04a8834bcdb1~mv2.webp)
![[ICCV2025] "👀One Look is Enough: Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation on High-Resolution Images"](https://static.wixstatic.com/media/5b1cac_3deeadc6f7eb4d2f934c04a8834bcdb1~mv2.png/v1/fill/w_100,h_100,fp_0.50_0.50,q_95,enc_avif,quality_auto/5b1cac_3deeadc6f7eb4d2f934c04a8834bcdb1~mv2.webp)
![[ICCV2025] "🌕PAN-Crafter: Learning Modality-Consistent Alignment for PAN-Sharpening"](https://static.wixstatic.com/media/5b1cac_6400052ebcd24bfd94f92893555b1cdd~mv2.png/v1/fill/w_250,h_250,fp_0.50_0.50,q_35,blur_30,enc_avif,quality_auto/5b1cac_6400052ebcd24bfd94f92893555b1cdd~mv2.webp)
![[ICCV2025] "🌕PAN-Crafter: Learning Modality-Consistent Alignment for PAN-Sharpening"](https://static.wixstatic.com/media/5b1cac_6400052ebcd24bfd94f92893555b1cdd~mv2.png/v1/fill/w_100,h_100,fp_0.50_0.50,q_95,enc_avif,quality_auto/5b1cac_6400052ebcd24bfd94f92893555b1cdd~mv2.webp)
![[CVPR2025] "🌍U- Know- Diff-PAN: Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening"](https://static.wixstatic.com/media/5b1cac_77410a056b294194b3b27f3fa5bd1870~mv2.png/v1/fill/w_250,h_250,fp_0.50_0.50,q_35,blur_30,enc_avif,quality_auto/5b1cac_77410a056b294194b3b27f3fa5bd1870~mv2.webp)
![[CVPR2025] "🌍U- Know- Diff-PAN: Uncertainty-aware Knowledge Distillation Diffusion Framework with Details Enhancement for PAN-Sharpening"](https://static.wixstatic.com/media/5b1cac_77410a056b294194b3b27f3fa5bd1870~mv2.png/v1/fill/w_100,h_100,fp_0.50_0.50,q_95,enc_avif,quality_auto/5b1cac_77410a056b294194b3b27f3fa5bd1870~mv2.webp)
![[CVPR2025] "SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video"](https://static.wixstatic.com/media/5b1cac_da86c5ed2a314c56bae6aaddeb2447df~mv2.webp)









































