Soumik Mukhopadhyay

PhD student, PI Lab, University of Maryland.
Previously: Amazon Studios | Samsung Reasearch | IIT Guwahati.
Find me @ IRB3116, UMD.

hello world,

I’m a PhD student at the Department of Computer Science, University of Maryland. I’m working with Prof. Abhinav Shrivastava. I completed my undergrad from IIT Guwahati majoring in Computer Science and Engineering. Before joinging my PhD, I was a Research Engineer in the Medical Imaging Research Group at Healthcare & Medical Equipment division of Samsung Research Institute - Bangalore.


My research interests lie in Computer Vision and Machine Learning, especially Generative Models. I’m currently working on generative model representations, generation in implicit neural representation (INR) space and facial video generation. I have previously worked in visual forgery detection and privacy apart from various medical image analysis algorithms (ultrasound).

I am thankful for having research opportunities to work with Prof. Tianyi Zhou, Mr. Nitin Singhal, Mr. Bhavya Ajani, Prof. Arijit Sur, Prof. Shubhamoy Maitra, Prof. Debjyoti Bera, Prof. Sagarmoy Dutta, in the past.


My hobbies include singing, drawing, and hiking. I used to also like photo editing before it was easy.


Oct 24, 2023 1 paper got accepted at WACV2024.
Jun 16, 2023 Awarded with Summer Research Fellowship 2023.
May 22, 2022 I will be interning at Amazon Studios & Prime videos in Summer 2022.
May 24, 2021 Awarded with Summer Research Fellowship 2021.
Sep 1, 2020 Awarded with Dean’s Fellowship 2020.

latest posts

selected publications


  1. diff2lip.png
    Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization
    Soumik MukhopadhyaySaksham SuriRavi Teja Gadde, and 1 more author
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Jan 2024


  1. diffssl2.png
    Do text-free diffusion models learn discriminative visual representations?
    Soumik Mukhopadhyay*Matthew Gwilliam*Yosuke Yamaguchi, and 5 more authors
    Under Review, Jan 2023
  2. diffssl.png
    Diffusion Models Beat GANs on Image Classification
    Soumik Mukhopadhyay*Matthew Gwilliam*Vatsal Agarwal, and 5 more authors
    arXiv preprint, Jan 2023


  1. needle-detection.png
    Deep learning based needle tracking in prostate fusion biopsy
    Soumik MukhopadhyayPraful MathurAditya Bhardwaj, and 5 more authors
    In Medical Imaging 2021: Image-Guided Procedures, Robotic Interventions, and Modeling, Jan 2021


  1. rigid-deformable.png
    Rigid and deformable corrections in real-time using deep learning for prostate fusion biopsy
    Aditya BhardwajJun-Sung ParkSoumik Mukhopadhyay , and 4 more authors
    In Medical Imaging 2020: Image-Guided Procedures, Robotic Interventions, and Modeling, Jan 2020