Achint Soni

I am a Masters (Thesis based) student in Computer Science at the Cheriton School of Computer Science, University of Waterloo, advised by Prof. Sirisha Rambhatla and Prof. Charles Clarke. Previously, I received my B.Tech in Electrical Engineering from the Indian Institute of Technology Kanpur, where I was advised by Prof. Faiz Hamid and Prof. Laxmidhar Behera.

My primary research areas are: Generative world models, image and video generation/editing, 3D reconstruction, and disentangled representation learning.

If you want to collaborate or have any questions feel free to shoot me an email. I am always interested in connecting with people.

News

[June'25] My work "VideoAgent: Self Improving Video Generation" got accepted to Workshop on Reinforcement Learning Beyond Rewards at RLC 2025.
[June'25] My work "LOCATEdit : Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing" got accepted to ICCV 2025.
[May'24] Our work "Why do Variational Autoencoders really promote disentanglement?" got accepted to ICML 2024.
[Sep'23] Joined David R. Cheriton School of Computer Science at University of Waterloo as a graduate student
[May'23] Graduated from IIT Kanpur with a B.Tech. degree majoring in Electrical Engineering, with minors in machine learning and theory of computation.
[Apr'23] received an offer of admission to the MMath in Computer Science program at the University of Waterloo.

Publications

LOCATEdit : Graph Laplacian Optimized Cross Attention for Localized Text-Guided Image Editing

Achint Soni, Meet Soni, Sirisha Rambhatla

ICCV, 2025

VideoAgent: Self-Improving Video Generation

Achint Soni, Sreyas Venkataraman, Abhranil Chandra, Sebastian Fischmeister, Percy Liang, Bo Dai, Sherry Yang

Preprint, under review at NeurIPS 2025

Understanding and Enforcing Precise Control in Generative Models via Graph-Based Attention

Achint Soni

Masters Thesis

Opinion: Mental Health Research: To Augment or Not To Augment

Argyrios Perivolaris, Alice Rueda, Karisa B Parkington, Achint Soni, Sirisha Rambhatla, Reza Samavi, Rakesh Jetly, Andrew James Greenshaw, Yanbo Zhang, Bo Cao, Sri Krishnan, Venkat Bhat

Frontiers in Psychiatry, 2025

VIDEOSCORE: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Yuchen Lin, Wenhu Chen

EMNLP Main, 2024

Why Do Variational Autoencoders Really Promote Disentanglement?

Pratik Bhowal, Achint Soni, Sirisha Rambhatla

International Conference on Machine Learning (ICML), 2024