I am a first year CS PhD student at the University of Michigan, advised by Justin Johnson. I am broadly interested in deep learning, with applications in computer vision and natural language processing.
Prior to joining UMich, I was a visiting scholar at the Georgia Institute of Technology, collaborating with awesome folks in the labs of Devi Parikh and Dhruv Batra. My work was focused on image captioning (nocaps), probabilistic neural-symbolic models for VQA, and visual dialog.
I graduated from Indian Institute of Technology Roorkee in 2018, with a major in Electrical Engineering and minor in Computer Science. I was selected twice as a Google Summer of Code student, with TARDIS Foundation (2016) and OpenCV (2017). Parallel to my second GSoC, I interned at Goldman Sachs, Bangalore. I was also an active member of the Mobile Development Group IITR, and started a reading group, now named Vision and Language Group.
Feel free to say hi: kdexd at umich dot edu
[Jun 2020] Paper out on arxiv: VirTex: Learning Visual Representations from Textual Annotations.
[May 2020] Serving as a reviewer for ECCV 2020.
[Oct 2019] Presented my work nocaps at ICCV 2019, Seoul.
[Sep 2019] Started as a PhD student at the University of Michigan!
[Aug 2019] nocaps: novel object captioning at scale accepted to ICCV 2019!
[Apr 2019] Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering accepted to ICML 2019!
[Feb 2019] Paper out on arxiv: Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering.
While I keep marching forward as years pass by, my first projects would always remind me of my humble beginnings: it has been five years on this roller-coaster, which I call my professional life.