Nan Xi

I am a final year Ph.D. student of Computer Science and Engineering at University at Buffalo. I am fortunate to be supervised by Prof. Junsong Yuan. Prior to UB, I obtained my M.D. degree in oncology, where I was initially trained as an internal oncologist. My current research interest lies in visual reasoning and vision-language learning. In the medical domain, my ultimate goal is to empower medical AI systems with the ability of visual reasoning akin to human experts.

Contact: nanxi [at] buffalo dot edu

Research Interest and Highlights

Video Reasoning of Human Activities

  1. [**New**]

    Yisong Wang $^ *$, Nan Xi $^ *$ $^ \dagger$, Jingjing Meng, Junsong Yuan. “Interaction-centric Spatio-Temporal Context Reasoning for Multi-person Video HOI Recognition”. in European Conference on Computer Vision (ECCV), 2024 ($*$ Equal contricution; $\dagger$ Corresponding author)

  2. Nan Xi, Jingjing Meng, Junsong Yuan. “Open Set Video HOI detection from Action- centric Chain-of-Look Prompting”. in Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2023

Surgical Scene Reasoning

  1. Nan Xi, Jingjing Meng, Junsong Yuan. “Chain-of-Look Prompting for Verb-centric Surgical Triplet Recognition in Endoscopic Videos”, in Proc. ACM International Conf. on Multimedia (ACM MM), 2023

  2. Nan Xi, Jingjing Meng, Junsong Yuan. “Forest Graph Convolution Network for Surgical Action Triplets Recognition in Endoscopic Videos”, in IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2022