Sitemap
A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.
Pages
Posts
Future Blog Post
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.
Blog Post number 4
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 3
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 2
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Blog Post number 1
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
portfolio
Masked-Unmasked Face Recognition with Chain of Thoughts
This project aimed to develop a robust face recognition system for occluded faces. My contributions included designing the core adaptive feature selection module and implementing the data preprocessing pipeline.
publications
HCNQA: Enhancing 3D Visual Question-Answering with Hierarchical Concentration Narrowing Supervision
Published in International Conference on Artificial Neural Networks (ICANN), 2025, 2024
This work introduces HCNQA, a method to enhance spatial reasoning in 3D-VQA. By integrating a Hierarchical Concentration Narrowing (HCN) module, we guide the model’s attention to suppress shortcuts and improve performance on the ScanQA benchmark.
research
Enhancing Spatial Reasoning in 3D Scene Understanding with LLMs
Investigated and implemented a pipeline to fuse textual spatial embeddings from a 3D grounding model (EDA) into an LLM (LEO), providing critical insights into mitigating spatial information loss in multimodal models.
Neural Module Network for 3D-VQA
Leading a project to develop a Neural Module Network (NMN) for 3D-VQA. My work involves designing a framework to parse natural language questions into executable programs, aiming to improve model interpretability, structured reasoning, and generalization.
Learning Variational Physical Representations from Visual Features
Upcoming research on more variable and expressive latent representations of physical properties, moving beyond deterministic point estimates to better capture material uncertainty inherent in visual data.