Research

My research focuses on advancing visual perception systems by integrating edge-cloud processing, ensuring robust privacy safeguards, and leveraging dynamic temporal modeling to enable scalable, real-time scene understanding. I have not yet completed my research and work so this is just a preview of what I have done so far.

Open Science & Resources

I have released my fine-tuned FastVLM-0.5b model and the custom COCO dataset used for validation. Explore the model and dataset on Huggingface, and check out the complete codebase on GitHub. I plan to formally publish this research in about one year through a professor-led collaboration.