Goutam Yelluru Gopal

I am a Research Scientist at Airy3D. Previously, I was a PhD student at the VidPro Group, Dept. of Electrical and Computer Engineering, Concordia University and my research topic was towards designing lightweight, robust algorithms for Visual Object Tracking.

I hold an MSc Degree in Computer Science from Saarland University, with specialization in Machine Learning. I did my Masters Thesis at the Machine Learning Group on tackling Algorithmic Bias in binary classifiers.

Email / Google Scholar / Github / LinkedIn / CV

News

I am always open to networking and new opportunities. Feel free to connect with me on LinkedIn

09/2024: Joined Airy3D as Research Scientist.
05/2024: Started my internship at Airy3D.
04/2024: Successfully defended my Doctoral Thesis! Report / Slides
03/2024: Published the C++ implementation of trackers for our BMVC2023 and WACV2024 papers on GitHub.
03/2024: Won first prize at The 8th Graduate Students Research Conference 2024 by Concordia University. Poster link
11/2023: Presented my recent research work at BMVC2023 Doctoral Consortium. Slides here
10/2023: I was awarded a grant covering the BMVC2023 conference fees and accommodation costs. Thanks to the Organizing Committee!
10/2023: I was awarded the Concordia Conference and Exposition Allowance to attend BMVC2023. Thank you Concordia SGS!
08/2023: A paper got accepted at IEEE/CVF WACV 2024.
08/2023: A paper got accepted at BMVC 2023.
07/2023: Found a (possible) bug in Apple's Mobile Vision Transformer code; Created a PR to fix it.
03/2023: A journal got accepted at Springer Multimedia Tools and Applications.
04/2020: A paper got accepted at IEEE ICIP 2020.
01/2020: A paper got accepted at IEEE ICASSP 2020.

Research Interests

Computer Vision, Machine Learning, Deep Learning, and Optimization.

	Separable Self and Mixed Attention Transformers for Efficient Object Tracking Goutam Yelluru Gopal, Maria Amer, IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024 paper / arXiv / code Transformer-based models are under-utilized for tracking due to the computational complexity of their attention blocks. This paper proposes an efficient self and mixed attention transformer-based architecture for lightweight tracking. Simulations show that our SMAT tracker surpasses the performance of related lightweight trackers on GOT10k, TrackingNet, LaSOT, NfS30, UAV123, and AVisT datasets, while running at 37 fps on CPU and 158 fps on GPU.
	Mobile Vision Transformer-based Visual Object Tracking Goutam Yelluru Gopal, Maria Amer, The 34th British Machine Vision Conference (BMVC), 2023. paper / arXiv / code / poster / video The state-of-the-art Vision Transformer-based trackers are computationally expensive since they have a large number of model parameters and rely on specialized hardware (e.g., GPU) for faster inference. We propose a lightweight, accurate, and fast tracking algorithm using Mobile Vision Transformers (MobileViT) as the backbone for the first time, along with a novel approach to fuse the template and search region representations in the MobileViT backbone.
	Reliable interconnected channels for dynamic DCF based visual tracking Goutam Yelluru Gopal, Maria Amer, Springer's Multimedia Tools and Application (MTAP), 2023 In DCF-based trackers, several non-discriminative or unreliable channels display ambiguous filter responses due to the effect of various external factors. We address this problem by proposing a three-fold objective function that accounts for the relationship between channels along with per-channel reliability scores during channel weight learning and a temporal prior. In addition, we present an algorithm to compute channel weights efficiently and maintain the tracking speed.
	Reliable Temporally Consistent Feature Adaptation for Visual Object Tracking Goutam Yelluru Gopal, Maria Amer 27th IEEE International Conference on Image Processing (ICIP), 2020 Use of multiple features and sophisticated learning methods have increased the accuracy of DCF-based tracking results. However, unreliable features lead to erroneous target localization and lead to tracking failures. To alleviate this problem, we propose a method for online adaptation of feature weights based on their reliability. Our objective is modeled as a convex optimization problem for robust learning of feature weights.
	Dynamic Channel Pruning For Correlation Filter Based Object Tracking Goutam Yelluru Gopal, Maria Amer 45th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , 2020 In DCF-based tracking, not all channels contain useful information for target localization. During challenging scenarios, non-discriminative channels lead to erroneous tracker. To mitigate this problem, we propose a method for dynamic channel pruning. The proposed method for learning of channel weights is modeled as a non-smooth convex optimization problem. We also propose an algorithm to solve the resulting problem efficiently compared to off-the-shelf solvers.

Feel free to steal this website's source code. Do not scrape the HTML from this page itself, as it includes analytics tags that you do not want on your own website use the github code instead.